Approximate Tensorization of the Relative Entropy for Noncommuting Conditional Expectations

Bardet, Ivan; Capel, Ángela; Rouzé, Cambyse

doi:10.1007/s00023-021-01088-3

Approximate Tensorization of the Relative Entropy for Noncommuting Conditional Expectations

Original Paper
Open access
Published: 31 July 2021

Volume 23, pages 101–140, (2022)
Cite this article

Download PDF

You have full access to this open access article

Annales Henri Poincaré Aims and scope Submit manuscript

Approximate Tensorization of the Relative Entropy for Noncommuting Conditional Expectations

Download PDF

1788 Accesses
7 Citations
1 Altmetric
Explore all metrics

Abstract

In this paper, we derive a new generalisation of the strong subadditivity of the entropy to the setting of general conditional expectations onto arbitrary finite-dimensional von Neumann algebras. This generalisation, referred to as approximate tensorization of the relative entropy, consists in a lower bound for the sum of relative entropies between a given density and its respective projections onto two intersecting von Neumann algebras in terms of the relative entropy between the same density and its projection onto an algebra in the intersection, up to multiplicative and additive constants. In particular, our inequality reduces to the so-called quasi-factorization of the entropy for commuting algebras, which is a key step in modern proofs of the logarithmic Sobolev inequality for classical lattice spin systems. We also provide estimates on the constants in terms of conditions of clustering of correlations in the setting of quantum lattice spin systems. Along the way, we show the equivalence between conditional expectations arising from Petz recovery maps and those of general Davies semigroups.

Probability structures in subspace lattice approach to foundations of quantum theory

Article 01 April 2015

Calculus for Non-Compatible Observables, Construction Through Conditional States

Article 11 July 2014

On natural density, orthomodular lattices, measure algebras and non-distributive $$L^p$$ spaces

Article 03 September 2015

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In the last few decades, entropy has been proven to be a fundamental object in various fields of mathematics and theoretical physics. Its quantum analogue characterizes the optimal rate at which two different states of a system can be discriminated when an arbitrary number of copies of the system is available. Given two states $\rho ,\sigma $ of a finite-dimensional von Neumann algebra ${{\mathcal {N}}}\subset {{\mathcal {B}}}({{\mathcal {H}}})$, it is given by

$$\begin{aligned} D(\rho \Vert \sigma ):= \mathop {\mathrm{Tr}}\nolimits [\rho \,(\ln \rho -\ln \sigma )]\,, \end{aligned}$$

whenever $\mathop {\mathrm{supp}}\nolimits (\rho )\subset \mathop {\mathrm{supp}}\nolimits (\sigma )$, where $\mathop {\mathrm{Tr}}\nolimits $ denotes the unnormalized trace on ${{\mathcal {B}}}({{\mathcal {H}}})$. When $\sigma :={\mathbb {1}}_{{{\mathcal {H}}}}/d_{{\mathcal {H}}}$ is the completely mixed state of ${{\mathcal {B}}}({{\mathcal {H}}})$, the relative entropy can be written in terms of the von Neumann entropy $S(\rho ):=-\mathop {\mathrm{Tr}}\nolimits [\rho \ln \rho ]$ of the state $\rho $:

$$\begin{aligned} D(\rho \Vert {\mathbb {1}}_{{\mathcal {H}}}/d_{{\mathcal {H}}})=-S(\rho )+\ln (d_{{\mathcal {H}}})\,. \end{aligned}$$

Probably the most fundamental property of entropy is the following strong subadditivity inequality (SSA) [34]: given a tripartite system ${{\mathcal {H}}}_{ABC}:={{\mathcal {H}}}_A\otimes {{\mathcal {H}}}_B\otimes {{\mathcal {H}}}_C$ and a state $\rho \equiv \rho _{ABC}$ on ${{\mathcal {H}}}_{ABC}$,

$$\begin{aligned} S(\rho _{ABC})+S(\rho _B)\le S(\rho _{AB})+S(\rho _{BC})\,, \end{aligned}$$

(SSA)

where for any subsystem D of ABC, $\rho _{D}:=\mathop {\mathrm{Tr}}\nolimits _{D^c}[\rho _{ABC}]$ denotes the marginal state on D. Restated in terms of the quantum relative entropy, (SSA) takes the following form:

$$\begin{aligned}&D\left( \rho _{ABC}\Big \Vert \rho _B\otimes \frac{{\mathbb {1}}_{AC}}{d_{{{\mathcal {H}}}_{AC}}}\right) \nonumber \\&\quad \le D\left( \rho _{ABC}\Big \Vert \rho _{AB}\otimes \frac{{\mathbb {1}}_{C}}{d_{{{\mathcal {H}}}_C}} \right) +D\left( \rho _{ABC}\Big \Vert \rho _{BC}\otimes \frac{{\mathbb {1}}_{A}}{d_{{{\mathcal {H}}}_{A}}} \right) \,. \end{aligned}$$

(1.1)

In the present paper, we consider the following more general framework: let ${\mathcal {M}}\subset {{\mathcal {N}}}_1,{{\mathcal {N}}}_2\subset {{\mathcal {N}}}$ be four von Neumann subalgebras of the algebra of linear operators acting on a finite-dimensional Hilbert space ${{\mathcal {H}}}$, and let $E^{\mathcal {M}},E_1,E_2$ be conditional expectations onto ${\mathcal {M}},{{\mathcal {N}}}_1,{{\mathcal {N}}}_2$, respectively. When the quadruple $({\mathcal {M}},{{\mathcal {N}}}_1,{{\mathcal {N}}}_2,{{\mathcal {N}}})$ forms a commuting square, that is when $E_{1}\circ E_2=E_2\circ E_1=E^{\mathcal {M}}$, the following generalization of SSA occurs: for any state $\rho $ on ${{\mathcal {N}}}$,

$$\begin{aligned} D(\rho \Vert E^{{\mathcal {M}}}_*(\rho ))\le D(\rho \Vert E_{1*}(\rho ))+D(\rho \Vert E_{2*}(\rho ))\,, \end{aligned}$$

(1.2)

where the maps $E^{\mathcal {M}}_*,E_{1*}$, $E_{2*}$ are the Hilbert-Schmidt duals of $E^{\mathcal {M}}, E_{1}, E_2$, also known as coarse-graining maps [38]. One can easily recover the previous (SSA) inequality from (1.2) by taking ${{\mathcal {N}}}\equiv {{\mathcal {B}}}({{\mathcal {H}}}_{ABC})$, and the coarse-graining maps to be the partial traces onto the subalgebras ${{\mathcal {N}}}_1\equiv {{\mathcal {B}}}({{\mathcal {H}}}_{AB}) $, ${{\mathcal {N}}}_2\equiv {{\mathcal {B}}}({{\mathcal {H}}}_{BC})$ and ${\mathcal {M}}\equiv {{\mathcal {B}}}({{{\mathcal {H}}}_B})$, respectively. Thus, inequality (1.2) can be seen as an operator algebraic generalization of the (SSA) inequality.

However, the commuting square assumption and subsequently inequality (1.2) are not satisfied in most of the cases of interest that appear in information-theoretical settings or quantum many-body systems. Indeed, in the context of interacting lattice spin systems, conditional expectations arising, e.g. from the large time limit of a dissipative evolution on subregions of the lattice generally do not satisfy the commuting square assumption. In this case, approximations of the (SSA) were found in the classical case (i.e. when all algebras are commutative) and when ${\mathcal {M}}\equiv {\mathbb {C}}{\mathbb {1}}_{{{\mathcal {H}}}}$ [14]. For classical lattice spin systems, these inequalities, termed as approximate tensorization of the relative entropy (also known in the literature as quasi-factorization of the relative entropy [14, 17]), take the following form

$$\begin{aligned} D(\rho \Vert \sigma )\le \frac{1}{1-2c_1}\,\big (D(\rho \Vert E_{1*}(\rho ))+D(\rho \Vert E_{2*}(\rho ))\big )\,, \end{aligned}$$

(1.3)

where $\sigma := E^{{\mathcal {M}}}_*(\rho )$ for all states $\rho $, and $c_1:=\Vert E_{1}\circ E_{2}-E^{\mathcal {M}}:\,{\mathbb {L}}_1(\sigma )\rightarrow {\mathbb {L}}_\infty ({{\mathcal {N}}})\Vert $ is a constant that measures the violation of the commuting square condition for the quadruple $({\mathcal {M}},{{\mathcal {N}}}_1,{{\mathcal {N}}}_2,{{\mathcal {N}}})$. For reasons that will become clear in the remaining parts of the article, we refer to the constant $c_1$ as the clustering of correlations constant in this introduction.

An inequality of the form of (1.3) is the main ingredient in modern proofs of modified logarithmic Sobolev inequalities (MLSI) which govern the rapid thermalization of classical lattice spin systems evolving according to a Glauber dynamics and in the high temperature regime [14, 17]. Furthermore, the aforementioned quantum versions of (1.3) for different conditional relative entropies have been used in the past years to obtain some examples of positive MLSI for quantum spin systems [3, 10, 11]. Our main motivation in the current paper is a continuation of those results by further generalizing (1.3) to a more abstract setting, with the aim of providing new interesting examples of positive MLSI. In fact, after the first version of this manuscript, the main results contained here have allowed some of the authors to solve a long-standing open problem regarding a system-size independent MLSI for certain evolutions that converge to Gibbs states of nearest-neighbour commuting Hamiltonians at high enough temperature in [12].

Main results: In this paper, building on the previous results of approximate tensorization of the form of (1.3), we take one step further and introduce a weak approximate tensorization for the relative entropy, denoted throughout the text by AT(c, d), which amounts to the existence of positive constants $c\ge 1$ and $d\ge 0$ such that (see Theorem 2)^{Footnote 1}

$$\begin{aligned} D(\rho \Vert E^{\mathcal {M}}_*(\rho ))\le c\,\big (D(\rho \Vert E_{1*}(\rho ))+D(\rho \Vert E_{2*}(\rho ))\big )+d\,. \end{aligned}$$

(AT(c,d))

Whenever $d=0$, we refer to the previous bound as a strong approximate tensorization for the relative entropy. Nevertheless, as opposed to the classical setting, conditional expectations arising from dissipative evolutions on quantum lattice spin systems generically do not satisfy the commuting square condition even at infinite temperature. This difference is exclusively due to the non-commutativity of the underlying algebras. The additive constant d is meant to take into account this correction from the classical case.

Note that, at infinite temperature, the conditional expectations are selfadjoint with respect to the Hilbert-Schmidt inner product, a property referred to as symmetric in [4, 21]. Under this condition, in [20], a different extension of (SSA) was proposed. In our framework, the inequality derived in [20] leads to an AT(1, d), which can be regarded as measuring the violation of the commutative square condition at infinite temperature. On the other hand, our strong approximate tensorization constant c can be regarded as a finite temperature relaxation of the case $c=1$ in [20].

The first AT(c, d) inequality that we obtain is presented in Proposition 2, where we use the change of measure argument from [27] in order to directly connect the previous AT(1, d) inequality from [20] for symmetric conditional expectations to an AT$(c,d')$ inequality for the general case, where c is a spectral quantity depending solely on the invariant states of the smallest algebra ${\mathcal {M}}$ and $d'$ is proportional to d. In particular, whenever $d=0$, this results allows us to transfer strong approximate tensorization for symmetric conditional expectations to strong approximate tensorization for general conditional expectations. However, in this inequality the multiplicative constant cannot be related to the clustering of correlations constant $c_1$ in the case of interacting systems, and can be in general exponentially larger. Our main result, stated in Theorem 2, precisely fills this gap. Moreover, the inequality reduces to the classical inequality of [14] for commutative algebras.

In Sect. 5, we apply the previous results on weak approximate tensorization to the context of lattice spin systems with commuting Hamiltonians. In particular, we show in Theorem 3 that classical evolutions over quantum systems (termed embedded Glauber dynamics) satisfy AT(c, 0) with the same constant as in the classical case. As an independent but important result, we also prove in Theorem 1, that the conditional expectations associated to the heat-bath dynamics and Davies dynamics coincide. This, in particular, allows us to transfer various results of remarkable interest that have been proven in the past years for one of the dynamics to the other, and vice versa.

Applications As mentioned previously, the main application of these inequalities is in the context of mixing times of continuous-time local Markovian evolutions over quantum lattice spin systems - although we expect these inequalities and their proof techniques to find other applications in quantum information theory. In [14], Cesi used his inequality in order to show the exponential convergence in relative entropy of classical Glauber dynamics on lattice systems towards equilibrium, independently of the lattice size, in the form of a positive MLSI constant (defined in Sect. 4.1). In a subsequent paper [12] that appeared after the first version of the current manuscript, we made use of the approximate tensorization inequality to show similar convergences for dissipative quantum Gibbs samplers.

Moreover, in this paper we illustrate the potential of these techniques in the aforementioned context of mixing times by estimating the MLSI constant whenever the generator of the dynamics is constructed from Pinching onto a pair of different, orthonormal bases. Additionally, we use our main results in approximate tensorization to obtain new entropic uncertainty relations in Sect. 4.2.

Outline of the paper In Sect. 2, we review basic mathematical concepts used in this paper, and more particularly the notion of a non-commutative conditional expectation. We derive theoretical expressions on the strong (c) and weak (d) constants for general von Neumann algebras in Sect. 3, where our main result is stated as Theorem 2. We subsequently apply them to obtain strengthenings of uncertainty relations and examples of positivity of MLSI in Sect. 4. Moreover, in Sect. 5, we derive explicit bounds on the constants c and d for conditional expectations associated to Gibbs samplers on lattice spin systems in terms of the interactions of the corresponding Hamiltonian. In Sect. 6, we discuss the results presented in our paper and how they have been applied to different contexts after the appearance of the first version of our manuscript. Finally, in Appendix A, we review the conditional expectations arising from Petz recovery maps and from Davies generators and show in that both conditional expectations coincide. We conclude by collecting the proofs of some technical results in Appendix B.

2 Notations and Definitions

In this section, we fix the basic notation used in the paper, and introduce the necessary definitions.

2.1 Basic Notations

Let $({{\mathcal {H}}},\langle .|.\rangle )$ be a finite-dimensional Hilbert space of dimension $d_{{\mathcal {H}}}$. We denote by ${{\mathcal {B}}}({{\mathcal {H}}})$ the Banach space of bounded operators on ${{\mathcal {H}}}$, by ${{\mathcal {B}}}_{\mathrm{sa}}({{\mathcal {H}}})$ the subspace of self-adjoint operators on ${{\mathcal {H}}}$, and by ${{\mathcal {B}}}_+({{\mathcal {H}}})$ the cone of positive semidefinite operators on ${{\mathcal {H}}}$. The adjoint of an operator Y is written as $Y^*$. We will also use the same notations ${{\mathcal {N}}}_{\mathrm{sa}}$ and ${{\mathcal {N}}}_+$ in the case of a von Neumann subalgebra ${{\mathcal {N}}}$ of ${{\mathcal {B}}}({{\mathcal {H}}})$. The identity operator on ${{\mathcal {N}}}$ is denoted by ${\mathbb {1}}_{{\mathcal {N}}}$, dropping the index ${{\mathcal {N}}}$ when it is unnecessary. In the case of ${{\mathcal {B}}}({\mathbb {C}}^\ell )$, $\ell \in {\mathbb {N}}$, we will also use the notation ${\mathbb {1}}$ for ${\mathbb {1}}_{{\mathbb {C}}^\ell }$. Similarly, given a map $\Phi :{{\mathcal {B}}}({{\mathcal {H}}})\rightarrow {{\mathcal {B}}}({{\mathcal {H}}})$, we denote its dual with respect to the Hilbert-Schmidt inner product as $\Phi _*$. We also denote by ${\mathrm{id}}_{{{\mathcal {B}}}({{\mathcal {H}}})}$, or simply ${\mathrm{id}}$, resp. ${\mathrm{id}}_\ell $, the identity superoperator on ${{\mathcal {B}}}({{\mathcal {H}}})$, resp. ${{\mathcal {B}}}({\mathbb {C}}^\ell )$. We denote by $\mathcal {D}({{\mathcal {H}}})$ the set of positive semidefinite, trace-one operators on ${{\mathcal {H}}}$, also called density operators, by ${{\mathcal {D}}}_+({{\mathcal {H}}})$ the subset of full-rank density operators, and by ${{\mathcal {D}}}_{\le }({{\mathcal {H}}})$ the set of subnormalized density operators. In the following, we will often identify a density matrix $\rho \in \mathcal {D}({{\mathcal {H}}})$ and the state it defines, that is the positive linear functional ${{\mathcal {B}}}({{\mathcal {H}}})\ni X\mapsto \mathop {\mathrm{Tr}}\nolimits (\rho \,X)$. More generally, given a von Neumann subalgebra ${{\mathcal {N}}}\subseteq {{\mathcal {B}}}({{\mathcal {H}}})$ with block decomposition ${{\mathcal {N}}}:=\bigoplus _l{\mathbb {M}}_{n_l}\otimes {\mathbb {1}}_{m_l}$, we denote by ${{\mathcal {D}}}({{\mathcal {N}}})$ the set of states of the form

$$\begin{aligned} \sigma :=\bigoplus _{l} p_l\,\rho _l\otimes \tau _l\,, \end{aligned}$$

(2.1)

for some $n_l\times n_l$ states $\rho _l$ and $ m_l\times m_l$ full-rank states $\tau _l$. The sets ${{\mathcal {D}}}({{\mathcal {N}}})_+$ and ${{\mathcal {D}}}({{\mathcal {N}}})_{\le }$ are defined similarly.

2.2 Entropic Quantities and ${\mathbb {L}}_p$ Spaces

Throughout this paper, we will use various distance measures between states and between observables: given a state $\rho \in {{\mathcal {D}}}({{\mathcal {N}}})$, its von Neuman entropy is defined by

$$\begin{aligned} S(\rho ):=-\mathop {\mathrm{Tr}}\nolimits \big [ \rho \,\ln \rho \big ]\,. \end{aligned}$$

When $\rho \equiv \rho _{AB}\in {{\mathcal {D}}}({{\mathcal {H}}}_A\otimes {{\mathcal {H}}}_B)$ is the state of a bipartite quantum system, its conditional entropy is defined by

$$\begin{aligned} S(A|B)_\rho :=S(\rho _{AB})-S(\rho _B)\,, \end{aligned}$$

where $\rho _B:=\mathop {\mathrm{Tr}}\nolimits _A(\rho )$ corresponds to the marginal of $\rho $ over the subsystem ${{\mathcal {H}}}_B$. More generally, given two positive semidefinite operators $\rho ,\sigma \in {{\mathcal {B}}}_+({{\mathcal {H}}})$, the relative entropy between $\rho $ and $\sigma $ is defined as follows [44]:

$$\begin{aligned} D(\rho \Vert \sigma ):=\left\{ \begin{aligned}&\mathop {\mathrm{Tr}}\nolimits [\rho \,(\ln \rho -\ln \sigma )]\,\,\,\,\mathop {\mathrm{supp}}\nolimits (\rho )\subset \mathop {\mathrm{supp}}\nolimits (\sigma )\\&+\infty \,\,\qquad \qquad \qquad \text {else} \end{aligned}\right. \end{aligned}$$

Moreover, given (possibly subnormalized) positive semidefinite operators $\rho \ge 0$ and $\sigma >0$, their max-relative entropy is defined as [18]:

$$\begin{aligned} D_{\max }(\rho \Vert \sigma ):= \inf \{\lambda |\,\rho \le {rm e}^{\lambda }\sigma \} \equiv \ln \,(\Vert \sigma ^{-\frac{1}{2}}\,\rho \,\sigma ^{-\frac{1}{2}}\Vert _\infty )\,. \end{aligned}$$

From the max-relative entropy, we can define the max-information of a (possibly subnormalized) bipartite state $\rho _{AB}\in {{\mathcal {D}}}_{\le }({{\mathcal {H}}}_A\otimes {{\mathcal {H}}}_B)$ as follows [8]:

$$\begin{aligned} I_{\max }(A:B)_{\rho }\equiv I_{\max }({{\mathcal {H}}}_A:{{\mathcal {H}}}_B)_{\rho } :=\inf _{\tau _{B}\in {{\mathcal {D}}}({{\mathcal {H}}})}\,D_{\max }(\rho _{AB}\Vert \rho _A\otimes \tau _B)\,. \end{aligned}$$

Given a subalgebra ${{\mathcal {N}}}$ of ${{\mathcal {B}}}({{\mathcal {H}}})$ and $\sigma \in {{\mathcal {D}}}_+({{\mathcal {N}}})$, we define the modular maps $\Gamma _\sigma :{{\mathcal {N}}}\rightarrow {{\mathcal {B}}}({{\mathcal {H}}})$ and $\Delta _\sigma :{{\mathcal {N}}}\rightarrow {{\mathcal {N}}}$ as follows

$$\begin{aligned} \Gamma _\sigma (X):=\sigma ^{1/2}\,X\,\sigma ^{1/2}\, , \qquad \Delta _\sigma (X)=\sigma \,X\,\sigma ^{-1}\,. \end{aligned}$$

Then for any $p\ge 1$ and $X\in {{\mathcal {N}}}$, its non-commutative weighted ${\mathbb {L}}_p(\sigma )$-norm is defined as [30]:

$$\begin{aligned} \Vert X\Vert _{{\mathbb {L}}_p(\sigma )}:=\mathop {\mathrm{Tr}}\nolimits \left[ \big |\Gamma _\sigma ^{\frac{1}{p}}(X) \big |^p \right] ^{\frac{1}{p}}\, , \end{aligned}$$

and $\Vert X\Vert _{{\mathbb {L}}_{\infty }(\sigma )}=\Vert X\Vert _\infty $, the operator norm of X, which we will also often more simply denote by $\Vert X \Vert $. We call the space ${{\mathcal {B}}}({{\mathcal {H}}})$ endowed with the norm $\Vert .\Vert _{{\mathbb {L}}_p(\sigma )}$ the quantum ${\mathbb {L}}_p(\sigma )$ space. In the case $p=2$, we have a Hilbert space, with corresponding $\sigma $-KMS scalar product

$$\begin{aligned} \langle X,\,Y\rangle _\sigma :=\mathop {\mathrm{Tr}}\nolimits \left[ \sigma ^{1/2}X^*\sigma ^{1/2}Y \right] \,. \end{aligned}$$

(2.2)

Weighted ${\mathbb {L}}_p$ norms enjoy the following useful properties:

Hölder’s inequality: for any $p,{\hat{p}}\ge 1$ such that $p^{-1}+{\hat{p}}^{-1}=1$, and any $X,Y\in {{\mathcal {N}}}$:
$$\begin{aligned} \langle X,\,Y\rangle _\sigma \le \Vert X\Vert _{{\mathbb {L}}_p(\sigma )}\,\Vert Y\Vert _{{\mathbb {L}}_{{\hat{p}}}(\sigma )}\,. \end{aligned}$$
Here, ${\hat{p}}$ is the Hölder conjugate of p.
Duality of norms: for any $p\ge 1$ of Hölder conjugate ${\hat{p}}$, and any $X\in {{\mathcal {N}}}$:
$$\begin{aligned} \Vert X\Vert _{{\mathbb {L}}_p(\sigma )}=\sup _{\Vert Y\Vert _{{\mathbb {L}}_{{\hat{p}}}(\sigma )}\le 1}\,\langle Y,\,X\rangle _\sigma \,. \end{aligned}$$
For any completely positive, unital linear map $\Phi :{{\mathcal {N}}}\rightarrow {{\mathcal {N}}}$ such that $\Phi _*(\sigma )=\sigma $, any $p\ge 1$ and any $X\in {{\mathcal {N}}}$:
$$\begin{aligned} \Vert \Phi (X)\Vert _{{\mathbb {L}}_p(\sigma )}\le \Vert X\Vert _{{\mathbb {L}}_p(\sigma )}\,. \end{aligned}$$
(2.3)

2.3 Conditional Expectations

Here, we introduce the main object studied in this paper:

Definition 1

(Conditional expectations [37]). Let ${\mathcal {M}}\subset {{\mathcal {N}}}$ be a von Neumann subalgebra of ${{\mathcal {N}}}$. Given a state $\sigma \in {{\mathcal {D}}}_+({\mathcal {M}})$, a linear map $E:{{\mathcal {N}}}\rightarrow {\mathcal {M}}$ is called a conditional expectation with respect to $\sigma $ of ${{\mathcal {N}}}$ onto ${\mathcal {M}}$ if the following conditions are satisfied:

For all $X\in {{\mathcal {N}}}$, $\Vert E[X]\Vert \le \Vert X\Vert $;
For all $X\in {\mathcal {M}}$, $E[X]=X$;
For all $X\in {{\mathcal {N}}}$, $\mathop {\mathrm{Tr}}\nolimits [\sigma E[X]]=\mathop {\mathrm{Tr}}\nolimits [\sigma X]$.

A conditional expectation satisfies the following useful properties (see [42] for proofs and more details):

Proposition 1

Conditional expectations generically satisfy the following properties:

(i)
The map E is completely positive and unital.
(ii)
For any $X\in {{\mathcal {N}}}$ and any $Y,Z\in {\mathcal {M}}$, $E[YXZ]=Y E[X]Z$.
(iii)
E is self-adjoint with respect to the scalar product $\langle .,\,.\rangle _\sigma $. In other words:
$$\begin{aligned} \Gamma _\sigma \circ E=E_*\circ \Gamma _\sigma \,, \end{aligned}$$
where $E_*$ denotes the adjoint of E with respect to the Hilbert-Schmidt inner product.
(iv)
E commutes with the modular automorphism group of $\sigma $: for any $s\in {\mathbb {R}}$,
$$\begin{aligned} \Delta _\sigma ^{is}\circ E=E\circ \Delta ^{is}_\sigma \,. \end{aligned}$$
(2.4)
(v)
Uniqueness: given a von Neumann subalgebra ${\mathcal {M}}\subset {{\mathcal {N}}}$ and a faithful state $\sigma $, the existence of a conditional expectation E is equivalent to the invariance of ${\mathcal {M}}$ under the modular automorphism group $(\Delta _\sigma ^{is})_{s\in {\mathbb {R}}}$. In this case, E is uniquely determined by $\sigma $.

From now on, and with a slight abuse of notations, given the finite-dimensional von Neumann subalgebra ${{\mathcal {N}}}=E[{{\mathcal {B}}}({{\mathcal {H}}})]$ of ${{\mathcal {B}}}({{\mathcal {H}}})$, we denote by ${{\mathcal {D}}}({{\mathcal {N}}}):= E_{*}({{\mathcal {D}}}({{\mathcal {H}}}))$ its corresponding set of states that are invariant by E, so that ${{\mathcal {D}}}({{\mathcal {H}}})\equiv {{\mathcal {D}}}({{\mathcal {B}}}({{\mathcal {H}}}))$. In other words, the states $\tau _l$ in the decomposition (2.1) are now fixed by E. Similarly, the set of subnormalized states on the algebra ${{\mathcal {N}}}$ is defined as ${{\mathcal {D}}}_{\le }({{\mathcal {N}}}) $. We also introduce the concept of a conditional covariance: given a von Neumann-subalgebra ${\mathcal {M}}\subset {{\mathcal {N}}}$, a conditional expectation $E^{\mathcal {M}}$ from ${{\mathcal {N}}}$ onto ${\mathcal {M}}$ and a quantum state $\sigma \in {{\mathcal {D}}}_+({\mathcal {M}})$, where ${{\mathcal {D}}}({\mathcal {M}})$ is defined with respect to $E^{\mathcal {M}}$, we define the conditional covariance functional as follows: for any two $X,Y\in {{\mathcal {N}}}$,

$$\begin{aligned} \mathrm{Cov}_{{\mathcal {M}},\sigma }(X,Y):=\langle X-E^{\mathcal {M}}[X],\,Y-E^{\mathcal {M}}[Y]\rangle _\sigma \,. \end{aligned}$$

(2.5)

2.4 Two Examples of Classes of Conditional Expectations

In this subsection, we provide more details about the conditional expectations that we will consider in the case of Gibbs states on lattice spin systems in Sect. 5. Some properties and new results of independent interest regarding these conditional expectations are deferred to Appendix A for sake of clarity.

2.4.1 Conditional Expectations Generated by a Petz Recovery Map

Let $\sigma $ be a faithful density matrix on a finite-dimensional algebra ${{\mathcal {N}}}$ and let ${\mathcal {M}}\subset {{\mathcal {N}}}$ be a subalgebra. We denote by $E_\tau $ the conditional expectation onto ${\mathcal {M}}$ with respect to the completely mixed state (i.e. $E_\tau $ is self-adjoint with respect to the Hilbert-Schmidt inner product). We also adopt the following notations: we write $\sigma _{\mathcal {M}}=E_\tau (\sigma )$ and

$$\begin{aligned} {\mathcal {A}}_\sigma (X):=\sigma _{\mathcal {M}}^{-\frac{1}{2}}\,E_\tau [\sigma ^{\frac{1}{2}}\,X\,\sigma ^{\frac{1}{2}}]\,\sigma _{\mathcal {M}}^{-\frac{1}{2}}\,. \end{aligned}$$

Remark that ${\mathcal {A}}_\sigma $ is also the unique map such that for all $X\in {{\mathcal {N}}}$ and all $Y\in {\mathcal {M}}$:

$$\begin{aligned} \mathop {\mathrm{Tr}}\nolimits [\sigma ^{\frac{1}{2}}\,X\,\sigma ^{\frac{1}{2}}\,Y]=\mathop {\mathrm{Tr}}\nolimits [\sigma _{\mathcal {M}}^{\frac{1}{2}}\,{\mathcal {A}}_\sigma (X)\,\sigma _{\mathcal {M}}^{\frac{1}{2}}\,Y]\,. \end{aligned}$$

The adjoint of ${\mathcal {A}}_\sigma $ is the Petz recovery map of $E_\tau $ with respect to $\sigma $, denoted by ${{\mathcal {R}}}_\sigma $:

$$\begin{aligned} \mathcal {R}_{\sigma }(\rho _{\mathcal {M}}):=\sigma ^{\frac{1}{2}}\sigma _{\mathcal {M}}^{-\frac{1}{2}}\rho _{\mathcal {M}}\sigma _{\mathcal {M}}^{-\frac{1}{2}}\sigma ^{\frac{1}{2}}\,, \end{aligned}$$

where $\rho _{\mathcal {M}}:=E_\tau (\rho )$. It is proved in [13] that ${\mathcal {A}}_\sigma $ is a conditional expectation if and only if $\sigma \,X\,\sigma ^{-1}\in {\mathcal {M}}$ for all $X\in {\mathcal {M}}$. In the general case, we denote by

$$\begin{aligned} E_\sigma :=\lim _{n\rightarrow \infty }{\mathcal {A}}_\sigma ^n \end{aligned}$$

(2.6)

the projection on its fixed-point algebra for the $\sigma $-KMS inner product, which is a conditional expectation as we assumed $\sigma $ to be faithful. That is, $E_\sigma $ is the orthogonal projection for $\langle \cdot ,\cdot \rangle _\sigma $ on the algebra:

$$\begin{aligned} {\mathcal {F}}({\mathcal {A}}_\sigma )=\{X\in {{\mathcal {N}}}\,;\,{\mathcal {A}}_\sigma (X)=X\}\,. \end{aligned}$$

2.4.2 Conditional Expectations Coming from Davies Semigroups

The basic model for the evolution of an open system in the Markovian regime is given by a quantum Markov semigroup (or QMS) $(\mathcal {P}_t)_{t\ge 0}$ acting on ${{\mathcal {B}}}({{\mathcal {H}}})$. Such a semigroup is characterised by its generator, called the Lindbladian $\mathcal {L}$, which is defined on ${{\mathcal {B}}}({{\mathcal {H}}})$ by

$$\begin{aligned} {{\mathcal {L}}}(X)={\lim }_{t\rightarrow 0}\,\frac{1}{t}\,(\mathcal {P}_t(X)-X) \end{aligned}$$

for all $X\in {{\mathcal {B}}}({{\mathcal {H}}})$. Recall that by the GKLS Theorem [25, 35], ${{\mathcal {L}}}$ takes the following form: for all $X\in {{\mathcal {B}}}({{\mathcal {H}}})$,

$$\begin{aligned} {{\mathcal {L}}}(X)=i[H,X]+\frac{1}{2}\sum _{k=1}^l{\left[ 2\,L_k^*XL_k-\left( L_k^*L_k\,X+X\,L_k^*L_k\right) \right] }\, , \end{aligned}$$

(2.7)

where $H\in {{\mathcal {B}}}_{\mathrm{sa}}({{\mathcal {H}}})$, the sum runs over a finite number of Lindblad operators $L_k\in {{\mathcal {B}}}({{\mathcal {H}}})$, and $[\cdot ,\cdot ]$ denotes the commutator defined as $[X,Y]:=XY-YX$, $\forall X,Y\in {{\mathcal {B}}}({{\mathcal {H}}})$. The QMS is said to be faithful if it admits a full-rank invariant state $\sigma $. When the state $\sigma $ is the unique invariant state, the semigroup is called primitive. Further assuming the self-adjointness of the generator ${{\mathcal {L}}}$ with respect to the inner product (2.2) (or detailed balance condition), there exists a conditional expectation $E\equiv E_{\mathcal {F}}$ onto the fixed-point subalgebra ${\mathcal {F}}({{\mathcal {L}}}):=\{X\in {{\mathcal {B}}}({{\mathcal {H}}}):\,{{\mathcal {L}}}(X)=0\}$ such that

$$\begin{aligned} \mathcal {P}_t(X)\underset{t\rightarrow \infty }{\rightarrow }E[X]\, \end{aligned}$$

for all $X\in {{\mathcal {B}}}({{\mathcal {H}}})$.

We now focus on a particular class of QMS called Davies QMS. Such semigroups are obtained in the weak coupling limit of a system and a heat bath. Let H be a selfadjoint operator on ${{\mathcal {H}}}$, representing the Hamiltonian of the system. The corresponding Gibbs state at inverse temperature $\beta $ is defined as

$$\begin{aligned} \sigma =\frac{\hbox {e}^{-\beta H}}{\mathop {\mathrm{Tr}}\nolimits [\hbox {e}^{-\beta H}]}\,. \end{aligned}$$

(2.8)

Next, consider the Hamiltonian $H^{\mathrm{HB}}$ of the heat bath, as well as a set of system-bath interactions $\{ S_{\alpha }\otimes B_{\alpha } \}$, for some label $\alpha $. Here, we do not assume anything on the $S_\alpha $’s. The Hamiltonian of the universe composed of the system and its heat-bath is given by

$$\begin{aligned} H=H_\Lambda +H^{\mathrm{HB}}+\sum _{\alpha \in \Lambda }S_{\alpha }\otimes B_{\alpha }\,. \end{aligned}$$

(2.9)

Assuming that the bath is in a Gibbs state, by a standard argument (e.g. weak coupling limit, see [39]), the evolution on the system can be approximated by a quantum Markov semigroup whose generator is of the following form:

$$\begin{aligned} {{\mathcal {L}}}^{\mathrm{D},\beta }(X)=\sum _{\omega ,\alpha }\,\chi ^{\beta }_{\alpha }(\omega )\,\Big ( S_{\alpha }^*(\omega )XS_{\alpha }(\omega )-\frac{1}{2}\,\big \{ S_{\alpha }^*(\omega )S_{\alpha }(\omega ),X \big \} \Big )\,. \end{aligned}$$

(2.10)

The Fourier coefficients of the two-point correlation functions of the environment $\chi _{\alpha }^\beta $ satisfy the following KMS condition:

$$\begin{aligned} \chi _{\alpha }^\beta (-\omega )=\hbox {e}^{-\beta \omega }\, \chi _{\alpha }^\beta (\omega )\,. \end{aligned}$$

(2.11)

The operators $S_{\alpha }(\omega )$ are the Fourier coefficients of the system couplings $S_{\alpha }$, which means that they satisfy the following equation for any $t\in {\mathbb {R}}$:

$$\begin{aligned} \hbox {e}^{-itH}\,S_{\alpha }\hbox {e}^{it H}=\sum _\omega \hbox {e}^{it\omega }S_{\alpha }(\omega )\,\qquad \Leftrightarrow \qquad S_\alpha (\omega )=\sum _{\varepsilon -\varepsilon '=\omega }P_\varepsilon \,S_\alpha \,P_{\varepsilon '}\,. \end{aligned}$$

(2.12)

where the sum is over a finite number of frequencies. This implies in particular the following useful relation:

$$\begin{aligned} \Delta _{\sigma }(S_{\alpha }(\omega ))=\hbox {e}^{\beta \omega }\,S_{\alpha }(\omega )\,. \end{aligned}$$

(2.13)

The above identity means that the operators $S_{\alpha }(\omega )$ form a basis of eigenvectors of $\Delta _\sigma $. Next, we define the conditional expectation onto the algebra ${\mathcal {F}}({{\mathcal {L}}})$ of fixed points of ${{\mathcal {L}}}$ with respect to the Gibbs state $\sigma =\sigma ^\beta $ as follows [28]:

$$\begin{aligned} E^{\mathrm{D},\beta }:=\lim _{t\rightarrow \infty }\hbox {e}^{t{{\mathcal {L}}}^{\mathrm{D},\beta }}\,. \end{aligned}$$

(2.14)

Some results regarding the fixed-point algebra associated to this conditional expectation are contained in Appendix A. In particular, we prove the following theorem which is of independent interest.

Theorem 1

Define the algebra ${\mathcal {M}}=\{S_\alpha \}'$, $E^{\mathrm{D},\beta }$ as above and $E_{\sigma }$ as in Eq. (2.6) with respect to the inclusion ${\mathcal {M}}\subset {{\mathcal {B}}}({{\mathcal {H}}})$. Then both conditional expectations coincide.

3 Weak Approximate Tensorization of the Relative Entropy

This section is devoted to the main results of this article, namely approximate tensorization inequalities for the relative entropy.

Definition 2

Let ${\mathcal {M}}\subset {{\mathcal {N}}}_1,\,{{\mathcal {N}}}_2\subset {{\mathcal {N}}}$ be finite-dimensional von Neumann algebras and $E^{{\mathcal {M}}},\,E_1 ,\, E_2$ associated conditional expectations onto ${\mathcal {M}}$, resp. ${{\mathcal {N}}}_1,\,{{\mathcal {N}}}_2$. These conditional expectations are said to satisfy a weak approximate tensorization with constants $c \ge 1$ and $d\ge 0$, denoted by AT(c, d), if, for any state $\rho \in {{\mathcal {D}}}({{\mathcal {N}}})$:

$$\begin{aligned} D(\rho \Vert E^{\mathcal {M}}_*(\rho ))\le c\,\big (D(\rho \Vert E_{1*}(\rho ))+D(\rho \Vert E_{2*}(\rho ))\big )+d \end{aligned}$$

(AT(c,d))

The approximate tensorization is said to be strong if $d=0$.

Remark 1

One can easily get similar inequalities for $k\ge 2$ algebras ${\mathcal {M}}\subset {{\mathcal {N}}}_1,\dots {{\mathcal {N}}}_k\subset {{\mathcal {N}}}$ by simply averaging over each inequality for two $k_1\ne k_2\in [k]$. Denoting by c and d as the maximal constants we get by considering two algebras ${{\mathcal {N}}}_{k_1}$ and ${{\mathcal {N}}}_{k_2}$ pairwise, we would thus obtain

$$\begin{aligned} D(\rho \Vert E^{\mathcal {M}}_*(\rho ))\le \frac{2c}{k}\,\sum _{j=1}^k\,D(\rho \Vert E_{j*}(\rho ))+d\,. \end{aligned}$$

(3.1)

For sake of clarity, we will restrict to the case $k=2$ in the rest of the article.

The first technical result presented in this section is Lemma 1, derived from the so-called multivariate trace inequalities [41]. It takes the form

$$\begin{aligned} D(\rho \Vert E^{\mathcal {M}}_*(\rho ))\le D(\rho \Vert E_{1*}(\rho ))+D(\rho \Vert E_{2*}(\rho )) + \xi (E_{1*}(\rho ), \, E_{2*}(\rho ), \, E^{\mathcal {M}}_*(\rho ))\, , \end{aligned}$$

where $\xi (E_{1*}(\rho ), \, E_{2*}(\rho ), \, E^{\mathcal {M}}_*(\rho ))$ is an additive error term that we subsequently estimate via different approaches in the subsequent Sects. 3.2–3.4: Lemma 1 directly yields a generalization of a result of [20] for conditional expectations with respect to non-tracial states in Corollary 1. Moreover, using a noncommutative change of measure argument [4], we provide in Proposition 2 some first estimates of the strong and weak constants c and d in AT(c, d) in terms of the maximal and minimal eigenvalues of a common invariant state of the three conditional expectations involved.

Next, in Theorem 2, we use a different technique involving Pinching maps onto certain subspaces that appear in a block-diagonal decomposition of $\mathcal {M}$ (this setting is properly introduced in Sect. 3.3) to obtain the inequality:

$$\begin{aligned} D(\rho \Vert E^{\mathcal {M}}_*(\rho ))&\le \frac{1}{1-c_1} \left( D(\rho \Vert E_{1*}(\rho ))+D(\rho \Vert E_{2*}(\rho )) \right) \\&\quad + \xi _2 (E_{1*}(\rho ), \, E_{2*}(\rho ), \, E^{\mathcal {M}}_*(\rho ))\, , \end{aligned}$$

where $\xi _2 (E_{1*}(\rho ), \, E_{2*}(\rho ), \, E^{\mathcal {M}}_*(\rho ))$ strongly depends on the Pinching map with respect to $E^{\mathcal {M}}_*(\rho )$ and it is subsequently estimated in Proposition 3. Furthermore, the multiplicative error term above can be interpreted as arising from a condition of clustering of correlations for the state $E_*^{\mathcal {M}}(\rho )$ (see Sect. 3.4).

3.1 A Technical Lemma

In the next result, we derive a bound on the difference between $D(\rho \Vert E^{\mathcal {M}}_*(\rho ))$ and the sum of the relative entropies $D(\rho \Vert E_{i*}(\rho ))$, which is our key tool in finding constants c and d for which AT(c,d) is satisfied. The result is inspired by the work of [14, 17] and makes use of the multivariate trace inequalities introduced in [41]:

Lemma 1

Let ${\mathcal {M}}\subset {{\mathcal {N}}}_1 ,\,{{\mathcal {N}}}_2\subset {{\mathcal {N}}}$ be finite-dimensional von Neumann algebras and $E^{{\mathcal {M}}},E_1 ,\, E_2$ their corresponding conditional expectations. Then the following inequality holds for any $\rho \in {{\mathcal {D}}}({{\mathcal {N}}})$, writing $\rho _j:=E_{j*}(\rho )$ and $\rho _{\mathcal {M}}:=E^{\mathcal {M}}_{*}(\rho )$:

$$\begin{aligned} D(\rho \Vert \rho _{\mathcal {M}})&\le \,D(\rho \Vert \rho _1)+D(\rho \Vert \rho _2)\nonumber \\&\quad +\,\ln \left\{ \int _{-\infty }^\infty \,\mathop {\mathrm{Tr}}\nolimits \left[ \rho _{1}\, \rho _{{\mathcal {M}}}^{\frac{-1-it}{2}}\, \rho _{2}\, \rho _{{\mathcal {M}}}^{\frac{-1+it}{2}} \right] \,\beta _0(t)\,dt \right\} \,, \end{aligned}$$

(3.2)

with the probability distribution function

$$\begin{aligned} \beta _0(t)= \frac{\pi }{2} (\cosh (\pi t)+ 1)^{-1}\,. \end{aligned}$$

Proof

The first step of the proof consists in showing the following bound:

$$\begin{aligned} D(\rho \Vert \rho _{\mathcal {M}}) \le D(\rho \Vert \rho _1)+D(\rho \Vert \rho _2)+ \ln \mathop {\mathrm{Tr}}\nolimits [M]\,, \end{aligned}$$

(3.3)

where $ M = \exp \left[ - \ln \rho _{\mathcal {M}}+ \ln \rho _1 + \ln \rho _2 \right] $. Indeed,

$$\begin{aligned} \displaystyle D(\rho \Vert \rho _{\mathcal {M}})- D(\rho \Vert \rho _1)- D(\rho \Vert \rho _2)&=\mathop {\mathrm{Tr}}\nolimits \left[ {\rho } \left( - \ln {\rho } \underbrace{ - \ln \rho _{\mathcal {M}}+ \ln \rho _1+\ln \rho _2}_{\ln M} \right) \right] \\&= - D(\rho \Vert M). \end{aligned}$$

Moreover, since $\mathop {\mathrm{Tr}}\nolimits [M]\ne 1$ in general, from the non-negativity of the relative entropy of two states it follows that:

$$\begin{aligned} D(\rho \Vert M) \ge -\log \mathop {\mathrm{Tr}}\nolimits [M]. \end{aligned}$$

In the next step, we bound the error term making use of [33, Theorem 7] and [41, Lemma 3.4], concerning Lieb’s extension of Golden-Thompson inequality and Sutter, Berta and Tomamichel’s rotated expression for Lieb’s pseudo-inversion operator using multivariate trace inequalities, respectively: Let us recall that Theorem 7 of [33] states that for observables f, g and h, we have

$$\begin{aligned} \mathop {\mathrm{Tr}}\nolimits \left[ \mathrm{exp}\left( - f + g + h \right) \right] \le \mathop {\mathrm{Tr}}\nolimits \left[ \mathrm{e}^g {\mathcal {T}}_{\mathrm{e}^f} ({\text {e}}^h)\right] , \end{aligned}$$

where ${\mathcal {T}}_{f} $ is given by:

$$\begin{aligned} {\mathcal {T}}_{f} (h) := \int _0^\infty (f + t)^{-1} h (f + t)^{-1} dt \, . \end{aligned}$$

An alternative definition of this superoperator in terms of multivariate trace inequalities was provided in Lemma 3.4 of [41], namely

$$\begin{aligned} {\mathcal {T}}_{f} (h) = \int _{- \infty }^\infty \beta _0 (t) \, f^{\frac{-1-it}{2}} h f^{\frac{-1+it}{2}} dt \, , \end{aligned}$$

with $\beta _0$ as in the statement of the lemma. Now, we apply both results to inequality (3.3), to obtain

$$\begin{aligned} \mathop {\mathrm{Tr}}\nolimits [M]&= \mathop {\mathrm{Tr}}\nolimits \left[ \mathrm{exp}\left( - \ln {\rho _{\mathcal {M}}} + \ln {\rho _1} + \ln {\rho _2} \right) \right] \\&\quad \le \int _{- \infty }^\infty \mathop {\mathrm{Tr}}\nolimits \left[ \rho _1 \, \rho _{\mathcal {M}}^{\frac{-1-it}{2}} \rho _2 \, \rho _{\mathcal {M}}^{\frac{-1+it}{2}} \right] \,\beta _0 (t) \, dt\,, \end{aligned}$$

which concludes the proof of the lemma. $\square $

Note that, if a constant $d>0$ is such that

$$\begin{aligned} \ln \int _{-\infty }^\infty \,\mathop {\mathrm{Tr}}\nolimits \left[ \rho _{1}\rho _{{\mathcal {M}}}^{\frac{-1-it}{2}}\rho _{2}\rho _{{\mathcal {M}}}^{\frac{-1+it}{2}} \right] \,\beta _0(t)\,dt \le d \end{aligned}$$

for every $\rho \in {{\mathcal {D}}}({{\mathcal {N}}})$, then inequality (3.2) constitutes a result of approximate tensorization AT(1, d). Using this observation, we obtain an arguably more direct proof of a result appearing in [20], that we generalize to the case of non-tracial states. Indeed, the proof of [20] required the introduction of so-called amalgamated ${\mathbb {L}}_p$ spaces, a technical tool that we do not require.

Corollary 1

With the notations of Lemma 1, define the constant

$$\begin{aligned} d:= & {} \sup _{\rho \in {{\mathcal {D}}}({{\mathcal {N}}}_{2})}\inf \big \{\ln (\lambda )|\,E_{1*}(\rho )\le \lambda \eta \,\text { for some }\,\eta \in {{\mathcal {D}}}({\mathcal {M}}) \big \}\\\equiv & {} \sup _{\rho \in {{\mathcal {D}}}({{\mathcal {N}}}_{2})}\inf _{\eta \in {{\mathcal {D}}}({\mathcal {M}})}\,D_{\max }(E_{1*}(\rho )\Vert \eta ) \,. \end{aligned}$$

Then the following weak approximate tensorization $\mathrm{AT}(1,d)$ holds:

$$\begin{aligned} D(\rho \Vert \rho _{\mathcal {M}})\le D(\rho \Vert \rho _1)+D(\rho \Vert \rho _2)+ d\,. \end{aligned}$$

Proof

We focus on the last term on the right-hand side of (3.2). First, note that:

$$\begin{aligned}&\mathop {\mathrm{Tr}}\nolimits \left[ \rho _{1}\,\rho _{\mathcal {M}}^{\frac{-1-it}{2}}\rho _{2}\,\rho _{\mathcal {M}}^{\frac{-1+it}{2}} \right] \\&\quad = \mathop {\mathrm{Tr}}\nolimits \left[ \rho \,E_1^*\left( \rho _{\mathcal {M}}^{\frac{-1-it}{2}}\rho _{2}\,\rho _{\mathcal {M}}^{\frac{-1+it}{2}} \right) \right] = \mathop {\mathrm{Tr}}\nolimits \left[ \rho \,\rho _{\mathcal {M}}^{\frac{-1-it}{2}}E_{1*}(\rho _{2})\,\rho _{\mathcal {M}}^{\frac{-1+it}{2}} \right] . \end{aligned}$$

We have by definition of d that there exists a state $\eta \in {{\mathcal {D}}}({\mathcal {M}})$ such that for any $t\in {\mathbb {R}}$:

$$\begin{aligned} \mathop {\mathrm{Tr}}\nolimits \left[ \rho \,\rho _{\mathcal {M}}^{\frac{-1-it}{2}}E_{1*}(\rho _{2})\,\rho _{\mathcal {M}}^{\frac{-1+it}{2}} \right] \le \,{rm e}^{d} \mathop {\mathrm{Tr}}\nolimits \left[ \rho \,\rho _{\mathcal {M}}^{\frac{-1-it}{2}}\eta \,\rho _{\mathcal {M}}^{\frac{-1+it}{2}} \right] ={rm e}^{d}\mathop {\mathrm{Tr}}\nolimits [\rho X_{\mathcal {M}}]\,, \end{aligned}$$

for some density $X_{\mathcal {M}}\in {\mathcal {M}}$ given by $\rho _{\mathcal {M}}^{\frac{-1-it}{2}}\eta \,\rho _{\mathcal {M}}^{\frac{-1+it}{2}}$. Since ${\mathcal {M}}\subset {{\mathcal {N}}}$, $\mathop {\mathrm{Tr}}\nolimits [\rho X_{\mathcal {M}}]=\mathop {\mathrm{Tr}}\nolimits [\rho _{\mathcal {M}}\,X_{\mathcal {M}}]=\mathop {\mathrm{Tr}}\nolimits [\eta ]=1$. The result follows. $\square $

Remark 2

In [23], the authors showed that, for doubly stochastic conditional expectations (i.e. $E_{i*}=E_i$, $E^{\mathcal {M}}_*=E^{\mathcal {M}}$), the following equation holds: Given the following block decomposition of the algebras ${{\mathcal {N}}}_2$ and ${\mathcal {M}}$,

$$\begin{aligned} {{\mathcal {N}}}_2&\equiv \bigoplus _{l\in I_{{{\mathcal {N}}}_2}}\,{\mathbb {M}}_{m_l}\otimes {\mathbb {1}}_{t_l}\,\qquad \qquad {\mathcal {M}}\equiv \bigoplus _{k\in I_{{\mathcal {M}}}}\,{\mathbb {M}}_{n_k}\otimes {\mathbb {1}}_{s_k}\,,\\ D({{\mathcal {N}}}_2\Vert {\mathcal {M}})&:= \sup _{\rho \in {{\mathcal {D}}}({{\mathcal {N}}}_2)} \inf _{\eta \in {{\mathcal {D}}}({\mathcal {M}})}D_{\max }(\rho \Vert \eta )\equiv \max _{l\in I_{{{\mathcal {N}}}_2}}\ln \Big (\sum _{k\in I_{{\mathcal {M}}}} \min (a_{kl},\,n_k)\,s_k/t_l \Big )\,, \end{aligned}$$

where $a_{kl}$ denotes the number of copies of the block ${\mathbb {M}}_{n_k}$ contained in the block ${\mathbb {M}}_{m_l}$. In the context of lattice spin systems, this typically corresponds to the infinite temperature regime.

3.2 Approximate Tensorization via Noncommutative Change of Measure

Corollary 1 states a correction to exact tensorization with a unique weak constant. We expect this result to be relevant for doubly stochastic conditional expectations, where this additive term is purely quantum. However, the weak constant d is suboptimal in general. In this section and the following one, we provide tools to improve the latter at the cost of replacing the optimal strong constant by $c>1$. This intuition is inspired by the classical setting, where the weak constant can be removed at the cost of a worsening of the strong constant [14, 17].

Given a state $\sigma $ that is invariant for the conditional expectations $E^{\mathcal {M}}, E_1$ and $E_2$, we define the doubly stochastic conditional expectations ${E}^{(0),{\mathcal {M}}}, E_1^{(0)}$ and $E_2^{(0)}$ onto the same fixed-point algebras ${\mathcal {M}}\subset {{\mathcal {N}}}_1,{{\mathcal {N}}}_2\subset {{\mathcal {N}}}$. Then, the following proposition is a direct consequence of a recent noncommutative change of measure argument in [27] under the assumption that strong approximate tensorization for the relative entropy holds for ${E}^{(0),{\mathcal {M}}}, E_1^{(0)}$ and $E_2^{(0)}$.

Proposition 2

As in Corollary 1, we define the constant

$$\begin{aligned} d:= & {} \sup _{\rho \in {{\mathcal {D}}}({{\mathcal {N}}}_{2})}\inf \big \{\ln (\lambda )|\,E_{1*}^{(0)}(\rho )\le \lambda \eta \,\text { for some }\,\eta \in {{\mathcal {D}}}({\mathcal {M}}) \big \}\\\equiv & {} \sup _{\rho \in {{\mathcal {D}}}({{\mathcal {N}}}_{2})}\inf _{\eta \in {{\mathcal {D}}}({\mathcal {M}})}\,D_{\max }(E^{(0)}_{1*}(\rho )\Vert \eta ) \,. \end{aligned}$$

Let us assume that $\mathrm{AT}(1,d)$ holds for the doubly stochastic conditional expectations, i.e. for every $\rho \in {{\mathcal {D}}}({{\mathcal {H}}})$

$$\begin{aligned} D( \rho \Vert E^{(0),{\mathcal {M}}}_{*}(\rho )) \le D(\rho \Vert E^{(0)}_{1*}(\rho )) + D(\rho \Vert E^{(0)}_{2*}(\rho )) +d \, . \end{aligned}$$

(3.4)

Then, the following result of $\mathrm{AT}(c,d')$ with $c=\frac{\lambda _{\max }(\sigma )}{\lambda _{\min }(\sigma )}$ and $d'= \lambda _{\max }(\sigma )\,d_{{\mathcal {H}}}\,d$ holds:

$$\begin{aligned} D(\rho \Vert E^{\mathcal {M}}_{*}(\rho ))\le \frac{\lambda _{\max }(\sigma )}{\lambda _{\min }(\sigma )}\,\big (D(\rho \Vert E_{1*}(\rho ))+D(\rho \Vert E_{2*}(\rho ))\big )+\lambda _{\max }(\sigma )\,d_{{\mathcal {H}}}\,d\,. \end{aligned}$$

(3.5)

In particular, if $\mathrm{AT}(1,0)$ holds for the doubly stochastic conditional expectations ${E}^{(0),{\mathcal {M}}}, E_1^{(0)}$ and $E_2^{(0)}$, then the conditional expectations $E^{\mathcal {M}}, E_1$ and $E_2$ satisfy $\mathrm{AT}(c,0)$ with $c=\frac{\lambda _{\max }(\sigma )}{\lambda _{\min }(\sigma )}$.

We defer the proof of this result to Appendix B.1, as it merely follows the lines of [27].

3.3 Approximate Tensorization via Pinching Map

Proposition 2 states an approximate tensorization inequality with the advantage over Corollary 1 that the weak constant d vanishes when the doubly stochastic conditional expectations projecting onto the same subalgebras form a commuting square. However, the multiplicative constant typically explodes when increasing the size of the system. In the following theorem, we take care of this issue by employing a pinching argument in place of the change of measure argument laid in Proposition 2.

Before stating the result, let us fix some notations. As before, we are interested in proving (weak) approximate tensorization results for the quadruple of algebras ${\mathcal {M}}\subset {{\mathcal {N}}}_1\,,\,{{\mathcal {N}}}_2\subset {{\mathcal {N}}}$. As a subalgebra of ${{\mathcal {B}}}({{\mathcal {H}}})$ for some Hilbert space ${{\mathcal {H}}}$, ${\mathcal {M}}$ bears the following block diagonal decomposition: given ${{\mathcal {H}}}=\bigoplus _{i\in I_{\mathcal {M}}}{{\mathcal {H}}}_i\otimes {{\mathcal {K}}}_i$:

$$\begin{aligned}&{\mathcal {M}}\equiv \bigoplus _{i\in I_{\mathcal {M}}}\,{{\mathcal {B}}}({{\mathcal {H}}}_i)\otimes {\mathbb {1}}_{{{\mathcal {K}}}_i}\,,\qquad \text { so that }\qquad \forall \rho \in {{\mathcal {D}}}({{\mathcal {N}}})\,,\, \nonumber \\&\rho _{\mathcal {M}}:=\sum _{i\in I_{\mathcal {M}}}\,\mathop {\mathrm{Tr}}\nolimits _{{{\mathcal {K}}}_i}[P_i\rho P_i]\,\otimes \tau _i\,, \end{aligned}$$

(3.6)

where $P_i$ corresponds to the projection onto the i-th diagonal block in the decomposition of ${\mathcal {M}}$, and each $\tau _i$ is a full-rank state on ${{\mathcal {K}}}_i$. We further make the observation that, since the restrictions of the conditional expectations $E_1$, $E_2$ and $E^{\mathcal {M}}$ on ${{\mathcal {B}}}({{\mathcal {H}}}_i\otimes {{\mathcal {K}}}_i)$ only act non-trivially on the factor ${{\mathcal {B}}}({{\mathcal {K}}}_i)$, there exist conditional expectations ${E}_j^{(i)}$ and $({E}^{{\mathcal {M}}})^{(i)}$ acting on ${{\mathcal {B}}}( {{\mathcal {K}}}_i)$ and such that

$$\begin{aligned} E_j|_{{{\mathcal {B}}}({{\mathcal {H}}}_i\otimes {{\mathcal {K}}}_i)}:={\mathrm{id}}_{{{\mathcal {B}}}({{\mathcal {H}}}_i)}\otimes {E}_j^{(i)}\,,\quad \text {resp.}\quad E^{\mathcal {M}}|_{{{\mathcal {B}}}({{\mathcal {H}}}_i\otimes {{\mathcal {K}}}_i)}:={\mathrm{id}}_{{{\mathcal {B}}}({{\mathcal {H}}}_i)}\otimes ({E}^{\mathcal {M}})^{(i)}\,. \end{aligned}$$

(3.7)

In order to get another form of approximate tensorization, we wish to compare the state $\rho $ with a classical-quantum state according to the decomposition given by ${\mathcal {M}}$. To this end we introduce the Pinching map with respect to each ${{\mathcal {H}}}_i$: define $\rho _{{{\mathcal {H}}}_i}\equiv \mathop {\mathrm{Tr}}\nolimits _{{{\mathcal {K}}}_i}[P_i\,\rho \,P_i]$. Then each $\rho _{{{\mathcal {H}}}_i}$ can be diagonalized individually:

$$\begin{aligned} \rho _{{{\mathcal {H}}}_i}\equiv \sum _{\lambda ^{(i)}\in \mathrm{Spec}(\rho _{{{\mathcal {H}}}_i})}\,\lambda ^{(i)}\,|\lambda ^{(i)}\rangle \!\langle \lambda ^{(i)}|\,. \end{aligned}$$

The Pinching map we are interested in is then:

$$\begin{aligned}&\mathcal {P}_{\rho _{\mathcal {M}}}(X)\\&\quad \equiv \sum _{i\in I_{\mathcal {M}}}\,\sum _{\lambda ^{(i)}\in \mathrm{Spec}(\rho _{{{\mathcal {H}}}_i})}\,\left( |\lambda ^{(i)}\rangle \!\langle \lambda ^{(i)}|\otimes \mathbb {1}_{{{\mathcal {K}}}_i}\right) \,X\,\left( |\lambda ^{(i)}\rangle \!\langle \lambda ^{(i)}|\otimes \mathbb {1}_{{{\mathcal {K}}}_i}\right) \,,\\&\quad X\in {{\mathcal {B}}}({{\mathcal {H}}})\,. \end{aligned}$$

Remark that we have for all $\rho \in {{\mathcal {D}}}({{\mathcal {N}}})$:

$$\begin{aligned} \mathop {\mathrm{Tr}}\nolimits _{{{\mathcal {H}}}_i}[P_i\,\rho \,P_i]=\mathop {\mathrm{Tr}}\nolimits _{{{\mathcal {H}}}_i}[P_i\,\mathcal {P}_{\rho _{\mathcal {M}}}(\rho )\,P_i]\,. \end{aligned}$$

Theorem 2

Assume

$$\begin{aligned}&c_1:=\max _{i\in I_{\mathcal {M}}}\Vert E_{1}^{(i)}\circ E_{2}^{(i)}-(E^{\mathcal {M}})^{(i)}:\,{\mathbb {L}}_1(\tau _i)\rightarrow {\mathbb {L}}_\infty \Vert <1\,. \end{aligned}$$

(3.8)

Then, the following inequality holds:

$$\begin{aligned} D(\rho \Vert \rho _{\mathcal {M}})&\le \frac{1}{(1-{c_1})}\,\big (D(\rho \Vert \rho _1)+D(\rho \Vert \rho _2)+D_{\max }\big (E_{1*}\circ E_{2*}(\rho )\Vert E_{1*}\circ E_{2*}(\eta )\big )\nonumber \\&\quad +\,c_1 D(\eta \Vert \mathcal {P}_{\rho _{\mathcal {M}}}(\rho ))\big )\,, \end{aligned}$$

(3.9)

for any $\eta \in {{\mathcal {D}}}({{\mathcal {N}}})$ such that $\eta =\mathcal {P}_{\rho _{\mathcal {M}}}(\eta )$ and $\mathop {\mathrm{Tr}}\nolimits _{{{\mathcal {K}}}_i}[P_i\,\eta \,P_i]=\rho _{{{\mathcal {H}}}_i}$. In particular, any state $\eta $ of the form $\eta := \sum _{i\in I_{\mathcal {M}}}\,\rho _{{{\mathcal {H}}}_i}\otimes \tau _i'$, for an arbitrary family of subnormalized states $\tau _i'$, satisfies these conditions.

Alternatively, we can get

$$\begin{aligned} D(\rho \Vert \rho _{\mathcal {M}})\le \frac{1}{(1-{c_1})}\,\big (D(\rho \Vert \rho _1)+D(\rho \Vert \rho _2)\big )+D\big (\rho \Vert \mathcal {P}_{\rho _{\mathcal {M}}}(\rho ) )\,. \end{aligned}$$

(3.10)

Consequently, AT(c,d) holds with

$$\begin{aligned}&c:=\frac{1}{(1-{c_1})}\,, \nonumber \\&d:= \frac{1}{(1-{c_1})}\left( \underset{\rho \in {{\mathcal {D}}}({{\mathcal {N}}})}{\sup }\,\underset{\eta \in {{\mathcal {D}}}({{\mathcal {N}}})}{\inf }\,D_{\max }\big (E_{1*}\circ E_{2*}(\rho )\Vert E_{1*}\circ E_{2*}(\eta )\big )+\,c_1 D(\eta \Vert \mathcal {P}_{\rho _{\mathcal {M}}}(\rho ))\big )\right) \,, \end{aligned}$$

(3.11)

where the infimum in the second line runs over $\eta $ such that $\eta =\mathcal {P}_{\rho _{\mathcal {M}}}(\eta )$ and $\mathop {\mathrm{Tr}}\nolimits _{{{\mathcal {K}}}_i}[P_i\,\eta \,P_i]=\rho _{{{\mathcal {H}}}_i}$.

Proof

The proof starts similarly to that of Corollary 1. We once again simply need to bound the integral on the right hand side of (3.2). By considering $\eta $ as in the statement of the theorem and writing for the moment ${\tilde{d}}:=D_{\max }\big (E_{1*}\circ E_{2*}(\rho )\Vert E_{1*}\circ E_{2*}(\eta )\big )$, we obtain

$$\begin{aligned} \mathop {\mathrm{Tr}}\nolimits \Big [ \rho _{1}\,\rho _{\mathcal {M}}^{\frac{-1-it}{2}}\,\rho _{2}\,\rho _{\mathcal {M}}^{\frac{-1+it}{2}}\Big ]&= \mathop {\mathrm{Tr}}\nolimits \Big [ \rho \,\rho _{\mathcal {M}}^{\frac{-1-it}{2}}\,E_{1*}(\rho _{2})\,\rho _{\mathcal {M}}^{\frac{-1+it}{2}}\Big ]\\&\le {rm e}^{{\tilde{d}}} \, \mathop {\mathrm{Tr}}\nolimits \Big [ \rho \,\rho _{\mathcal {M}}^{\frac{-1-it}{2}}\,E_{1*}\circ E_{2*}(\eta )\,\rho _{\mathcal {M}}^{\frac{-1+it}{2}}\Big ]\,. \end{aligned}$$

To simplify the notation, let us write: $\eta _{12}:=E_{1*}\circ E_{2*}(\eta )$. Now, note that the following holds:

$$\begin{aligned} \mathop {\mathrm{Tr}}\nolimits \left[ \left( \rho - \rho _{\mathcal {M}}\right) \rho _{\mathcal {M}}^{\frac{-1-it}{2}} \left( \eta _{12} - \rho _{\mathcal {M}}\right) \rho _{\mathcal {M}}^{\frac{-1+it}{2}} \right] = \mathop {\mathrm{Tr}}\nolimits \left[ \rho \, \rho _{\mathcal {M}}^{\frac{-1-it}{2}} \, \eta _{12} \, \rho _{\mathcal {M}}^{\frac{-1+it}{2}} \right] -1 -1 + 1, \end{aligned}$$

since $ E^{\mathcal {M}}_*$, $E_{1*}$ and $E_{2*}$ are conditional expectations in the Schrödinger picture and, thus, trace preserving. Therefore,

$$\begin{aligned}&\ln \int _{-\infty }^{+\infty } {rm e}^{{\tilde{d}}} \, \mathop {\mathrm{Tr}}\nolimits \left[ \rho \, \rho _{\mathcal {M}}^{\frac{-1-it}{2}} \, \eta _{12} \, \rho _{\mathcal {M}}^{\frac{-1+it}{2}} \right] \,\beta _0 (t) \,dt \, \\&\quad = \ln \int _{-\infty }^{+\infty } {rm e}^{{\tilde{d}}} \, \left( \mathop {\mathrm{Tr}}\nolimits \left[ \left( \rho - \rho _{\mathcal {M}}\right) \rho _{\mathcal {M}}^{\frac{-1-it}{2}} \left( \eta _{12} - \rho _{\mathcal {M}}\right) \rho _{\mathcal {M}}^{\frac{-1+it}{2}} \right] + 1 \right) \,\beta _0 (t) \,dt \\&\quad \le {\tilde{d}} + \int _{-\infty }^{+\infty } \mathop {\mathrm{Tr}}\nolimits \left[ \left( \rho - \rho _{\mathcal {M}}\right) \rho _{\mathcal {M}}^{\frac{-1-it}{2}} \left( \eta _{12} -\rho _{\mathcal {M}}\right) \rho _{\mathcal {M}}^{\frac{-1+it}{2}} \right] \, \beta _0 (t)\,dt\,, \end{aligned}$$

where we have used that $ \ln (x +1)\le x$ for positive real numbers. Defining $X:=\Gamma _{\rho _{\mathcal {M}}}^{-1}(\rho )$ and $Y_t:=\rho _{\mathcal {M}}^{\frac{-1-it}{2}}\,\eta \, \rho _{\mathcal {M}}^{\frac{-1+it}{2}}$, we note that

$$\begin{aligned} E_1 \circ E_2 [Y_t] = \rho _{\mathcal {M}}^{\frac{-1-it}{2}}\,\eta _{12}\, \rho _{\mathcal {M}}^{\frac{-1+it}{2}}, \end{aligned}$$

(3.12)

and we can rewrite the previous expression as

$$\begin{aligned}&\int _{-\infty }^{+\infty } \mathop {\mathrm{Tr}}\nolimits \left[ \left( \rho - \rho _{\mathcal {M}}\right) \rho _{\mathcal {M}}^{\frac{-1-it}{2}} \left( \eta _{12} - \rho _{\mathcal {M}}\right) \rho _{\mathcal {M}}^{\frac{-1+it}{2}} \right] \, \beta _0 (t)\,dt\nonumber \\&\quad = \int _{-\infty }^{+\infty } \mathop {\mathrm{Tr}}\nolimits \left[ \left( X - E^{\mathcal {M}}[X] \right) \Delta _{\rho _{\mathcal {M}}}^{-it/2} \left( \eta _{12} - \rho _{\mathcal {M}}\right) \right] \, \beta _0 (t)\,dt \nonumber \\&\quad = \int _{-\infty }^{+\infty } \mathop {\mathrm{Tr}}\nolimits \left[ \left( X - E^{\mathcal {M}}[X] \right) \rho _{\mathcal {M}}^{\frac{1}{2}} \left( E_1 \circ E_2 [Y_t] -E^{\mathcal {M}}[Y_t] \right) \rho _{\mathcal {M}}^{\frac{1}{2}} \right] \, \beta _0 (t)\,dt\nonumber \\&\quad = \int _{- \infty }^{+ \infty } \left\langle X - E^{\mathcal {M}}[X] , \ E_1 \circ E_2 [Y_t] -E^{\mathcal {M}}[Y_t] \right\rangle _{\rho _{\mathcal {M}}}\, \beta _0 (t)\,dt\, , \end{aligned}$$

(3.13)

thus obtaining the following inequality

$$\begin{aligned}&\ln \, \int _{-\infty }^{\infty } {rm e}^{{\tilde{d}}} \, \mathop {\mathrm{Tr}}\nolimits \Big [ \rho _{1}\,\rho _{\mathcal {M}}^{\frac{-1-it}{2}}\,\eta _{12}\,\rho _{\mathcal {M}}^{\frac{-1+it}{2}}\Big ]\,\beta _0(t)\,dt\\&\quad \le {\tilde{d}}+\int _{-\infty }^\infty \,\langle X-E^{\mathcal {M}}[X]\,,E_1\circ E_2[Y_t]\\&\qquad -E^{\mathcal {M}}[Y_t]\rangle _{\rho _{\mathcal {M}}}\,\beta _0(t)\,dt . \end{aligned}$$

Now, we focus on the integrand on the right-hand side of the above inequality. Denote for any $A\in {{\mathcal {B}}}({{\mathcal {H}}})$,

$$\begin{aligned} A^{(\lambda ,i)}:=\left( |\lambda ^{(i)}\rangle \!\langle \lambda ^{(i)}|\otimes \mathbb {1}_{{{\mathcal {K}}}_i}\right) P_i\,A\,P_i\left( |\lambda ^{(i)}\rangle \!\langle \lambda ^{(i)}|\otimes \mathbb {1}_{{{\mathcal {K}}}_i}\right) \,. \end{aligned}$$

We also write $A^{(\lambda ,i)}=|\lambda ^{(i)}\rangle \!\langle \lambda ^{(i)}|\otimes A^{(\lambda ,i)}$ by a slight abuse of notation. Then

$$\begin{aligned}&\langle X-E^{\mathcal {M}}[X]\,,\, E_1\circ E_2[Y_t]-E^{\mathcal {M}}[Y_t]\rangle _{\rho _{\mathcal {M}}}\\&\quad = \sum _{i\in I_{\mathcal {M}}}\,\sum _{\lambda ^{(i)}\in \mathrm{Spec}(\rho _{{{\mathcal {H}}}_i})}\,\lambda ^{(i)}\,\langle X^{(\lambda ,i)}-E^{\mathcal {M}}[X^{(\lambda ,i)}],\,E_1\circ E_2[Y_t^{(\lambda ,i)}]\\&\qquad -E^{\mathcal {M}}[Y_t^{(\lambda ,i)}]\rangle _{\tau _i}\,. \end{aligned}$$

Next, by Hölder’s inequality each summand in the right-hand side above is upper bounded by

$$\begin{aligned}&\Vert ({\mathrm{id}}-(E^{\mathcal {M}})^{(i)})[X^{(\lambda ,i)}] \Vert _{{\mathbb {L}}_1(\tau _i)}\,\Vert (E_1^{(i)}\circ E_2^{(i)}-(E^{\mathcal {M}})^{(i)}) [Y_t^{(\lambda ,i)}] \Vert _\infty \nonumber \\&\quad \le c_1\, \Vert ({\mathrm{id}}-(E^{\mathcal {M}})^{(i)})[X^{(\lambda ,i)}] \Vert _{{\mathbb {L}}_1(\tau _i)}\,\Vert ({\mathrm{id}}-(E^{\mathcal {M}})^{(i)})[Y_t^{(\lambda ,i)}]\Vert _{{\mathbb {L}}_1(\tau _i)}\nonumber \\&\quad = c_1\,\Vert \rho ^{(\lambda ,i)}-E^{\mathcal {M}}_{*}(\rho ^{(\lambda ,i)})\Vert _1\,\Vert \tau _i'-\tau _i\Vert _{1}\\&\quad \le \frac{c_1}{2}\,\left( \Vert \rho ^{(\lambda ,i)}-E^{\mathcal {M}}_{*}(\rho ^{(\lambda ,i)})\Vert _1^2+\Vert \tau _i'-\tau _i\Vert _{1}^2\right) \,,\nonumber \end{aligned}$$

where we use Young’s inequality in the last line. Using Pinsker’s inequality and summing over the indices i and $\lambda ^{(i)}$, we find that

$$\begin{aligned} \ln \int _{-\infty }^\infty \mathop {\mathrm{Tr}}\nolimits \left[ \rho \,\rho _{\mathcal {M}}^{\frac{-1-it}{2}}E_{1*}\circ E_{2*}(\mathcal {P}_{\mathcal {M}}(\rho ))\,\rho _{\mathcal {M}}^{\frac{-1+it}{2}} \right] \beta _0(t)\,dt&\le {\tilde{d}} +c_1\,D(\rho \Vert \rho _{\mathcal {M}})\\&\quad + c_1\,D(\eta \Vert \mathcal {P}_{\rho _{\mathcal {M}}}(\rho ))\,. \end{aligned}$$

Equation (3.9) follows after rearranging the term. In order to obtained Eq. (3.10), we exploit that $\rho _{\mathcal {M}}$ is a fixed point of $\mathcal {P}_{\rho _{\mathcal {M}}}$ and therefore

$$\begin{aligned} D(\rho \Vert \rho _{\mathcal {M}})=D(\rho \Vert \mathcal {P}_{\rho _{\mathcal {M}}}(\rho ))+D(\mathcal {P}_{\rho _{\mathcal {M}}}(\rho )\Vert \rho _{\mathcal {M}})\,. \end{aligned}$$

We can then apply Eq. (3.9) to $\mathcal {P}_{\rho _{\mathcal {M}}}(\rho )$ and remark that the weak constant vanishes. The result follows after remarking that $\mathcal {P}_{\rho _{\mathcal {M}}}\circ E_*^{\mathcal {M}}=E_*^{\mathcal {M}}\circ \mathcal {P}_{\rho _{\mathcal {M}}}$ and applying the data-processing inequality to the map $\mathcal {P}_{\rho _{\mathcal {M}}}$. $\square $

Remark 3

In the case of a classical evolution over a classical system, taking $\eta =\mathcal {P}_{\rho _{\mathcal {M}}}(\rho )$ shows that $d=0$ in Eq. (3.11), and thus we get back the strong approximate tensorization of [14]. In Sect. 5.2, we will see that this remains also true for classical evolution over quantum systems. The estimation of the constant c under a condition of clustering of correlations is discussed in the next section.

The next proposition provides a short analysis of the weak constant in Theorem 2. We note that the interpretation of this term as a deviation to the classical case is direct from the pinching argument, which explicitly “pinches” on a classical basis. However, as opposed to Proposition 2, we were unable to prove that the weak constant necessarily vanishes when the doubly stochastic conditional expectations form a commuting square.

Proposition 3

With the notations of Lemma 1 and Theorem 2,

$$\begin{aligned} \sup _{\rho \in {{\mathcal {D}}}({{\mathcal {N}}})}\,D_{\max }\big (E_{1*}\circ E_{2*}(\rho )\Vert E_{1*}\circ E_{2*}(\eta )\big )\le d_1 + d_2 \end{aligned}$$

where

$$\begin{aligned}&d_1:=\sup _{\rho \in {{\mathcal {D}}}({{\mathcal {N}}})}\,D_{\max }\big (E_{1*}\circ E_{2*}(\rho )\Vert E_{1*}\circ E_{2*}(\mathcal {P}_{\mathcal {M}}(\rho ))\big )\,,\\&d_2:=\,\max _{i\in I_{\mathcal {M}}}\sup _{\rho ^{(i)}\in {{\mathcal {D}}}(P_i{{\mathcal {N}}}P_i)}\,I_{\max }\big ( {{\mathcal {H}}}_i:{{\mathcal {K}}}_i \big )_{\rho ^{(i)}}\,, \end{aligned}$$

and where $\mathcal {P}_{\mathcal {M}}:=\sum _{i\in I_{\mathcal {M}}}P_i(\cdot )P_i$.

Furthermore, given $i\in I_{{\mathcal {N}}}$, denote by $I^{(i)}_{\mathcal {M}}$ the set of indices corresponding to the minimal projectors in ${\mathcal {M}}$ contained in the i-th block of ${{\mathcal {N}}}$. Moreover, for each of the blocks i of ${{\mathcal {N}}}$, of corresponding minimal projector $P^{{{\mathcal {N}}}}_i$, decompose $P^{{\mathcal {N}}}_i{\mathcal {M}}P^{{\mathcal {N}}}_i$ as follows: letting $P_i^{{\mathcal {N}}}{{\mathcal {H}}}:= \bigoplus _{j\in I^{(i)}_{\mathcal {M}}} \,{{\mathcal {H}}}^{(i)}_{j}\otimes {{\mathcal {K}}}^{(i)}_j$,

$$\begin{aligned} P_i^{{\mathcal {N}}}{\mathcal {M}}P_i^{{\mathcal {N}}}:=\bigoplus _{j\in I^{(i)}_{\mathcal {M}}} {{\mathcal {B}}}({{\mathcal {H}}}^{(i)}_j)\otimes {\mathbb {1}}_{{{\mathcal {K}}}_j^{(i)}}\,. \end{aligned}$$

Then,

$$\begin{aligned}&d_1\le \max _{i\in I_{{\mathcal {N}}}}\ln (|I^{(i)}_{\mathcal {M}}|)\,\\&d_2\le 2\max _{i\in I_{{\mathcal {N}}}}\max _{j\in I^{(i)}_{\mathcal {M}}}\min \left\{ \ln (d_{{{\mathcal {H}}}^{(i)}_j}),\ln (d_{{{\mathcal {K}}}^{(i)}_j}) \right\} \,. \end{aligned}$$

The proof of this result is deferred to Appendix B.2.

3.4 Clustering of Correlations

In this section we shift slightly our focus and study the multiplicative constant of the previous results, instead of the additive one. More specifically, we provide an interpretation of the multiplicative constant appearing in the last section in terms of certain notions of clustering of correlations. The latter play a particularly relevant role when applied in the context of quantum spin lattices [12].

The constant $c_1:=\max _{i\in I_{\mathcal {M}}}\Vert E_{1}^{(i)}\circ E_{2}^{(i)}-(E^{\mathcal {M}})^{(i)}:\,{\mathbb {L}}_1(\tau _i)\rightarrow {\mathbb {L}}_\infty \Vert $ appearing in Theorem 2 provides a bound on the following covariance-type quantity: For any $i\in I_{\mathcal {M}}$ and any $X, Y\in {\mathbb {L}}_1(\tau _i)$,

$$\begin{aligned}&\mathrm{Cov}_{{\mathbb {C}}{\mathbb {1}}_{{{\mathcal {K}}}_i},\tau _i}(E_1^{(i)}[X],E_2^{(i)}[Y])\nonumber \\&\quad :=\langle E_1^{(i)}[X]-(E^{\mathcal {M}})^{(i)}[X],\,E_2^{(i)}[Y]-(E^{\mathcal {M}})^{(i)}[Y]\rangle _{\tau _i}\nonumber \\&\quad \le \,c_1\,\Vert X\Vert _{{\mathbb {L}}_1(\tau _i)}\,\Vert Y\Vert _{{\mathbb {L}}_1(\tau _i)}\,. \end{aligned}$$

(3.14)

We call the above property conditional ${\mathbb {L}}_1$ clustering of correlations, and denote it by $\mathrm{cond}{\mathbb {L}}_1(c_1)$. Conversely, one can show by duality of ${\mathbb {L}}_p$-norms that if $\mathrm{cond}{\mathbb {L}}_1(c_1')$ holds for some positive constant $c_1'$, then $c_1\le c_1'$: for all $i\in I_{\mathcal {M}}$

$$\begin{aligned}&\Vert E_{1}^{(i)}\circ E_{2}^{(i)}-(E^{\mathcal {M}})^{(i)}:\,{\mathbb {L}}_1(\tau _i)\rightarrow {\mathbb {L}}_\infty \Vert \\&\quad =\sup _{X\in {\mathbb {L}}_1(\tau _i)}\,\Vert E_{1}^{(i)}\circ E_{2}^{(i)}[X]-(E^{\mathcal {M}})^{(i)}[X]\Vert _\infty \\&\quad =\sup _{X,Y\in {\mathbb {L}}_1(\tau _i)}\,\langle Y,\,E_1^{(i)}\circ E_2^{(i)}[X]-(E^{\mathcal {M}})^{(i)}[X]\rangle _{\tau _i}\\&\quad =\sup _{X,Y\in {\mathbb {L}}_1(\tau _i)}\,\langle E_1^{(i)}[Y]-(E^{\mathcal {M}})^{(i)}[Y],\, E_2^{(i)}[X]\\&\qquad -(E^{\mathcal {M}})^{(i)}[X]\rangle _{\tau _i}\\&\quad \le c_1'\,. \end{aligned}$$

In [28], the authors introduced a different notion of clustering of correlation in order to show the positivity of the spectral gap of Gibbs samplers^{Footnote 2}.

Definition 3

We say that ${\mathcal {M}}\subset {{\mathcal {N}}}_1,{{\mathcal {N}}}_2\subset {{\mathcal {N}}}$ satisfies strong ${\mathbb {L}}_2$ clustering of correlations with respect to the state $\sigma \in {{\mathcal {D}}}({\mathcal {M}})$ with constant $c_{2}>0$ if for all $X,Y\in {{\mathcal {N}}}$,

$$\begin{aligned} \mathrm{Cov}_{{\mathcal {M}},\sigma }(E_1[X],E_2[Y])\le \,c_{2}\, \Vert X\Vert _{{\mathbb {L}}_2(\sigma )}\,\Vert Y\Vert _{{\mathbb {L}}_2(\sigma )}\,. \end{aligned}$$

(3.15)

Equivalently, $\Vert E_1\circ E_2-E^{\mathcal {M}}:\,{\mathbb {L}}_2(\sigma )\rightarrow {\mathbb {L}}_2(\sigma )\Vert \le c_2$.

Definition 3 does not depend on the state $\sigma \in {{\mathcal {D}}}({\mathcal {M}})$ chosen. This is the content of the next theorem, whose proof is presented in Appendix B.3.

Lemma 2

Let ${\mathcal {M}}\subset {{\mathcal {N}}}_1,{{\mathcal {N}}}_2\subset {{\mathcal {N}}}\subset {{\mathcal {B}}}({{\mathcal {H}}})$ be von Neumann subalgebras of the algebra ${{\mathcal {B}}}({{\mathcal {H}}})$ so that ${{\mathcal {N}}}_1 \cap {{\mathcal {N}}}_2 \ne \emptyset $. Then, for any two states $\sigma ,\sigma '\in {{\mathcal {D}}}({\mathcal {M}})$:

$$\begin{aligned} \sup _{X\in {{\mathcal {N}}}}\,\frac{\mathrm{Cov}_{{\mathcal {M}},\,\sigma }(E_1[X],E_2[X])}{\Vert X\Vert _{{\mathbb {L}}_2(\sigma )}^2} = \sup _{X\in {{\mathcal {N}}}}\,\frac{\mathrm{Cov}_{{\mathcal {M}},\,\sigma '}(E_1[X],E_2[X])}{\Vert X\Vert _{{\mathbb {L}}_2(\sigma ')}^2} \end{aligned}$$

Remark 4

As a consequence of the previous theorem, we realize that the condition assumed in [28] of strong ${\mathbb {L}}_2$ clustering of correlation with respect to one invariant state, to prove positivity of the spectral gap for the Davies dynamics, would be analogous to assuming strong ${\mathbb {L}}_2$ clustering of correlation with respect to any invariant state.

It is easy to see that the above notion of strong ${\mathbb {L}}_2$ clustering of correlation implies that of a conditional ${\mathbb {L}}_2$ clustering, denoted by ${\mathrm{cond}{\mathbb {L}}_2}(c_2)$, simply defined by replacing the ${\mathbb {L}}_1$ norms by ${\mathbb {L}}_2$ norms in Eq. (3.14), or equivalently by assuming that

$$\begin{aligned} \max _{i\in I_{\mathcal {M}}}\Vert E_1^{(i)}\circ E_2^{(i)}-(E^{\mathcal {M}})^{(i)}:\,{\mathbb {L}}_2(\tau _i)\rightarrow {\mathbb {L}}_2(\tau _i)\Vert \le c_2\,. \end{aligned}$$

One can ask whether the converse holds. We prove it under the technical assumption that the composition of conditional expectations $E_1\circ E_2$ cancels off-diagonal terms in the decomposition of ${\mathcal {M}}$:

$$\begin{aligned} E_{1*}\circ E_{2*}=E_{1*}\circ E_{2*}\circ \mathcal {P}_{\mathcal {M}}\,. \end{aligned}$$

(3.16)

This is for instance the case when ${\mathcal {M}}\subset {{\mathcal {N}}}_1,{{\mathcal {N}}}_2\subset {{\mathcal {N}}}$ forms a commuting square.

Proposition 4

Assume that Eq. (3.16) holds. Then:

1.
$d_1=0$ in Proposition 3 and
2.
strong $\mathcal {L}_2$ clustering is equivalent to conditional $\mathcal {L}_2$ clustering.

The proof for this result is also deferred to Appendix B.3.

We conclude this section by noting a crucial difference between ${\mathbb {L}}_2$ and ${\mathbb {L}}_1$ clusterings: similarly to Definition 3, one could define a notion of strong ${\mathbb {L}}_1$ clustering of correlation with respect to a state $\sigma \in {{\mathcal {D}}}({\mathcal {M}})$:

$$\begin{aligned} \Vert E_1\circ E_2-E^{\mathcal {M}}:\,{\mathbb {L}}_1(\sigma )\rightarrow {\mathbb {L}}_\infty ({{\mathcal {N}}})\Vert \equiv c_1(\sigma )\,<\infty . \end{aligned}$$

This would in particular imply ${\mathrm{cond}{\mathbb {L}}_1}(c_1(\sigma ))$. With this notion, and from an argument very similar to that of the proof of Theorem 2, we could show the following bound on the error term in Lemma 1:

$$\begin{aligned}&\ln \left\{ \int _{-\infty }^\infty \,\mathop {\mathrm{Tr}}\nolimits \left[ \rho _{1}\rho _{{\mathcal {M}}}^{\frac{-1-it}{2}}\rho _{2}\rho _{{\mathcal {M}}}^{\frac{-1+it}{2}} \right] \,\beta _0(t)\,dt \right\} \\&\quad \le 2\,\Vert E_{1}\circ E_2\\&\quad \quad -E^{{\mathcal {M}}}:\,{\mathbb {L}}_1(\rho _{\mathcal {M}})\rightarrow {\mathbb {L}}_\infty \Vert \,D(\rho \Vert \rho _{\mathcal {M}})\,\\&\quad \equiv 2 \,c_1(\rho _{\mathcal {M}})\,D(\rho \Vert \rho _{\mathcal {M}})\,. \end{aligned}$$

From this, one would conclude a strong approximate tensorization result if one could find a uniform bound on $c_1(\sigma )$ for any $\sigma \in {{\mathcal {D}}}({\mathcal {M}})$. However, and as opposed to the case of strong ${\mathbb {L}}_2$ clustering, the constant $c_1(\sigma )$ depends on the state $\sigma $, and can in particular diverge: this is the case whenever there exists $i\in I_{\mathcal {M}}$ such that $\dim ({{\mathcal {H}}}_i)<\infty $, and for a state $\sigma :=|\psi \rangle \langle \psi |_{{{\mathcal {H}}}_i}\otimes \tau _i$ that is pure on ${{\mathcal {H}}}_i$. This justifies our choice of $\mathrm{cond}{\mathbb {L}}_1$ as the better notion of ${\mathbb {L}}_1$ clustering in the quantum setting. After the submission of this manuscript, new insights into this particular problem were shed in [24]. We defer a discussion of their results to Sect. 6.

4 Applications

This section is devoted to two applications of the results of last section. In Sect. 4.1, we show the usefulness of Theorem 2 in the context of modified logarithmic Sobolev inequalities. Then, we derive new entropic uncertainty relations in Sect. 4.2.

4.1 Modified Logarithmic Sobolev Inequalities for Biased Bases

Take ${{\mathcal {H}}}={{\mathbb {C}}}^l$ and assume that the algebra ${{\mathcal {N}}}_1$ is the diagonal onto some orthonormal basis $|e^{(1)}_k\rangle $, whereas ${{\mathcal {N}}}_2$ is the diagonal onto the basis $|e^{(2)}_k\rangle $. Moreover, choose ${\mathcal {M}}$ to be the trivial algebra ${\mathbb {C}}{\mathbb {1}}_\ell $. Hence for each $i\in \{1,2\}$, $E_i$ denotes the Pinching map onto the diagonal $\mathrm{span}\left\{ |e_k^{(i)}\rangle \langle e_k^{(i)}|\right\} $ and $E^{\mathcal {M}}=\frac{{\mathbb {1}}}{\ell }\mathop {\mathrm{Tr}}\nolimits [\cdot ]$. Then, for any $X\ge 0$:

$$\begin{aligned} \Vert (E_1\circ E_2-E^{\mathcal {M}})(X)\Vert _\infty&=\Big \Vert \sum _{k,k'}|e^{(1)}_k\rangle \langle e^{(1)}_k|e^{(2)}_{k'}\rangle \langle e^{(2)}_{k'}| X| e^{(2)}_{k'}\rangle \langle e^{(2)}_{k'}|e^{(1)}_k\rangle \langle e^{(1)}_k|\\&\quad -\frac{1}{\ell }\,|e^{(1)}_k\rangle \langle e^{(1)}_k|\,\langle e^{(2)}_{k'}|X|e^{(2)}_{k'}\rangle \Big \Vert _\infty \\&=\Big \Vert \sum _{k,k'}\,\big ( |\langle e^{(1)}_k|e^{(2)}_{k'}\rangle |^2-\frac{1}{\ell } \big )|e^{(1)}_k\rangle \langle e^{(1)}_k|\,\langle e^{(2)}_{k'}| X| e^{(2)}_{k'}\rangle \Big \Vert _\infty \\&=\max _{k}\,\sum _{k'}\,\Big | |\langle e^{(1)}_k|e^{(2)}_{k'}\rangle |^2-\frac{1}{\ell } \Big | \quad \langle e^{(2)}_{k'}| X| e^{(2)}_{k'}\rangle \\&\le \varepsilon \,\frac{1}{\ell }\mathop {\mathrm{Tr}}\nolimits [X] \end{aligned}$$

where $\varepsilon :=\ell \,\max _{k,k'}\,\Big | |\langle e^{(1)}_k|e^{(2)}_{k'}\rangle |^2-\frac{1}{\ell } \Big |$. Hence

$$\begin{aligned} \Vert (E_1\circ E_2-E^{\mathcal {M}}):{\mathbb {L}}_1(\ell ^{-1}{\mathbb {1}})\rightarrow {\mathbb {L}}_\infty \Vert \le \varepsilon \,, \end{aligned}$$

so that by choosing $\eta =\rho =\mathcal {P}_{\rho _{\mathcal {M}}}(\rho )$ in Theorem 2, as long as $\varepsilon <1$, for any $\rho \in {{\mathcal {D}}}({\mathbb {C}}^\ell )$, AT($(1-\varepsilon )^{-1},0$) holds:

$$\begin{aligned} D(\rho \Vert \ell ^{-1}{\mathbb {1}})\le \frac{1}{1-\varepsilon }\,(D(\rho \Vert E_{1*}(\rho ))+D(\rho \Vert E_{2*}(\rho )))\,. \end{aligned}$$

(4.1)

This result is related to Example 4.5 of [31]. There, the author obtains an inequality that can be rewritten in the following form:

$$\begin{aligned} D(\rho \Vert \ell ^{-1}{\mathbb {1}})\le 4 \left( \ln _{1-\delta } \left( \frac{2}{3 \ell + 5} \right) + 1 \right) \,(D(\rho \Vert E_{1*}(\rho ))+D(\rho \Vert E_{2*}(\rho )))\,, \end{aligned}$$

(4.2)

where $\delta $ here is related with $\varepsilon $ in our example by:

$$\begin{aligned} \delta \ge \frac{1}{\ell } (1- \varepsilon ). \end{aligned}$$

The approximate tensorization derived in (4.1) can be used to find exponential convergence in relative entropy of the primitive quantum Markov semigroup ${rm e}^{t{{\mathcal {L}}}}$, where

$$\begin{aligned} {{\mathcal {L}}}(X):=E_1(X)+E_2(X)-2X\,. \end{aligned}$$

Indeed, for any state $\rho \in {{\mathcal {D}}}({{\mathcal {H}}})$, denoting by $\rho _t$ the evolved state ${rm e}^{t{{\mathcal {L}}}}(\rho )$ up to time t, the fact that $D(\rho _t\Vert \ell ^{-1}{\mathbb {1}})\le {rm e}^{-\alpha t}D(\rho \Vert \ell ^{-1}{\mathbb {1}})$ holds for some $\alpha >0$ is equivalent to the so-called modified logarithmic Sobolev inequality. Let us recall that ${{\mathcal {L}}}$ is said to satisfy a positive modified logarithmic Sobolev inequality (MLSI for short) if there exists a constant $\alpha >0$ such that the following inequality holds for every $\rho \in \mathcal {D}(\mathcal {H})$:

$$\begin{aligned} \alpha D(\rho \Vert \ell ^{-1}{\mathbb {1}}) \le - \mathop {\mathrm{Tr}}\nolimits [ {{\mathcal {L}}}(\rho ) ( \ln \rho - \ln \ell ^{-1}{\mathbb {1}})] \, . \end{aligned}$$

In such a case, the optimal $\alpha $ for which the previous inequality holds is called the modified logarithmic Sobolev constant. In this particular setting, by [27, Lemma 3.4], the MLSI for ${{\mathcal {L}}}$ can be written as

$$\begin{aligned} \alpha D(\rho \Vert \ell ^{-1}{\mathbb {1}})\le D(\rho \Vert E_{1*}(\rho ))+D(E_{1*}(\rho )\Vert \rho )+D(\rho \Vert E_{2*}(\rho ))+D(E_{2*}(\rho )\Vert \rho )\,. \end{aligned}$$

By positivity of the relative entropy, it suffices to prove the existence of a constant $\alpha >0$ such that

$$\begin{aligned} \alpha D(\rho \Vert \ell ^{-1}{\mathbb {1}})\le D(\rho \Vert E_{1*}(\rho ))+D(\rho \Vert E_{2*}(\rho ))\,. \end{aligned}$$

This last inequality is equivalent to (4.1) for $\alpha =1-\varepsilon $. Therefore, Theorem 2 yields as a consequence the fact that the generator ${{\mathcal {L}}}$ defined above satisfies a MLSI of constant bounded by $1-\varepsilon $.

4.2 Tightened Entropic Uncertainty Relations

Given a function $f\in {\mathbb {L}}_2({\mathbb {R}})$ and its Fourier transform ${\mathcal {F}}[f]$ with $\Vert f\Vert _{{\mathbb {L}}_2({\mathbb {R}})}=\Vert {\mathcal {F}}[f]\Vert _{{\mathbb {L}}_2({\mathbb {R}})}=1$, Weyl proved in [45] the following uncertainty relation:

$$\begin{aligned} V(|f|^2)\,V(|{\mathcal {F}}[f]|^2)\ge \frac{1}{16\pi ^2}\,, \end{aligned}$$

(4.3)

where, given a probability distribution function g, V(g) denotes its variance. The uncertainty inequality means that $|f|^2$ and $|{\mathcal {F}}[f]|^2$ cannot both be concentrated arbitrarily close to their corresponding means. An entropic strengthening of (4.3) was derived independently by Hirschmann [26] and Stam [40], and tightened later on by Beckner [6]:

$$\begin{aligned} H(|f|^2)+H(|{\mathcal {F}}[f]|^2)\ge \ln \frac{\mathrm{e}}{2}\,, \end{aligned}$$

where $H(g):=-\int _{{\mathbb {R}}} \,g(x)\ln g(x)\,dx$ stands for the differential entropy functional. In the quantum mechanical setting, this inequality has the interpretation that the total amount of uncertainty, as quantified by the entropy, of non-commuting observables (i.e. the position and momentum of a particle) is uniformly lower bounded by a positive constant independently of the state of the system. For an extensive review of entropic uncertainty relations for classical and quantum systems, we refer to the recent survey [15].

More generally, given two POVMs ${\mathbf {X}}:= \{X_x\}_{x}$ and ${\mathbf {Y}}:=\{Y_y\}_{y}$ on a quantum system A, and in the presence of side information M that might help to better predict the outcomes of ${\mathbf {X}}$ and ${\mathbf {Y}}$, the following state-dependent tightened bound was found in [19] (see also [7] for the special case of measurements in two orthonormal bases and [36] for the case without memory): for any bipartite state $\rho _{AM}\in {{\mathcal {D}}}({{\mathcal {H}}}_A\otimes {{\mathcal {H}}}_M)$,

$$\begin{aligned} S(X|M)_{(\Phi _{{\mathbf {X}}}\otimes {\mathrm{id}}_M)(\rho )}+S(Y|M)_{(\Phi _{{\mathbf {Y}}}\otimes {\mathrm{id}}_M)(\rho )}\ge - \ln c'+S(A|M)_\rho \,, \end{aligned}$$

(4.4)

with $c'=\max _{x,y}\{\mathop {\mathrm{Tr}}\nolimits (X_x\,Y_x)\}$, where $\Phi _{{\mathbf {Z}}}$ denotes the quantum-classical channel corresponding to the measurement ${\mathbf {Z}}\in \{{\mathbf {X}},{\mathbf {Y}}\}$:

$$\begin{aligned} \Phi _{{\mathbf {Z}}}(\rho _A):=\sum _{z}\,\mathop {\mathrm{Tr}}\nolimits (\rho _A Z_z)\,|z\rangle \langle z|_Z\,. \end{aligned}$$

The above inequality has been recently extended to the setting where the POVMs are replaced by two arbitrary quantum channels in [22]. In this section, we restrict ourselves to the setting of [7], so that the measurement channels reduce to the Pinching maps of Sect. 4.1. First of all, we notice that the relation (4.4) easily follows from Corollary 1:

Example 1

Take ${{\mathcal {H}}}_{AM}={{\mathcal {H}}}_A \otimes {{\mathcal {H}}}_M$ a bipartite system and, as in the case of Sect. 4.1, assume that the algebra ${{\mathcal {N}}}_1$ is the diagonal onto some orthonormal basis $|e^{(\mathcal {X})}_x\rangle $ in ${{\mathcal {H}}}_A $, whereas ${{\mathcal {N}}}_2$ is the diagonal onto the basis $|e^{(\mathcal {Y})}_y\rangle $ also in ${{\mathcal {H}}}_A $. Moreover, choose ${\mathcal {M}}$ to be the algebra ${\mathbb {C}}{\mathbb {1}}_\ell \otimes \mathcal {B}({{\mathcal {H}}}_M)$. Hence for each alphabet $\mathcal {Z}\in \{\mathcal {X},\mathcal {Y}\}$, $E_\mathcal {Z}$ denotes the Pinching map onto the diagonal $\mathrm{span}\left\{ |e_z^{(\mathcal {Z})}\rangle \langle e_z^{(\mathcal {Z})}|\right\} $, which we tensorize with the identity map in M, and $E^{\mathcal {M}}\otimes {\mathrm{id}}_M=\frac{1}{d_A}{\mathbb {1}}_A \otimes \mathop {\mathrm{Tr}}\nolimits _A[\cdot ] $. Then, for every $\rho _{AM} \in \mathcal {D}({{\mathcal {H}}}_{AM})$,

$$\begin{aligned}&S(X|M)_{(E_{\mathcal {X}}\otimes {\mathrm{id}}_M)(\rho _{AM})} \\&\quad = - D\left( (E_{\mathcal {X}}\otimes {\mathrm{id}}_M)(\rho _{AM}) \Big | \Big | \frac{{\mathbb {1}}_A}{d_A} \otimes \rho _M \right) +\ln d_A \\&\quad = D(\rho _{AM} || (E_{\mathcal {X}}\otimes {\mathrm{id}}_M)(\rho _{AM})) - D(\rho _{AM} || (E^{\mathcal {M}}\otimes {\mathrm{id}}_M)(\rho _{AM})) +\ln d_A, \end{aligned}$$

where the last equality is derived from [27, Lemma 3.4]. Hence, since

$$\begin{aligned} D(\rho _{AM} || (E^{\mathcal {M}}\otimes {\mathrm{id}}_M)(\rho _{AM})) = - S(A| M)_{\rho _{AM}} + \ln d_A , \end{aligned}$$

by virtue of Corollary 1 we have

$$\begin{aligned}&S(X|M)_{(E_{\mathcal {X}}\otimes {\mathrm{id}}_M)(\rho _{AM})} + S(Y|M)_{(E_{\mathcal {Y}}\otimes {\mathrm{id}}_M)(\rho _{AM})} \\&\quad \ge D(\rho _{AM} || (E_{\mathcal {X}}\otimes {\mathrm{id}}_M)(\rho _{AM}))+ D(\rho _{AM} || (E_{\mathcal {Y}}\otimes {\mathrm{id}}_M)(\rho _{AM}))\\&\qquad - 2 D(\rho _{AM} || (E^{\mathcal {M}}\otimes {\mathrm{id}}_M)(\rho _{AM})) + 2 \ln d_A\\&\quad \ge S(A|M)_{\rho _{AM}} - d + \ln d_A , \end{aligned}$$

where

$$\begin{aligned} d:=\sup _{\rho \in {{\mathcal {D}}}({{\mathcal {N}}}_{2})}\inf \big \{\ln (\lambda )|\,E_{\mathcal {X}} \otimes {\mathrm{id}}_M (\rho )\le \lambda \eta \,\text { for some }\,\eta \in {{\mathcal {D}}}({\mathcal {M}}) \big \} . \end{aligned}$$

Now, taking into account the computations of Sect. 4.1, notice that

$$\begin{aligned} d= \ln \left( d_A \, \underset{x, \, y}{\text {max}}\, |\langle e^{(\mathcal {X})}_x|e^{(\mathcal {Y})}_{y}\rangle |^2 \right) \, , \end{aligned}$$

obtaining thus expression (4.4).

However, close to the completely mixed state, this inequality is not tight whenever ${\mathbf {X}}$ and ${\mathbf {Y}}$ are not mutually unbiased bases (i.e. $\exists x\in {{\mathcal {X}}},y\in \mathcal {Y}$ such that $|\langle X^x|Y^y\rangle |^2>\frac{1}{d_A}$). Here, we derive the following strengthening of Eq. (4.4) when $d_M=1$ as a direct consequence of Theorem 2:

Corollary 2

Given a finite alphabet $\mathcal {Z}\in \{\mathcal {X},\mathcal {Y}\}$, let $E_{\mathcal {Z}}$ denote the Pinching channels onto the orthonormal basis $\{|e^{(\mathcal {Z})}_z\rangle \}_{z\in \mathcal {Z}}$ corresponding to the measurement ${\mathbf {Z}}$. Assume further that $c_1= d_A\max _{x,y}\big | |\langle e^{(\mathcal {X})}_x|e^{(\mathcal {Y})}_y\rangle |^2-\frac{1}{d_A} \big |<1$. Then the following strengthened entropic uncertainty relation holds for any state $\rho \in {{\mathcal {D}}}({{\mathcal {H}}}_{A})$,

$$\begin{aligned} S(X)_{E_{\mathcal {X}}(\rho )}+S(Y)_{E_{\mathcal {Y}}(\rho )}\ge (1+c_1)\,S(A)_\rho +(1-c_1)\ln d_A\,. \end{aligned}$$

(4.5)

Proof

Following the first lines of Example 1 for $d_M=1$, we have

$$\begin{aligned} S(X)_{E_{\mathcal {X}}(\rho )}+ S(Y)_{E_{\mathcal {Y}}(\rho )}= & {} D(\rho || E_{\mathcal {X}}(\rho )) + D(\rho || E_{\mathcal {Y}}(\rho )) \\&- 2 D(\rho || E^{\mathcal {M}}(\rho )) + 2 \ln d_A , \end{aligned}$$

where $E^{\mathcal {M}}=\frac{{\mathbb {1}}}{\ell }\mathop {\mathrm{Tr}}\nolimits [\cdot ]$. Then, by virtue of Theorem 2,

$$\begin{aligned}&D(\rho || E_{\mathcal {X}}(\rho )) + D(\rho || E_{\mathcal {Y}}(\rho )) - 2 D(\rho || E^{\mathcal {M}}(\rho )) \\&\quad \ge (-1 - c_1) D(\rho || E^{\mathcal {M}}(\rho )) - D_\text {max} (E_\mathcal {X} \circ E_\mathcal {Y} (\rho ) \Vert E_\mathcal {X} \circ E_\mathcal {Y} (\eta ) ) \\&\qquad - c_1 D (\eta \Vert \mathcal {P}_{\rho _\mathcal {M}}(\rho ) ) \, \end{aligned}$$

for any $\eta = \mathcal {P}_{\rho _\mathcal {M}}(\rho )$, and by further choosing $\eta =\rho $, the last two terms above vanish. Thus, we have:

$$\begin{aligned} S(X)_{E_{\mathcal {X}}(\rho )}+ S(Y)_{E_{\mathcal {Y}}(\rho )} \ge (-1 - c_1) \, D(\rho || E^{\mathcal {M}}(\rho )) + 2 \ln d_A . \end{aligned}$$

To conclude, just notice that

$$\begin{aligned} D(\rho || E^{\mathcal {M}}(\rho )) = - S(A)_{\rho } + \ln d_A , \end{aligned}$$

$\square $

Analogously, we can study the case for three different orthonormal bases (see [7]). For that, let us recall that given $ {{\mathcal {N}}}_1, {{\mathcal {N}}}_2, {{\mathcal {N}}}_3 \subset {{\mathcal {N}}}$ von Neumann subalgebras and ${\mathcal {M}}\subset {{\mathcal {N}}}_1 \cap {{\mathcal {N}}}_2 \cap {{\mathcal {N}}}_3$, if we consider their associated conditional expectations $E_i$ with respect to a state $\sigma $, and for each pair $({{\mathcal {N}}}_i, {{\mathcal {N}}}_j)$ a result of AT($c_{ij}, d_{ij}$) holds, then for every $\rho \in {{\mathcal {D}}}({{\mathcal {N}}})$:

$$\begin{aligned} D(\rho || E_*^{\mathcal {M}}(\rho ))\le & {} \frac{2}{3} \, \underset{i, \, j \in \left\{ 1,2,3 \right\} }{\text {max}} \left\{ c_{ij} \right\} \left( \, D(\rho || E_{1*} (\rho ) ) + D(\rho || E_{2*} (\rho ) ) \right. \nonumber \\&\left. + D(\rho || E_{3*} (\rho ) ) \, \right) + \frac{d_{12} + d_{13} + d_{23}}{3}. \end{aligned}$$

(4.6)

Corollary 3

Given a finite alphabet $I \in \{\mathcal {X},\mathcal {Y}, \mathcal {Z}\}$, and using the same notation that in Corollary 2, assume that

$$\begin{aligned} c_1= d_A \underset{I, \, J \in \left\{ \mathcal {X}, \mathcal {Y}, \mathcal {Z} \right\} }{\text {max}} \left| |\langle e^{(I)}_i|e^{(J)}_j\rangle |^2-\frac{1}{d_A} \right| <1 . \end{aligned}$$

Then the following strengthened entropic uncertainty relation holds for any state $\rho \in {{\mathcal {D}}}({{\mathcal {H}}}_{A})$,

$$\begin{aligned} S(X)_{E_{\mathcal {X}}(\rho )}+S(Y)_{E_{\mathcal {Y}}(\rho )}+S(Z)_{E_{\mathcal {Z}}(\rho )}\ge (2+c_1)\,S(A)_\rho +(1-c_1)\ln d_A\,. \end{aligned}$$

5 Lattice Spin Systems with Commuting Hamiltonians

In this section, we further control the strong and weak constants appearing in Theorem 2 in the context of lattice spin systems, and compare them with previous conditions in the classical and quantum literature. The main result presented in this section is Theorem 3, where we show that the classical Glauber dynamics embedded in a quantum system satisfies a strong approximate tensorization AT(1, 0) at infinite temperature and presents an approximate tensorization AT(c, 0) with small multiplicate constant when the temperature is high enough. This result is contained in Sect. 5.2.

Given a finite lattice $\Lambda \subset \subset {\mathbb {Z}}^d$, we define the tensor product Hilbert space ${{\mathcal {H}}}:={{\mathcal {H}}}_\Lambda \equiv \bigotimes _{k\in \Lambda }{{\mathcal {H}}}_k$, where for each $k\in \Lambda $, ${{\mathcal {H}}}_k\simeq {\mathbb {C}}^\ell $, $\ell \in {\mathbb {N}}$. Then, let $\Phi :\Lambda \rightarrow {{\mathcal {B}}}_{\mathrm{sa}}({{\mathcal {H}}}_{\Lambda }) $ be an r-local potential, i.e. for any $j\in \Lambda $, $\Phi (j)$ is self-adjoint and supported on a ball of radius r around site j. We assume further that $\Vert \Phi (j) \Vert \le K$ for some constant $K<\infty $. The potential $\Phi $ is said to be a commuting potential if for any $i,j\in \Lambda $, $[\Phi (i),\Phi (j)]=0$. Given such a local, commuting potential, the Hamiltonian on a subregion $A\subseteq \Lambda $ is defined as

$$\begin{aligned} H_A=\sum _{j\in A}\,\Phi (j)\,. \end{aligned}$$

(5.1)

Next, the corresponding Gibbs state corresponding to the region A and at inverse temperature $\beta $ is defined as

$$\begin{aligned} \sigma _A=\frac{\mathrm{e}^{-\beta H_A}}{\mathop {\mathrm{Tr}}\nolimits [\mathrm{e}^{-\beta H_A}]}\,. \end{aligned}$$

(5.2)

Note that this is in general not equal to the state $\mathop {\mathrm{Tr}}\nolimits _B[\sigma _\Lambda ]$.

We begin by introducing Davies semigroups on lattice spin systems. These are the most studied examples of Markovian dynamics studied in this context, together with heat-bath generators defined through Petz recovery maps [3, 29, 43]. Thanks to Theorem 1, we know that the conditional expectations arising from both dynamics coincide. Hence, for the rest of the paper, all the results presented will be independent of the choice of underlying dynamics.

5.1 Davies Generators on Lattice Spin Systems

Consider the setting introduced in Sect. 2.3 and, in particular, the Hamiltonian modelling the system-bath interaction. As mentioned before, the evolution on the system can be approximated by a quantum Markov semigroup whose generator is of the following form:

$$\begin{aligned} {{\mathcal {L}}}^{\mathrm{D},\beta }_\Lambda (X)=i[H_\Lambda ,X]+\sum _{k\in \Lambda }\,{{\mathcal {L}}}^{\mathrm{D},\beta }_k(X)\,, \end{aligned}$$

(5.3)

where

$$\begin{aligned} {{\mathcal {L}}}^{\mathrm{D},\beta }_k(X)=\sum _{\omega ,\alpha }\,\chi ^{\beta }_{\alpha ,k}(\omega )\,\Big ( S_{\alpha ,k}^*(\omega )XS_{\alpha ,k}(\omega )-\frac{1}{2}\,\big \{ S_{\alpha ,k}^*(\omega )S_{\alpha ,k}(\omega ),X \big \} \Big )\,. \end{aligned}$$

(5.4)

Similarly, define the generator ${{\mathcal {L}}}^\beta _A$ by restricting the sum in Eq. (5.3) to the sublattice A:

$$\begin{aligned} {{\mathcal {L}}}^{\mathrm{D},\beta }_A(X)=i[H_A,X]+\sum _{k\in A}\,{{\mathcal {L}}}^{\mathrm{D},\beta }_k(X)\,. \end{aligned}$$

(5.5)

Note that ${{\mathcal {L}}}^{\mathrm{D},\beta }_A$ acts non-trivially on the boundary of A, denoted by $A_\partial :=\{k\in \Lambda :\,d(k,A)\le r\}$. Then, for any region $A\subset \Lambda $, we define the conditional expectation onto the algebra ${{\mathcal {N}}}_A$ of fixed points of ${{\mathcal {L}}}_A$ with respect to the Gibbs state $\sigma =\sigma _\Lambda $ as follows [28]: given an adequate decomposition ${{\mathcal {H}}}_\Lambda :=\bigoplus _{i\in I_{{{\mathcal {N}}}_A}}\,{{\mathcal {H}}}_i^A\otimes {{\mathcal {K}}}_i^A$ of the total Hilbert space ${{\mathcal {H}}}_\Lambda $ of the lattice spin system,

$$\begin{aligned} {{\mathcal {N}}}_A&=\bigoplus _{i\in I_{{{\mathcal {N}}}_A}}\,{{\mathcal {B}}}({{\mathcal {H}}}_i^A)\otimes {\mathbb {1}}_{{{\mathcal {K}}}_i^A}\,, \nonumber \\ E^{\mathrm{D},\beta }_A[X]&:=\lim _{t\rightarrow \infty }\mathrm{e}^{t{{\mathcal {L}}}^{\mathrm{D},\beta }_A}(X)\equiv \sum _{i\in I_{{{\mathcal {N}}}_A}}\, \mathop {\mathrm{Tr}}\nolimits _{{{\mathcal {K}}}^A_i}(P^{A}_i ({\mathbb {1}}_{{{\mathcal {H}}}_i^A}\otimes \sigma _i^A)\,X P^{A}_i) \otimes {\mathbb {1}}_{{{\mathcal {K}}}^A_i}\,, \end{aligned}$$

(5.6)

for some fixed full-rank states $\sigma _i^A$ on ${{\mathcal {K}}}_i^A$. It was shown in Lemma 11 of [28] that the generator of the Davies semigroups corresponding to a local commuting potential is frustration-free. This means that the state $\sigma $ is in the kernels of all ${{\mathcal {L}}}_A^{\mathrm{D},\beta }$, $A\subseteq \Lambda $. Therefore, the conditional expectations $E^{\mathrm{D},\beta }_A$ are all defined with respect to $\sigma $.

In the next section, we study the weak approximate tensorization of the conditional expectations $E_A^{D,\beta }\equiv E_A^\beta $ in the case of a classical Hamiltonian. We start with the following simple observation for commuting Hamiltonians.

Proposition 5

Let $A,B\subset \Lambda $ be two regions separated by at least a distance 2r, that is such that $A_\partial \cap B_\partial =\emptyset $. Then ${{\mathcal {N}}}_A$ and ${{\mathcal {N}}}_B$ form a commuting square, that is,

$$\begin{aligned} E_A^{\beta }\circ E_B^{\beta }=E_B^{\beta }\circ E_A^{\beta } = E_{A\cup B}^{\beta }\,. \end{aligned}$$

(5.7)

Consequently, for all $\rho \in {{\mathcal {D}}}({{\mathcal {H}}}_\Lambda )$,

$$\begin{aligned} D\big (\rho \Vert E_{A\cup B*}^{\beta }(\rho )\big )\le D\big (\rho \Vert E_{A*}^{\beta }(\rho )\big )+D\big (\rho \Vert E_{B*}^{\beta }(\rho )\big )\,. \end{aligned}$$

(5.8)

Proof

Remark that by definition of the map ${{\mathcal {L}}}_A^{D,\beta }$, it only acts non-trivially on $A_\partial $ and as identity on $(A_\partial )^c$. Consequently, as $E_A=\lim _{t\rightarrow \infty } e^{t{{\mathcal {L}}}_A^{D,\beta }}$, this property carries over to the conditional expectation and we have $E_A=E_A\otimes \mathbb {1}_{{{\mathcal {H}}}_{A_\partial ^c}}$ by slight abuse of notations. Similarly, $E_B=E_B\otimes \mathbb {1}_{{{\mathcal {H}}}_{B_\partial ^c}}$. This shows the result since $A_\partial \cap B_\partial =\emptyset $. $\square $

5.2 Classical Hamiltonian Over Quantum Systems

In this section, we investigate the case of a quantum lattice spin system undergoing a classical Glauber dynamics, whose framework was already studied in [16]. These semigroups correspond to Davies generators whose Hamiltonian is classical, that is, diagonal in a product basis of ${{\mathcal {H}}}_\Lambda $. In order to make the connection with the classical Glauber dynamics over a classical system (i.e. initially diagonal in the product basis), we introduce the generator more explicitly: consider a lattice spin system over $\Gamma ={\mathbb {Z}}^d$ with classical configuration space $S=\{+1,-1\}$, and, for each $\Lambda \subset \Gamma $, denote by $\Omega _\Lambda =S^\Lambda $ the space of configurations over $\Lambda $. Next, given a classical finite-range, translationally invariant potential $\{J_A\}_{A\in \Gamma }$ and a boundary condition $\tau \in \Omega _{\Lambda ^c}$, define the Hamiltonian over $\Lambda $ as

$$\begin{aligned} H_\Lambda ^\tau (\sigma )=-\sum _{A\cap \Lambda \ne 0}\,J_A(\sigma \times \tau ),~~~~~~\forall \sigma \in \Omega _\Lambda \,. \end{aligned}$$

The classical Gibbs state corresponding to such Hamiltonian is then given by

$$\begin{aligned} \mu _\Lambda ^\tau (\sigma )=(Z_{\Lambda }^\tau )^{-1}\,\exp \big ( -H_{\Lambda }^\tau (\sigma )\big )\,, \end{aligned}$$

Next, define the Glauber dynamics for a potential J as the Markov process on $\Omega _\Lambda $ with the generator

$$\begin{aligned} (L_\Lambda f)(\sigma )=\sum _{x\in \Lambda }\,c_{J}(x,\sigma )\nabla _xf(\sigma )\,, \end{aligned}$$

where $\nabla _xf(\sigma )=f(\sigma ^x)-f(\sigma )$ and $\sigma ^x$ is the configuration obtained by flipping the spin at position x. The numbers $c_J(x,\sigma )$ are called transition rates and must satisfy the following assumptions:

1.
There exist $c_m,c_M$ such that $0<c_m\le c_J(x,\sigma )\le c_M<\infty $ for all $x,\sigma $.
2.
$c_J(x,.)$ depends only on spin values in $b_r(x)$.
3.
For all $k\in \Gamma $, $c_J(x,\sigma ')=c_J(x+k,\sigma )$ id $\sigma '(y)=\sigma (y+k)$ for all y.
4.
Detailed balance: for all $x\in \Gamma $, and all $\sigma $
$$\begin{aligned} \exp \left( -\sum _{A\ni x}J_A(\sigma )\right) c_J(x,\sigma )=c_J(x,\sigma ^x)\exp \left( -\sum _{A\ni x}J_A(\sigma ^x)\right) \,. \end{aligned}$$

These assumptions constitute sufficient conditions for the corresponding Markov process to have the Gibbs states over $\Lambda $ as stationary points. Next, we introduce the notion of a quantum embedding of the aforementioned classical Glauber dynamics. This is the Lindbladian of corresponding Lindblad operators given by

$$\begin{aligned} L_{x,\eta }:=\sqrt{c_J(x,\eta )}\,|\eta ^x\rangle \langle \eta |\otimes {\mathbb {1}}\,,~~~\forall x\in \Lambda ,\,\eta \in \Omega _{b_x(r)}\,. \end{aligned}$$

(5.9)

It was shown in [16] that such a dynamics is KMS-symmetric with respect to the state $\mu _\Lambda ^\tau $ as embedded into the computational basis. Moreover, the set of fixed points in the Schrödinger picture corresponds to the convex hull of the set of Gibbs states over $\Lambda $, $\{\mu _\Lambda ^\tau |\tau \in \Omega _{\Lambda ^c}\}$. In the Heisenberg picture, this implies that the fixed-point algebras ${\mathcal {F}}({{\mathcal {L}}}_A)$ are expressed as

$$\begin{aligned} {\mathcal {F}}({{\mathcal {L}}}_A):=\bigoplus _{\omega \in \Omega _{\partial A}}\,|\omega \rangle \langle \omega |_{\partial A}\otimes {\mathbb {1}}_{A}\otimes {{\mathcal {B}}}({{\mathcal {H}}}_{A_\partial ^c})\,. \end{aligned}$$

(5.10)

Equivalently,

$$\begin{aligned} E_{A*}(\rho )&=\sum _{\omega \in \Omega _{\partial A}}\,|\omega \rangle \langle \omega |_{\partial A}\otimes \sigma ^\omega _{A}\otimes \mathop {\mathrm{Tr}}\nolimits _A(\langle \omega |\rho |\omega \rangle )\nonumber \\&\equiv \sum _{\omega \in \Omega _{\partial A}}\,|\omega \rangle \langle \omega |_{\partial A}\otimes E_{A*}^\omega (\langle \omega |\rho |\omega \rangle ) \,, \end{aligned}$$

(5.11)

where $\sigma ^\omega _A$ denotes the Gibbs state $\mu ^\omega _A$ embedded into the computational basis.

With this expression at hand we can prove that classical Hamiltonians over quantum systems satisfy the same approximate tensorization than in the classical case.

Theorem 3

Let $A,B\subset \Lambda $. Then, at $\beta =0$, ${{\mathcal {N}}}_A$ and ${{\mathcal {N}}}_B$ form a commuting square, that is,

$$\begin{aligned} E_A^{\beta =0}\circ E_B^{\beta =0}=E_B^{\beta =0}\circ E_A^{\beta =0} = E_{A\cup B}^{\beta =0}\, \end{aligned}$$

(5.12)

and consequently, for all $\rho \in {{\mathcal {D}}}({{\mathcal {H}}}_\Lambda )$,

$$\begin{aligned} D\big (\rho \Vert E_{A\cup B*}^{\beta =0}(\rho )\big )\le D\big (\rho \Vert E_{A*}^{\beta =0}(\rho )\big )+D\big (\rho \Vert E_{B*}^{\beta =0}(\rho )\big )\,. \end{aligned}$$

(5.13)

At finite temperature $\beta >0$, $\mathrm{AT}(c,0)$ holds with

$$\begin{aligned} c=\frac{1}{1-c_1}\qquad \text {where}\quad c_1=\sup _{\omega \in \Omega _{\partial (A\cup B)}}\,\Vert E_A^\omega \circ E_B^\omega -E_{A\cup B}^\omega \,:\,{\mathbb {L}}_1(\mu _{A\cup B}^\omega )\rightarrow {\mathbb {L}}_\infty \Vert \,. \end{aligned}$$

(5.14)

Proof

Equation (5.12) is a direct consequence of the definition of the conditional expectations at $\beta =0$: i.e. $E_{A}^{\beta =0}={\mathbb {1}}_A\otimes \mathop {\mathrm{Tr}}\nolimits _A$. In order to prove that AT(c, 0) holds at positive temperature, we use our main result on approximate tensorization based on Pinching techniques, namely Theorem 2. More specifically, for every $\rho \in {{\mathcal {D}}}({{\mathcal {H}}}_{\Lambda })$, we denote $\rho _{\mathcal {M}}:= E_{A \cup B^*} (\rho )$ and apply Eq. (3.9) to $\eta =\mathcal {P}_{\rho _{\mathcal {M}}}(\rho )$. Thus, we only need to check that $D_{\max }\big (E_{A*}\circ E_{B*}(\rho )\Vert E_{A*}\circ E_{B*}(\eta )\big )=0$. We denote by $\mathcal {P}_A$ the pinching map on the computational basis on a subset A of $\Lambda $. By a simple computation we see that $\mathcal {P}_{(A\cup B)_\partial }\circ \mathcal {P}_{\rho _{\mathcal {M}}}=\mathcal {P}_{(A\cup B)_\partial }$ and

$$\begin{aligned} E_{A*}\circ E_{B*}= E_{A*}\circ E_{B*}\circ \mathcal {P}_{(A\cup B)_\partial }\,, \end{aligned}$$

so that $E_{A*}\circ E_{B*}(\rho )=E_{A*}\circ E_{B*}(\mathcal {P}_{\rho _{\mathcal {M}}}(\rho ))$, which completes the proof. $\square $

In Theorem 3, we have shown that strong approximate tensorization AT(1, 0) holds at infinite temperature for classical Hamiltonians. However, let us remark that it is not clear (and we strongly believe the opposite) that this remains true for non-classical commuting Gibbs states. A first idea to support this intuition has been shown in Proposition 5. We leave a thorough study of this fact for future work.

6 Outlook

In this paper, we introduce and study an extension of the celebrated strong subadditivity of the entropy: given algebras ${{\mathcal {N}}}={{\mathcal {N}}}_1\cap {{\mathcal {N}}}_2$, ${{\mathcal {N}}}_1,{{\mathcal {N}}}_2\subseteq {\mathcal {M}}$, with corresponding conditional expectations $E_1:{\mathcal {M}}\rightarrow {{\mathcal {N}}}_1$, $E_2:{\mathcal {M}}\rightarrow {{\mathcal {N}}}_2$ and $E_{{\mathcal {N}}}:{\mathcal {M}}\rightarrow {{\mathcal {N}}}$, there exist constants $c\ge 1$ and $d\ge 0$ such that

$$\begin{aligned} D(\rho \Vert E_*^{\mathcal {M}}(\rho ))\le c\,\Big (D(\rho \Vert E_{1*}(\rho ))+D(\rho \Vert E_{2*}(\rho ))\Big )+d\qquad \forall \rho \,. \end{aligned}$$

(6.1)

In analogy with its classical analogue, we dubbed this inequality approximate tensorization of the relative entropy.

Since the first submission of this paper, (6.1) has found several extensions and applications in the fields of quantum information theory and many body quantum systems: first, the inequality was used to derive the first proof of the positivity of the modified logarithmic Sobolev inequality constant independently of the system size for Gibbs states of nearest neighbour commuting Hamiltonians on a regular lattice [12]. For this specific class of Gibbs states, the authors showed that the analysis can indeed be reduced to the case of states $\rho $ for which the additive error term in Theorem 2 vanishes, hence providing a direct application to our main result.

More recently, [24] (as well as a new version of [31]) proved a strong approximate tensorization result with multiplicative constant depending on the ${\mathbb {L}}_2$ clustering of the conditional expectations as well as the dimension of the system. Their approximate tensorization was then used to find asymptotically tight exponential entropic decay to equilibrium for various models of noise including quantum Markov semigroups generated by classical graph Laplacians, approximate k-designs, or the quantum Kac master equation. In their extension of (6.1), the noisy system can also be coupled to an arbitrarily large noiseless environment. Although providing a tight approximate tensorization result in the sense that $d=0$ and that it reduces to the exact tensorization in the commuting square setting, their bound however still provides a poor control of the multiplicative constant in the context of Gibbs samplers. We expect that both methods combined will prove useful in proving the uniform positivity of the MLSI constant for generic quantum Gibbs samplers in the near future. Indeed, these techniques, together with a version of Theorem 2, will be used soon to derive positivity of a MLSI for Davies generators in 1D systems [2].

Notes

The definition of (strong) approximate tensorization recently arose in a first version of the paper [31], where it was coined as “adjusted subadditivity of relative entropy”. As explained by the author himself, this definition was already present in an earlier draft of our present article, which we had shared with him (see also the recently published thesis [9]). Furthermore, the techniques that we introduce here are different from his, and more in line with the classical literature on the subject.
Here, we formulate it in our general framework of finite-dimensional $*$-algebras.
Compared to [13], the role of $\rho $ and $\sigma $ is exchanged. The result nevertheless stays the same, as can be readily checked from their proof.

References

Bardet, I.: Estimating the decoherence time using non-commutative Functional Inequalities. arXiv preprint arXiv:1710.01039 (2017)
Bardet, I., Capel, Á., Gao, L., Lucia, A., Pérez-García, D., Rouzé, C.: Logarithmic Sobolev inequality for Davies generators. in preparation (2021)
Bardet, I., Capel, Á., Lucia, A., Pérez-García, D., Rouzé, C.: On the modified logarithmic Sobolev inequality for the heat-bath dynamics for 1D systems. J. Math. Phys. 62(6), 061901 (2021)
Bardet, I., Junge, M., LaRacuente, N., Rouzé, C., França, D.S.: Group transference techniques for the estimation of the decoherence times and capacities of quantum markov semigroups. IEEE Trans. Inf, Theory 67(5), 2878–2909 (2021)
Bardet, I., Rouzé, C.: Hypercontractivity and logarithmic Sobolev Inequality for non-primitive quantum Markov semigroups and estimation of decoherence rates. arXiv preprint arXiv:1803.05379 (2018)
Beckner, W.: Inequalities in Fourier analysis. Ann. Math. 102(1), 159–182 (1975)
Article MathSciNet Google Scholar
Berta, M., Christandl, M., Colbeck, R., Renes, J.M., Renner, R.: The uncertainty principle in the presence of quantum memory. Nature Phys. 6(9), 659–662 (2010)
Article ADS Google Scholar
Berta, M., Christandl, M., Renner, R.: The quantum reverse Shannon theorem based on one-shot information theory. Commun. Math. Phys. 306(3), 579 (2011)
Article ADS MathSciNet Google Scholar
Capel, Á.: Quantum logarithmic sobolev inequalities for quantum many-body systems: An approach via quasi-factorization of the relative entropy. Ph.D. thesis at Universidad Autónoma de Madrid (2019)
Capel, Á., Lucia, A., Pérez-García, D.: Quantum conditional relative entropy and quasi-factorization of the relative entropy. J. Physics A: Math. Theor. 51, 484001 (2018)
Article MathSciNet Google Scholar
Capel, Á., Lucia, A., Pérez-García, D.: Superadditivity of quantum relative entropy for general states. IEEE Trans. Inf. Theory 64(7), 4758–4765 (2018)
Article MathSciNet Google Scholar
Capel, Á., Rouzé, C., França, D. S.: The modified logarithmic Sobolev inequality for quantum spin systems: classical and commuting nearest neighbour interactions. arXiv preprint arXiv:2009.11817 (2020)
Carlen, E.A., Vershynina, A.: Recovery map stability for the data processing inequality. J. Physics A: Math. Theor. 53(3), 035204 (2017)
Article ADS MathSciNet Google Scholar
Cesi, F.: Quasi-factorization of the entropy and logarithmic Sobolev inequalities for Gibbs random fields. Probab. Theory Relat. Fields 120(4), 569–584 (2001)
Article MathSciNet Google Scholar
Coles, P.J., Berta, M., Tomamichel, M., Wehner, S.: Entropic uncertainty relations and their applications. Rev. Mod. Phys. 89(1), 015002 (2017)
Article ADS MathSciNet Google Scholar
Cubitt, T.S., Lucia, A., Michalakis, S., Perez-Garcia, D.: Stability of local quantum dissipative systems. Commun. Math. Phys. 337(3), 1275–1315 (2015)
Article ADS MathSciNet Google Scholar
Dai Pra, P., Paganoni, A.M., Posta, G.: Entropy inequalities for unbounded spin systems. Ann. Probab. 30(4), 1959–1976 (2002)
Article MathSciNet Google Scholar
Datta, N.: Min- and max- relative entropies and a new entanglement monotone. IEEE Trans. Inf. Theory 55, 2816–2826 (2009)
Article MathSciNet Google Scholar
Frank, R.L., Lieb, E.H.: Extended quantum conditional entropy and quantum uncertainty inequalities. Commun. Math. Phys. 323(2), 487–495 (2013)
Article ADS MathSciNet Google Scholar
Gao, L., Junge, M., LaRacuente, N.: Unifying Entanglement with Uncertainty via Symmetries of Observable Algebras. arXiv preprint arXiv:1710.10038 (2017)
Gao, L., Junge, M., LaRacuente, N.: Fisher information and logarithmic Sobolev inequality for matrix valued functions. Ann. Henri Poincaré 21, 3409–3478 (2020)
Gao, L., Junge, M., LaRacuente, N.: Uncertainty principle for quantum channels. 2018 IEEE International Symposium on Information Theory (ISIT), 996–1000 (2018)
Gao, L., Junge, M., LaRacuente, N.: Relative entropy for von Neumann subalgebras. Int. J. Math. 31(06), 2050046 (2020)
Article MathSciNet Google Scholar
Gao, L., Rouzé, C.: Spectral methods for entropy contraction coefficients. arXiv preprint arXiv:2102.04146 (2021)
Gorini, V., Kossakowski, A., Sudarshan, E.C.G.: Complete positive dynamical semigroups of N-level systems. J. Math. Phys. 17, 821 (1976)
Article ADS Google Scholar
Hirschman, I.I.: A note on entropy. Am. J. Math. 79(1), 152–156 (1957)
Article MathSciNet Google Scholar
Junge, M., LaRacuente, N., Rouzé, C.: Stability of logarithmic sobolev inequalities under a noncommutative change of measure. arXiv preprint arXiv:1911.08533 (2019)
Kastoryano, M.J., Brandão, F.G.S.L.: Quantum Gibbs samplers: The commuting case. Commun. Math. Phys. 344(3), 915–957 (2016)
Article ADS MathSciNet Google Scholar
Kastoryano, M.J., Brandão, F.G.S.L.: Quantum Gibbs samplers: The commuting case. Commun. Math. Phys. 344(3), 915–957 (2016)
Article ADS MathSciNet Google Scholar
Kosaki, H.: Application of the complex interpolation method to a von neumann algebra: non-commutative $\mathbb{L}_p$-spaces. J. Funct. Anal. 56, 29–78 (1984)
Article MathSciNet Google Scholar
Laracuente, N.: Quasi-factorization and Multiplicative Comparison of Subalgebra-Relative Entropy. arXiv preprint arXiv:1912.00983 (2019)
Ledoux, M.: Logarithmic Sobolev inequalities for unbounded spin systems revisited. Séminaire de probabilités (Strasbourg) 35, 167–194 (2001)
MathSciNet MATH Google Scholar
Lieb, E.H.: Convex trace functions and the Wigner–Yanase–Dyson conjecture. Adv. Math. 11(3), 267–288 (1973)
Article MathSciNet Google Scholar
Lieb, E.H., Ruskai, M.B.: Proof of the strong subadditivity of quantum mechanical entropy. J. Math. Phys. 14, 1938–1941 (1973)
Article ADS MathSciNet Google Scholar
Lindblad, G.: On the generators of quantum dynamical semigroups. Commun. Math. Phys. 48(2), 119–130 (1976)
Article ADS MathSciNet Google Scholar
Maasen, H., Uffink, J.B.M.: Generalized entropic uncertainty relations. Phys. Rev. Lett. 60, 1103–1106 (1988)
Article ADS MathSciNet Google Scholar
Ohya, M., Petz, D.: Quantum entropy and its use. Texts and Monographs in Physics. Springer Verlag, Berlin (1993)
Book Google Scholar
Petz, D.: Quantum information theory and quantum statistics. Theoretical and Mathematical Physics. Springer, Berlin (2008)
MATH Google Scholar
Spohn, H., Lebowitz, J.L.: Irreversible thermodynamics for quantum systems weakly coupled to thermal reservoirs. Adv. Chem. Phys. 38, 109–142 (1978)
Google Scholar
Stam, A.J.: Some inequalities satisfied by the quantities of information of Fisher and Shannon. Inf. Control 2(2), 101–112 (1959)
Article MathSciNet Google Scholar
Sutter, D., Berta, M., Tomamichel, M.: Multivariate trace inequalities. Commun. Math. Phys. 352(1), 37–58 (2017)
Article ADS MathSciNet Google Scholar
Takesaki, M.: Theory of Operator Algebras II. Encyclopaedia of Mathematical Sciences, vol. 125. Springer, Berlin Heidelberg, Berlin, Heidelberg (2003)
Book Google Scholar
Temme, K.: Thermalization time bounds for Pauli stabilizer Hamiltonians. Commun. Math. Phys. 350(2), 603–637 (2017)
Article ADS MathSciNet Google Scholar
Umegaki, H.: Conditional expectation in an operator algebra IV. Entropy and information. Kodai Math. Sem. Rep. 14, 59–85 (1962)
Article MathSciNet Google Scholar
Weyl, H.: The Theory of Groups and Quantum Mechanics. Courier Corporation (1950)

Download references

Acknowledgements

IB was supported by Region Ile- de-France in the framework of DIM SIRTEQ. AC was partially supported by a La Caixa-Severo Ochoa grant (ICMAT Severo Ochoa project SEV-2011-0087, MINECO) and acknowledges support from MINECO (grant MTM2017-88385-P), from Comunidad de Madrid (grant QUITEMAD-CM, ref. P2018/TCS-4342) and from ICMAT Severo Ochoa project SEV-2015-0554 (MINECO). CR is grateful to Federico Pasqualotto for useful discussions, and acknowledges financial support from the TUM university Foundation Fellowship. CR and AC acknowledge funding by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germanys Excellence Strategy EXC-2111 390814868. This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No 648913).

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Inria Paris, Paris, France
Ivan Bardet
Departamento de Análisis Matemático y Matemática Aplicada, Universidad Complutense de Madrid, Madrid, Spain
Ángela Capel
Instituto de Ciencias Matemáticas (CSIC-UAM-UC3M-UCM), Madrid, Spain
Ángela Capel
Department of Mathematics, Technische Universität München, 85748, Garching, Germany
Ángela Capel & Cambyse Rouzé
Munich Center for Quantum Science and Technology (MCQST), München, Germany
Ángela Capel & Cambyse Rouzé

Authors

Ivan Bardet
View author publications
You can also search for this author in PubMed Google Scholar
Ángela Capel
View author publications
You can also search for this author in PubMed Google Scholar
Cambyse Rouzé
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ángela Capel.

Additional information

Communicated by Matthias Christandl.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

A Conditional Expectations on Fixed-points of Markovian Evolution

In this section, we consider conditional expectations arising from Petz recovery maps and from Davies generators as introduced in Sect. 2.4. The main result, Theorem 1, states that the corresponding conditional expectations coincide.

1.1 A.1 Conditional Expectations Generated by a Petz Recovery Map

Here, we further discuss the notion of conditional expectations coming from the Petz recovery map. The discussion is largely inspired by some results in [13].

Let $\sigma $ be a faithful density matrix on the finite-dimensional algebra ${{\mathcal {N}}}$ and let ${\mathcal {M}}\subset {{\mathcal {N}}}$ be a subalgebra. We denote by $E_\tau $ the conditional expectation onto ${\mathcal {M}}$ with respect to the completely mixed state (i.e. $E_\tau $ is self-adjoint with respect to the Hilbert-Schmidt inner product). Let us recall the notations and notions introduced in Sect. 2.3 regarding the adjoint of the Petz recovery map and the conditional expectation constructed from it. We show below the form that these concepts take for a bipartite system.

Example 2

Our main example is the case of a bipartite system AB. In this case, ${{\mathcal {N}}}={{\mathcal {B}}}({{\mathcal {H}}}_{AB})$ and ${\mathcal {M}}={\mathbb {1}}_{{{\mathcal {H}}}_A}\otimes {{\mathcal {B}}}({{\mathcal {H}}}_{B})$. Let $\sigma =\sigma _{AB}$ be a faithful density matrix on AB. The partial trace with respect to ${{\mathcal {H}}}_A$ is an example of a conditional expectation $E_\tau $ which is not compatible with $\sigma _{AB}$, in general. With this choice, we obtain:

$$\begin{aligned}&\sigma _{\mathcal {M}}=\sigma _B\,,\\&{\mathcal {A}}_{\sigma _{AB}}(X)=\sigma _{B}^{-\frac{1}{2}}\,\mathop {\mathrm{Tr}}\nolimits _A[\sigma _{AB}^{\frac{1}{2}}\,X\,\sigma _{AB}^{\frac{1}{2}}]\,\sigma _{B}^{-\frac{1}{2}}\,,\qquad \forall X\in {{\mathcal {B}}}({{\mathcal {H}}}_{AB})\, , \\&{{\mathcal {R}}}_{\sigma _{AB}}(\rho _B)=\sigma _{AB}^{\frac{1}{2}}\sigma _{B}^{-\frac{1}{2}}\,\rho _B\,\sigma _{B}^{-\frac{1}{2}}\sigma _{AB}^{\frac{1}{2}}\,,\qquad \forall \rho _{AB}\in {{\mathcal {D}}}({{\mathcal {H}}}_{AB})\,, \end{aligned}$$

where here we identify an operator $X_B$ with ${\mathbb {1}}_A\otimes X_B$ for sake of simplicity. An important remark is that, in general, $E_{\sigma _{AB}*}$ is not a recovery map.

We are now ready to state a first technical proposition, whose content is mostly contained in [13].

Proposition 6

Let $\rho $ be a density matrix on ${{\mathcal {N}}}$. Then the following assertions are equivalent:

1.
$D(\rho \Vert \sigma )=D(\rho _{\mathcal {M}}\Vert \sigma _{\mathcal {M}})$;
2.
$\rho ={{\mathcal {R}}}_\sigma (\rho _{\mathcal {M}})$;
3.
$\rho =E_{\sigma *}(\rho )$;
4.
$D(\rho \Vert E_{\sigma *}(\rho ))=0$.
5.
$D(\rho \Vert \sigma )=D(E_{\sigma *}(\rho )\Vert \sigma )$;

Remark that $(1)\Leftrightarrow (2)$ is Petz condition for equality in the data processing inequality. The equivalence $(3)\Leftrightarrow (4)$ is obvious, and $(4)\Leftrightarrow (5)$ is a consequence of Lemma 3.4 in [27]:

$$\begin{aligned} D(\rho \Vert \sigma )-D(E_{\sigma *}(\rho )\Vert \sigma )=D(\rho \Vert E_{\sigma *}(\rho ))\,. \end{aligned}$$

(A.1)

We shall now give a direct proof of $(2)\Leftrightarrow (3)$.

Proof of Proposition 6

We only prove $(2)\Leftrightarrow (3)$. Note that for $X\in {{\mathcal {N}}}$, by definition $X={\mathcal {A}}_\sigma (X)$ iff $X=E_\sigma (X)$. Then let $\rho $ be a density matrix on ${{\mathcal {N}}}$ and define $X=\sigma ^{-\frac{1}{2}}\,\rho \,\sigma ^{-\frac{1}{2}}$. We have:

$$\begin{aligned} \rho ={{\mathcal {R}}}_\sigma (\rho _{\mathcal {M}})&\Leftrightarrow X={\mathcal {A}}_\sigma (X) \\&\Leftrightarrow X=E_\sigma (X) \\&\Leftrightarrow \rho = \sigma ^{\frac{1}{2}}\,X\,\sigma ^{\frac{1}{2}} = \sigma ^{\frac{1}{2}}\,E_\sigma (X)\,\sigma ^{\frac{1}{2}}=E_{\sigma *}(\sigma ^{\frac{1}{2}}\,X\,\sigma ^{\frac{1}{2}})=E_{\sigma *}(\rho )\,. \end{aligned}$$

where in the last line we use property 3 in Proposition 6. $\square $

It would be interesting to compare the two notions of “conditional” relative entropies $D(\rho \Vert \sigma )-D(\rho _{\mathcal {M}}\Vert \sigma _{\mathcal {M}})$ (introduced in [3, 10, 11]) and $D(\rho \Vert E_{\sigma *}(\rho ))$. This is the content of the following proposition.

Proposition 7

For any state $\eta \in {{\mathcal {D}}}({{\mathcal {N}}})$ such that $E_{\sigma *}(\eta )=\eta $ and any state $\rho \in {{\mathcal {D}}}({{\mathcal {N}}})$, we have

$$\begin{aligned} D(\rho \Vert \sigma )-D(\rho _{\mathcal {M}}\Vert \sigma _{\mathcal {M}})=D(\rho \Vert \eta )-D(\rho _{\mathcal {M}}\Vert \eta _{\mathcal {M}})\,, \end{aligned}$$

(A.2)

i.e. the difference of relative entropies does not depend on the choice of the invariant state for $E_\sigma $. Consequently,

$$\begin{aligned} D(\rho \Vert \sigma )-D(\rho _{\mathcal {M}}\Vert \sigma _{\mathcal {M}})\le D(\rho \Vert E_{\sigma *}(\rho ))\,. \end{aligned}$$

(A.3)

Proof

Equation (A.3) is a direct consequence of Eq. (A.2) when applied to $\eta =E_{\sigma *}(\rho )$, so we focus on the first equation (remark that it can be seen as a counterpart of Eq. (A.1) for the difference of relative entropies). To this end, we need the following state $\sigma _{\mathop {\mathrm{Tr}}\nolimits }$ defined in [1] and heavily exploited in [5]:

$$\begin{aligned} \sigma _{\mathop {\mathrm{Tr}}\nolimits }=E_{\sigma *}\Big (\frac{\mathbb {1}}{d_{{\mathcal {H}}}}\Big )\,. \end{aligned}$$

It has the property that for all $X\in {\mathcal {F}}({\mathcal {A}}_\sigma )$, $[X,\sigma _{\mathop {\mathrm{Tr}}\nolimits }]=0$ (see Lemma 3.1 in [1]). Then it is enough to prove that for all $\eta \in {{\mathcal {D}}}({{\mathcal {N}}})$ such that $E_{\sigma *}(\eta )=\eta $, we have:

$$\begin{aligned} D(\rho \Vert \sigma _{\mathop {\mathrm{Tr}}\nolimits })-D(\rho _{\mathcal {M}}\Vert (\sigma _{\mathop {\mathrm{Tr}}\nolimits })_{\mathcal {M}})=D(\rho \Vert \eta )-D(\rho _{\mathcal {M}}\Vert \eta _{\mathcal {M}})\,. \end{aligned}$$

Now any such $\eta $ can be written $\eta =X\sigma _{\mathop {\mathrm{Tr}}\nolimits }$ with $X\in {\mathcal {F}}({\mathcal {A}}_\sigma )$. Remark that by definition of ${\mathcal {F}}({\mathcal {A}}_\sigma )$, $X\in {\mathcal {M}}$ so that $E_\tau (\eta )=X E_\tau (\sigma _{\mathop {\mathrm{Tr}}\nolimits })$. Using the commutation between X and $\sigma _{\mathop {\mathrm{Tr}}\nolimits }$ and developing the RHS of the previous equation we get the result. $\square $

1.2 A.2 Davies Semigroups

Here we consider the conditional expectation associated to the Davies dynamics that was presented in Sect. 2.4. Our first result is a characterization of the fixed-point algebra in the Davies case.

Proposition 8

One has

$$\begin{aligned} {\mathcal {F}}({{\mathcal {L}}}^{\mathrm{D},\beta })=\{\sigma ^{it}\,S_\alpha \,\sigma ^{-it}\,;\,\forall \,t\ge 0,\,\forall \alpha \}'\,, \end{aligned}$$

(A.4)

where the notation $\{ \cdot \}'$ denotes the centralizer of the set.

Proof

We recall that ${\mathcal {F}}({{\mathcal {L}}}^{{\mathrm{D},\beta }})=\{S_\alpha (\omega )\}'$. Hence, since $\sigma ^{it}S_\alpha \sigma ^{-it}$ can be expressed as a linear combination of the $S_\alpha (\omega )$’s by Eq. (2.12), it directly follows that

$$\begin{aligned} {\mathcal {F}}({{\mathcal {L}}}^{{\mathrm{D},\beta }})\subseteq \{\sigma ^{it}\,S_\alpha \,\sigma ^{-it}\,;\,t\ge 0\}' \end{aligned}$$

To prove the opposite direction, we let $X\in \{\sigma ^{it}\,S_\alpha \,\sigma ^{-it}\,;\,t\ge 0\}'$. This means in particular that, for all $t\in {\mathbb {R}}$, and all $\alpha $:

$$\begin{aligned}{}[X, \sigma ^{it}S_{\alpha }\sigma ^{-it}]=\sum _{\omega }\,{rm e}^{it\omega }\,[X,S_{\alpha }(\omega )]=0\,. \end{aligned}$$

(A.5)

Since the equation holds for all $t\in {\mathbb {R}}$, we can differentiate it $N\equiv |\{\omega \}|$ times at 0 to get that, for any $0\le n\le N-1$:

$$\begin{aligned} \sum _{\omega }\,\omega ^n\,[X,S_\alpha (\omega )]=0\,. \end{aligned}$$

Using an arbitrary labelling of the N distinct frequencies $\omega _1,...,\omega _N$, the resulting N linear equations can be rewritten as

Since all the frequencies $\omega _i$ are distinct, their Vandermonde matrix is invertible. Hence, $[X,S_\alpha (\omega )]=0$ for all $\omega $, so that $X\in {\mathcal {F}}({{\mathcal {L}}}^{\mathrm{D},\beta })$. $\square $

Combining this result with a result from [13], we can finally show the result stated in Theorem 1 that the conditional expectations in the Davies and the Petz cases coincide.

Proof of Theorem 1

First, we remark that both conditional expectations are self-adjoint with respect to the $\sigma $-KMS inner product. Therefore, by uniqueness of the conditional expectation, it is enough to prove that ${\mathcal {F}}({{\mathcal {L}}}^{\mathrm{D},\beta })={\mathcal {F}}(A_{\sigma })$. The analysis of the algebra ${\mathcal {F}}(A_{\sigma })$ was carried out in [13]^{Footnote 3}. In particular, they proved (Theorem 3.3) that ${\mathcal {F}}(A_{\sigma })$ is the largest $*$-sub-algebra of ${\mathcal {M}}$ left-invariant by the modular operator. From this characterization, it is easy to see that ${\mathcal {F}}({{\mathcal {L}}}^{\mathrm{D},\beta })\subseteq {\mathcal {F}}({{\mathcal {A}}}_\sigma )$: indeed ${\mathcal {F}}({{\mathcal {L}}}^{\mathrm{D},\beta })=\{S_\alpha (\omega )\}'\subseteq \{S_\alpha \}'\equiv {\mathcal {M}}$. Moreover, for any $X\in {\mathcal {F}}({{\mathcal {L}}}^{\mathrm{D},\beta })$

$$\begin{aligned}{}[\Delta _\sigma (X),S_\alpha (\omega )]&=\sigma \,X\,\sigma ^{-1}\,S_\alpha (\omega )-S_\alpha (\omega )\sigma X\sigma ^{-1}\\&=\mathrm{e}^{-\beta \omega }\,\big ( \sigma XS_{\alpha }(\omega )\sigma ^{-1}-\sigma \,S_\alpha (\omega )X\sigma ^{-1} \big )\\&=\mathrm{e}^{-\beta \omega }\,\sigma \,[X,S_\alpha (\omega )]\,\sigma ^{-1}\\&=0. \end{aligned}$$

It remains to show that any $*$-sub-algebra $\mathcal {V}$ of ${\mathcal {M}}$ which is invariant by $\Delta _\sigma $ is contained in ${\mathcal {F}}({{\mathcal {L}}}^{\mathrm{D},\beta })$. This directly follows from (A.4): since for any $X\in \mathcal {V}$, $\Delta (X)\in \mathcal {V}$, we have that

$$\begin{aligned}{}[X,\sigma ^{it}\,S_\alpha \,\sigma ^{-it}]&=X\,\sigma ^{it}\,S_\alpha \,\sigma ^{-it}-\sigma ^{it}\,S_\alpha \,\sigma ^{-it}X\\&=\sigma ^{it}\Delta _{\sigma }^{-it}(X)S_\alpha \sigma ^{-it}-\sigma ^{it}S_\alpha \Delta _{\sigma }^{-it}(X)\sigma ^{-it}\\&=\sigma ^{it}\,[\Delta _{\sigma }^{-it}(X),S_\alpha ]\,\sigma ^{-it}\\&=0\, \end{aligned}$$

and the result follows. $\square $

B Proofs

1.1 B.1 Proof of Proposition 2

Proof of Proposition 2

The proof of this result relies on the Holley-Stroock perturbative argument for the Lindblad relative entropy proved in [27]. This entropic distance is defined for two positive semi-definite operators $X,Y\in {{\mathcal {B}}}({{\mathcal {H}}})$ such that Y is full rank as

$$\begin{aligned} D_{\mathrm{Lin}}(X\Vert Y):=\mathop {\mathrm{Tr}}\nolimits [X(\log X-\log Y)]-\mathop {\mathrm{Tr}}\nolimits [X]+\mathop {\mathrm{Tr}}\nolimits [Y]\,. \end{aligned}$$

Next, we use a direct adaptation of the proof of Proposition 4.2 in [27] in order to relate the Lindblad relative entropies $D_{\mathrm{Lin}}(\rho \Vert E_{*}^{\mathcal {M}}(\rho ))$ and $D_{\mathrm{Lin}}(\rho \Vert E_{*}^{(0),{\mathcal {M}}}(\rho ))$. More precisely, we have that for any positive, semidefinite operators X and Y,

$$\begin{aligned}&\frac{1}{\lambda _{\max }(\sigma )\,d_{{\mathcal {H}}}} \,D_{\mathrm{Lin}}(\Gamma _{\sigma }(X)\Vert E_{*}^{{\mathcal {M}}}(\Gamma _\sigma (Y)) )\nonumber \\&\quad \le D_{\mathrm{Lin}}(X\Vert E_{*}^{(0),{\mathcal {M}}}(Y))\nonumber \\&\quad \le \frac{1}{\lambda _{\min }(\sigma )d_{{{\mathcal {H}}}}}\,D_{\mathrm{Lin}}(\Gamma _{\sigma }(X)\Vert E_{*}^{{\mathcal {M}}}(\Gamma _\sigma (Y)) )\,, \end{aligned}$$

(B.1)

where $\lambda _{\min }(\sigma )$, resp. $\lambda _{\max }(\sigma )$, denotes the smallest, resp. largest, eigenvalue of the state $\sigma $. In words, the proof of [27, Proposition 4.2] consists in the observation that the conditional expectations $E^{(0),{\mathcal {M}}}$ and $E^{{\mathcal {M}}}$ are related via $d_{{\mathcal {H}}}E^{{\mathcal {M}}}_*=\Gamma _\sigma ^{-1}E^{(0),{\mathcal {M}}}_*$, together with the monotonicity of $D_{\mathrm{Lin}}$ under completely positive, trace non-increasing maps. Analogous inequalities hold for $E_1$ and $E_2$. Similarly to what is done for classical spin systems in [32], the previous inequality can be rewritten in the following way. Consider the generalization of the relative entropy for $X=\Gamma _\sigma ^{-1}(\rho )$ given by:

$$\begin{aligned} \mathrm{Ent}_{1,{\mathcal {M}}}(X):=D(\rho \Vert E_*^{\mathcal {M}}(\rho ))\,, \end{aligned}$$

with analogous expressions for ${{\mathcal {N}}}_1$ and ${{\mathcal {N}}}_2$ with their respective conditional expectations. Then, we can express this relative entropy as an infimum over $D_{\mathrm{Lin}}$. Indeed, Lemma 3.4 in [27] states that for all full-rank positive semi-definite $Y\in {\mathcal {M}}$,

$$\begin{aligned} D_{\mathrm{Lin}}(\rho \Vert \Gamma _{\sigma }(Y))=D_{\mathrm{Lin}}(\rho \Vert E_*^{\mathcal {M}}(\rho ))+D_{\mathrm{Lin}}(E_*^{\mathcal {M}}(\rho )\Vert \Gamma _{\sigma }(Y))\,. \end{aligned}$$

(B.2)

It shows in particular that $D_{\mathrm{Lin}}(\rho \Vert \Gamma _{\sigma }(Y))\ge D_{\mathrm{Lin}}(\rho \Vert E_*^{\mathcal {M}}(\rho ))=\mathrm{Ent}_{1,{\mathcal {M}}}(X)$, with equality for $Y=E^{\mathcal {M}}(X)$. Thus, we obtain

$$\begin{aligned} \mathrm{Ent}_{1,{\mathcal {M}}}(X)=\underset{Y\in {\mathcal {M}}\,,Y>0}{\inf }\,D_{\mathrm{Lin}}(\rho \Vert \Gamma _{\sigma }(Y))\, \end{aligned}$$

(B.3)

and optimizing over all Y we can rewrite Eq. (B.1) as

$$\begin{aligned} \frac{1}{\lambda _{\max }(\sigma )\,d_{{\mathcal {H}}}} \,\mathrm{Ent}_{1,{\mathcal {M}}}(X)\le D(\rho \Vert E_{*}^{(0),{\mathcal {M}}}(\rho ))\le \frac{1}{\lambda _{\min }(\sigma )d_{{{\mathcal {H}}}}}\,\mathrm{Ent}_{1,{\mathcal {M}}}(X)\,. \end{aligned}$$

Finally, using the approximate tensorization at infinite temperature and rearranging the terms leads to the result:

$$\begin{aligned} \mathrm{Ent}_{1,{\mathcal {M}}}(X) \le \frac{\lambda _{\max }(\sigma )}{\lambda _{\min }(\sigma )}\,\big ( \mathrm{Ent}_{1,{{\mathcal {N}}}_1}(X)+\mathrm{Ent}_{1,{{\mathcal {N}}}_2}(X) \big ) +\lambda _{\max }(\sigma )\,d_{{\mathcal {H}}}\,d\,. \end{aligned}$$

$\square $

1.2 B.2 Proof of Proposition 3

Proof of Proposition 3

We first proceed by proving the bound $d\le d_1+d_2$. For all $\rho \in {{\mathcal {D}}}({{\mathcal {N}}})$, we can use the chain rule on the max-relative entropy to obtain:

$$\begin{aligned}&D_{\max }\big (E_{1*}\circ E_{2*}(\rho )\Vert E_{1*}\circ E_{2*}(\eta )\big ) \\&\quad \le D_{\max }\big (E_{1*}\circ E_{2*}(\rho )\Vert E_{1*}\circ E_{2*}(\mathcal {P}_{\mathcal {M}}(\rho ))\big )\\&\qquad +D_{\max }\big (E_{1*}\circ E_{2*}(\mathcal {P}_{\mathcal {M}}(\rho ))\Vert E_{1*}\circ E_{2*}(\eta )\big )\\&\quad \le d_1 + D_{\max }\big (\mathcal {P}_{\mathcal {M}}(\rho )\Vert \eta \big )\,, \end{aligned}$$

where the second inequality follows from the data processing inequality for $D_{\max }$. Then

$$\begin{aligned} D_{\max }\big (\mathcal {P}_{\mathcal {M}}(\rho )\Vert \eta \big )\le \max _{i\in I_{\mathcal {M}}}D_{\max }\big (\mathcal {P}_{\mathcal {M}}(\rho )^{(i)}\Vert \eta ^{(i)}\big )\,, \end{aligned}$$

where we write $A^{(i)}:=P_i\,A\,P_i$ for any $A\in {{\mathcal {B}}}({{\mathcal {H}}})$. This last $D_{\max }$ is exactly $I_{\max }\big ( {{\mathcal {H}}}_i:{{\mathcal {K}}}_i \big )_{\rho ^{(i)}}$ after minimizing on $\eta $. We are left with proving the two separate bounds on $d_1$ and $d_2$ respectively. The first bound is a simple consequence of the data processing inequality for $D_{\max }$ and the Pinching inequality. The second bound is a consequence of Lemma B.7 in [8]. $\square $

1.3 B.3 Proofs of Lemma 2 and Proposition 4

Before proving Lemma 2, we need to prove a technical lemma.

Lemma 3

Given a conditional expectation $E:{{\mathcal {N}}}\rightarrow {\mathcal {M}}\subset {{\mathcal {N}}}\subset {{\mathcal {B}}}({{\mathcal {H}}})$ that is invariant with respect to two different full-rank states, $\rho $ and $\sigma $, the following holds:

$$\begin{aligned} \Gamma _\rho ^{1/2}\circ E\circ \Gamma _{\rho }^{-1/2}=\Gamma _\sigma ^{1/2}\circ E\circ \Gamma _\sigma ^{-1/2} \end{aligned}$$

Proof of Lemma 3

Since we are in finite dimension, the von Neumann algebra ${\mathcal {M}}$ takes the following form:

$$\begin{aligned} {\mathcal {M}}=\bigoplus _i\,{{\mathcal {B}}}({{\mathcal {H}}}_i)\otimes {\mathbb {1}}_{{{\mathcal {K}}}_i}\,, \end{aligned}$$

for some decomposition ${{\mathcal {H}}}:=\bigoplus _i\,{{\mathcal {H}}}_i\otimes {{\mathcal {K}}}_i$ of ${{\mathcal {H}}}$. Therefore, since $\rho $ and $\sigma $ are invariant stats of E, they can be decomposed as follows:

$$\begin{aligned} \rho =\bigoplus _{i}\, \rho _i\otimes \tau _i\,,~~~ \sigma =\bigoplus _{i}\, \sigma _i\otimes \tau _i\,, \end{aligned}$$

for given positive definite operators $\sigma _i$, $\rho _i$ and where $\tau _i$ is given by ${\mathbb {1}}_{\mathcal {K}_i}/d_{\mathcal {K}_i}$. Hence,

$$\begin{aligned} \rho ^{-1/4}\sigma ^{1/4}=\bigoplus _i\,\rho _i^{-1/4}\sigma _i^{1/4}\otimes {\mathbb {1}}_{{{\mathcal {K}}}_i}\in {{\mathcal {N}}}. \end{aligned}$$

Then, it is clear that the following string of identities hold for all $Y\in {{\mathcal {B}}}({{\mathcal {H}}})$:

$$\begin{aligned}&\rho ^{-1/4}\,\sigma ^{1/4}\,E\big [\sigma ^{-1/4}\rho ^{1/4}\,Y\,\rho ^{1/4}\sigma ^{-1/4}\big ]\,\sigma ^{1/4}\rho ^{-1/4}\\&\quad =E\big [\rho ^{-1/4}\,\sigma ^{1/4}\sigma ^{-1/4}\rho ^{1/4}\,Y\,\rho ^{1/4}\sigma ^{-1/4}\sigma ^{1/4}\rho ^{-1/4}\big ]\\&\quad =E[Y]\,. \end{aligned}$$

The result follows after choosing $Y=\rho ^{-1/4}X\rho ^{-1/4}$. $\square $

Now we can proceed to the Proof of Lemma 2.

Proof of Lemma 2

We begin with proving that the property of strong ${\mathbb {L}}_2$ clustering of correlations is independent of the invariant state, thanks to Lemma 3. Indeed, if we choose $Y:= \Gamma _\sigma ^{-1/2}(X)$ and call $X':= \Gamma _{\sigma '}^{1/2}(Y)$, it is clear that

$$\begin{aligned} \left\| X \right\| _{{\mathbb {L}}_2(\sigma )}^2= \left\| Y \right\| _2^2 \; \; \text { and } \; \;\left\| Y \right\| _2^2 = \left\| X' \right\| _{{\mathbb {L}}_2(\sigma ')}^2. \end{aligned}$$

Therefore, we have the following chain of identities:

$$\begin{aligned}&\sup _{X\in {{\mathcal {N}}}} \, \frac{\mathrm{Cov}_{{\mathcal {M}},\,\sigma }(E_1[X],E_2[X])}{\Vert X\Vert _{{\mathbb {L}}_2(\sigma )}^2}\\&\quad = \sup _{X\in {{\mathcal {N}}}} \, \frac{\langle X,\,E_1\circ E_2[X]-E^{\mathcal {M}}[X] \rangle _{\sigma } }{\Vert X\Vert _{{\mathbb {L}}_2(\sigma )}^2}\\&\quad =\sup _{Y\in {{\mathcal {N}}}} \, \frac{\langle \Gamma _{\sigma }^{-1/2}(X),\,E_1\circ E_2[\Gamma _{\sigma }^{-1/2}(X)]-E^{\mathcal {M}}[\Gamma _{\sigma }^{-1/2}(X)] \rangle _{\sigma } }{\Vert Y\Vert _{2}^2}\\&\quad =\sup _{Y\in {{\mathcal {N}}}}\, \frac{\langle X,\,\Gamma _{\sigma }^{1/2}(E_1\circ E_2-E^{\mathcal {M}})[\Gamma _{\sigma }^{-1/2}(X)] \rangle _{\mathrm{HS}} }{\Vert Y\Vert _{2}^2}\\&\quad =\sup _{Y\in {{\mathcal {N}}}} \, \frac{\langle \Gamma _{\sigma '}^{-1/2}(X),\,E_1\circ E_2[\Gamma _{\sigma '}^{-1/2}(X)]-E^{\mathcal {M}}[\Gamma _{\sigma '}^{-1/2}(X)] \rangle _{\sigma '} }{\Vert Y\Vert _{2}^2}\\&\quad =\sup _{X'\in {{\mathcal {N}}}} \, \frac{\mathrm{Cov}_{{\mathcal {M}},\,\sigma }(E_1[X'],E_2[X'])}{\Vert X'\Vert _{{\mathbb {L}}_2(\sigma )}^2}\,, \end{aligned}$$

where we have used Lemma 3 in the fourth line. $\square $

In a different direction, we can also provide the Proof of Proposition 4.

Proof of Proposition 4

Point 1. is straigthforward so we focus on point 2. As already mentioned, strong $ {\mathbb {L}}_2$ clustering implies $\mathrm{cond}{\mathbb {L}_2}(c_2)$, so we only need to prove the other implication. Now assume that $\mathrm{cond}{\mathbb {L}_2}(c_2)$ holds with a constant $c_2$ and take $X\in {{\mathcal {D}}}({{\mathcal {N}}})$. We write $T=E_1\circ E_2-E^{\mathcal {M}}$. Remark that, according to the decomposition of ${\mathcal {M}}$ given in Eq. (3.6) and exploiting Eq. (3.16), T acts on X as:

$$\begin{aligned} T(X)=\sum _{i\in I_{\mathcal {M}}}\,\left( {\mathbb {1}}_{{{\mathcal {B}}}({{\mathcal {H}}}_i)}\otimes T^{(i)}\right) (P_i\,X\,P_i)\,, \end{aligned}$$

(B.4)

where $T^{(i)}$ acts on ${{\mathcal {B}}}({{\mathcal {K}}}_i)$ and where the $P_i$ are the orthogonal projections on ${{\mathcal {H}}}_i\otimes {{\mathcal {K}}}_i$.

Consider now the Hilbert-Schmidt decomposition of $P_iXP_i$ with respect to $({{\mathcal {B}}}({{\mathcal {H}}}_i),\langle \cdot , \cdot \rangle _{\sigma _i})$ and $({{\mathcal {B}}}({{\mathcal {K}}}_i),\langle \cdot , \cdot \rangle _{\tau _i})$:

$$\begin{aligned} P_i\,X\,P_i=\sum _\alpha \,f_\alpha ^{(i)}\otimes g_\alpha ^{(i)}\,. \end{aligned}$$

Thus we have

$$\begin{aligned} \Vert P_i\,X\,P_i\Vert _{{\mathbb {L}}_2(\sigma _i\otimes \tau _i)}^2=\sum _\alpha \,\Vert f_\alpha ^{(i)}\Vert _{{\mathbb {L}}_2(\sigma _i)}^2\,\Vert g_\alpha ^{(i)}\Vert _{{\mathbb {L}}_2(\tau _i)}^2\,, \end{aligned}$$

and therefore

$$\begin{aligned} \Vert T(X)\Vert _{{\mathbb {L}}_2(\sigma _i\otimes \tau _i)}^2&= \sum _{i\in I_{\mathcal {M}}}\,\left\| \left( {\mathbb {1}}_{{{\mathcal {B}}}({{\mathcal {H}}}_i)}\otimes T^{(i)}\right) (P_i\,X\,P_i)\right\| _{{\mathbb {L}}_2(\sigma _i\otimes \tau _i)}^2 \\&= \sum _{i\in I_{\mathcal {M}}}\,\left\| \sum _\alpha f_\alpha ^{(i)}\otimes T^{(i)}(g_\alpha ^{(i)})\right\| _{{\mathbb {L}}_2(\sigma _i\otimes \tau _i)}^2 \\&= \sum _{i\in I_{\mathcal {M}}}\,\sum _\alpha \Vert f_\alpha ^{(i)}\Vert _{{\mathbb {L}}_2(\sigma _i)}^2\,\Vert T^{(i)}(g_\alpha ^{(i)})\Vert _{{\mathbb {L}}_2(\tau _i)}^2 \\&\le c_2 \left\| \sum _{i\in I_{\mathcal {M}}}\,P_i\,X\,P_i\right\| _{{\mathbb {L}}_2(\sigma )}^2 \\&\le c_2 \Vert X\Vert _{{\mathbb {L}}_2(\sigma )}^2\,, \end{aligned}$$

where in the third line we use that $(f_\alpha ^{(i)})_\alpha $ is an orthogonal family for every $i\in I_{\mathcal {M}}$. This shows that

$$\begin{aligned} \Vert E_1\circ E_2-E^{\mathcal {M}}\,:\,L_2(\sigma )\rightarrow L_2(\sigma )\Vert \le c_2\,, \end{aligned}$$

which is equivalent to strong $L_2$ clustering. $\square $

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bardet, I., Capel, Á. & Rouzé, C. Approximate Tensorization of the Relative Entropy for Noncommuting Conditional Expectations. Ann. Henri Poincaré 23, 101–140 (2022). https://doi.org/10.1007/s00023-021-01088-3

Download citation

Received: 01 June 2020
Accepted: 14 July 2021
Published: 31 July 2021
Issue Date: January 2022
DOI: https://doi.org/10.1007/s00023-021-01088-3

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Approximate Tensorization of the Relative Entropy for Noncommuting Conditional Expectations

Abstract

Similar content being viewed by others

Probability structures in subspace lattice approach to foundations of quantum theory

Calculus for Non-Compatible Observables, Construction Through Conditional States

On natural density, orthomodular lattices, measure algebras and non-distributive $$L^p$$ spaces

1 Introduction

2 Notations and Definitions

2.1 Basic Notations

2.2 Entropic Quantities and \({\mathbb {L}}_p\) Spaces

2.3 Conditional Expectations

Definition 1

Proposition 1

2.4 Two Examples of Classes of Conditional Expectations

2.4.1 Conditional Expectations Generated by a Petz Recovery Map

2.4.2 Conditional Expectations Coming from Davies Semigroups

Theorem 1

3 Weak Approximate Tensorization of the Relative Entropy

Definition 2

Remark 1

3.1 A Technical Lemma

Lemma 1

Proof

Corollary 1

Proof

Remark 2

3.2 Approximate Tensorization via Noncommutative Change of Measure

Proposition 2

3.3 Approximate Tensorization via Pinching Map

Theorem 2

Proof

Remark 3

Proposition 3

3.4 Clustering of Correlations

Definition 3

Lemma 2

Remark 4

Proposition 4

4 Applications

4.1 Modified Logarithmic Sobolev Inequalities for Biased Bases

4.2 Tightened Entropic Uncertainty Relations

Example 1

Corollary 2

Proof

Corollary 3

5 Lattice Spin Systems with Commuting Hamiltonians

5.1 Davies Generators on Lattice Spin Systems

Proposition 5

Proof

5.2 Classical Hamiltonian Over Quantum Systems

Theorem 3

Proof

6 Outlook

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendices

A Conditional Expectations on Fixed-points of Markovian Evolution

1.1 A.1 Conditional Expectations Generated by a Petz Recovery Map

Example 2

Proposition 6

Proof of Proposition 6

Proposition 7

Proof

1.2 A.2 Davies Semigroups

Proposition 8

Proof

Proof of Theorem 1

B Proofs

1.1 B.1 Proof of Proposition 2

Proof of Proposition 2

1.2 B.2 Proof of Proposition 3

Proof of Proposition 3

1.3 B.3 Proofs of Lemma 2 and Proposition 4