Abstract
We extend three related results from the analysis of influences of Boolean functions to the quantum setting, namely the KKL theorem, Friedgut’s Junta theorem and Talagrand’s variance inequality for geometric influences. Our results are derived by a joint use of recently studied hypercontractivity and gradient estimates. These generic tools also allow us to derive generalizations of these results in a general von Neumann algebraic setting beyond the case of the quantum hypercube, including examples in infinite dimensions relevant to quantum information theory such as continuous variables quantum systems. Finally, we comment on the implications of our results as regards to noncommutative extensions of isoperimetric type inequalities, quantum circuit complexity lower bounds and the learnability of quantum observables.
Similar content being viewed by others
1 Introduction
Let \(\Omega _n=\{-1,1\}^n\) be the Boolean hypercube equipped with the uniform probability measure \(\mu _n\). For any \(f:\Omega _n\rightarrow \mathbb {R}\), we denote by \(\textrm{Var}(f)=\textrm{Var}_{\mu _n}(f)\) its variance, i.e. \(\textrm{Var}(f)=\mathbb {E}|f-\mathbb {E}f|^2\). For each \(1\le j\le n\), the influence of the j-th variable on f is given by
where \(f^{\oplus j}(x)=f(x^{\oplus j})\) and \(x^{\oplus j}\) denotes the vector in \(\Omega _n\) obtained by flipping the j-th variable, that is, for \(x=(x_1,\dots , x_n)\),
The notion of influences appears naturally in many contexts ranging from isoperimetric inequalities [KMS12, CEL12], threshold phenomena in random graphs [FK96], cryptography [LMN93], etc. For these reasons, the last three decades witnessed an extensive study of their properties, which led to many applications in theoretical computer science (hardness of approximation [DS05, Hs01] and learning theory [OS07]), percolation theory [BKS99], social choice theory [Mos12, BOL85] to cite a few.
Karpovsky [Kar76] proposed the sum of the influences (also called total influence),
as a measure of complexity of a function f. This first intuition was then made rigorous in [LMN93] and [Bop97] where tight circuit complexity lower bounds in terms of the total influence were derived for the complexity class \({\text {AC}}^0\) of constant depth circuits. A simple lower bound on \({\text {Inf}}f\) in terms of the variance can be derived from Poincaré inequality: For all \(f:\Omega _n\rightarrow {\mathbb {R}}\) one has [O’D14, Chapter 2]
Functions on the hypercubes \(\Omega _n\) that take only values in \(\{-1,1\}\) are of particular interest. These are the so-called Boolean functions and play important roles in social science, combinatorics, computer sciences and many other areas. See [dW08, O’D14] for more information. Note that the \(L^p\)-norms, \(1\le p<\infty \), of Boolean functions are always equal to 1, where the weighted \(L^p\)-norm of a function \(f:\Omega _n\rightarrow \mathbb {R}\) is defined as
A Boolean function \(f:\Omega _n\rightarrow \{-1,1\}\) is said to be balanced if \(\mathbb {E}f=0\). If f is a Boolean function, the influence of the j-th variable can further be expressed as
The Poincaré inequality (1.1) implies that there exists \(j\in \{1,\dots ,n\}\) such that \({\text {Inf}}_j f\ge 1/n\). Note that Poincaré inequality (1.1) can be tight, e.g. for balanced Boolean function \(f(x)=x_1.\) So it may happen that the total influence \(\approx \) variance. Is it possible that all the influences are small simultaneously, that is, \({\text {Inf}}_j(f)\approx \textrm{Var}(f)/n\) for all \(1\le j\le n\)? Quite surprisingly, the answer is negative; a celebrated result of Kahn, Kalai and Linial [KKL88] predicts that every balanced Boolean function has an influential variable. More precisely, Kahn, Kalai and Linial [KKL88] proved that for any balanced Boolean function f on \(\Omega _n\), there exists \(1\le j\le n\) such that
where \(C>0\) is some universal constant. So some variable has an influence at least \(\Omega (\log (n)/n)\), which is larger than the order 1/n deduced from Poincaré inequality.
This theorem of Kahn, Kalai and Linial (KKL in short) plays a fundamental role in Boolean analysis. It was further strengthened by Talagrand [Tal94] and Friedgut [Fri98] in different directions.
In his celebrated paper [Tal94], Talagrand proved that for all \(n\ge 1\) and \(f:\Omega _n\rightarrow \mathbb {R}\), we have for some universal \(C>0\) that
where \(D_j f(x):=\frac{1}{2}(f(x)-f(x^{\oplus j}))\). Note that if f is Boolean, then \(D_j f\) takes values only in \(\{-1,0,1\}\), so that \(\Vert D_j f \Vert _1=\Vert D_j f\Vert _2^2= {\text {Inf}}_j f\). Therefore, this inequality of Talagrand (1.4), as an improvement of Poincaré inequality (1.1), immediately implies the result of KKL. There are plenty of extensions of Talagrand’s inequality (1.4) [OW13a, OW13b, CEL12], which has become a central tool in theoretical computer science [O’D14]. Moreover, it provides a powerful tool to study sub-diffusive and super-concentration phenomena [BKS03, BKS99, Cha14, GS15, ADH17, Sos18, Tan20] ubiquitous to many models studied in modern probability theory (percolation, random matrices, spin glasses, etc.); see the review articles [CEL12, Led19] and references therein for more details.
Also related to the KKL theorem, Friedgut’s Junta theorem [Fri98] states that a Boolean function with a bounded total influence essentially depends on few coordinates. More precisely, a Boolean function \(f:\Omega _n\rightarrow \{-1,1\}\) is called a k-junta, for \(k\in \{1,\dots ,n\}\) independent of n, if it depends on at most k coordinates. When \(k=1\), the function is called a dictatorship. If f is a junta, it is an immediate consequence that the total influence does not depend on n, i.e. \({\text {Inf}}f=\mathcal {O}(1)\). Friedgut’s Junta theorem provides the following converse statement: for any Boolean function \(f:\Omega _n\rightarrow \{-1,1\}\) and \(\varepsilon >0\), there exists a k-junta \(g:\Omega _n\rightarrow \{-1,1\}\) such that
Since its discovery, Friedgut’s Junta theorem has found many applications in random graph theory and the learnability of monotone Boolean functions [OS07].
Judging from the range of applicability of these results, it is natural to consider their extensions to noncommutative or quantum settings. Partial results in this direction were obtained by Montanaro and Osborne [MO10a]. There, Boolean functions on the hypercube \(\Omega _n\) were replaced by quantum Boolean functions on n qubits, that is, operators \(A\in M_2(\mathbb {C})^{\otimes n}\) acting on the n-fold tensor product of \(\mathbb {C}^2\) with the additional conditions that \(A=A^*\) and \(A^2=\textbf{1}\). Here and in what follows, \(M_k(\mathbb {C})\) denotes the k-by-k complex matrix algebra. Then, the \(L^2\)-influence of A in j-th coordinate is defined as \({\text {Inf}}_j^2 A:= \Vert d_j A\Vert _2^2\), where we used \(d_j\) to denote the quantum analogue of the bit-flip map
with \(\mathbb { I}\) being the identity map over \(M_2(\mathbb {C})\), and replaced the normalized \(L^p\)-norm on \(\Omega _n\) by the normalized Schatten-p norm on \(M_2(\mathbb {C})^{\otimes n}\). The quantum influence has already found interesting applications to quantum complexity theory [BGJ+22]. In this framework, Montanaro and Osborne [MO10a, Proposition 11.1] proved a quantum analogue of Talagrand’s inequality (1.4). However, this does not yield a quantum KKL as in the classical setting since we do not have the identity \(\Vert d_j A\Vert _1=\Vert d_j A\Vert ^2_2\) for general quantum Boolean functions. In the worst case, we may even have \(\Vert d_j A\Vert _1=\Vert d_j A\Vert _2\) ( j is a bad influence according to [MO10a, Definition 11.2]) and thus (1.4) will not help anymore. For this reason, the problem of whether every balanced quantum Boolean function has an influential variable still remains open; see [MO10a] for some partial results and more discussions.
In fact, the observation that \(\Vert d_j A\Vert _1\ne \Vert d_j A\Vert ^2_2\) is not exclusive to the quantum setting, and also arises for instance when considering extensions of the setup of Boolean functions on the hypercubes to functions on smooth manifolds, after replacing the uniform distribution on \(\Omega _n\) by an appropriate finite measure, and the discrete derivatives \(D_j\) by the partial derivatives associated to the differential structure of the manifold. In this setting, analogues of the previous results were recently obtained for the \(L^1\)-influences \({\text {Inf}}^1_j A:=\Vert d_j A\Vert _1\), which is sometimes called geometric influence for its relation to isoperimetric inequalities [KMS12, CEL12, Aus16, Bou17].
In this paper, we propose to take the above considerations as a starting point for establishing quantum analogues of (1.1), (1.3), (1.4) and (1.5) based on the \(L^1\)-influences. Our first main result (Theorems 3.2 and 3.6) states that for any self-adjoint operator A on n qubits with \(\Vert A\Vert \le 1\) we have
for some universal \(C>0\), where \(\log ^+\) refers to the positive part of the logarithm. In particular, this suggests that every balanced quantum Boolean function has a variable that has geometric influence at least of the order \(\log (n) /n\). We also prove a quantum \(L^1\)-Poincaré inequality (Theorem 3.1): for any operator A on n qubits we have
Therefore our result provides an alternative answer to the quantum KKL conjecture [MO10a, Conjecture 3 of Section 12] in terms of geometric influences (Theorem 3.9). The inequality (1.6) is inspired by some results in the classical setting; see for example [KMS12, CEL12]. Since (1.6) will be our main focus, rather than (1.4), to distinguish them in the sequel, we shall refer to (1.6) as (\(L^1\)-)Talagrand’s inequality, and (1.4) as Talagrand’s \(L^1\)-\(L^2\) variance inequality as did in [Led19]. We also have a qubit isoperimetric type inequality and a stronger form of \(L^1\)-Poincaré (1.7); see Sect. 6.4 below.
Our second main result is a quantum analogue of Friedgut’s Junta theorem (Theorem 3.11 and Corollary 3.12): for any quantum Boolean function \(A\in M_2(\mathbb {C})^{\otimes n}\) and \(\varepsilon >0\) there exists another quantum Boolean function \(B\in M_2(\mathbb {C})^{\otimes n}\) that is supported on k subsystems such that
where \({\text {Inf}}^p(A):=\sum _{j=1}^n\,{\text {Inf}}_j^p(A)\) with \({\text {Inf}}^p_j(A)=\Vert d_j A\Vert _p^p\).
The proofs of Equations (1.6) and (1.8) make use of recent noncommutative generalizations of hypercontractive inequalities and gradient estimates [OZ99, MO10a, KT13, TPK14, CM17a, DR20, BDR20, WZ21, GR21, Bei21]. Moreover, the generality of these tools also allows us to further extend most of our results to the abstract von Neumann algebraic setting which contains both our previously stated results and their classical analogues previously established in [CEL12, Bou17], but also other extensions arising in noncommutative analysis and quantum information with discrete and continuous variables. As for their classical analogues, we expect our results to find many new applications to quantum information and quantum computation.
The rest of the paper is organized as follows: in Sect. 2, we recall useful definitions and results from the Fourier analysis on the quantum Boolean hypercubes including Poincaré inequality, hypercontractivity, intertwining and gradient estimates. Section 3 is devoted to the statement and proof of our main results, namely a quantum \(L^1\)-Poincaré inequality (Theorem 3.1), quantum Talagrand inequality (Theorem 3.2), and quantum KKL theorem (Theorem 3.9) and a quantum Friedgut’s Junta theorem (Theorem 3.11 and Corollary 3.12). These results are then extended to the general von Neumann algebraic setting in Sect. 4. Finally, examples and applications to quantum circuit complexity and quantum learning theory are provided in Sects. 5 and 6.
2 Quantum Boolean Analysis
Let us start by recapitulating the framework of quantum Boolean functions from [MO10a]. As a quantum analogue of functions on the Boolean hypercubes, i.e., functions of n bits, we will take observables on n qubits. In other words, our algebra of observables is \( M_2(\mathbb {C})^{\otimes n}\cong M_{2^n}(\mathbb {C})\) endowed with the operator norm \(\Vert \cdot \Vert \). In what follows, we denote by \({\text {tr}}\) the trace in \(M_2(\mathbb {C})^{\otimes n}\), and by \({\text {tr}}_T\) the partial trace with respect to any subset T of qubits. Following [MO10a, Definition 3.1], we say \(A\in M_2(\mathbb {C})^{\otimes n}\) is a quantum Boolean function if \(A=A^*\) and \(A^2=\textbf{1}\). Here and in what follows, \(\textbf{1}\) always denotes the identity operator. A quantum Boolean function A is balanced if \({\text {tr}}(A)=0\).
One pillar of analysis on the Boolean hypercube is that every function \(f:\Omega _n\rightarrow \mathbb {R}\) has the Fourier–Walsh expansion, i.e. can be expressed as a linear combination of characters. Our quantum analogues of the characters for 1 qubit are the Pauli matrices
Clearly, these are quantum Boolean functions, and they form a basis of \(M_2(\mathbb {C})\). For \(s=(s_1,\dots , s_n)\in \{0,1,2,3\}^n\), we put
These are again quantum Boolean functions, and form a basis of \(M_2(\mathbb {C})^{\otimes n}\). Accordingly, every \(A\in M_2(\mathbb {C})^{\otimes n}\) can uniquely be expressed as
where \({\widehat{A}}_s\in {\mathbb {C}}\) is the Fourier coefficient. Given \(s\in \{0,1,2,3\}^n\), we call the set of indices j such that \(s_j\ne 0\) the support of s, and denote it by \({\text {supp}}(s)\). Its cardinality is denoted by \(|{\text {supp}}(s)|\). Similarly, the support of A is defined by
and its cardinality is denoted by \(|{\text {supp}}(A)|\). In analogy with the classical setting, an arbitrary operator \(A\in M_2(\mathbb {C})^{\otimes n}\) is called a k-junta if \(|{\text {supp}}(A)|\le k\). As the Pauli matrices are orthonormal with respect to the normalized Hilbert–Schmidt inner product, the coefficients \({\widehat{A}}_s\) can be recovered by
Note that whenever A is self-adjoint, the coefficients \(\widehat{A}_s\) must be real. The quantum analogue of the bit-flip map is given by
Here \({\mathbb {I}}\) denotes the identity map on \(M_2(\mathbb {C})\). Note that \({\mathcal {L}}_0:={\mathbb {I}}-\frac{1}{2}\textrm{tr}\) satisfies \({\mathcal {L}}_0^2={\mathcal {L}}_0\), so that \(d_j^2=d_j\).
For \(p\ge 1\), we denote by \({\text {Inf}}^p_j(A):=\Vert d_j A\Vert _p^p\) the \(L^p\)-influence of j on the operator \(A\in M_2(\mathbb {C})^{\otimes n}\), and by \({\text {Inf}}^p (A):=\sum _{j=1}^{n}{\text {Inf}}_j^p(A)\) the associated total \(L^p\)-influence, where the normalized Schatten-p norm of an operator \(A\in M_2(\mathbb {C})^{\otimes n}\) is defined as (\(|A|:=(A^*A)^{1/2}\))
The \(L^1\)-influence is also called the geometric influence. For the \(L^2\)-influence we have
with \({\mathcal {L}}:=\sum _{j=1}^n d_j\). The operator \(\mathcal {L}\) is the generator of the tensor product of the quantum depolarizing semigroups \((P_t)_{t\ge 0}\) for the individual qubits:
It is a tracially symmetric quantum Markov semigroup, whose general properties are discussed in Sect. 4.
In the Fourier decomposition, we have the following convenient expressions for the \(L^2\)-influence:
and the semigroup \(P_t\):
In addition we need the following further facts:
Lemma 2.1
(Poincaré inequality, see Proposition 10.9 of [MO10a]). For all \(A\in M_2(\mathbb {C})^{\otimes n}\) such that \({\text {tr}}(A)=0\) and \(t\ge 0\), one has
This inequality is equivalent to
and is also equivalent to
Lemma 2.2
(Hypercontractivity, see Theorem 8.4 of [MO10a]). For all \(A\in M_2(\mathbb {C})^{\otimes n}\), \(t\ge 0\) and \(p=p(t)= 1+e^{-2t}\) one has
Lemma 2.3
(Intertwining). For all \(j\in \{1,\dots ,n\}\) and \(t\ge 0\) one has
Proof
Follows easily from the definitions of \(P_t\) and \(d_j\). \(\square \)
We denote by \(\Gamma : M_2(\mathbb {C})^{\otimes n}\rightarrow M_2(\mathbb {C})^{\otimes n}\) the carré du champ operator associated to \(P_t=e^{-t{\mathcal {L}}}\) which is defined via:
Lemma 2.4
(Gradient estimate [JZ15, WZ21]). For any \(A\in M_2(\mathbb {C})^{\otimes n}\) and all \(t\ge 0\),
We close this section by remarking that classical Boolean functions are special quantum Boolean functions. In fact, the Fourier–Walsh expansions of classical Boolean functions correspond to (2.1) when restricting \(s\in \{0,3\}^n\) .
3 Main Results for Quantum Boolean Functions
In this section we state and prove our main results in the restricted setting of the quantum Boolean cube.
3.1 A quantum \(L^1\)-Poincaré inequality
We start with the following \(L^1\)-Poincaré type inequality; see also [DPMRF23] for variations of this inequality and Sect. 6.4 for a stronger form.
Theorem 3.1
For all \(A\in M_2(\mathbb {C})^{\otimes n}\), one has
Proof
This follows from a simple use of the triangle inequality for the \(L^1\)-norm as well as monotonicity under the normalized partial trace. \(\square \)
3.2 A quantum \(L^1\)-Talagrand inequality
We first prove a quantum \(L^1\)-Talagrand inequality on quantum Boolean cubes that can be extended to more general von Neumann algebras; see Sect. 4. We will see later that on quantum Boolean cubes the estimates can be improved, so that we may deduce a sharp quantum KKL theorem for \(L^1\)-influences.
Theorem 3.2
For all \(A\in M_2(\mathbb {C})^{\otimes n}\) with \(\Vert A\Vert \le 1\) one has
for some universal \(C>0\).
Proof
Differentiating the function \(t\mapsto \Vert P_t(A)\Vert _2^2\) one gets
By intertwining (Lemma 2.3) and hypercontractivity (Lemma 2.2),
with \(p(t)=1+e^{-2t}\). By Hölder’s inequality,
For the term with the \(L^1\)-norm we use intertwining again and \(L^1\)-contractivity of \((P_t)_{t\ge 0}\) to get \(\Vert d_j P_t A\Vert _1\le \Vert d_j A\Vert _1\). For the term with \(\Vert \cdot \Vert \)-norm we use the bound derived from Lemma 3.4 below, which gives \(\Vert d_j P_t A\Vert \le (e^t-1)^{-1/2}\). Altogether,
As a consequence,
Since \(e^t-1\ge t\) and \(p(t)\ge 1\), we have
Choosing \(T=1\), we further show in Lemma 3.3 below that, given \(a=\Vert d_j A\Vert _1\),
for some universal constant \(C>0\). We finish by combining (3.3) and the bound (with \(T=1\))
derived from the Poincaré inequality (Lemma 2.1). \(\square \)
Lemma 3.3
There exists a universal \(C>0\) such that for \(\alpha >0\), \(a\ge 0\), \(p(t)=1+e^{-2\alpha t}\) and \(0\le r\le \min \{1,1/2\alpha \}\), we have
Proof
Note that \(r\le 1\) and \(1\le p(t)\le 2\), we have
Now we estimate the right hand side in two cases. If \(a\ge 1\), then \(a^{2/p(t)-1}\le a\) and thus
If \(a<1\), then \(a^{2/p(t)-1}\le a^{\alpha t/2}\) for \(0\le t\le r\). In fact, for \(t\in [0,r]\subset [0, 1/2\alpha ]\), we have
which is equivalent to
for \(\beta \in [0,2\alpha r]\subset [0,1]\). This is clear as \(\varphi (0)=0\) and \(\varphi '(\beta )=(3-\beta )e^\beta -1\ge 0\) for \(\beta \in [0,1]\). Hence
where \(x=\log (1/a)>0\). It remains to show that there exists a universal constant \(C>0\) such that for all \(x>0\),
In fact, the function
is continuous on \((0,\infty )\), and we have
and by L’Hôpital’s rule
Then (3.4) follows and the proof is complete. \(\square \)
It remains to prove the following technical lemma:
Lemma 3.4
Let \(n\ge 1\) and \((P_t)_{t\ge 0}\) be the quantum depolarizing semigroup on n-qubits defined in (2.4). Then for all \(t> 0\) and all \(A\in M_2(\mathbb {C})^{\otimes n}\) we have
In particular, for each \(1\le j\le n\),
Proof
By definition of \((P_t)_{t\ge 0}\):
where (1) follows from the differentiation of \(s\mapsto P_s(P_{t-s}(A^*)P_{t-s}(A))\), whereas (2) follows from gradient estimates Lemma 2.4. Now we claim that for all A and for each \(1\le j\le n\) we have
and thus
Let us first finish the proof of the lemma given (3.7). Applying (3.7) to \(P_t A\), we may proceed with the previous estimate as
which proves (3.5).
Now it remains to show (3.6). For this we decompose \(A=\sum _{k}A^k_1\otimes \cdots \otimes A^k_n\) into a sum of tensor products, where k in \(A^k_j=A^{(k)}_j\) is just an index, then
and
For any \(X,Y\in M_2(\mathbb {C})\), a direct computation shows
and
Thus
So (3.6) is equivalent to
which can be reformulated as
with \(T_j:={\mathbb {I}} ^{\otimes (j-1)}\otimes \frac{1}{2}{\text {tr}}\otimes {\mathbb {I}} ^{\otimes (n-j)}\). Now (3.8) follows from the Kadison–Schwarz inequality [Wol12, Chapter 5.2] and that \(\frac{1}{2}{\text {tr}}\) is unital completely positive (over \(M_2(\mathbb {C})\)). This finishes the proof of the claim (3.6) and thus the proof of the lemma. \(\square \)
Remark 3.5
Following the argument in [CEL12, Proof of Theorem 1], one can prove a quantum analogue of (1.4) using similar properties of quantum depolarizing semigroups. In fact, the proof of (1.4) does not even require strictly positive Ricci curvature lower bounds, i.e. Lemma 2.4 can be weakened. We will not discuss it here as (1.4) is not our main focus and a quantum analogue was already obtained in [MO10a].
The quantum Talagrand inequality Theorem 3.2 implies a quantum KKL for \(L^1\) influences (following a similar argument in Lemma 3.8 below): for balanced quantum Boolean A on n-qubits,
Recall that for classical Boolean functions the sharp order is \(\log (n)/n\), which can be captured by tribes functions [O’D14, Chapter 4]. In fact, the order \(\log (n)/n\) is also sharp for quantum Boolean functions, which can be seen from the following improved version of quantum Talagrand Theorem 3.2:
Theorem 3.6
For every \(p\in [1,2)\) there exists a constant \(C_p>0\) such that for every \(n\in {\mathbb {N}}\) and \(A\in M_2({\mathbb {C}})^{\otimes n}\) with \(\Vert A\Vert \le 1\) one has
where the constant can be chosen of order \(C_p\sim C/(2-p)\) as \(p\nearrow 2\). In particular, for \(p=1\):
Proof
Let \(T>0\) be such that \(p\le 1+e^{-2T}\). By the Poincaré inequality we have
By intertwining and hypercontractivity,
with \(p(t)=1+e^{-2t}\).
By interpolation and \(\Vert d_jA\Vert \le 2\Vert A\Vert \le 2\),
If \(\Vert d_j(A)\Vert _p^p\ge 1\), then \(\Vert d_j(A)\Vert _p^{p(2/p(t)-1)}\le \Vert d_j(A)\Vert _p^p\), so that
If \(\Vert d_j(A)\Vert _p^p< 1\), then \(\Vert d_j(A)\Vert _p^{p(2/p(t)-1)}\le \Vert d_j(A)\Vert _p^{pt/2}\) (recall that \(p(2/p(t)-1)\ge pt/2\) by the proof of Lemma 3.3) and thus
We claim that
Applying (3.9) to \(a=\Vert d_j (A)\Vert _p^{-p}\), we obtain
Now let us prove the claim (3.9) which we divide into two cases. When \(a\in [1,e]\), we have
which are nothing but
A direct computation shows that when \(a\in [1,e]\)
and
This proves (3.10) and thus the claim when \(a\in [1,e]\). When \(a\ge e\), we have \(2\log a\ge 1+\log a\), and
which proves the claim for \(a\in [1,e]\).
Noting that \(8T\le 16\max \left\{ \frac{3T}{2},2\right\} \), we thus just proved for all \(e^{-2T}\ge p-1\), one has
choosing \(T=-\frac{1}{2} \log (p-1)\), the above inequality becomes
with \(C_p=\frac{16\max \{-\frac{3}{4}\log (p-1),2\}}{2-p}\) which is of the order \(32/(2-p)\) as \(p\nearrow 2\). \(\square \)
Remark 3.7
The above Theorem 3.6 (when \(p=1\)) improves Theorem 3.2, since it gives the right order of quantum KKL as we shall see in the next. However, Theorem 3.2 can be easily extended to more general von Neumann algebras, which will be discussed in Theorem 4.3. The generalization of Theorem 3.6 is also possible but requires additional assumption which we will not discuss in the general von Neumann algebra setting.
3.3 A KKL theorem for quantum Boolean functions
Our quantum KKL theorem for geometric (\(L^1\)-)influences follows as a simple corollary of Theorem 3.6. First we need an elementary lemma.
Lemma 3.8
If \(n\in {\mathbb {N}}\), \(a_1,\dots ,a_n\ge 0\) and \(c>0\) such that
then
Proof
If \(\max _{1\le j\le n}a_j\ge 1/\sqrt{n}\), we are done, so we can assume \(a_j<1/\sqrt{n}\le 1\) for all \(j\in \{1,\dots ,n\}\). Then we have
\(\square \)
Theorem 3.9
For every \(1\le p<2\), there exists a constant \(C_p>0\) such that for any \(n\ge 1\) and any balanced quantum Boolean function \(A\in M_2(\mathbb {C})^{\otimes n}\)
Proof
Since \(\textrm{Var}(A)=1\) for any balanced quantum Boolean function, the result follows from Theorem 3.6 with the help of Lemma 3.8. \(\square \)
All combined, we have shown that every balanced quantum Boolean function has a geometrically influential variable. In fact, suppose that \(A\in M_2(\mathbb {C})^{\otimes n}\) is a balanced quantum Boolean function, then
According to \(L^1\)-Poincaré inequality (3.1),
One may wonder if \({\text {Inf}}_j^1(A)=\Vert d_j A\Vert _1\approx 1/n\) for all \(1\le j\le n\) is possible. However, our Theorem 3.9 for \(p=1\) indicates that this is not the case. There exists j such that \({\text {Inf}}^1_j (A)\ge C\log (n)/n\) for some \(C>0\).
Remark 3.10
In [MO10a, Conjecture 3 of Section 12], the authors have conjectured a similar KKL-type result for the quantum \(L^2\)-influences \({\text {Inf}}_j^2(A)\). While this influence coincides with the \(L^1\)-influence \({\text {Inf}}^1_j(A)\) when A is a classical Boolean function, this is not the case in the quantum setting. Hence, this conjecture in [MO10a] remains open to the best of our knowledge.
3.4 A Friedgut’s Junta theorem for quantum Boolean functions
We recall that a Boolean function \(g:\Omega _n\rightarrow \{-1,1\}\) is called a k-junta if it only depends on a set of at most \(k<n\) bits. In [Fri98], Friedgut showed that for any Boolean function \(f:\{-1,1\}^n\rightarrow \{-1,1\}\) and \(\varepsilon \in (0,1)\), f is \(\varepsilon \)-close in 2-norm to a \(2^{\mathcal {O}({\text {Inf}}^2 f/\varepsilon )}\)-junta, where
denotes the total \(L^2\)-influence of f, with \({\text {Inf}}_j^2(f):=\Vert D_j f\Vert _2^2\). More recently, Bouyrie [Bou17] proved an \(L^1\) version of Friedgut’s junta theorem, more adapted to continuous models, based on the proof techniques developed in [CEL12] (see also [Aus16] for a previous account of the result upon which the proof of [Bou17] relies). The next theorem constitutes a quantum generalization of the \(L^1\) Friedgut’s Junta theorem; see Corollary 3.12 followed. Recall that we define k-juntas for operators that are not necessarily Boolean.
Theorem 3.11
For any \(A\in M_2({\mathbb {C}})^{\otimes n}\) and any \(\varepsilon >0\) small enough, there exists a k-junta \(B\in M_2({\mathbb {C}})^{\otimes n}\) with \(\Vert A-B\Vert _2\le \varepsilon \) and
Moreover, B can be taken to be \(2^{-|T|}{\text {tr}}_T(A)\) for some set \(T\subset \{1,\dots ,n\}\) of \(n-k\) qubits.
Proof
Let \(d=\frac{2{\text {Inf}}^2(A)}{\varepsilon ^2}\). If \(d\le 1\), then
for any subset T of \(\{1,\dots ,n\}\) by the (non-primitive) Poincaré inequality for the tensor product of depolarizing semigroups restricted to the subset T (see [Bar17, Example 3.1]).
Let us now consider the case \(d>1\). Let
and \(B=2^{-|T|}{\text {tr}}_T(A)\). The matrix \(A-B\) has the Fourier decomposition
By Plancherel’s identity,
Let us treat both summands on the right side separately. For the first summand,
where we used formula (2.5) for \({\text {Inf}}^2\).
For the second summand,
for any \(t\ge 0\). Here we used (2.6) for the depolarizing semigroup.
Now take \(t=\log 2\). By intertwining (Lemma 2.3), hypercontractivity (Lemma 2.2) and interpolation,
Since \(d_j\) is a projection, \(\Vert d_j(A)\Vert _2\le \Vert A\Vert _2\). Moreover, by definition, for \(j\in T\)
Therefore,
where we used the elementary inequality \(x\le e^{x/2}\) for \(x\ge 0\) in the last step.
Altogether we have shown that \(\Vert A-B\Vert _2^2\le \varepsilon ^2\). Moreover, B is a k-junta with \(k=|T^c|\). Since
for every \(j\in T^c\), we have
Hence
\(\square \)
In the next corollary we restrict ourselves to quantum Boolean functions.
Corollary 3.12
For any quantum Boolean \(A\in M_2({\mathbb {C}})^{\otimes n}\) and any \(\varepsilon >0\) small enough there exists a quantum Boolean k-junta \(C\in M_2({\mathbb {C}})^{\otimes n}\) with \(\Vert A-C\Vert _2\le \varepsilon \) and
Proof
By Theorem 3.11 there exists a self-adjoint k-junta \(B\in M_2({\mathbb {C}})^{\otimes n}\) such that \(\Vert A-B\Vert _2\le \varepsilon \) and \(k\le 2^{\frac{30{\text {Inf}}^2(A)}{\varepsilon ^2}}{\text {Inf}}^1(A)^6/{\text {Inf}}^2(A)^5\). Let us now define \(C:={\text {sgn}}(B)\) as follows: Given the spectral decomposition \(B=\sum _i\lambda _i|\psi _i\rangle \langle \psi _i|\), \({\text {sgn}}(B)=\sum _{i}{\text {sgn}}(\lambda _i)|\psi _i\rangle \langle \psi _i|\), where the sign function \({\text {sgn}}\) is defined as
Note that \(|\lambda +{\text {sgn}}(\lambda )|\ge 1\). Then
and
Therefore,
where in (1) we have used that \(A^2=\textbf{1}\), whereas in (2) we used the fact that \(B=2^{-|T|}{\text {tr}}_T(A)\) for some set T of qubits, so that \(\Vert B\Vert \le \Vert A\Vert \le 1\). Moreover, we know the size of \(T^c\) from Theorem 3.11. The result then follows after rescaling of \(\varepsilon \) to \(\varepsilon /3\). \(\square \)
Remark 3.13
In the case of a classical Boolean function f, we know that \({\text {Inf}}f\equiv {\text {Inf}}^1 f={\text {Inf}}^2 f\) and the bound in Corollary 3.12,
simplifies as
We therefore recover the classical Friedgut’s Junta theorem.
Remark 3.14
In the classical setting, other junta-type theorems related to Fourier analysis of Boolean functions may be found in [FKN02, ADFS04, Bou02, KN06, KS02, DFKO06]. While extending these results to the present quantum setting is an interesting problem, their statements do not directly involve the notion of influence that is central to our study. This interesting direction of research will therefore be considered elsewhere.
4 Von Neumann Algebraic Generalizations
In this section, we generalize the main results from the previous section to the general von Neumann algebraic setting. Apart from technical challenges that arise from the fact that the underlying Hilbert space can be infinite-dimensional and the operators involved can be unbounded, most proofs run parallel to the ones for qubits once the appropriate assumptions are identified. As demonstrated in the next section, these hypotheses are satisfied for a number of interesting examples besides the qubit systems treated in Sect. 3.
We start recapitulating some basic von Neumann algebra theory. As a general reference, we refer to [Tak02, Tak03]. Let \(\mathcal {H}\) be a Hilbert space and \(B(\mathcal {H})\) the space of all bounded linear operators on \(\mathcal {H}\). The \(\sigma \)-weak topology on \(B(\mathcal {H})\) is the topology induced by the seminorms \(|{\text {tr}}(\,\cdot \,x)|\), where x runs over the set of all trace-class operators. A von Neumann algebra \({\mathcal {M}}\) on \(\mathcal {H}\) is a unital \(*\)-subalgebra of \(B(\mathcal {H})\) that is closed in the \(\sigma \)-weak topology. A linear functional on \({\mathcal {M}}\) is called normal if it is continuous with respect to the \(\sigma \)-weak topology. The set of all normal linear functionals on \({\mathcal {M}}\) is denoted by \({\mathcal {M}}_*\), and the obvious dual pairing between \({\mathcal {M}}\) and \({\mathcal {M}}_*\) establishes an isometric isomorphism between \({\mathcal {M}}\) and \(({\mathcal {M}}_*)^*\).
A state on \({\mathcal {M}}\) is a positive linear functional \(\varphi :{\mathcal {M}}\rightarrow \mathbb {C}\) such that \(\varphi (\textbf{1})=1\). A state is called faithful if \(\varphi (x^*x)=0\) implies \(x=0\). For a faithful normal state \(\varphi \) on \({\mathcal {M}}\) let \(\mathcal {H}_\varphi \) denote the completion of M with respect to the inner product
and let \(\Lambda _\varphi (x)\) denote the image of x inside \(\mathcal {H}_\varphi \). The GNS representation is defined by \(\pi _\varphi (x)\Lambda _\varphi (y)=\Lambda _\varphi (xy)\). The vector \(\Lambda _\varphi (\textbf{1})\) is a cyclic and separating vector for \(\pi _\varphi ({\mathcal {M}})\), which is denoted by \(\Omega _\varphi \). We routinely identify \({\mathcal {M}}\) with \(\pi _\varphi ({\mathcal {M}})\).
For the definition of the noncommutative \(L^p\) spaces, we need some basic modular theory. The operator
is a closable anti-linear operator on \(\mathcal {H}_\varphi \). Let S denote its closure and \(S=J\Delta ^{1/2}\) the polar decomposition of S. The operator J is an anti-unitary involution, called the modular conjugation, and \(\Delta =S^*S\) is called the modular operator.
The symmetric embedding \(i_2\) of \({\mathcal {M}}\) into \(\mathcal {H}_\varphi \) is given by \(i_2(x)=\Delta ^{1/4}\Lambda _\varphi (x)\) and the symmetric embedding \(i_1\) of \({\mathcal {M}}\) into \({\mathcal {M}}_*\) is uniquely determined by the relation
or in other words, \(i_1=i_2^*J i_2\) if we view J as an isomorphism between \(\mathcal {H}_\varphi \) and \({\overline{\mathcal {H}}}_\varphi \cong \mathcal {H}_\varphi ^*\).
Kosaki’s interpolation \(L^p\) spaces [Kos84] are defined as the complex interpolation space
for \(p\in (1,\infty )\). Thus we get embeddings \(i_p:{\mathcal {M}}\rightarrow L^p({\mathcal {M}},\varphi )\) for \(p\in (1,\infty )\) with
for all \(x\in {\mathcal {M}}\). In particular, \(L^2({\mathcal {M}},\varphi )\cong \mathcal {H}_\varphi \) isometrically, and the definition of \(i_2\) is consistent with the definition given before under this identification.
In the case \({\mathcal {M}}=M_n({\mathbb {C}})\), every state \(\varphi \) on \({\mathcal {M}}\) is of the form \(\varphi ={\text {tr}}(\,\cdot \,\sigma )\) for some density matrix \(\sigma \). The state \(\varphi \) is faithful if and only if \(\sigma \) is invertible. In this case \(L^p(M_n({\mathbb {C}}),\varphi )\) can be identified with \(M_n({\mathbb {C}})\) with the norm \({\text {tr}}(|\cdot |^p)^{1/p}\), and the embedding \(i_p\) is given by \(i_p(x)=\sigma ^{1/2p}x\sigma ^{1/2p}\). In particular, \(\Vert i_p(x)\Vert ={\text {tr}}(|\sigma ^{1/2p}x\sigma ^{1/2p}|^p)^{1/p}\), which is the expression for the \(L^p\) norm commonly used in quantum information theory.
A quantum Markov semigroup (QMS) on \({\mathcal {M}}\) is a family \((P_t)_{t\ge 0}\) of normal bounded linear operators on \({\mathcal {M}}\) such that
-
\(P_0=\textrm{id}_{{\mathcal {M}}}\), \(P_s P_t=P_{s+t}\) for \(s,t\ge 0\),
-
\(P_t(x)\rightarrow x\) as \(t\searrow 0\) in the \(\sigma \)-weak topology for every \(x\in {\mathcal {M}}\),
-
\(\sum _{j,k=1}^n y_j^*P_t(x_j^*x_k)y_k\ge 0\) for all \(x_1,\dots ,x_n,y_1,\dots ,y_n\in {\mathcal {M}}\) and \(t\ge 0\),
-
\(P_t(\textbf{1})=1\) for all \(t\ge 0\).
If \((P_t)_{t\ge 0}\) is a quantum Markov semigroup on \({\mathcal {M}}\), then \(P_t\) has a pre-adjoint \((P_t)_*:{\mathcal {M}}_*\rightarrow {\mathcal {M}}_*\) for every \(t\ge 0\), and \(((P_t)_*)_{t\ge 0}\) is a strongly continuous semigroup on \({\mathcal {M}}_*\). The QMS \((P_t)_{t\ge 0}\) is called KMS-symmetric with respect to \(\varphi \) if
for all \(x,y\in {\mathcal {M}}\) and \(t\ge 0\). In this case, for all \(p\in [1,\infty )\) and \(t\ge 0\) the operator
extends to a contraction \(P_t^{(p)}\) on \(L^p({\mathcal {M}},\varphi )\), and \((P_t^{(p)})_{t\ge 0}\) is a strongly continuous semigroup. In particular, \((P_t)_*=P_t^{(1)}\). Occasionally we also write \(P_t^{(\infty )}\) for \(P_t\).
The generator of \((P_t^{(p)})_{t\ge 0}\) is defined by
where the limit is taken in the norm topology if \(p\in [1,\infty )\) and in the \(\sigma \)-weak topology if \(p=\infty \). We also write \({\mathcal {L}}\) for \({\mathcal {L}}_\infty \). Note that there are differing sign conventions for the generator; with our convention, \({\mathcal {L}}_2\) is a positive self-adjoint operator on \(L^2({\mathcal {M}},\varphi )\).
We make the following assumption:
- (H0):
-
There exists a \(*\)-subalgebra \({\mathcal {A}}\) of \(D({\mathcal {L}})\) which is \(\sigma \)-weakly dense in \({\mathcal {M}}\) and invariant under \((P_t)_{t\ge 0}\).
We can then define the carré du champ operator as follows:
We write \(\Gamma (x)\) for \(\Gamma (x,x)\).
We will further use the following assumption:
- (H1):
-
Bakry–Émery gradient estimate: There exists \(K\in {\mathbb {R}}\) such that
$$\begin{aligned} \Gamma (P_t(x))\le e^{-2Kt}P_t(\Gamma (x)) \end{aligned}$$for all \(x\in {\mathcal {A}}\) and \(t\ge 0\).
To avoid case distinctions, the following notation will come in handy:
Further, we write \(K_-\) for the negative part of a real number K. The following result is an analog of Lemma 3.4.
Lemma 4.1
If \((P_t)_{t\ge 0}\) is a QMS satisfying (H0)–(H1), then
for all \(x\in {\mathcal {A}}\) and \(t\ge 0\).
In particular,
for all \(x\in {\mathcal {A}}\) and \(t\in \left[ 0,\frac{1}{2K_-}\right) \).
- (H2):
-
There exists a finite family of linear self-adjoint maps \(d_j:{\mathcal {A}}\rightarrow {\mathcal {M}}\), \(j\in {\mathcal {J}}\), such that
$$\begin{aligned} \langle i_2(x),i_2({\mathcal {L}}(x))\rangle =\sum _{j\in {\mathcal {J}}}\Vert i_2(d_j(x))\Vert ^2 \end{aligned}$$(H2-1)and a constant \(M>0\) such that
$$\begin{aligned} \max _{j\in {\mathcal {J}}}\Vert d_j(x)\Vert \le M\Vert \Gamma (x)\Vert ^{1/2} \end{aligned}$$(H2-2)for all \(x\in {\mathcal {A}}\).
Note that (H2-1) implies in particular that the series on the right side converges for all \(x\in {\mathcal {A}}\), and by polarization,
for all \(x,y\in {\mathcal {A}}\).
In this situation we define the p-influence of the j-th variable on x by \({\text {Inf}}^p_j(x)=\Vert i_p(d_j(x))\Vert ^p\) and the total influence of x by \({\text {Inf}}^p(x)=\sum _{j\in {\mathcal {J}}}{\text {Inf}}^p_j(x)\).
We say \((P_t)_{t\ge 0}\) is primitive if \(P_t(x)\rightarrow \varphi (x)\textbf{1}\) \(\sigma \)-weakly as \(t\rightarrow \infty \) for every \(x\in {\mathcal {M}}\).
Theorem 4.2
(\(L^1\)-Poincaré inequality) If \((P_t)_{t\ge 0}\) is a primitive KMS-symmetric QMS on \({\mathcal {M}}\) satisfying (H0)–(H2) with \(K>0\), then
for all \(x\in {\mathcal {A}}\).
Proof
By duality and \(\sigma \)-weak density of \({\mathcal {A}}\) in \({\mathcal {M}}\), we have
Since \((P_t)_{t\ge 0}\) is primitive,
Now by the consequence (4.1) of (H2-1),
By Lemma 4.1 and (H2-2),
All combined, we obtain the desired inequality. \(\square \)
To prove our general noncommutative version of the \(L^1\)-Talagrand inequality, we need some more assumptions on \((P_t)_{t\ge 0}\), which we collect in the following:
- (H3):
-
Poincaré inequality: There exists a constant \(\lambda >0\) such that
$$\begin{aligned} \lambda \Vert i_2(x-\varphi (x)\textbf{1})\Vert ^2\le \langle i_2(x),i_2({\mathcal {L}}(x))\rangle \end{aligned}$$for all \(x\in {\mathcal {A}}\).
- (H4):
-
Hypercontractivity: There exists a constant \(\alpha >0\) such that
$$\begin{aligned} \Vert i_2(P_t(x))\Vert \le \Vert i_p(x)\Vert \end{aligned}$$for all \(x\in {\mathcal {A}}\), \(t\ge 0\) and \(p=1+e^{-2\alpha t}\).
- (H5):
-
Intertwining: There exists a constant \(\mu \in {\mathbb {R}}\) such that
$$\begin{aligned} \Vert i_p(d_j(P_t(x)))\Vert \le e^{-\mu t}\Vert i_p(P_t(d_j(x)))\Vert \end{aligned}$$for all \(x\in {\mathcal {A}}\), \(j\in {\mathcal {J}}\), \(p\in [1,\infty ]\) and \(t\ge 0\).
In fact, it is well-known [OZ99] that hypercontractivity (H4) implies Poincaré inequality (H3).
The proof of the following theorem follows the argument given by Cordero–Erausquin and Ledoux [CEL12, Theorem 6] in the commutative case. We refer to the appendix for the details.
Theorem 4.3
(\(L^1\)-Talagrand inequality). If \((P_t)_{t\ge 0}\) is a KMS-symmetric QMS on \({\mathcal {M}}\) satisfying (H0)–(H5), then there exists a constant \(C>0\) depending only on the constants \(K,M,\alpha ,\lambda ,\mu \) such that
for all \(x\in {\mathcal {A}}\) with \(\Vert x\Vert \le 1\).
Again following [CEL12], we can also give a generalization of Talagrand’s inequality (1.4) in this setting.
Theorem 4.4
If \((P_t)\) is a KMS-symmetric QMS on \({\mathcal {M}}\) satisfying (H0), (H2)–(H5), then
for all \(x\in {\mathcal {A}}\).
Proof
By the Poincaré inequality (H3), we have
for \(T=1/2\lambda \).
Arguing as in the proof of Theorem 3.2, we get
By (H4) and (H5),
with \(p(t)=1+e^{-2\alpha t}\).
After the change of variables \(s=p(t)\) and application of Hölder’s inequality we get
From here, the claimed inequality follows from an elementary bound on the last integral (compare [CEL12, Theorem 1]). \(\square \)
Since the p-influences for different p do not coincide in the quantum setting, this version of Talagrand’s inequality does not imply a KKL bound. However, we still have the following weaker bound as consequence of Theorem 4.3. Again, the proof can be found in the appendix.
Theorem 4.5
If \((P_t)\) is a KMS-symmetric QMS on \({\mathcal {M}}\) satisfying (H0)–(H5) and the cardinality n of \({\mathcal {J}}\) is finite, then there exists \(C'>0\) depending only on the constants \(K,L,M,\alpha ,\lambda ,\mu \) such that
for all self-adjoint \(x\in {\mathcal {A}}\) with \(\Vert i_2(x)\Vert =1\), \(\Vert x\Vert \le 1\) and \(\varphi (x)=0\).
Remark 4.6
The sharpness of the bound derived in Theorem 4.5 in the present general context was shown in [KMS12].
To prove our generalized version of Friedgut’s junta theorem, we need one last assumption on \((P_t)\). For that purpose, if \({\mathcal {I}}\subset {\mathcal {J}}\), let \(E_{{\mathcal {I}}}\) denote the orthogonal projection onto \(\bigcap _{i\in {\mathcal {I}}}\overline{i_2(\ker d_i)}\) in \(L^2({\mathcal {M}},\varphi )\).
- (H6):
-
There exists a constant \(\nu >0\) such that
$$\begin{aligned} \nu \Vert i_2(x)-E_{{\mathcal {I}}}(i_2(x))\Vert ^2\le \sum _{i\in {\mathcal {I}}}\Vert i_2(d_i(x))\Vert ^2 \end{aligned}$$for every \(x\in {\mathcal {A}}\) and \({\mathcal {I}}\subset {\mathcal {J}}\).
If \((P_t)\) is primitive, then \(E_{{\mathcal {J}}}(i_2(x))=i_2(\varphi (x)\textbf{1})\). Thus (H6) is a strengthening of the Poincaré inequality from (H3) in the case of primitive QMS.
Lemma 4.7
If \((P_t)\) is a KMS-symmetric QMS on \({\mathcal {M}}\) satisfying (H0), (H2), (H4)–(H6), then for any \(x\in {\mathcal {A}}\), \(t,\eta >0\) and \({\mathcal {I}}\subset {\mathcal {J}}\) such that \({\text {Inf}}^1_i(x)\le \eta \) for all \(i\in {\mathcal {I}}\) one has
where \(q(t)=\frac{1-e^{-2\alpha t}}{1+e^{-2\alpha t}}\).
Proof
By (H6) we have
By (H4), (H5) and interpolation,
Therefore,
\(\square \)
Lemma 4.8
If A is a positive self-adjoint operator on a Hilbert space \({\mathcal {H}}\), then
for every \(\xi \in D(A^{1/2})\) and \(t\ge 0\).
Proof
This follows by the spectral theorem from the scalar inequality \((1-e^{-tx})^2\le tx\) for \(t,x\ge 0\). \(\square \)
A version of Friedgut’s junta theorem in this setting now reads as follows. Again, the proof can be found in the appendix.
Theorem 4.9
Let \((P_t)\) be a KMS-symmetric QMS on \({\mathcal {M}}\) satisfying (H0), (H2), (H4)–(H6). There exists a constant \(C>0\) depending only on \(\alpha \) and \(\nu \) such that for every \(x\in {\mathcal {A}}\) and \(0<\varepsilon \le 2/\nu \) there exists a set \({\mathcal {I}}\subset {\mathcal {J}}\) such that \(\Vert i_2(x)-E_{{\mathcal {I}}}(i_2(x))\Vert \le \varepsilon \) and
where \(\mu _-=-\mu \) if \(\mu <0\), and 0 otherwise.
5 Examples
5.1 Classical case
The results in [CEL12, Bou17] fit into our framework by choosing the commutative von Neumann algebras, i.e. \(({\mathcal {M}},\varphi )=L^\infty (X,\mu )\) with X a probability measure space.
5.2 Generalized depolarizing semigroups
We start with a simple weighted generalization of the depolarizing semigroup, also known as generalized depolarizing: given a full-rank state \(\omega \) over \(\mathbb {C}^{d}\),
We verify assumptions (H0)-(H5) for the semigroup \((e^{t\mathcal {L}_\omega })_{t\ge 0}\) . First of all, since we are in a finite dimensional case, (H0) is directly satisfied. (H1) was proved in [JZ15] with \(K=\frac{1}{2}\). With \(d_j(x)=\textrm{id}^{\otimes (j-1)}\otimes (\textrm{id}-{\text {tr}}(\omega \cdot )\textbf{1})\otimes \textrm{id}^{\otimes (n-j)}\) a direct computation shows
which settles (H2-1). Condition (H2-2) with \(M=\sqrt{2}\) follows as in Equation (3.7). Condition (H3) with \(\lambda =1\) is easy to check for \(n=1\), one for arbitrary n follows by tensorization. The best constant \(\alpha \) satisfying (H4) for any n has been shown in [BDR20, Theorems 24 & 25], whereas a lower bound on \(\alpha \) was found e.g. in [TPK14, Theorem 9]. A direct computation shows (H5) with \(\mu =1\).
5.3 Quantum Ornstein–Uhlenbeck semigroup
Next, we consider the generator of the so-called quantum Ornstein–Uhlenbeck semigroup [FRS94, CFL00, CS08]. The latter acts on the algebra \(B(\ell ^2(\mathbb {N}))\) of all bounded operators on the Hilbert space \(\ell ^2(\mathbb {N})\) of square-summable sequences. Denoting by a and \(a^*\) the annihilation and creation operators of the quantum harmonic oscillator, which are defined by their action on a given orthonormal basis \(\{|k\rangle \}_{k\in \mathbb {N}}\) of \(\mathcal {H}\equiv \ell ^2(\mathbb {N})\simeq L^2(\mathbb {R})\) as follows:
the generator of the quantum Ornstein–Uhlenbeck semigroup takes the following form at least on finite rank operators:
where \(\mu>\lambda >0\). Denoting \(\nu =\lambda ^2/\mu ^2\), it has a unique invariant state
Here we will use the notion of a Schwartz operator [KKW16]: an operator \(x\in B(L^2(\mathbb {R}))\) is called a Schwartz operator if for any indices \(\alpha ,\beta ,\alpha ',\beta '\in \mathbb {N}\),
where \(\mathfrak {S}(\mathbb {R})\) denotes the set of Schwartz functions over \(\mathbb {R}\), \(Q:(x\mapsto \psi (x))\mapsto (x\mapsto x\psi (x))\) is the so-called position operator and \(P:(x\mapsto \psi (x))\mapsto (x\mapsto -i\psi '(x))\) is the momentum operator. We denote by \(\mathfrak {S}(\mathcal {H})\) the algebra of Schwartz operators.
Proposition 5.1
The semigroup generated by \(\mathcal {L}\) and derivations \(d_a:=[a,\cdot ]\) and \(d_{a*}=[a^*,\cdot ]\) satisfy the conditions (H0)–(H5) with respect to the algebra \(\mathcal {A}\equiv \mathfrak {S}(\mathcal {H})\).
Proof
The set \(\mathfrak {S}(\mathcal {H})\) of Schwartz operators is a \(*\)-subalgebra of \(B(L^2(\mathbb {R}))\) [KKW16, Lemma 3.5]. Moreover for any \(p\ge 1\), the set \(\mathfrak {S}_0(\mathcal {H})\) of finite-rank Schwartz operators is dense in the space \(\mathcal {T}_p(\mathcal {H})\) of Schatten-p operators [KKW16, Lemma 2.5]. Therefore, since finite-rank operators are \(\sigma \)-weakly dense in \(B(\mathcal {H})\), this also holds for \(\mathfrak {S}(\mathcal {H})\). In order to show that \(\mathfrak {S}(\mathcal {H})\) is invariant with respect to the semigroup generated by \(\mathcal {L}\), we use tools from noncommutative Fourier analysis: given a trace-class operator x, its characteristic function is given by
where \(D(z):=e^{za^*-\bar{z}a}\), for \(z\in \mathbb {C}\), is the so-called one-mode displacement operator. By the quantum Plancherel identity, we have that for any two trace-class operators x, y [Hol11],
Moreover, the quantum Ornstein–Uhlenbeck semigroup can be represented by a family of quantum channels \(e^{t\mathcal {L}}\) modelling a quantum beam-splitter of transmissivity \(\eta =e^{-(\mu ^2-\lambda ^2)t}\) and with environment state \(\sigma _{\mu ,\nu }\) can be shown to induce the following action on characteristic functions:
with \(N=\frac{\lambda ^2}{\mu ^2-\lambda ^2}\). Therefore, for any \(\alpha ,\beta ,\alpha ',\beta '\in \mathbb {N}\) and normalized \(\psi ,\varphi \in \mathfrak {S}(\mathbb {R})\), denoting \(y:=|P^{\beta '}Q^{\alpha '}\varphi \rangle \langle P^\beta Q^\alpha \psi |\), we have
where we used that \(u\mapsto e^{-(2N+1)(1-\eta )|u^2|/2\eta }\chi _T(u/\sqrt{\eta })\) is a Schwartz function, see [KKW16, Proposition 3.18]. Finally, by [KKW16, Proposition 3.14], for any \(x\in \mathfrak {S}(\mathcal {H})\), \(\mathcal {L}(x)\) is closable with closure in \(\mathfrak {S}(\mathcal {H})\). Hence, (H0) is satisfied for the algebra \(\mathcal {A}\equiv \mathfrak {S}(\mathcal {H})\) . Property (H1) can be easily derived from the canonical commutation relation \([a,a^*]=\mathbb {I}\) and gives \(K=(\mu ^2-\lambda ^2)/2\) (see e.g. [CM17a]). Property (H2) is satisfied for the maps \(d_a:=[a,\cdot ]\) and \(d_{a^*}:=[a^*,\cdot ]\). The Poincaré inequality (H3) follows from the characterization of the spectrum of the generator \(\mathcal {L}\) established in [CFL00]. The hypercontractivity constant in (H4) was estimated in [CS08]. The intertwining relation of (H5) was found in [CM17a]. \(\square \)
5.4 Group von Neumann algebras
Let G be a countable discrete group with unit e, L(G) the group von Neumann algebra on \(\ell ^2(G)\) generated by \(\{\lambda _g,g\in G\}\) where \(\lambda \) is the left regular representation of G. We denote by \(\tau (x)=\langle x\delta _e,\delta _e\rangle \) the canonical tracial faithful state. Here and in what follows, \(\delta _g\) always denotes the function on G that takes value 1 at g and vanishes elsewhere.
A function \(\psi :G\rightarrow [0,\infty )\) is a conditionally negative definite (cnd) length function if \(\psi (e)=0\), \(\psi (g^{-1})=\psi (g)\) and
for every \(f:G\rightarrow \mathbb {C}\) with finite support such that \(\sum _{g\in G} f(g)=0\).
By Schoenberg’s Theorem (see for example [BO08, Theorem D.11]), to every cnd function one can associate a \(\tau \)-symmetric quantum Markov semigroup on L(G) given by
For a countable discrete group G, a 1-cocycle is a triple \((H,\pi ,b)\), where H is a real Hilbert space, \(\pi :G\rightarrow O(H)\) is an orthogonal representation, and \(b:G\rightarrow H\) satisfies the cocycle law: \(b(gh)=b(g)+\pi (g)b(h),g,h\in G.\) To any cnd function \(\psi \) on a countable discrete group G, one can associate with a 1-cocycle \((H,\pi ,b)\) such that \(\psi (g^{-1}h)=\Vert b(g)-b(h)\Vert ^2,g,h\in G\). See [BO08, Appendix D] for more information.
Fix an orthonormal basis \((e_j)_{j\in {\mathcal {J}}}\) of H. In case G is finite, the index set \({\mathcal {J}}\) can always be taken to be finite. Let \({\mathcal {A}}\) be the linear span of the operator \(\lambda _g\), \(g\in G\), and let
The space \({\mathcal {A}}\) is contained in the domain of the generator \({\mathcal {L}}\) of \((P_t)\) and \({\mathcal {L}}(\lambda _g)=\psi (g)\lambda _g\) for \(g\in G\). Moreover, \(\Gamma (\lambda _g,\lambda _h)=\langle b(g),b(h)\rangle \lambda _{g^{-1}h}\) for \(g,h\in G\).
Clearly, condition (H0) is satisfied. Condition (H1) is satisfied with \(K=0\) [WZ21, Example 3.14]. For condition (H2) note that if \(x=\sum _g f(g)\lambda _g\in {\mathcal {A}}\), then
In particular, \(d_j(x)^*d_j(x)\le \Gamma (x)\) for every \(j\in {\mathcal {J}}\). Moreover,
Thus condition (H2) holds with \(M=1\). Condition (H3) holds with the spectral gap \(\lambda =\inf _{g:\psi (g) >0} \psi (g)\) of \({\mathcal {L}}\). Since \(d_j P_t=P_t d_j\), condition (H5) is always satisfied with \(\mu =0\).
Condition (H4) is known to hold for certain discrete groups. For free groups, it is known that (H4) holds with \(\alpha =2\) [JPP+15, Theorem A]. We refer to [JPPP17] for more examples including triangular groups, finite cyclic groups \(\mathbb {Z}_N, N\ge 6\), infinite Coxeter groups etc. with \(0<\alpha <\infty \).
6 Applications
6.1 Influence and circuit complexity lower bounds
As mentioned in the introduction, Karpovsky [Kar76] was the first to propose the total influence, as a measure of complexity of a function f. This intuition was then made rigorous in [LMN93] and [Bop97] where tight circuit complexity lower bounds in terms of the total influence were derived for the complexity class \({\text {AC}}^0\) of constant depth circuits.
Similar results were recently derived in the quantum setting. For instance, [BGJ+22] show a direct link between the notion of \(L^2\)-influence and the complexity of quantum circuits. More precisely, they showed that for a quantum circuit U, that is a unitary matrix in \(M_2(\mathbb {C})^{\otimes n}\) [BGJ+22, Theorem 12]
where the \(L^2\)-circuit sensitivity \({\text {CiS}}^2(U)\) is defined as
and where \({\text {Cost}}(U)\) refers to the cost of the circuit and was introduced in a series of seminal papers by Nielsen and coauthors [Nie06, NDGD06a, NDGD06b, DN08] as a lower bound on the minimal number of one and two-qubit gates required from a given universal gate-set in order to synthesize the unitary U. More precisely, given traceless self-adjoint operators \(h_1,\dots ,h_m\) that are supported on 2 qubits and normalized as \(\Vert h_i\Vert =1\), the circuit cost of U with respect to \(h_1,\dots , h_m\) is defined as
where the infimum above is taken over all continuous functions \(r_j:[0,1]\rightarrow \mathbb {R}\) satisfying
and
where \(\mathcal {P}\) denotes the path-ordering operator. We start by providing a simple bound on the p-influences for \(p\in [1,2]\) (for convenience we may write \(\frac{{\text {tr}}_{N_j}}{2^{|N_j|}}(O)\) for \(\frac{{\text {tr}}_{N_j}}{2^{|N_j|}}(O)\otimes \textbf{1}\)):
Proposition 6.1
For any \(j\in \{1,\dots , n\}\), let \(N_j\subset \{1,\dots , m\}\) be the minimal set of qubits such that \(\frac{{\text {tr}}_j}{2}(U(\frac{{\text {tr}}_{N_j}}{2^{|N_j|}}(O)\otimes \textbf{1})U^*)=U(\frac{{\text {tr}}_{N_j}}{2^{|N_j|}}(O)\otimes \textbf{1})U^*\) for any \(O\in M_2(\mathbb {C})^{\otimes n}\). Then, for any self-adjoint matrix \(O\in M_2(\mathbb {C})^{\otimes n}\) with \(\Vert O\Vert _2\le 1\) and all \(p\in [1,2]\) we have
In the case \(p=2\) and denoting \(L:=\max _i\,|\{j:\,i\in N_j\}|\), we get
Proof
For \(p\in [1,2]\) and any \(O\in M_2(\mathbb {C})^{\otimes n}\) with \(\Vert O\Vert _2\le 1\), we have
where in the second inequality above we use that the partial trace \({\text {tr}}_j\) is a projection onto the algebra of operators supported on \(\{j\}^c\), and therefore minimized the distance to that subalgebra. The third inequality follows from the non-primitive Poincaré inequality from Equation (3.13). \(\square \)
Remark 6.2
The assumption \(\frac{{\text {tr}}_j}{2}(U(\frac{{\text {tr}}_{N_j}}{2^{|N_j|}}(O)\otimes \textbf{1})U^*)=U(\frac{{\text {tr}}_{N_j}}{2^{|N_j|}}(O)\otimes \textbf{1})U^*\) can be interpreted as a lightcone condition: let’s consider for simplicity n a unitary circuit in brickwork architecture of the form \(U=U^{\ell } U^{\ell -1}\dots U^1\), where for each j, \(U^{2j+1}=U_{1,2}^{2j+1}\otimes \dots \otimes U^{2j+1}_{n-1,n }\) and \(U^{2j}=U_{2,3}^{2j}\otimes \dots \otimes U^{2j}_{n-2,n-1 }\), where by \(U^j_{r,r+1}\) we mean a unitary with non-trivial support on qubits r and \(r+1\). Hence, for any set \(N_1=\{1,\dots , n_1\}\) and any observable \(O_{N_1^c}\) supported on \(N_1^c\),
is supported in the set \(\{n_1-\ell +1,\dots , n\}\). Hence, for \(n_1=\ell +1\), the condition holds. In other words, \(n_1\) scales linearly with the depth \(\ell \) of the circuit U. The above simple argument generalizes easily to higher dimensions and general local unitary circuits.
In the case when \(p=1\) we can bound the total \(L^1\) output influence in terms of the total \(L^1\) input influence.
Proposition 6.3
For any \(j\in \{1,\dots , n\}\), let \(N_j\subset \{1,\dots , m\}\) be the minimal set of qubits such that \(\frac{{\text {tr}}_j}{2}(U\frac{{\text {tr}}_{N_j}}{2^{|N_j|}}(O)U^*)=U\frac{{\text {tr}}_{N_j}}{2^{|N_j|}}(O)U^*\) for any \(O\in M_2(\mathbb {C})^{\otimes n}\) and denote \(L:=\max _i\,|\{j:\,i\in N_j\}|\). Then, for any such matrix \(O\in M_2(\mathbb {C})^{\otimes n}\)
Proof
For any \(O\in M_2(\mathbb {C})^{\otimes n}\),
where in the second inequality above we used the definition of \(N_j\) and that the partial trace \({\text {tr}}_j\) is contractive in \(\Vert \cdot \Vert _1\) norm. The third inequality follows from simple triangle inequality and monotonicity of the \(L^1\)-norm under partial traces. \(\square \)
Finally, we find a bound on the variation of the \(L^1\)-influence through a circuit U in terms of the cost of a unitary U. We define the \(L^1\)-circuit sensitivity of a unitary \(U\in M_2(\mathbb {C})^{\otimes n}\) as
Theorem 6.4
The \(L^1\)-circuit sensitivity of a unitary \(U\in M_2(\mathbb {C})^{\otimes n}\) is a lower bound on the circuit cost:
Proof
Our proof follows similar steps to those leading to [BGJ+22, Theorem 12] (see also [Eis21, MAVAV16]): we first show that, for a unitary \(U_t=e^{-itH}\), where H acts non-trivially on a set S of k qubits, and O with \(\Vert O\Vert _1=1\),
where we denoted \(O(t):=U_tOU_t^*\). Back to our original problem, we take a Trotter decomposition of U such that for arbitrary small \(\varepsilon >0\),
where \(V_N\) is defined as follows:
so that
Next, we define \(O_\eta =W_\eta O_{\eta -1}W_\eta ^*\) with \(O_0=O\). We have,
where the inequality follows from (6.1) for \(k=2\) and \(t=\frac{1}{N}\). Summing over \(\eta \), we get
Since the circuit cost is expressed as \({\text {Cost}}(U)=\inf _{(r_j)_j} \lim _{N\rightarrow \infty }\frac{1}{N}\sum _{\eta =1}^N\sum _{j=1}^m\Big |r_j\Big (\frac{\eta }{N}\Big )\Big |\), and since the influence of \(UOU^*\) can be arbirarily well approximated by that of \(V_NOV_N^*\) as \(N\rightarrow \infty \), the result follows. \(\square \)
Remark 6.5
Combined with our quantum Friedgut’s Junta theorem 3.11 the above results show that for any observable O with \({\text {Inf}}^1(O),{\text {Inf}}^2(O)=\mathcal {O}(1)\) and \(\Vert O\Vert _2=\mathcal {O}(1)\), and for any unitary U with \(L=\mathcal {O}(1)\), the output observable \(UOU^*\) can be well approximated by a k-junta with \(k=\mathcal {O}(1)\). Taking again the simple example constructed in Remark 6.2, we recover the simple fact that, for a 1-qubit Pauli matrix evolving according to a circuit of constant depth, the output observable will still be supported on a constant size region. While it would be interesting to find some non-trivial situations where our bounds still hold, we leave this question to future work.
6.2 Learning quantum Boolean functions
In this section, we use our quantum Friedgut’s Junta theorem 3.11 to provide an efficient algorithm for learning quantum Boolean functions. Our argument relies on the following quantum generalization of Goldreich–Levin theorem (see [MO10a, Theorem 7.6]):
Theorem 6.6
(quantum Goldreich–Levin). Given an oracle access to a unitary operator U on n qubits and its adjoint \(U^*\), and given \(\delta ,\gamma >0\), there is a \({\text {poly}}\big (n,\frac{1}{\gamma }\big )\log \big (\frac{1}{\delta }\big )\)-time algorithm which outputs a list \(L=\{s_1,\dots , s_m\}\) such that with probability \(1-\delta \): (1) if \(|\widehat{U}_s|\ge \gamma \) then \(s\in L\); and (2) for all \(s\in L\), \(|\widehat{U}_s|\ge \gamma /2\).
Once the quantum Goldreich–Levin algorithm has been used to output a list of Fourier coefficients, the following lemma, which is also taken from [MO10a], can be used to compute them:
Lemma 6.7
[MO10a, Lemma 7.4] For any quantum Boolean function A, and any \(s\in \{0,1,2,3\}^n\) it is possible to estimate \(\widehat{A}_s\) to within \(\pm \eta \) with probability \(1-\delta \) using \(\mathcal {O}\big (\frac{1}{\eta ^2}\log \big (\frac{1}{\delta }\big )\big )\) queries.
Combining Theorem 6.6, Lemma 6.7 with our Theorem 3.11, we directly arrive at the following result:
Proposition 6.8
(Learning quantum Boolean functions). Let \(A\in M_2(\mathbb {C})^{\otimes n}\) be a quantum Boolean function. Given oracle access to A, with probability \(1-\delta \), we can learn A to precision \(\varepsilon \) in \(L^2\) using \({\text {poly}}(n,4^k,\log \big (\frac{1}{\delta }\big ))\) queries to A, where
Proof
By Theorem 3.11, we have a quantum Boolean function \(B=2^{-|T|}{\text {tr}}_{T}(A)\in M_2(\mathbb {C})^{\otimes n}\) supported on a region \(T^c\) of k qubits such that \(\Vert A-B\Vert _2\le \varepsilon \), with \(k\le k(\varepsilon )\). Next, we use the quantum Goldreich–Levin algorithm of Theorem 6.6 in order to output a list \(L=\{s^{(1)},\dots , s^{(m)}\}\subseteq \{0,1,2,3\}^n\) of corresponding significant Fourier coefficients \(\widehat{A}_{s^{(j)}}\), \(1\le j\le m\). Denoting the operator \(A_L:=\sum _{s\in L} \widehat{A}_{s}\sigma _{s}\) as well as the set of strings \(s\in \{0,1,2,3\}^n\) with \(s_T=0\) as \(L'\), we get
Moreover, by Theorem 6.6, we have that with probability \(1-\delta \), if \(s\notin L\), then \(|\widehat{A}_s|\le \gamma \). Therefore we have that with that same high probability
It remains to evaluate the coefficients \(\widehat{A}_s\) for \(s\in L\). This can be done within precision \(\pm \eta \) with probability \((1-\delta )\) using \(\mathcal {O}\big (\frac{1}{\eta ^2}\log \big (\frac{1}{\delta }\big )\big )\) queries of A according to Lemma 6.7. Moreover, since A is a quantum Boolean function, there are at most \(\frac{4}{\gamma ^2}\) coefficients \(\widehat{A}_s\) such that \(|\widehat{A}_s|\ge \frac{\gamma }{2}\). Therefore, with high probability \(|L|\le \frac{4}{\gamma ^2}\). Choosing \(\gamma =\varepsilon 2^{-k}\) so that \(\Vert A-A_L\Vert _2= \mathcal {O}( \varepsilon )\), we need to evaluate \(4/\gamma ^2=\mathcal {O}(4^k)\) such coefficients. The result follows. \(\square \)
6.3 Learning quantum dynamics
Proposition 6.8 extends the domain of applicability of Proposition 41 in [MO10b] where the authors provided an efficient algorithm to learn the evolution of initially local observables under the dynamics generated by a local Hamiltonian. While the proof of [MO10b, Proposition 41] requires the Lieb–Robinson bounds in order to control the sets of sites of large influence in terms of the support of the initial observable and the light-cone of H, our argument has the advantage of not putting any geometric locality assumption of the quantum Boolean function A.
To further illustrate our result, we consider the following generalization of the setup of [MO10a, Proposition 41]: let \(\Lambda \) be a finite set of size \(|\Lambda |=n\) endowed with a metric \(d(\cdot ,\cdot )\). We suppose that there is a monotone increasing function g on \([0,\infty )\) and constants \(C,D>0\) such that
The constant D typically denotes the spatial dimension in the case of a regular lattice. We consider a quantum spin system on the point set \(\Lambda \) by assigning the Hilbert space \(\mathcal {H}_x\equiv \mathbb {C}^2\) to each site \(x\in \Lambda \). For any subset \(T \subseteq \Lambda \), the configuration space of spin states on T is given by the tensor product \(\mathcal {H}_T=\bigotimes _{x\in T}\mathcal {H}_x\), and the algebra \(\mathcal {A}_T:=B(\mathcal {H}_T)\) of observables on T acts on the Hilbert space \(\mathcal {H}_T\). We consider a Hamiltonian \(H_\Lambda =\sum _{X\subseteq \Lambda } h_X\), where \(h_X\in B(\mathcal {H}_X)\) is a local Hamiltonian, i.e. a self-adjoint operator on \(\mathcal {H}_X\), for each \(X\subset \Lambda \). In what follows, we denote the diameter of a set \(Z\subseteq \Lambda \) by \({\text {diam}}(Z):=\max \{d(x,y)|\,x,y\in Z\}\). We further assume the following requirements [MKN17, Assumption A]:
-
(i)
There is a decreasing function f(R) on \([0,\infty )\), such that
$$\begin{aligned} \max _{x\in \Lambda }\,\sum _{\begin{array}{c} Z\ni x\\ {\text {diam}}(Z)\ge R \end{array}}\Vert h_Z\Vert \le f(R)\,,\quad R\ge 0; \end{aligned}$$ -
(ii)
The following constant is independent of the system size n:
$$\begin{aligned} \mathcal {C}_0:=\max _{x\in \Lambda }\,\sum _{y\in \Lambda }\sum _{Z\ni x,y}\Vert h_Z\Vert <\infty \,. \end{aligned}$$
Strictly speaking, condition (ii) only makes sense when considering a family of Hamiltonians \(H_\Lambda \) defined on an increasing family of sets \(\Lambda \) all included in a countable set \(\Sigma \). We will however favour simplicity over rigour here.
By [MKN17, Theorem 2.1], for any two one-local Pauli operators \(\sigma _{s_i},\sigma _{s_j}\) with \(j\ne i\), and all \(R\ge 1\) we have that, given \(d_{ij}\equiv d(i,j)\)
for any \(t\ge 0\), where v and \(\mathcal {C}_2\) are positive constants independent of \(\Lambda ,t,R,i\) and j.
Proposition 6.9
With the above assumptions, we further assume that there is \(R\ge 1\) such that for all \(i\in \Lambda \), the constants
can be bounded by constants independent of the size n of the system. Then, with probability \(1-\delta \), we can learn the quantum Boolean functions \(e^{itH_\Lambda }\sigma _{s_i}e^{-itH_\Lambda }\) to precision \(\varepsilon \) in \(L^2\) using \({\text {poly}}(n,\exp (\exp (\varepsilon ^{-2}|\log (\varepsilon )|)),\log \big (\frac{1}{\delta }\big ))\) queries to \(e^{-itH_\Lambda }\) and \(e^{itH_\Lambda }\).
Proof
In view of the dependence of k on the influences in Proposition 6.8 on the influences, it is enough to control \({\text {Inf}}^1(e^{itH_\Lambda }\sigma _{s_i}e^{-itH_\Lambda })\) and \({\text {Inf}}^2(e^{itH_\Lambda }\sigma _{s_i}e^{-itH_\Lambda })\) independently of the size of the system. We clearly have for any \(A\in M_2(\mathbb {C})^{\otimes n}\) that \({\text {Inf}}^1(A)\le \sum _{j\in \Lambda } \Vert d_j A\Vert \) and \({\text {Inf}}^2(A)\le \sum _{j\in \Lambda } \Vert d_j A\Vert ^2\). Moreover, by the following well-known expression for the partial trace
where \(\sigma _{s_j}\) are Pauli matrices on site j, we have that
Therefore,
\(\square \)
Remark 6.10
Our result in Proposition 6.8 has the advantage that it does not assume in advance that A is (close to) a k-junta. This comes at the price that the dependence of the query complexity on the approximating parameter \(\varepsilon \) scales doubly exponentially with the latter. For the same reason, our dynamics learning method in Proposition 6.9 allows us to extend the class of Hamiltonians considered in [MO10a] to Hamiltonians satisfying a weaker power-law decay, at the cost of a much worse dependence on \(\varepsilon \). This dependency on \(\varepsilon \) is also not new in classical setting [BT96, OS07].
Remark 6.11
In a recent article [CNY23], the authors provide an algorithm for learning any unitary k-junta U with precision \(\varepsilon \) and high probability which uses \(\mathcal {O}\big (\frac{k}{\varepsilon }+\frac{4^k}{\varepsilon ^2}\big )\) queries to U (see Theorem 29), extending a previous quantum algorithm for learning classical k-juntas reported in [AcS07]. While the dependence on \(\varepsilon \) is much tighter than ours, the two results are incomparable, since we replaced the requirement that U is a k-junta by the weaker condition that it has influences \({\text {Inf}}^1 U, {\text {Inf}}^2 U=\mathcal {O}(1)\).
6.4 Quantum isoperimetric type inequalities
Closely related to the concentration of measure phenomenon and functional inequalities, isoperimetric inequalities provide powerful tools in the analysis of extremal sets and surface measures. Given a metric space (X, d) equipped with a Borel measure \(\mu \), the boundary measure of a Borel set A in X with respect to \(\mu \) is defined as [Led00, Led01, BGL14]
where we recall that \(A_r:=\{x\in X|\,d(x,A)<r\}\) is the (open) r-neighbourhood of A. The isoperimetric profile of \(\mu \) corresponds to the largest function \(I_\mu \) on \([0,\mu (X)]\) such that, for any Borel set \(A\subset X\) with \(\mu (A)<\infty \),
In the case of the canonical Gaussian measure \(\gamma \) on the Borel sets of \(\mathbb {R}^k\) with density \((2\pi )^{-k/2}e^{-|x|^2/2}\) with respect to the Lebesgue measure, with the usual Euclidean metric induced by the norm |x| [Led01, Theorem 2.5]:
where \(\Phi (t)=(2\pi )^{-1/2}\int _{-\infty }^te^{-x^2/2}dx\) is the distribution function of the canonical Gaussian measure in dimension one. Moreover, equality holds in (6.3) if and only if A is a half-space in \(\mathbb {R}^k\). Moreover, as \(a\rightarrow 0\), we have
Similar isoperimetric inequalities were also derived for hypercontractive, log-concave measures [BL96] (see also [Mil09, Mil10]). When \(k=1\), the boundary measure of a Borel set A can be expressed in terms of the geometric influence \(\Vert f_A'\Vert _1\) of a smooth approximation \(f_A\) of the characteristic function of A. In other words, \(\mu ^+(A)\approx {\text {Inf}}^1(f_A)\).
This observation allows us to generalize the notion of isoperimetric inequality in the context of smooth Riemannian manifolds to discrete settings. In the context of the classical Boolean hypercube \(\Omega _n\), the edge isoperimetric inequality states that for any m, among the m-element subsets of the discrete cube, the minimal edge boundary is attained by the set of m largest elements in the lexicographic order [Ber67, Har64, Har76, Lin64]. In particular, for any set \(A\subset \Omega _n\) of vertices
where we recall that \(\mu _n\) is the uniform probability measure on \(\Omega _n\), and \(f_A\) corresponds to the characteristic function of set A. Here, \(\partial A\) simply corresponds to the set of vertices in the complement of A that are adjacent to A. This inequality is moreover tight when \(|A|=2^d\) for some \(d\in \mathbb {N}\) (take for instance A to be the vertices of a d-dimensional subcube). We notice the similarity with (6.4) up to the change of power in the logarithmic factor.
Similarly, consider a finite graph \(G=(V,E)\) with set of vertices V and set of edges E with bounded degree d (i.e. each vertex has at most a fixed number d adjacent edges). The graph G is said to satisfy the linear isoperimetric inequality if
for some \(h>0\) and all subsets A of V such that \({\text {Card}}(A)\le \frac{1}{2}{\text {Card}}(V)\). The so-called Cheeger constant h of the graph can be related to the spectral gap \(\lambda \) of the graph Laplacian via Cheeger’s and Buser’s inequalities. Here, \({\text {Card}}(\partial A)\) plays the role of \(\mu ^+(A)\) and can be once again related to a notion of influence. The linear isoperimetric inequality can be understood as a weaker form of isoperimetry than the one derived for log-concave, hypercontractive measures, and hence only implies exponential concentration for the normalized counting measure on G. Moreover, one should not expect to recover the stronger Gaussian type isoperimetry in this setting, since the hypercontractivity constant for graph Laplacians is known to scale with the size of the graph [BT06].
Linear isoperimetric inequalities were also considered in the more general context of Markov chains over finite sample spaces. For instance, in the case of a continuous time Markov chain with transition rates Q(x, y) and unique reversible probability measure \(\pi \) with non-negative entropic Ricci curvature, [EF18] established that for any set A,
where \(\lambda \) is the spectral gap of Q, \(Q_*=\min \{Q(x,y):Q(x,y)>0\}\) and \(\pi ^+(\partial A)=\sum _{x\in A,y\in A^c}Q(x,y)\pi (x)\) denotes the perimeter measure of A. We also note that extensions of such inequalities in the quantum setting were obtained in [TKR+10].
Interestingly, such inequalities are well-known to be equivalent to an \(L^1\)-Poincaré inequality. It is then natural to ask whether one could recover the type of isoperimetry found for the Gaussian measure and uniform measure on the hypercube in other discrete and quantum settings by further assuming hypercontractivity of the (quantum) Markov chain. This is indeed the case, as we prove by a direct appeal to Talagrand’s inequality:
Theorem 6.12
(Qubit isoperimetric type inequality). For any projection \(P_A\) onto a subspace \(A\subset (\mathbb {C}^2)^{\otimes n}\),
for some universal constant C, where \(\tau (A){:=}2^{-n}{\text {tr}}(P_A)\).
Proof
As mentioned in [CEL12], this is a simple corollary of Talagrand’s inequality Theorem 3.2 after assuming that
for every \(j\in \{1,\dots ,n\}\), since otherwise the result directly holds. \(\square \)
Remark 6.13
Similar to the quantum KKL conjecture of Montanaro and Osborne, it is reasonable to conjecture the following \(L^2\) variant of (6.6)
We end this section by remarking the following \(L^1\)-Poincaré inequality that is stronger than Theorem 3.1. See [ILvHV18] for the discussions on the classical Boolean cubes.
Theorem 6.14
For all \(A\in M_2(\mathbb {C})^{\otimes n}\), one has
Remark 6.15
Using the inequality
for \(p=1/2\), we get
So (6.8) is stronger than (3.1) up to the multiplicative constant \(\pi \).
Proof of Theorem 6.14
Recall that we have proved (3.7)
So we have the second inequality of (6.8) by operator monotonicity of \(x\mapsto x^{1/2}\). Now let us prove the first inequality of (6.8). We denote by \((A,B)\mapsto \langle A,B\rangle :=2^{-n}{\text {tr}}(A^* B)\) the normalized Hilbert–Schmidt inner product. By semigroup interpolation (2.4) and duality,
Now recall that we have the following inequality:
To prove it, form the operators \(X:=\sum _{j=1}^{n}X_j\otimes \vert j\rangle \langle 1\vert \) and \(Y:=\sum _{j=1}^{n}Y_j\otimes \vert j\rangle \langle 1\vert \). Then Hölder’s inequality gives
which, together with
yields (6.9). Now apply (6.9) to \((X_j,Y_j)=(d_j A, d_j P_t B)\) to get
To conclude, we use Lemma 3.4 together with
\(\square \)
7 Discussions
We end this paper with the following discussions.
7.1 Equivalence between log-Sobolev and Talagrand’s inequalities
In Theorem 4.3, we derived a general noncommutative extension of Talagrand’s inequality. Our proof requires the joint use of the hypercontractivity inequality (H4) with the intertwining relation (H5). It is hence legitimate to ask whether, in return, such Talagrand-type inequalities imply hypercontractivity. This question was answered in the positive in the classical, continuous setting in [BH99, Proposition 1], and later on for discrete spaces in [Vö16]. It would be interesting to consider the similar problem in the quantum setting, which we leave to future work.
7.2 Learning low-degree quantum Boolean functions
An alternative notion of complexity than the support condition for k-juntas is that of the degree: a bounded function \(f:\Omega _n\rightarrow [-1,1]\) is said to have degree at most \(d\in \{1,\dots , n\}\) if for any string \(s\in \{-1,1\}^n\) with Hamming weight \(|s|>d\), the Fourier coefficient \(\widehat{f}(s)=0\). In particular, Boolean functions of degree at most d are \(d2^{d-1}\) juntas [NS94]. As a main tool for the result, the authors derived a simple lower bound on the degree of the function in terms of its total influence. This observation can be used in conjunction with the Goldreich–Levin algorithm in order to devise a learning algorithm which makes \({\text {poly}}(n)\) random queries to f. More efficient algorithms were proposed in the past decades [LMN93, IRR+21, Man94]. However all these algorithm have a query complexity scaling polynomially with n. In the recent article [EI22], the authors show that any low degree Boolean function can be approximated to \(\varepsilon \) precision in \(L^2\) with probability \(1-\delta \) from \(\mathcal {O}\big ({\text {poly}}\big (\frac{1}{\varepsilon },d\big )\log \big (\frac{n}{\delta }\big )\big )\) random queries to the function. While this result is incomparable to the ones we report in Sect. 6.2, it would be interesting to find a quantum extension of it. The result of [EI22] uses the so-called Bohnenblust–Hille inequalities. The study of Bohnenblust–Hille inequalities has a long history and these inequalities have found many applications in various problems. A Boolean analogue was known [DMoP19] and has led to interesting applications to learning theory [EI22]. Here we formulate and conjecture a quantum analogue of Bohnenblust–Hille inequality and explain why it is useful to learning problems in the quantum setting.
Conjecture 7.1
Fix \(d\ge 1\). Then there exists \(C_d>0\) depending only on d such that for all \(n\ge 1\) and all \(A\in M_2(\mathbb {C})^{\otimes n}\) of degree at most d i.e.
we have
where the degree |s| of a string s is defined as the number of components that are different from 0.
If Conjecture 7.1 holds, we expect that it can be used in a similar fashion as in [EI22] in order to devise a highly efficient algorithm for learning quantum Boolean functions of small degree in terms of query complexity.
In fact, this conjecture has been resolved after an earlier version of this paper was post out. It was first resolved by Huang, Chen and Preskill [HCP23]. Later on, another proof was found by Volberg and Zhang [VZ23].
Data availability statement
No data was generated as part of this work.
References
Atıcı, A., Servedio, R.A.: Quantum algorithms for learning and testing juntas. Quantum Inf. Process. 6(5), 323–348 (2007)
Alon, N., Dinur, I., Friedgut, E., Sudakov, B.: Graph products, Fourier analysis and spectral techniques. Geom. Funct. Anal. 14(5), 913–940 (2004)
Auffinger, A., Damron, M., Hanson, J.: 50 Years of First-Passage Percolation. University Lecture Series, vol. 68. American Mathematical Society, Providence (2017)
Austin, T.: On the failure of concentration for the \(\ell ^\infty \)-ball. Israel J. Math. 211(1), 221–238 (2016)
Bardet, I.: Estimating the decoherence time using non-commutative functional inequalities (2017). arXiv preprint arXiv:1710.01039
Beigi, S., Datta, N., Rouzé, C.: Quantum reverse hypercontractivity: its tensorization and application to strong converses. Commun. Math. Phys. 376(2), 753–794 (2020)
Beigi, S.: Improved quantum hypercontractivity inequality for the qubit depolarizing channel. J. Math. Phys. 62(12), 14 (2021)
Bernstein, A.J.: Maximally connected arrays on the \(n\)-cube. SIAM J. Appl. Math. 15, 1485–1489 (1967)
Bu, K., Garcia, R.J., Jaffe, A., Koh, D.E., Li, L.: Complexity of quantum circuits via sensitivity, magic, and coherence (2022). arXiv preprint arXiv:2204.12051
Bakry, D., Gentil, I., Ledoux, M.: Analysis and Geometry of Markov Diffusion Operators. Grundlehren der mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences], vol. 348. Springer, Cham (2014)
Bobkov, S.G., Houdré, C.: A converse Gaussian Poincaré-type inequality for convex functions. Statist. Probab. Lett. 44(3), 281–290 (1999)
Benjamini, I., Kalai, G., Schramm, O.: Noise sensitivity of Boolean functions and applications to percolation. Inst. Hautes Études Sci. Publ. Math. 90, 5–43 (1999)
Benjamini, I., Kalai, G., Schramm, O.: First passage percolation has sublinear distance variance. Ann. Probab. 31(4), 1970–1978 (2003)
Bakry, D., Ledoux, M.: Lévy-Gromov’s isoperimetric inequality for an infinite-dimensional diffusion generator. Invent. Math. 123(2), 259–281 (1996)
Brown, N.P., Ozawa, N.: \(C^*\)-Algebras and Finite-Dimensional Approximations. Graduate Studies in Mathematics, vol. 68. American Mathematical Society, Providence (2008)
Ben-Or, M., Linial, N.: Collective coin flipping, robust voting schemes and minima of Banzhaf values. In: 26th Annual Symposium on Foundations of Computer Science (sfcs 1985), pp. 408–416. IEEE (1985)
Boppana, R.B.: The average sensitivity of bounded-depth circuits. Inform. Process. Lett. 63(5), 257–261 (1997)
Bourgain, J.: On the distributions of the Fourier spectrum of Boolean functions. Israel J. Math. 131, 269–276 (2002)
Bouyrie, R.: An unified approach to the junta theorem for discrete and continuous models (2017). arXiv preprint arXiv:1702.00753
Bshouty, N.H., Tamon, C.: On the Fourier spectrum of monotone functions. J. ACM 43(4), 747–770 (1996)
Bobkov, S.G., Tetali, P.: Modified logarithmic Sobolev inequalities in discrete settings. J. Theoret. Probab. 19(2), 289–336 (2006)
Cordero-Erausquin, D., Ledoux, M.: Hypercontractive measures, Talagrand’s inequality, and influences. In: Geometric Aspects of Functional Analysis, Volume 2050 of Lecture Notes in Mathematics, pp. 169–189. Springer, Heidelberg (2012)
Cipriani, F., Fagnola, F., Lindsay, J.M.: Spectral analysis and Feller property for quantum Ornstein–Uhlenbeck semigroups. Commun. Math. Phys. 210(1), 85–105 (2000)
Chatterjee, S.: Superconcentration and Related Topics. Springer Monographs in Mathematics, Springer, Cham (2014)
Carlen, E.A., Maas, J.: Gradient flow and entropy inequalities for quantum Markov semigroups with detailed balance. J. Funct. Anal. 273(5), 1810–1869 (2017)
Chen, T., Nadimpalli, S., Yuen, H.: Testing and learning quantum juntas nearly optimally. In: Proceedings of the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp. 1163–1185. SIAM, Philadelphia (2023)
Carbone, R., Sasso, E.: Hypercontractivity for a quantum Ornstein–Uhlenbeck semigroup. Probab. Theory Relat. Fields 140(3–4), 505–522 (2008)
Dinur, I., Friedgut, E., Kindler, G., O’Donnell, R.: On the Fourier tails of bounded functions over the discrete cube (extended abstract). In: STOC’06: Proceedings of the 38th Annual ACM Symposium on Theory of Computing, pp. 437–446. ACM, New York (2006)
Defant, A., Mastył o, M., Pérez, A.: On the Fourier spectrum of functions on Boolean cubes. Math. Ann. 374(1–2), 653–680 (2019)
Dowling, M.R., Nielsen, M.A.: The geometry of quantum computation. Quantum Inf. Comput. 8(10), 861–899 (2008)
De Giacomo, P., Milad, M., Cambyse, R., Daniel Stilck, F.: Limitations of variational quantum algorithms: a quantum optimal transport approach. PRX Quantum 4, 010309 (2023)
Datta, N., Rouzé, C.: Relating relative entropy, optimal transport and Fisher information: a quantum HWI inequality. Ann. Henri Poincaré 21(7), 2115–2150 (2020)
Dinur, I., Safra, S.: On the hardness of approximating minimum vertex cover. Ann. Math. (2) 162(1), 439–485 (2005)
de Wolf, R.: A brief introduction to Fourier analysis on the Boolean cube. Theory Comput. 1–20 (2008)
Erbar, M., Fathi, M.: Poincaré, modified logarithmic Sobolev and isoperimetric inequalities for Markov chains with non-negative Ricci curvature. J. Funct. Anal. 274(11), 3056–3089 (2018)
Eskenazis, A., Ivanisvili, P.: Learning low-degree functions from a logarithmic number of random queries. In: STOC ’22—Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing, pp. 203–207. ACM, New York (2022)
Eisert, J.: Entangling power and quantum circuit complexity. Phys. Rev. Lett. 127(2), 5 (2021)
Friedgut, E., Kalai, G.: Every monotone graph property has a sharp threshold. Proc. Am. Math. Soc. 124(10), 2993–3002 (1996)
Friedgut, E., Kalai, G., Naor, A.: Boolean functions whose Fourier transform is concentrated on the first two levels. Adv. Appl. Math. 29(3), 427–437 (2002)
Friedgut, E.: Boolean functions with low average sensitivity depend on few coordinates. Combinatorica 18(1), 27–35 (1998)
Fagnola, F., Rebolledo, R., Saavedra, C.: Quantum flows associated to a class of laser master equations. J. Math. Phys. 1(35), 1–12 (1994)
Gao, L., Rouzé, C.: Ricci curvature of quantum channels on non-commutative transportation metric spaces (2021). arXiv preprint arXiv:2108.10609
Garban, C., Steif, J.E.: Noise Sensitivity of Boolean Functions and Percolation, vol. 5. Institute of Mathematical Statistics Textbooks, Cambridge University Press, New York (2015)
Harper, L.H.: Optimal assignments of numbers to vertices. J. Soc. Ind. Appl. Math. 12, 131–135 (1964)
Hart, S.: A note on the edges of the \(n\)-cube. Discrete Math. 14(2), 157–163 (1976)
Huang, H.-Y., Chen, S., Preskill, J.: Learning to predict arbitrary quantum processes. PRX Quantum 4(4), 040337 (2023)
Holevo, A.: Probabilistic and Statistical Aspects of Quantum Theory. Quaderni/Monographs, vol. 1, 2nd edn. Edizioni della Normale, Pisa (2011)
Håstad, J.: Some optimal inapproximability results. J. ACM 48(4), 798–859 (2001)
Ivanisvili, P., Li, D., van Handel, R., Volberg, A.: Improving constant in end-point Poincaré inequality on Hamming cube (2018). arXiv preprint arXiv:1811.05584
Iyer, S., Rao, A., Reis, V., Rothvoss, T., Yehudayoff, A.: Tight bounds on the Fourier growth of bounded functions on the hypercube (2021). arXiv preprint arXiv:2107.06309
Junge, M., Palazuelos, C., Parcet, J., Perrin, M., Ricard, É.: Hypercontractivity for free products. Ann. Sci. Éc. Norm. Supér. (4) 48(4), 861–889 (2015)
Junge, M., Palazuelos, C., Parcet, J., Perrin, M.: Hypercontractivity in group von Neumann algebras. Mem. Am. Math. Soc. 249(1183), xii+83 (2017)
Junge, M., Zeng, Q.: Noncommutative martingale deviation and Poincaré type inequalities with applications. Probab. Theory Relat. Fields 161(3–4), 449–507 (2015)
Karpovsky, M.G.: Finite Orthogonal Series in the Design of Digital Devices. Analysis, Synthesis and Optimization, Wiley, New York (1976)
Kahn, J., Kalai, G., Linial, N.: The influence of variables on Boolean functions. In: [Proceedings 1988] 29th Annual Symposium on Foundations of Computer Science, pp. 68–80 (1988)
Keyl, M., Kiukas, J., Werner, R.F.: Schwartz operators. Rev. Math. Phys. 28(03), 1630001 (2016)
Keller, N., Mossel, E., Sen, A.: Geometric influences. Ann. Probab. 40(3), 1135–1166 (2012)
Khot, S., Naor, A.: Nonembeddability theorems via Fourier analysis. Math. Ann. 334(4), 821–852 (2006)
Kosaki, H.: Applications of the complex interpolation method to a von Neumann algebra: noncommutative \(L^{p}\)-spaces. J. Funct. Anal. 56(1), 29–78 (1984)
Kindler, G., Safra, S.: Noise-resistant Boolean functions are juntas. Preprint 5(7), 19 (2002)
Kastoryano, M.J., Temme, K.: Quantum logarithmic Sobolev inequalities and rapid mixing. J. Math. Phys. 54(5), 052202 (2013)
Ledoux, M.: The geometry of Markov diffusion generators. Ann. Fac. Sci. Toulouse Math. (6) 9(2), 305–366 (2000)
Ledoux, M.: The Concentration of Measure Phenomenon. Mathematical Surveys and Monographs, vol. 89. American Mathematical Society, Providence (2001)
Ledoux, M.: Four Talagrand inequalities under the same umbrella (2019). arXiv preprint arXiv:1909.00363
Lindsey, J.H., II.: Assignment of numbers to vertices. Am. Math. Mon. 71, 508–516 (1964)
Linial, N., Mansour, Y., Nisan, N.: Constant depth circuits, Fourier transform, and learnability. J. Assoc. Comput. Mach. 40(3), 607–620 (1993)
Mansour, Y.: Learning Boolean functions via the Fourier transform. In: Theoretical Advances in Neural Computation and Learning, pp. 391–424. Springer (1994)
Mariën, M., Audenaert, K.M.R., Van Acoleyen, K., Verstraete, F.: Entanglement rates and the stability of the area law for the entanglement entropy. Commun. Math. Phys. 346(1), 35–73 (2016)
Milman, E.: On the role of convexity in isoperimetry, spectral gap and concentration. Invent. Math. 177(1), 1–43 (2009)
Milman, E.: Isoperimetric and concentration inequalities: equivalence under curvature lower bound. Duke Math. J. 154(2), 207–239 (2010)
Matsuta, T., Koma, T., Nakamura, S.: Improving the Lieb–Robinson bound for long-range interactions. Ann. Henri Poincaré 18(2), 519–528 (2017)
Montanaro, A., Osborne, T.J.: Quantum Boolean functions. Chic. J. Theor. Comput. Sci. 1, 1–45 (2010)
Montanaro, A., Osborne, T.J.: Quantum Boolean functions (2010). arXiv preprint arXiv:0810.2435
Mossel, E.: A quantitative Arrow theorem. Probab. Theory Relat. Fields 154(1–2), 49–88 (2012)
Nielsen, M.A., Dowling, M.R., Mile, G., Doherty, A.C.: Optimal control, geometry, and quantum computing. Phys. Rev. A (3) 73(6), 062323, 7 (2006)
Nielsen, M.A., Dowling, M.R., Mile, G., Doherty, A.C.: Quantum computation as geometry. Science 311(5764), 1133–1135 (2006)
Nielsen, M.A.: A geometric approach to quantum circuit lower bounds. Quantum Inf. Comput. 6(3), 213–262 (2006)
Nisan, N., Szegedy, M.: On the degree of Boolean functions as real polynomials. Comput. Complex. 4(4), 301–313 (1994)
O’Donnell, R.: Analysis of Boolean Functions. Cambridge University Press, New York (2014)
O’Donnell, R., Servedio, R.A.: Learning monotone decision trees in polynomial time. SIAM J. Comput. 37(3), 827–844 (2007)
O’Donnell, R., Wimmer, K.: KKL, Kruskal-Katona, and monotone nets. SIAM J. Comput. 42(6), 2375–2399 (2013)
O’Donnell, R., Wimmer, K.: Sharpness of KKL on Schreier graphs. Electron. Commun. Probab. 18, 12 (2013)
Olkiewicz, R., Zegarlinski, B.: Hypercontractivity in noncommutative \(L_p\) spaces. J. Funct. Anal. 161(1), 246–285 (1999)
Sosoe, P.: Fluctuations in first-passage percolation. In: Random Growth Models, Volume 75 of Proceedings of Symposium Applied Mathematics, pp. 69–93. Amer. Math. Soc., Providence (2018)
Takesaki, M.: Theory of Operator Algebras. I, volume 124 of Encyclopaedia of Mathematical Sciences. Springer, Berlin (2002) (Reprint of the first (1979) edition, Operator Algebras and Non-commutative Geometry, 5)
Takesaki, M.: Theory of Operator Algebras. II, Volume 125 of Encyclopaedia of Mathematical Sciences. Operator Algebras and Non-commutative Geometry, vol. 6. Springer, Berlin (2003)
Talagrand, M.: On Russo’s approximate zero-one law. Ann. Probab. 22(3), 1576–1587 (1994)
Tanguy, K.: Talagrand inequality at second order and application to Boolean analysis. J. Theoret. Probab. 33(2), 692–714 (2020)
Temme, K., Kastoryano, M.J., Ruskai, M.B., Wolf, M.M., Verstraete, F.: The \(\chi ^2\)-divergence and mixing times of quantum Markov processes. J. Math. Phys. 51(12), 122201, 19 (2010)
Temme, K., Pastawski, F., Kastoryano, M.J.: Hypercontractivity of quasi-free quantum semigroups. J. Phys. A 47(40), 405303, 27 (2014)
Volberg, A., Zhang, H.: Noncommutative Bohnenblust–Hille inequalities. Math. Ann. 1–20 (2023)
Völlering, F.: Talagrand’s inequality for interacting particle systems satisfying a log-Sobolev inequality. Ann. Inst. Henri Poincaré Probab. Stat. 52(1), 173–195 (2016)
Wolf, M.M.: Quantum Channels and Operations: Guided Tour. Lecture Notes, vol. 5 (2012). http://www-m5.ma.tum.de/foswiki/pubM
Wirth, M., Zhang, H.: Complete gradient estimates of quantum Markov semigroups. Commun. Math. Phys. 387(2), 761–791 (2021)
Acknowledgements
H.Z. is supported by the Lise Meitner fellowship, Austrian Science Fund (FWF) M3337. H.Z. would like to thank the American Institute of Mathematics and the AIM workshop Analysis on the hypercube with applications to quantum computing. He is also grateful to the organizers and other participants for creating an active atmosphere. The research of C.R. has been supported by ANR project QTraj (ANR-20-CE40-0024-01) of the French National Research Agency (ANR). C.R. acknowledges the support of the Munich Center for Quantum Sciences and Technology, as well as the Humboldt Foundation. C.R. would like to thank Amanda Young for fruitful discussion on the applications of Friedgut’s Junta theorem to learning quantum dynamics. The research of M.W. was funded by the Austrian Science Fund (FWF) under the Esprit Programme [ESP 156]. For the purpose of Open Access, the authors have applied a CC BY public copyright licence to any Author Accepted Manuscript (AAM) version arising from this submission. The authors want to thank Francisco Escudero Gutierrez and Hsin-Yuan Huang for helpful comments on an earlier version of the paper. They are grateful to the referees for the careful reading and helpful comments.
Funding
Open access funding provided by the Carolinas Consortium.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by E. Smith.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Proofs from Section 4
Proofs from Section 4
Proof of Lemma 4.1
Let \(F(s)=P_s(P_{t-s}(x)^*P_{t-s}(x))\). Since \(x\in {\mathcal {A}}\), the map F is \(\sigma \)-weakly differentiable on [0, t] and
Hence if \(\omega \in {\mathcal {M}}_*\) is positive, then
This implies the first inequality. The second inequality follows from the fact that \(P_t\) is unital and positive.
The last part is easy to see for \(K\ge 0\). If \(K< 0\) and \(0\le t\le -\frac{1}{2K}\), then
\(\square \)
Proof of Theorem 4.3
As a consequence of (H3),
Since \((P_t^{(2)})_{t\ge 0}\) is a symmetric contraction semigroup on \(L^2({\mathcal {M}},\varphi )\), the map \(t\mapsto \Vert i_2(P_t(x))\Vert ^2\) is differentiable and
where the second to last step used (H2-1).
By (H5) and (H4),
with \(p(t)=1+e^{-2\alpha t}\).
By interpolation,
To the first factor we apply (H5) again and the contractivity of \(P_t^{(1)}\) to get
To bound the second factor in (A.1) in the range \(\left[ 0,\frac{1}{2K_-}\right] \), we use Lemma 4.1 and (H2-2) to obtain
If we plug (A.2) and (A.3) into (A.1), we obtain
for \(0\le t\le 1/2K_-\). Therefore,
provided \(T\le 1 /K_-\). With \(T=\min \{2,1/K_-,1/\alpha \}\), the result follows from Lemma 3.3. \(\square \)
Proof of Theorem 4.5
By Theorem 4.3 there exists a constant \(C>0\) with dependence as indicated in the statement of this theorem such that
for all \(x\in {\mathcal {A}}\) with \(\Vert i_2(x)\Vert =1\), \(\Vert x\Vert \le 1\) and \(\varphi (x)=0\).
Now the conclusion follows from Lemma 3.8. \(\square \)
Proof of Theorem 4.9
The proof follows the same lines as the one of Theorem 3.11. Let us first consider the case \({\text {Inf}}^2(x)\ge 1\). Let \({\mathcal {I}}=\{j\in {\mathcal {J}}\mid {\text {Inf}}^1_j(x)\le \eta \}\) for some number \(\eta >0\). By the triangle inequality and the contractivity of \(E_{{\mathcal {I}}}\), we have
It follows from Lemmas 4.7 and 4.8 that
with \(q(t)=\frac{1-e^{-2\alpha t}}{1+e^{-2\alpha t}}\). For
we obtain \(\Vert i_2(x)-E_{{\mathcal {I}}}(i_2(x))\Vert \le \varepsilon \).
Moreover, by definition of \({\mathcal {I}}\), we have \({\text {Inf}}^1_j(x)>\eta \) for every \(j\in {\mathcal {J}}\setminus {\mathcal {I}}\). Thus
The elementary inequality \(r\le e^{r}-1\) for \(r\ge 0\) together with \({\text {Inf}}^2(x)\ge 1\) and \(\varepsilon \le 2/\nu \) implies
Let \(C=\frac{16}{\alpha }\left( \exp \left( \frac{\alpha }{2\nu ^2}\right) +1\right) \). Hence
Furthermore,
Altogether,
The bound in the case \({\text {Inf}}^2(x)<1\) follows by rescaling. \(\square \)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Rouzé, C., Wirth, M. & Zhang, H. Quantum Talagrand, KKL and Friedgut’s Theorems and the Learnability of Quantum Boolean Functions. Commun. Math. Phys. 405, 95 (2024). https://doi.org/10.1007/s00220-024-04981-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00220-024-04981-0