Abstract
Operator-valued Schatten spaces were introduced by G. Pisier as a noncommutative counterpart of vector-valued \(\ell _p\)-spaces. This family of operator spaces forms an interpolation scale which makes it a powerful and convenient tool in a variety of applications. In particular, as the norms coming from this family naturally appear in the definition of certain entropic quantities in quantum information theory (QIT), one may apply Pisier’s theory to establish some features of those quantities. Nevertheless, it could be quite challenging to follow the proofs of the main results of this theory from the existing literature. In this article, we attempt to fill this gap by presenting the underlying concepts and ideas of Pisier’s theory in a self-contained way which we hope to be more accessible, especially for the QIT community at large. Furthermore, we describe some applications of this theory in QIT. In particular, we prove a new uniform continuity bound for the quantum conditional Rényi entropy.
Similar content being viewed by others
Data availability
Data sharing is not applicable to this article as no new data were created or analyzed in this study.
Notes
These are the discrete case of non-commutative vector valued \(L_p\)-spaces of [38].
This operator space structure is essential for the theory to have all the desired properties, such as duality.
The precise definition of \({\mathcal {S}}_p[{\mathcal {X}}]\) is given in Sect. 4.
This approach was also raised by Marco Tomamichel and Alexander McKinlay in a discussion with the first author.
\(\mathop {\mathrm {ess\,sup}}\limits |h|:=\inf \{a\in {\mathbb {R}}:|h(z)|\le a \ \text {for almost all} \ z\in \Omega \}\).
As a convention we set \(\frac{1}{\infty }=0\).
Equivalently, AXB can be understood as \((A\otimes I_{{\mathcal {H}}})X (B\otimes I_{{\mathcal {H}}})\) where \(I_{{\mathcal {H}}}\) is the identity operator acting on \({\mathcal {H}}\).
In view of Ruan’s theorem, these spaces are sometimes termed as abstract operator spaces as opposed to concrete operator spaces (see Definition 3.1).
Note that the dual of \({\mathcal {S}}_1({\mathcal {H}})\) is \(B({\mathcal {H}})\), which does not coincide with \({\mathcal {S}}_\infty ({\mathcal {H}})\) when \({\mathcal {H}}\) is not finite-dimensional.
For a generalization to infinite dimensions the reader is referred to [37].
Here is a construction for \(\Phi \): let \(\rho _{XYZ}\) be a purification of \(\rho _{XY}\), which is a purification of \(\rho _{X}\) as well. Then, by the equivalence of purifications [36, Exercise 2.81] we have \(\dim {\mathcal {H}}_{X'}\le \dim {\mathcal {H}}_{Y}\otimes {\mathcal {H}}_{Z}\) and there is an isometry \(V_{X'\rightarrow YZ}\) such that \((I\otimes V) \sigma _{XX'} (I\otimes V^\dagger ) = \rho _{XYZ}\). Now, define \(\Phi _{X'\rightarrow Y}\) by \(\Phi (\tau _{X'}) = \textrm{tr}_{Z} (V \tau _{X'} V^\dagger )\). From the definition we clearly have \({\mathcal {I}}\otimes \Phi (\sigma _{XX'}) = \rho _{XY}\).
The radial limit of F at \(e^{it}\in \partial {\mathbb {D}}\) equals \(\lim _{r\rightarrow 1^-} F(re^{it})\).
\(\Vert F(e^{it})\Vert \) is the norm of vector \(F(e^{it})\in {\mathcal {H}}\).
The density \(\frac{1}{\cosh (\pi s)}\) on the boundary of the strip appears here by carrying the uniform measure on the unit circle via the conformal map as explained above.
References
Alicki, R., Fannes, M.: Continuity of quantum conditional information. J. Phys. A Math. Gen. 37, L55 (2004). https://doi.org/10.1088/0305-4470/37/5/L01
Bardet, I., Junge, M., LaRacuente, N., Rouzé, C., França, D.S.: Group transference techniques for the estimation of the decoherence times and capacities of quantum Markov semigroups. IEEE Trans. Inf. Theory 67(5), 2878–2909 (2021). https://doi.org/10.1109/TIT.2021.3065452
Bardet, I., Rouzé, C.: Hypercontractivity and logarithmic Sobolev inequality for non-primitive quantum Markov semigroups and estimation of decoherence rates. Ann. Henri Poincaré 23, 3839–3903 (2022). https://doi.org/10.1007/s00023-022-01196-8
Beigi, S.: Sandwiched Rényi divergence satisfies data processing inequality. J. Math. Phys. 54(12), 122202 (2013). https://doi.org/10.1063/1.4838855
Beigi, S., Datta, N., Rouzé, C.: Quantum reverse hypercontractivity: its tensorization and application to strong converses. Commun. Math. Phys. 376(2), 753–794 (2020). https://doi.org/10.1007/s00220-020-03750-z
Beigi, S., King, C.: Hypercontractivity and the logarithmic Sobolev inequality for the completely bounded norm. J. Math. Phys. 57(1), 015206 (2016). https://doi.org/10.1063/1.4934729
Bergh, J., Löfström, J.: Interpolation Spaces: An Introduction. Springer, Berlin (1976). https://doi.org/10.1007/978-3-642-66451-9
Bhatia, R.: Positive Definite Matrices. Princeton University Press, Princeton (2007). https://doi.org/10.1515/9781400827787
Calderón, A.: Intermediate spaces and interpolation, the complex method. Stud. Math. 24, 113–190 (1964). https://doi.org/10.4064/sm-24-2-113-190
Delgosha, P., Beigi, S.: Impossibility of local state transformation via hypercontractivity. Commun. Math. Phys. 332, 449–476 (2014). https://doi.org/10.1007/s00220-014-2105-y
Devetak, I., Junge, M., King, C., Ruskai, M.B.: Multiplicativity of completely bounded \(p\)-norms implies a new additivity result. Commun. Math. Phys. 266, 37–63 (2006). https://doi.org/10.1007/s00220-006-0034-0
Devinatz, A.: The factorization of operator valued functions. Ann. Math. 73(2), 458–495 (1961). https://doi.org/10.2307/1970313
Dupuis, F.: Chain rules for quantum Rényi entropies. J. Math. Phys. 56(2), 022203 (2015). https://doi.org/10.1063/1.4907981
Dupuis, F., Berta, M., Wullschleger, J., Renner, R.: One-shot decoupling. Commun. Math. Phys. 328(1), 251–284 (2014). https://doi.org/10.1007/s00220-014-1990-4
Effros, E.G., Kishimoto, A.: Module maps and Hochschild–Johnson cohomology. Indiana Univ. Math. J. 36(2), 257–276 (1987). https://doi.org/10.1512/iumj.1987.36.36015
Frank, R.L., Lieb, E.H.: Monotonicity of a relative Rényi entropy. J. Math. Phys. 54(12), 122201 (2013). https://doi.org/10.1063/1.4838835
Gao, L., Junge, M., LaRacuente, N.: Capacity estimates via comparison with TRO channels. Commun. Math. Phys. 364(1), 83–121 (2018). https://doi.org/10.1007/s00220-018-3249-y
Helson, H.: Lectures on Invariant Subspaces. Academic Press, Orlando (1964). https://doi.org/10.1016/C2013-0-12454-3
Jabbour, M.G., Datta, N.: A tight uniform continuity bound for the Arimoto–Rényi conditional entropy and its extension to classical-quantum states. IEEE Trans. Inf. Theory 68(4), 2169–2181 (2022). https://doi.org/10.1109/TIT.2022.3142812
Junge, M.: Factorization Theory for Spaces of Operators. Habilitation Thesis, Kiel University (1996)
Junge, M., Palazuelos, C.: Channel capacities via \(p\)-summing norms. Adv. Math. 272, 350–398 (2015). https://doi.org/10.1016/j.aim.2014.12.005
Junge, M., Parcet, J.: Mixed-norm Inequalities and Operator Space \(L_p\) Embedding Theory, Memoirs of the American Mathematical Society 203 (2010). https://doi.org/10.1090/S0065-9266-09-00570-5
Kamath, S., Anantharam, V.: On non-interactive simulation of joint distributions. IEEE Trans. Inf. Theory 62(6), 3419–3435 (2016). https://doi.org/10.1109/TIT.2016.2553672
Kouba, O.: On the interpolation of injective or projective tensor products of Banach spaces. J. Funct. Anal. 96(1), 38–61 (1991). https://doi.org/10.1016/0022-1236(91)90072-D
Leditzky, F., Wilde, M.M., Datta, N.: Strong converse theorems using Rényi entropies. J. Math. Phys. 57(8), 082202 (2016). https://doi.org/10.1063/1.4960099
Liu, J., Courtade, T.A., Cuff, P., Verdú, S.: Smoothing Brascamp–Lieb inequalities and strong converses of coding theorems. IEEE Trans. Inf. Theory 66(2), 704–721 (2019). https://doi.org/10.1109/TIT.2019.2953151
Liu, J., van Handel, R., Verdú, S.: Beyond the blowing-up lemma: sharp converses via reverse hypercontractivity. In: 2017 IEEE International Symposium on Information Theory (ISIT), pp. 943–947 (2017). https://doi.org/10.1109/ISIT.2017.8006667
Lunardi, A.: Interpolation theory. Scuola Normale Superiore (2009). https://doi.org/10.1007/978-88-7642-638-4
Markus, A.S.: Introduction to the spectral theory of polynomial operator pencils. Am. Math. Soc. (2012). https://doi.org/10.1090/mmono/071
Marwah, A., Dupuis, F.: Uniform continuity bound for sandwiched Rényi conditional entropy. J. Math. Phys. 63(5), 052201 (2022). https://doi.org/10.1063/5.0088507
Masani, P., Wiener, N.: The prediction theory of multivariate stochastic processes. Acta Math. 98, 111–150 (1957). https://doi.org/10.1007/BF02404472
McKinlay, A., Tomamichel, M.: Decomposition rules for quantum Rényi mutual information with an application to information exclusion relations. J. Math. Phys. 61, 072202 (2020). https://doi.org/10.1063/1.5143862
Mojahedian, M.M., Beigi, S., Gohari, A., Yassaee, M.H., Aref, M.R.: A correlation measure based on vector-valued \(L_p\)-norms. IEEE Trans. Inf. Theory 65(12), 7985–8004 (2019). https://doi.org/10.1109/TIT.2019.2937099
Mosonyi, M., Ogawa, T.: Quantum hypothesis testing and the operational interpretation of the quantum Rényi relative entropies. Commun. Math. Phys. 334(3), 1617–1648 (2015). https://doi.org/10.1007/s00220-014-2248-x
Müller-Lennert, M., Dupuis, F., Szehr, O., Fehr, S., Tomamichel, M.: On quantum Rényi entropies: a new generalization and some properties. J. Math. Phys. 54, 122203 (2013). https://doi.org/10.1063/1.4838856
Nielsen, M., Chuang, I.: Quantum Computation and Quantum Information: 10th Anniversary Edition. Cambridge University Press, Cambridge (2010). https://doi.org/10.1017/CBO9780511976667
Pisier, G.: The Operator Hilbert Space OH, Complex Interpolation and Tensor Norms, Memoirs of the American Mathematical Society 122 (1996). https://doi.org/10.1090/memo/0585
Pisier, G.: Non-commutative Vector Valued \(L_p\)-spaces and Completely \(p\)-summing Maps. Astérisque 247 (1998). arXiv:math/9306206
Pisier, G.: Introduction to Operator Space Theory. Cambridge University Press, Cambridge (2003). https://doi.org/10.1017/CBO9781107360235
Raginsky, M., Sason, I.: Concentration of measure inequalities in information theory, communications, and coding. Found. Trends Commun. Inf. Theory 10(1–2), 1–246 (2013). https://doi.org/10.1561/0100000064
Repovš, D., Semenov, P.V.: Continuous Selections of Multivalued Mappings. Springer, Berlin (1998). https://doi.org/10.1007/978-94-017-1162-3
Rudin, W.: Real and Complex Analysis. McGraw-Hill, New York (1987)
Sharma, N.: Random coding exponents galore via decoupling. arXiv:1504.07075 (2015)
Stein, E.M., Shakarchi, R.: Complex Analysis. Princeton University Press, Princeton (2010)
van Erven, T., Harremoës, P.: Rényi divergence and Kullback–Leibler divergence. IEEE Trans. Inf. Theory 60(7), 3797–3820 (2014). https://doi.org/10.1109/TIT.2014.2320500
Wiener, N.: On the factorization of matrices. Commentarii Mathematici Helvetici 29, 97–111 (1955). https://doi.org/10.1007/BF02564273
Wilde, M.M.: Quantum Information Theory. Cambridge University Press, Cambridge (2013). https://doi.org/10.1017/CBO9781139525343
Wilde, M.M., Winter, A., Yang, D.: Strong converse for the classical capacity of entanglement-breaking and Hadamard channels via a sandwiched Rényi relative entropy. Commun. Math. Phys. 331, 593–622 (2014). https://doi.org/10.1007/s00220-014-2122-x
Winter, A.: Tight uniform continuity bounds for quantum entropies: conditional entropy, relative entropy distance and energy constraints. Commun. Math. Phys. 347(1), 291–313 (2016). https://doi.org/10.1007/s00220-016-2609-8
Acknowledgements
MMG is supported by a fund from MSRT. SB is thankful to Marco Tomamichel and Alexander McKinlay for discussions that motivated this work. The authors are also grateful to Paul Gondolf for notifying us that our bound in Theorem 5.2 diverges as \(\alpha \rightarrow 1\). Finally, the authors are thankful to anonymous referees whose comments greatly improved the presentation of the paper.
Funding
Funding was provided by the Ministry of Science Research and Technology.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors have no conflicts of interest to disclose.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix A. Missing proofs from Sects. 3 and 4
Proof of Proposition 3.6
Part (i) is straightforward. We prove (ii) in the special case of \({\mathcal {H}}=\ell _2^d\); generalization to arbitrary \({\mathcal {H}}\) is similar.
We first show that \({\mathcal {S}}_\infty ^d = {\mathcal {C}}^d\otimes _h {\mathcal {R}}^d\). Recall that \({\mathcal {C}}^d\) is the space of column vectors, and \({\mathcal {R}}^d\) is the space of row vectors. Then, for \(v^c\in {\mathcal {C}}^d\) and \(w^r\in {\mathcal {R}}^d\), their multiplication \(v^cw^r\) makes sense and forms a \(d\times d\) matrix, i.e., an element of \({\mathcal {S}}_\infty ^d\). Define the linear map \(\phi : {\mathcal {C}}^d\otimes _h {\mathcal {R}}^d\rightarrow {\mathcal {S}}_\infty ^d\) by
We claim that \(\phi \) is a complete isometry between \( {\mathcal {C}}^d\otimes _h {\mathcal {R}}^d\) and \({\mathcal {S}}_\infty ^d\). To this end, we prove that \(\phi =\phi _1\) is an isometry; extending the proof for \(\phi _n\), the n-th amplification of \(\phi \), is immediate.
Let \(Z\in {\mathcal {C}}^d\otimes _h{\mathcal {R}}^d\). Then,
Put
Then,
On the other hand, \(Z=X\odot Y\) corresponds to \(\phi (Z) = \sum _i v^c_i w^r_i \). Therefore, \(\Vert Z\Vert _{ {\mathcal {C}}^d\otimes _h{\mathcal {R}}^d}\) equals the infimum of
subject to \(\phi (Z) = \sum _i v^c_i w^r_i \). Note that this is nothing but the operator norm of \(\phi (Z)\), i.e., \(\Vert \phi (Z)\Vert _{{\mathcal {S}}_\infty ^d}\).
Next, we show that \({\mathcal {S}}_1^d = {\mathcal {R}}^d\otimes _h {\mathcal {C}}^d\). As before, thinking of \({\mathcal {R}}^d\) and \({\mathcal {C}}^d\) as row and column vectors, we can define the linear map \(\psi : {\mathcal {R}}^d\otimes _h {\mathcal {C}}^d\rightarrow {\mathcal {S}}_1^d\) by
Again we only show that \(\psi \) is an isometry. To see this, let \(Z\in {\mathcal {R}}^d\otimes _h {\mathcal {C}}^d\). Then,
Put
Then,
and
On the other hand, \(Z=X\odot Y\) corresponds to \(\psi (Z) = \sum _i w^c_iv^r_i\). Therefore, \(\Vert Z\Vert _{ {\mathcal {R}}^d\otimes _h{\mathcal {C}}^d}\) equals the infimum of
subject to \(\psi (Z) = \sum _i w^c_iv^r_i\). An application of Hölder’s inequality shows that this is equal to \(\Vert \psi (Z)\Vert _{{\mathcal {S}}_1^d}\).
Part (iii) follows readily once we note that \(M_{1, d}({\mathcal {C}}^d)\) and \(M_{d, 1}({\mathcal {R}}^d)\) are naturally identified with the space of \(d\times d\) matrices, and by definition, their norms are the same as the norm of \({\mathcal {S}}_\infty ^d\). The isometries \(M_{1, d}({\mathcal {R}}^d) = M_{d, 1}({\mathcal {C}}^d) = {\mathcal {S}}_2^d\) are proven essentially in (A.1) and (A.2). \(\square \)
Proof of Proposition 4.3
(i) First note that by using the reiteration property (Proposition 3.3), we observe that \({\mathcal {R}}\big (1-\tfrac{1}{p_\theta }\big )=\Big [{\mathcal {R}}\big (1-\tfrac{1}{p_0}\big ),{\mathcal {R}}\big (1-\tfrac{1}{p_1}\big )\Big ]_\theta \) and \({\mathcal {R}}\big (\tfrac{1}{p_\theta }\big )=\Big [{\mathcal {R}}\big (\tfrac{1}{p_0}\big ),{\mathcal {R}}\big (\tfrac{1}{p_1}\big )\Big ]_\theta \). Then,
With Remark 3.5 in mind, (4.4) is obtained from (4.3) by taking \({\mathcal {X}}_i={\mathcal {S}}_{p_i}\) for \(i=0,1\).
(ii) Again, we prove the theorem in the finite-dimensional case, i.e., \({\mathcal {S}}_p^d[{\mathcal {X}}]^*={\mathcal {S}}^d_{p'}[{\mathcal {X}}^*]\). We know that for operator spaces \({\mathcal {Y}}_1, {\mathcal {Y}}_2\) which one of them is finite-dimensional we have \(({\mathcal {Y}}_1\otimes _h{\mathcal {Y}}_2)^* = {\mathcal {Y}}_1^*\otimes _h {\mathcal {Y}}_2^*\) completely isometrically [39, Corollary 5.8]. Using this result twice we obtain
completely isometrically. Next, we have
completely isometrically. Here, in the second equality we use the fact that for a compatible couple \(({\mathcal {Y}}_1, {\mathcal {Y}}_2)\) of finite-dimensional operator spaces we have \([{\mathcal {Y}}_1, {\mathcal {Y}}_2]_\theta ^* = [{\mathcal {Y}}_1^*, {\mathcal {Y}}_2^*]_\theta \) completely isometrically [39, Theorem 2.7.4]. We also for the third equality use \(({\mathcal {C}}^d)^*={\mathcal {R}}^d\) and \(({\mathcal {R}}^d)^*={\mathcal {C}}^d\) that are easy to prove [39, p. 41]. Then, we have
completely isometrically.
(iii) For \(p=\infty \) we have
The case \(p=1\) is similar. The general case is now obtained by interpolation:
The proof is complete. \(\square \)
Proof of Remark 4.6
(i) We need to show that for any \(\epsilon >0\) there is a factorization \(X={\hat{A}}{\hat{Y}}{\hat{A}}^*\) such that
Following Theorem 4.4, assume that \(X=AYB\) is such that \(\Vert A\Vert _{{\mathcal {S}}_{2p}}= \Vert B\Vert _{{\mathcal {S}}_{2p}}\) and
Define \({\hat{A}}:=[A ~ B^*]\) and
Then we have \(X={\hat{A}}{\hat{Y}}{\hat{A}}^*\), and
where in the third line we used \(\Vert A\Vert _{{\mathcal {S}}_{2p}}= \Vert B\Vert _{{\mathcal {S}}_{2p}}\).
(ii) Following [11], when X is positive, by Hölder’s inequality we have
This shows that the supremum in (4.6) can be restricted to the case of \(B=A^*\). Moreover, as argued in the proof of Theorem 4.5 we can apply polar decomposition to assume that A is positive.
To prove the assertion for (4.5) we follow similar steps as in the proof of Theorem 4.5. First, by part (i) of the remark, for any \(\epsilon >0\) there exists a factorization \(X=AYA^*\) such that
Moreover, once again applying the polar decomposition we can assume that \(A>0\). Let \({\hat{A}} = A^{\frac{p}{r}}\) and \({\hat{Y}} = A^{\frac{p}{q}}Y A^{\frac{p}{q}}\). Then, we have \(X= {\hat{A}} {\hat{Y}}{\hat{A}}^*\) and \(\Vert {\hat{A}}\Vert _{{\mathcal {S}}_{2r}}= \Vert {\hat{A}}^*\Vert _{{\mathcal {S}}_{2r}}= \Vert A\Vert _{{\mathcal {S}}_{2p}}^{\frac{p}{r}}\). Moreover, by Theorem 4.4 we have
Therefore,
The proof is complete. \(\square \)
Appendix B. Auxiliary lemmas in the proof of Theorem 3.7
1.1 B.1. A selection lemma
Suppose that \(\alpha \), \(\beta \) and \(\gamma \) are norms, respectively, on \(M_{d,n}\), \(M_{n,d'}\) and \(M_{d,d'}\) such that
Our goal here is to show that almost optimizing pairs (A, B) can be chosen to be measurable functions.
Lemma B.1
Let X(s) be a continuous function on \({\mathbb {R}}\) with values in \(M_{d,d'}\). Then for every \(\epsilon >0\) there exist measurable functions A(s), B(s) such that for every \(s\in {\mathbb {R}}\),
To prove this lemma we make use of a special case of the Kuratowski–Ryll-Nardzewski selection theorem [41]:
Theorem B.2
(Kuratowski–Ryll-Nardzewski selection theorem) Let \(cl ({\mathbb {R}}^n)\) be the set of all nonempty closed subsets of \({\mathbb {R}}^n\). Let \(\psi :{\mathbb {R}}\rightarrow cl ({\mathbb {R}}^n)\) be a multi-function such that the set \(\{s\in {\mathbb {R}}:\psi (s)\cap U\ne \emptyset \}\) is measurable for every closed subset U of \({\mathbb {R}}^n\). Then there is a measurable function \(f:{\mathbb {R}}\rightarrow {\mathbb {R}}^n\) such that \(f(s)\in \psi (s)\) for all \(s\in {\mathbb {R}}\).
Proof of Lemma B.1
Based on the above selection theorem we define \(\psi (s)\) to be the set of all almost optimizer pairs (A, B) satisfying
We need to show that \(\psi (\cdot )\) satisfies the hypotheses of the above selection theorem. It is straightforward to show that \(\psi (s)\) is non-empty and closed for every s. Thus, we need to verify that \(\{s\in {\mathbb {R}}:\psi (s)\cap U\ne \emptyset \}\) is measurable. To this end, consider the map
Since \(\Psi \) is continuous, the set
is closed. Now, if U is a closed set of pairs (A, B), then \({\mathcal {E}}\cap ({\mathbb {R}}\times U)\) is closed. Next, observe that
where \(\pi _1\) denotes the projection map onto the first component. Therefore, to see the measurability of \(\{s\in {\mathbb {R}}:\psi (s)\cap U\ne \emptyset \}\), it suffices to prove that \(\pi _1({\mathcal {E}}\cap ({\mathbb {R}}\times U))\) is measurable.
Note that any Euclidean space can be written as a countable union of compact sets. Then, since the intersection of a closed and a compact set is compact, the closed set \({\mathcal {E}}\cap ({\mathbb {R}}\times U)\) can be written as
where \(K_i\)’s are compact. On the other hand, since a continuous image of a compact set is compact, \(\pi (K_i)\)’s are compact too. This implies that
is a countable union of compact sets, and is therefore measurable. \(\square \)
1.2 B.2. A subharmonicity lemma
Let \({\mathbb {D}}\) be the unit disk in the complex plane and \(\partial {\mathbb {D}}\) be its boundary. It is well-known that having a sufficiently nice function f on \(\partial {\mathbb {D}}\), there is a unique harmonic function on \({\mathbb {D}}\) that matches f on the boundary. This harmonic function can be represented in terms of the Poisson kernel for \({\mathbb {D}}\), that is given by
for \(z=re^{i\theta }\in {\mathbb {D}},e^{it}\in \partial {\mathbb {D}}\). In particular, if \(f\in L_1(\partial {\mathbb {D}})\) and
then,
Recall that \({\mathbb {S}}:=\{z\in {\mathbb {C}}:0\le \text {Re}(z)\le 1\}\). The Poisson kernel for \({\mathbb {S}}\) may be obtained by carrying P over to \({\mathbb {S}}\) using the conformal map \(\phi :{\mathbb {D}}\rightarrow {\mathbb {S}}\) defined byFootnote 12
Indeed, for any holomorphic function \(f:{\mathbb {S}}\rightarrow {\mathbb {C}}\), the function \(f\circ \phi :{\mathbb {D}}\rightarrow {\mathbb {C}}\) is also holomorphic. Thus, using (B.1) we have
We note that \(\phi \) maps the boundary of \({\mathbb {D}}\) onto the boundary of \({\mathbb {S}}\). Then, letting \(\phi (e^{it})=b+is\) with \(b\in \{0,1\}\) and \(s\in {\mathbb {R}}\), and rewriting the integral in terms of s, we find that
Consequently, the Poisson kernel of \({\mathbb {S}}\) is given by the functions \(P_b(x+iy,s)\) for \(b=0,1\), where
We note that \(P_b(x+iy,s)\ge 0\) and it can be verified that
Lemma B.3
Let \(({\mathcal {X}}_0, {\mathcal {X}}_1)\) be a compatible couple and let \({\mathcal {X}}_\theta = [{\mathcal {X}}_0, {\mathcal {X}}_1]_\theta \) for \(0<\theta <1\).
-
(i)
Let \(f(z)\in {\mathscr {F}}({\mathcal {X}}_0,{\mathcal {X}}_1)\). Then we have
$$\begin{aligned} \log \Vert f(\theta )\Vert _{{\mathcal {X}}_\theta }\le \sum _{b=0}^1 \int _{-\infty }^{+\infty } P_b(\theta , s)\log \Vert f(b+is)\Vert _{{\mathcal {X}}_b}\textrm{d}s. \end{aligned}$$ -
(ii)
If \({\mathcal {X}}_0, {\mathcal {X}}_1\) are finite-dimensional (or more generally, reflexive), then the same inequality holds whenever f(z) is bounded on \({\mathbb {S}}\) and holomorphic in its interior (though not necessarily continuous up to the boundary).
The first part of this lemma is proven in [7, Lemma 4.3.2]. Here, we prove the second part for the sake of completeness.
Proof of (ii)
Let \({\mathcal {X}}^*_\theta \) be the Banach dual of \({\mathcal {X}}_\theta \). Then, there is \(\ell \in {\mathcal {X}}^*_\theta \) such that
Since \({\mathcal {X}}^*_\theta =[{\mathcal {X}}^*_0, {\mathcal {X}}^*_1]_\theta \), for any \(\epsilon >0\) there is g(z) such that \(g(\theta )=\ell \) and
Now define \(\psi (z):= \langle g(z), f(z)\rangle \). We note that \(\psi (z)\) is holomorphic and bounded. Then, by the subharmonicity of holomorphic functions we have
The desired inequality follows once we note that \(\epsilon >0\) is arbitrary. \(\square \)
Appendix C. Factorization of operator-valued functions
This section is dedicated to the Wiener–Masani theorem on factorization of operator-valued functions [12, 31, 46]. This theorem is used in the proof of Theorem 3.7, and is one of the main technical tools in the development of the theory of operator-valued Schatten spaces. Thus, we devote this section to the statement and proof of this theorem. We follow the approach of [18] for the proof, that is based on characterization of invariant subspaces of the shift operator. To this end, we first need a number of ingredients.
1.1 C.1. Hardy spaces
Following the notation of Sect. 2, let \({\mathbb {D}}\) be the unit disk in the complex plane and let \(\partial {\mathbb {D}}\) be its boundary, equipped with the normalized Lebesgue measure. Then, \(L_p= L_p(\partial {\mathbb {D}})\) denotes the Banach space of all measurable functions \(f:\partial {{\mathbb {D}}}\rightarrow {\mathbb {C}}\) such that the norm
is finite. We note that by Hölder’s inequality, we have \(L_q\subseteq L_p\) for \(p<q\).
Definition C.1
The Hardy space \(H_p\) is the closed subspace of \(L_p\) consisting of all functions \(f\in L_p\) for which there is a function F holomorphic in the interior of \({\mathbb {D}}\) whose radial limit coincides with f almost everywhere.Footnote 13
There is another description of \(H_p\) in terms Fourier coefficients and Fourier series that goes as follows. For each function \(f\in L_1\), let
and
be the Fourier coefficients and the Fourier series of f, respectively. Then, \(H_p\) is the subset of \(L_p\) consisting of all functions f whose Fourier coefficients \(a_n\) vanish for \(n<0\).
Define the operator \({\mathfrak {S}}\) by \(({\mathfrak {S}}f)(e^{it})=e^{it}f(e^{it})\). Consequently, for \(n\ge 1\) we have \(({\mathfrak {S}}^n f)(e^{it}) = e^{nit}f(e^{it})\). The operator \({\mathfrak {S}}\) is called the shift operator since it shifts the Fourier coefficients.
Definition C.2
A function \(f\in H_2\) is said to be outer if the linear span of the set \(\{{\mathfrak {S}}^n f:n\ge 0\}\) is dense in \(H_2\).
It is well-known that every outer function f can be written in the form
where \(\gamma \) is a complex number of modulus 1, and \(k(t)=\log |f(e^{it})|\) almost everywhere; see, e.g., [29, 42] for more information on outer functions.
1.2 C.2 Vector-valued Hardy spaces
Let \({\mathcal {H}}\) be a separable (usually finite-dimensional) Hilbert space. An \({\mathcal {H}}\)-valued function \(F:\partial {{\mathbb {D}}}\rightarrow {\mathcal {H}}\) is said to be measurable if for every \(v\in {\mathcal {H}}\) the complex function \(e^{it}\mapsto \langle v,F(e^{it})\rangle \) is measurable. Abusing the notation, let \(L_p({\mathcal {H}})=L_p(\partial {\mathbb {D}}, {\mathcal {H}})\) be the Banach space of all measurable functions \(F:\partial {{\mathbb {D}}}\rightarrow {\mathcal {H}}\) such that the norm
is finite.Footnote 14 It is worth noting that the above integral makes sense since the function \(e^{it}\mapsto \Vert F(e^{it})\Vert \), as the supremum of measurable functions, is itself measurable. The case \(p=2\) is of particular interest since \(L_2({\mathcal {H}})\) is a Hilbert space under the inner product
Let \(\{e_n:n=1,2,\cdots \}\) is an orthonormal basis for \({\mathcal {H}}\), and for \(F\in L_2({\mathcal {H}})\) define the coordinate functions by
Then, we have
where \(\Vert f_n\Vert _2\) is the norm of \(f_n\) in \(L_2=L_2(\partial {\mathbb {D}}, {\mathbb {C}})\).
For any \(F\in L_p({\mathcal {H}})\), one may define the Fourier expansion of F as
where \(\hat{a}_n\in {\mathcal {H}}\) is given by
We note that letting \({\mathcal {H}}={\mathbb {C}}\), we obtain the usual \(L_p=L_p(\partial {\mathbb {D}}, {\mathbb {C}})\) space with the usual Fourier transform.
Definition C.3
Let \({\mathcal {H}}\) be a (separable) Hilbert space and \(1\le p\le \infty \). The vector-valud Hardy space \(H_p({\mathcal {H}})\) is defined to be the closed subspace of \(L_p({\mathcal {H}})\) consisting of all functions F for which \(\hat{a}_n=0\), for all \(n<0\).
Similar to the usual complex-valued Hardy space, \(H_p({\mathcal {H}})\) can also be characterized in terms of holomorphic functions. We say that a vector-valued function \(\phi : {\mathbb {D}}\rightarrow {\mathcal {H}}\) is holomorphic if for every \(v\in {\mathcal {H}}\), the function \(z\mapsto \langle v,\phi (z)\rangle \) is holomorphic. Then, \(H_p({\mathcal {H}})\) is the space of functions \(F\in L_p({\mathcal {H}})\) for which there is a holomorphic function \(\phi : {\mathbb {D}}\rightarrow {\mathcal {H}}\) whose radial limit coincides with F almost everywhere, i.e.,
1.3 C.3 Invariant subspaces
The shift operator \({\mathfrak {S}}\) can also be defined on the space of vector-valued functions: \(({\mathfrak {S}}F)(e^{it})=e^{it}F(e^{it})\).
Definition C.4
A closed subspace \({\mathcal {M}}\) of \(L_2({\mathcal {H}})\) is said to be invariant if \({\mathfrak {S}}({\mathcal {M}})\subseteq {\mathcal {M}}\), where \({\mathfrak {S}}({\mathcal {M}}):=\{{\mathfrak {S}}F:\, F\in {\mathcal {M}}\}\). We say that \({\mathcal {M}}\) is doubly invariant if \({\mathfrak {S}}({\mathcal {M}})\subseteq {\mathcal {M}}\) and \({\mathfrak {S}}^{-1}({\mathcal {M}})\subseteq {\mathcal {M}}\). Also, \({\mathcal {M}}\) is said to be simply invariant if it is invariant but not doubly invariant.
We may also consider operator-valued (matrix-valued) functions. We say that such a function \(U(e^{it})\) is measurable if the function \(e^{it}\mapsto \langle v,U(e^{it}) w\rangle \) is measurable for all vectors v, w.
Lemma C.5
Let \({\mathcal {M}}\) be a simply invariant subspace of \(L_2({\mathcal {H}})\) that contains no nontrivial doubly invariant subspace. Then there exist an auxiliary Hilbert space \({\mathcal {K}}\) and a measurable operator-valued function U such that
-
(i)
\(U(e^{it}):{\mathcal {K}}\rightarrow {\mathcal {H}}\) is an isometry for any \(e^{it}\in \partial {{\mathbb {D}}}\).
-
(ii)
\({\mathcal {M}}\) consists of functions \(e^{it}\mapsto U(e^{it})F(e^{it})\) for \(F\in H_2({\mathcal {K}})\).
We note that vector-valued Hardy spaces \(H_2({\mathcal {K}})\) are simply invariant subspaces and do not contain any nontrivial doubly invariant subspace. The above lemma says that all such subspaces are essentially vector-valued Hardy spaces.
Proof
For each n, define \({\mathcal {M}}_n:={\mathfrak {S}}^n({\mathcal {M}})\). As \({\mathcal {M}}\) is invariant, we have
We note that
is doubly invariant, so by assumption \({\mathcal {M}}_\infty =0\).
For any \(n\ge 0\), let \({\mathcal {N}}_n\) be the orthogonal complement of \({\mathcal {M}}_{n+1}\) in \({\mathcal {M}}_n\) (with respect to the inner product of \(L_2({\mathcal {H}})\)). Thus, we have \({\mathcal {M}}_n={\mathcal {N}}_n\oplus {\mathcal {M}}_{n+1}\). Moreover, since \({\mathcal {M}}_\infty =0\), we have
Let \({\mathcal {B}}_0=\{E_1,E_2,\cdots \}\) be an orthonormal basis for \({\mathcal {N}}_0\), and note that \({\mathcal {B}}_n:={\mathfrak {S}}^n({\mathcal {B}}_0)=\{{\mathfrak {S}}^n E_k:k\ge 1\}\), for \(n\ge 1\), is a subset of \({\mathcal {M}}_n\subseteq {\mathcal {M}}_1\). Thus, \({\mathcal {B}}_0\) is orthogonal to \({\mathcal {B}}_n\) for all \(n\ge 1\). This means that for every j, k and \(n\ge 1\) we have
Moreover, as \({\mathcal {B}}_0\) is orthonormal, the same equation holds when \(n=0\) and \(j\ne k\). More generally, we observe that
This equation shows that all the nontrivial Fourier coefficients of the functions \(e^{it}\mapsto \big \langle E_j(e^{it}),E_k(e^{it})\big \rangle \) are zero, and hence it is almost everywhere identical to the constant function \(\delta _{jk}\).
We claim that for each n, the orthonormal set \({\mathcal {B}}_n\) is a basis for \({\mathcal {N}}_n\). To this end, we need to show that \({\mathcal {B}}_n\subseteq {\mathcal {N}}_n=\mathop {\textrm{span}}\limits {\mathcal {B}}_n\). Letting \(F\in {\mathcal {M}}_{n+1}\), by definition there is \(G\in {\mathcal {M}}\) such that \(F={\mathfrak {S}}^{n+1} G\). On the other hand, for \({\mathfrak {S}}^n E_j\in {\mathcal {B}}_n\) we have
where we used the fact that \({\mathfrak {S}}G\in {\mathcal {M}}_1\) and that \(E_j\in {\mathcal {N}}_0\). Thus, \({\mathfrak {S}}^nE_j\in {\mathcal {M}}_n\) and is orthogonal to \({\mathcal {M}}_{n+1}\), which means that \({\mathfrak {S}}^n E_j\in {\mathcal {N}}_n\). Therefore, \({\mathcal {B}}_n\subseteq {\mathcal {N}}_n\).
Next, let \(F\in {\mathcal {M}}_n\). Then, there is \(G\in {\mathcal {M}}\) such that \(F={\mathfrak {S}}^n G\). Since \(G\in {\mathcal {M}}={\mathcal {N}}_0\oplus {\mathcal {M}}_1\), there are \(G_0\in {\mathcal {N}}_0\) and \(G_1\in {\mathcal {M}}\) such that \(G=G_0+{\mathfrak {S}}G_1\). Therefore
By definition we have \({\mathfrak {S}}^{n+1}G_1\in {\mathcal {M}}_{n+1}\). Also, since \(G_0\in {\mathcal {N}}_0=\mathop {\textrm{span}}\limits {\mathcal {B}}_0\), we have \({\mathfrak {S}}^n G_0\in \mathop {\textrm{span}}\limits {\mathcal {B}}_n\). As a result, \(F\in \mathop {\textrm{span}}\limits {\mathcal {B}}_n\oplus {\mathcal {M}}_{n+1}\) for any \(F\in {\mathcal {M}}_n\). Thus, using \({\mathcal {B}}_n\subseteq {\mathcal {N}}_n\) proven above, and the definition of \({\mathcal {N}}_n\), we find that \({\mathcal {N}}_n=\mathop {\textrm{span}}\limits {\mathcal {B}}_n\).
Putting (C.3) and \({\mathcal {N}}_n=\mathop {\textrm{span}}\limits {\mathcal {B}}_n\) together, we find that every \(F\in {\mathcal {M}}\) can be expanded as
Now, let \({\mathcal {K}}\) be a Hilbert space isomorphic to \({\mathcal {N}}_0\) with orthonormal basis \(\{e_1,e_2,\cdots \}\). Every function in \(L_2({\mathcal {K}})\) can be written in the form \(\sum f_j e_j\) with \(f_j\in L_2\), \(j=1, 2,\ldots \), being coordinate functions (see (C.1)). Define \(\hat{U}:L_2({\mathcal {K}})\rightarrow L_2({\mathcal {H}})\) by
meaning that \(\hat{U}\Big (\sum f_j e_j\Big )(e^{it})=\sum f_j(e^{it}) E_j(e^{it})\). Then, \(\hat{U}\) is a well-defined linear operator. Moreover, since \(\langle E_j(e^{it}),E_k(e^{it})\rangle =\delta _{jk}\), almost everywhere, \(\hat{U}\) is an isometry.
For every \(e^{it}\) we define a linear map \(U(e^{it}):{\mathcal {K}}\rightarrow {\mathcal {H}}\) as follows. For every \(v\in {\mathcal {K}}\) let \({\hat{v}}\in L_2({\mathcal {K}})\) be the constant function \(\hat{v}(e^{it})=v\). Then, define
It is easy to verify that \(e^{it}\mapsto U(e^{it})\) is measurable. We claim that \(U(e^{it})\) is an isometry for almost every t. To show this, fix \(v, w\in {\mathcal {K}}\). Using the fact that \(\hat{U}\) is an isometry, for every n we obtain
On the other hand, the definition of \(\hat{U}\) implies that \(\hat{U}({\mathfrak {S}}^n \hat{w})={\mathfrak {S}}^n\hat{U}(\hat{w})\). Therefore,
Consequently, comparing to the previous equation, all the nontrivial Fourier coefficients of the function \(e^{it}\mapsto \langle \hat{U}\hat{v}(e^{it}),\hat{U}\hat{w}(e^{it})\rangle =\langle U(e^{it})v, U(e^{it})w\rangle \) vanish. This means that this function is constant almost everywhere. Thus, we have, almost everywhere,
where in the last line we use (C.5). This means that \(U(e^{it})\) is an isometry almost everywhere (hence one can modify \(U(e^{it})\) on a set of measure zero so that \(U(e^{it})\) is an isometry for any \(e^{it}\)).
We note that \(H_2({\mathcal {K}}) = \left\{ \sum f_je_j:\, f_j\in H_2, \Vert f_j\Vert _2^2<\infty \right\} \), and every \(f_j\in H_2\) can be written as \(f_j(e^{it})=\sum _{n\ge 0} a_{nj}e^{nit}\). Therefore, (C.4) yields
Finally, from the definition of U it follows that \(U(e^{it})e_j=(\hat{U}e_j)(e^{it})=E_j(e^{it})\). Hence, for every \(F=\sum _j f_je_j\in H_2({\mathcal {K}})\) we obtain
That is, \({\mathcal {M}}\) consists of functions of the form \(e^{it}\mapsto U(e^{it})F(e^{it})\) for \(F\in H_2({\mathcal {K}})\). \(\square \)
Remark C.6
In the above lemma, if \({\mathcal {H}}\) is finite-dimensional, then so is \({\mathcal {K}}\) since \(U(e^{it}):{\mathcal {K}}\rightarrow {\mathcal {H}}\) is an isometry almost everywhere.
1.4 C.4 Wiener–Masani factorization theorem
In the following we assume that \({\mathcal {H}}\) is finite-dimensional with orthonormal basis \(\{e_1, \ldots , e_d\}\). For every \(e^{it}\in \partial {\mathbb {D}}\) we assume that \(T(e^{it}):{\mathcal {H}}\rightarrow {\mathcal {H}}\) is a linear map such that \(e^{it}\mapsto T(e^{it})\) is an operator-valued measurable function. We further assume that entries of T (in the basis \(\{e_1, \ldots , e_d\}\)) are integrable, i.e., belong to \(L_1\).
Suppose that \(T(e^{it})\) is positive, and uniformly bounded almost everywhere, i.e., there exists a constant \(c>0\) such that
Let \(Q(e^{it})=\sqrt{T(e^{it})}\). We note that with the above assumptions Q can be written as the limit of measurable functions (consider the truncated Taylor expansion of \(x\mapsto \sqrt{x}\) at point c), and is measurable itself. We also note that Q can be considered as an element of \(B(L_2({\mathcal {H}}))\). Indeed, for every \(F\in L_2({\mathcal {H}})\) we have \((QF)(e^{it})=Q(e^{it})F(e^{it})\) and
Define
where \(Qe_j\in L_2({\mathcal {H}})\) is given by \((Qe_j)(e^{it}) = Q(e^{it})e_j\). Observe that by definition, \({\mathcal {M}}(T)\) is a closed and invariant subspace of \(L_2({\mathcal {H}})\). In fact, \({\mathcal {M}}(T)\) is the smallest invariant subspace of \(L_2({\mathcal {H}})\) generated by columns of Q.
Lemma C.7
Let \(T(e^{it})\) be a measurable operator-valued function as above, and \(\lambda (e^{it})\) be a positive measurable function such that
and
for some \(c>0\). Then \({\mathcal {M}}(T)\) does not contain any nontrivial doubly invariant subspace.
Proof
Observe that by (C.6) there is \(\psi \in H_2\) such that \(|\psi (e^{it})|=\sqrt{\lambda (e^{it})}\). To construct such a function we use the Poisson kernel of \({\mathbb {D}}\) to define
We note that \(g(e^{it})=\log \sqrt{\lambda (e^{it})}\) almost everywhere, and that g(z) is harmonic. Then, considering its harmonic conjugate, there is a holomorphic function \({\hat{g}}(z)\) with \(\Re {\hat{g}}(z) = g(z)\). Now, letting \(\psi (z) = e^{{\hat{g}}(z)}\), we have almost everywhere
We say that \(P\in H_2({\mathcal {H}})\) is a polynomial if its Fourier expansion (C.2) consists of finitely many \(n\ge 0\) terms, i.e., P is a polynomial in variable \(e^{it}\) with coefficients in \({\mathcal {H}}\). For such a polynomial \(P\in H_2({\mathcal {H}})\) we have \(\psi P\in H_2({\mathcal {H}})\) where \((\psi P)(z) = \psi (z)P(z)\). Define
Observe that \({\mathcal {N}}\subseteq H_2({\mathcal {H}})\) is an invariant subspace.
Next, consider the subspace \({\mathcal {W}}=\mathop {\textrm{span}}\limits \{Q P:\, P \text { polynomial}\}\subseteq L_2({\mathcal {H}})\) where as before \(Q=\sqrt{T}\). We note that by definition \(\overline{{\mathcal {W}}}= {\mathcal {M}}(T)\). Define U from \({\mathcal {W}}\) to \({\mathcal {N}}\) by \(U(Q P)=\psi P\). Since \(\sqrt{\lambda (e^{it})} I\le Q(e^{it})\) almost everywhere, we have
This means that \(\Vert U\Vert \le 1\), and U can be extended to \(U:{\mathcal {M}}(T)\rightarrow {\mathcal {N}}\) while \(\Vert U\Vert \le 1\).
We claim that U is one-to-one. To prove this, suppose that \(F\in {\mathcal {M}}(T)\) and \(UF=0\). Since \(F\in {\mathcal {M}}(T)\), there is a sequence of polynomials \(\{P_n:\, n\ge 1\}\subset H_2({\mathcal {H}})\) such that
and since \(QF=0\), we have
Define \(R_m=\{e^{it}:\lambda (e^{it})\ge 1/m\}\), and let \(J_m\) be the characteristic function of \(R_m\). We compute
Therefore
On the other hand, we have
where the penultimate inequality follows by ignoring a non-negative term and the Cauchy-Schwarz inequality, and the last inequality follows from \(Q(e^{it})\le \sqrt{c}I\). Taking the limit of both sides as \(n\rightarrow \infty \), using (C.7) and the non-negativity of the remaining term, we obtain
Now, the monotone convergence theorem implies \(\Vert F\Vert _2^2=\lim \limits _{m\rightarrow \infty } \Vert J_mF\Vert _2^2=0\). This means that \(F=0\). Thus, \(U:{\mathcal {M}}(T)\rightarrow {\mathcal {N}}\) is one-to-one, as claimed.
Finally, suppose that \({\mathcal {M}}'\subseteq {\mathcal {M}}(T)\) is a nontrivial doubly invariant subspace. Then \(\overline{U{\mathcal {M}}'}\subseteq {\mathcal {N}}\subseteq H_2({\mathcal {H}})\) is a doubly invariant subspace. But \(H_2({\mathcal {H}})\) does not contain nontrivial doubly invariant subspaces. This implies that \(\overline{U{\mathcal {M}}'}=0\). Hence, \({\mathcal {M}}'=0\) since U is one-to-one. \(\square \)
Similar to the scalar case (see Definition C.2), an operator-valued function A(z) is called outer if \(\{A(z)P(z):\, P(z)\in H_2({\mathcal {H}}) \text { polynomial}\}\) is dense in \(H_2({\mathcal {H}})\). Equivalently, A(z) is outer if
Lemma C.8
[29] Let A(z) be an outer operator-valued function in \(H_\infty ({\mathcal {H}})\). If \(A(e^{it})\) is invertible almost everywhere on \(\partial {{\mathbb {D}}}\), and
then A(z) is invertible for all z in the interior of \({\mathbb {D}}\).
Proof
Since \(\Vert A^{-1}(e^{it})\Vert ^{-1} \le \Vert A(e^{it})\Vert \) almost everywhere on \(\partial {{\mathbb {D}}}\) and \(A(z) \in H_\infty ({\mathcal {H}})\), it is evident that the function \({\tilde{\lambda }}(e^{it}):=\Vert A^{-1}(e^{it})\Vert ^{-1}\) lies in \(L_\infty \). On the other hand, it follows from (C.8) that \(\log {\tilde{\lambda }}(e^{it})\in L_1\). These facts imply that the outer function \(\lambda (z)\), defined by
lies in \(H_\infty \), and
Let h be an arbitrary but fixed vector in \({\mathcal {H}}\), and define \(\psi _h(e^{it})=\overline{\lambda (e^{it})} A^{-1}(e^{it})^*h\). Then, \(\psi _h \in L_2({\mathcal {H}})\) since \(\Vert \psi _h\Vert _{L_2({\mathcal {H}})} \le \Vert h\Vert _{\mathcal {H}}\) by (C.9).
Now let \(\Psi _h\) be the bounded linear functional on \(L_2({\mathcal {H}})\) corresponding to \(e^{-it}\psi _h(e^{it}) \in L_2({\mathcal {H}})\), i.e.,
We are going to show that \(\Psi _h\) vanishes on \(H_2({\mathcal {H}})\). To this end, note first that the last expression above is indeed a Fourier coefficient, with negative index, of the function \(\big \langle \psi _h(e^{it}), F(e^{it}) \big \rangle _{{\mathcal {H}}} \in H_2\). As a result, for every function \(u(z) \in H_2({\mathcal {H}})\) we have
where \(F(z)=A(z)u(z)\), since \(\big \langle \overline{\lambda (e^{it})}h, u(e^{it}) \big \rangle _{{\mathcal {H}}} \in H_2\). Thus, \(\Psi _h\) vanishes on \(A(z) H_2({\mathcal {H}})\), which in turn implies that \(\Psi _h\) vanishes on the entire \(H_2({\mathcal {H}})\) as by assumption A(z) is outer.
Let g be an arbitrary (but fixed) vector in \({\mathcal {H}}\), and \(z \notin {\mathbb {D}}\). Then, the vector-valued function \(F^{g}_z(e^{it}):=(e^{it}-z)^{-1}g\) is readily seen to be in \(H_2({\mathcal {H}})\). Hence, by the preceding paragraph \(\Psi _h(F^{g}_z)=0\), or,
Consequently, letting \(B(e^{it})=A^{-1}(e^{it})\), the function \(\lambda (e^{it})\big \langle h, A^{-1}(e^{it})g \big \rangle _{{\mathcal {H}}}\) extends to a function \(\lambda (z)\langle h, B(z)g \rangle _{{\mathcal {H}}}\) in \(H_\infty \) via Poisson integral. Finally, since \(\lambda (z)\) is outer, we have \(\lambda (z)\ne 0\) for all z in the interior of \({\mathbb {D}}\), and hence the function \(\langle h, B(z)g \rangle _{{\mathcal {H}}}\) is holomorphic as well. It is easily verified that indeed \(B(z)=A^{-1}(z)\) for all z in the interior of \({\mathbb {D}}\). \(\square \)
We can now state and prove the Wiener–Masani theorem.
Theorem C.9
(Wiener–Masani) Let \(T(e^{it})\) be a measurable operator-valued function as above, and \(\lambda (e^{it})\) be a positive measurable function such that
and
for some \(c>0\). Then there exists an outer operator-valued function \(A(z)\in H_\infty ({\mathcal {H}})\) such that A(z) is invertible for all z in the interior of \({\mathbb {D}}\), and
Proof
By Lemma C.7, \({\mathcal {M}}(T)\) does not contain any doubly invariant subspace. Therefore, by Lemma C.5, there exist a Hilbert space \({\mathcal {K}}\) and a function \(U(e^{it}):{\mathcal {K}}\rightarrow {\mathcal {H}}\), that is an isometry almost everywhere, such that \({\mathcal {M}}(T)=UH_2({\mathcal {K}})\).
Recall that \(Qe_j\in {\mathcal {M}}(T)\) for every basis element \(e_j\in {\mathcal {H}}\). Then, for each j, there is \(A_j\in H_2({\mathcal {K}})\) such that \(Qe_j=UA_j\). Define \(A(e^{it}): {\mathcal {H}}\rightarrow {\mathcal {K}}\) by \(A(e^{it})e_j=A_j(e^{it})\). Observe that A is a holomorphic operator-valued function since \(A_j\in H_2({\mathcal {K}})\). Moreover, using the fact that U is an isometry almost everywhere, we have
Now we show that \(A(e^{it})\) is outer. Let \(F\in H_2({\mathcal {K}})\) be arbitrary. Then, \(UF\in UH_2({\mathcal {K}})= {\mathcal {M}}(T)\), so for every \(n\ge 1\) there is a polynomial \(P_n\) such that \(\Vert QP_n- UF\Vert _2\le 1/n\). Next, using the fact that U is an isometry almost everywhere we have
This shows that \(\{AP: P \text { polynomial}\}\) is dense in \({\mathcal {H}}_2({\mathcal {K}})\) and A is outer.
It follows from (C.12) and our assumptions on \(T(e^{it})\) that \(A(e^{it})\) is invertible almost everywhere on \(\partial {{\mathbb {D}}}\), and \(\Vert A^{-1}(e^{it})\Vert ^2=\Vert T^{-1}(e^{it})\Vert \). On the other hand, by (C.11) we have
This, together with (C.10), yields that \(\log \Vert A^{-1}(e^{it})\Vert \in L_1\). Finally, Lemma C.8 is applied to conclude that A(z) is invertible for all z in the interior of \({\mathbb {D}}\). \(\square \)
The Wiener–Masani theorem can also be stated for operator-valued functions defined on the boundary of the strip \({\mathbb {S}}\). To this end, having measurable operator-valued functions \(T_b(s)\in B({\mathcal {H}})\), for \(b=0,1\), we may use the conformal map (B.2) to transform them into measurable operator-valued function on \(\partial {\mathbb {D}}\). Then, using the Wiener–Masani theorem, finding an appropriate holomorphic operator-valued function A, and pulling it back as a function on \({\mathbb {S}}\) through the conformal map, the desired factorization of \(T_b\), \(b=0,1\), is derived.
Corollary C.10
Let \(T_b(b+is)\), \(b=0,1\), be two measurable operator-valued functions, and \(\lambda _b(b+is)\) be two positive measurable functions such thatFootnote 15
and
for some \(c>0\). Then there exists a bounded holomorphic operator-valued function A(z) on \({\mathbb {S}}\) such that A(z) is invertible for all z in the interior of \({\mathbb {S}}\), and
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Beigi, S., Goodarzi, M.M. Operator-valued Schatten spaces and quantum entropies. Lett Math Phys 113, 91 (2023). https://doi.org/10.1007/s11005-023-01712-9
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11005-023-01712-9