One-Shot Randomized and Nonrandomized Partial Decoupling

Wakakuwa, Eyuri; Nakata, Yoshifumi

doi:10.1007/s00220-021-04136-5

One-Shot Randomized and Nonrandomized Partial Decoupling

Open access
Published: 16 July 2021

Volume 386, pages 589–649, (2021)
Cite this article

Download PDF

You have full access to this open access article

Communications in Mathematical Physics Aims and scope Submit manuscript

One-Shot Randomized and Nonrandomized Partial Decoupling

Download PDF

1164 Accesses
6 Citations
3 Altmetric
Explore all metrics

Abstract

We introduce a task that we call partial decoupling, in which a bipartite quantum state is transformed by a unitary operation on one of the two subsystems and then is subject to the action of a quantum channel. We assume that the subsystem is decomposed into a direct-sum-product form, which often appears in the context of quantum information theory. The unitary is chosen at random from the set of unitaries having a simple form under the decomposition. The goal of the task is to make the final state, for typical choices of the unitary, close to the averaged final state over the unitaries. We consider a one-shot scenario, and derive upper and lower bounds on the average distance between the two states. The bounds are represented simply in terms of smooth conditional entropies of quantum states involving the initial state, the channel and the decomposition. Thereby we provide generalizations of the one-shot decoupling theorem. The obtained result would lead to further development of the decoupling approaches in quantum information theory and fundamental physics.

Efficient methods for one-shot quantum communication

Article Open access 20 August 2022

Decoupling with Random Quantum Circuits

Article 18 September 2015

Reliability Function of Quantum Information Decoupling via the Sandwiched Rényi Divergence

Article 23 June 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Decoupling refers to the fact that we may destroy correlation between two quantum systems by applying an operation on one of the two subsystems. It has played significant roles in the development of quantum Shannon theory for a decade, particularly in proving the quantum capacity theorem [1], unifying various quantum coding theorems [2], analyzing a multipartite quantum communication task [3, 4] and in quantifying correlations in quantum states [5, 6]. It has also been applied to various fields of physics, such as the black hole information paradox [7], quantum many-body systems [8] and quantum thermodynamics [9, 10]. Dupuis et al. [11] provided one of the most general formulations of decoupling, which is often referred to as the decoupling theorem. The decoupling approach simplifies many problems of our interest, mostly due to the fact that any purification of a mixed quantum state is convertible to another reversibly [12].

All the above studies rely on the notion of random unitary, i.e., unitaries drawn at random from the set of all unitaries acting on the system, which leads to the full randomization over the whole Hilbert space. In various situations, however, the full randomization is a too strong demand. In the context of communication theory, for example, the full randomization leads to reliable transmission of quantum information, while we may be interested in sending classical information at the same time [13], for which the full randomization is more than necessary. In the context of quantum many-body physics, the random process caused by the complexity of dynamics is in general restricted by symmetry, and thus no randomization occurs among different values of conserved quantities. Hence, in order that the random-unitary-based method fits into broader context in quantum information theory and fundamental physics, it would be desirable to generalize the previous studies using the full-random unitary, to those based on random unitaries that are not fully random but with a proper structure.

As the first step toward this goal, we consider a scenario in which the unitaries take a simple form under the following direct-sum-product (DSP) decomposition of the Hilbert space:

$$\begin{aligned} {\mathcal {H}}=\bigoplus _{j=1}^J{\mathcal {H}}_j^l\otimes {\mathcal {H}}_j^r. \end{aligned}$$

(1)

Here, the superscripts l and r stand for “left” and “right”, respectively, and j is the index of the diagonal subspaces. This decomposition often appears in the context of quantum information theory, such as information-preserving structure [14, 15], the Koashi-Imoto decomposition [16], data compression of quantum mixed-state source [17], quantum Markov chains [18, 19] and simultaneous transmission of classical and quantum information [13]. Also, quantum systems with symmetry are represented by the Hilbert spaces decomposed into this form (see e.g. [20]), in which case j is the label of irreducible representations of a compact group G, ${\mathcal {H}}_j^l$ is the representation space and ${\mathcal {H}}_j^r$ is the multiplicity space for each j.

In this paper, we introduce and analyze a task that we call partial decoupling. We consider a scenario in which a bipartite quantum state $\Psi $ on system AR is subject to a unitary operation U on A, followed by the action of a quantum channel (CP map) ${\mathcal {T}}:A\rightarrow E$. The unitary is assumed to be chosen at random, not from the set of all unitaries on A, but from the subset of unitaries that take a simple form under the DSP decomposition. Thus, partial decoupling is a generalization of the decoupling theorem [11] that incorporates the DSP decomposition. Along the similar line as [11], we analyze how close the final state ${\mathcal {T}}^{A \rightarrow E} (U^A \Psi ^{AR} U^{\dagger A})$ is, on average over the unitaries, to the averaged final state ${\mathbb {E}}_{U} [ {\mathcal {T}}^{A \rightarrow E}(U^A ( \Psi ^{AR} ) U^{\dagger A})]$.

The main result in this paper is that we derive upper and lower bounds on the average distance between the final state and the averaged one. The bounds are represented in terms of the smooth conditional entropies of quantum states involving the initial state, the channel and the decomposition. For a particular case where $J=1$ and $\dim {{\mathcal {H}}_j^{A_l}}=1$, the obtained formulae are equivalent to those given by the decoupling theorem [11].

The result in this paper is applicable for generalizing any problems within the scope of the decoupling theorem by incorporating the DSP structure. Some of the applications are investigated in our papers [21,22,23,24].

In Refs. [21,22,23], we investigate communication tasks between two parties in which the information to be transmitted has both classical and quantum components. In this case, the Hilbert space ${\mathcal {H}}_j^l$ in (1) is assumed to be a one-dimensional space ${\mathbb {C}}$, and ${\mathcal {H}}_j^l$ to be the spaces with the same dimension for all j:

$$\begin{aligned} {\mathcal {H}}=\bigoplus _{j=1}^J {\mathcal {H}}_j^r, \ \ \ \ \ \ \dim {\mathcal {H}}_j^r = \dim {\mathcal {H}}_{j'}^r \ (\forall j, j'). \end{aligned}$$

(2)

Here, $j \in [1, J]$ and $\mathcal{H}_j^r$ correspond to the degrees of freedom related to classical and quantum components of the information to be transmitted, respectively. We investigate the tasks of channel coding in [21, 22] and source coding in [23] in the one-shot regime. Based on the result in this paper, we obtain general trade-off relations among the resources of classical communication, quantum communication and entanglement for those tasks.

In Ref. [24], we apply the result of partial decoupling to investigate the information paradox of quantum black holes with symmetry. Our analysis is based on the framework of Hayden–Preskill model [7], where a decoupling technique is used under the postulate that the internal dynamics of the system is given by a fully random unitary. This postulate should be modified when the system has symmetry since the dynamics cannot be fully random due to a conserved quantity. By letting j be the labeling of the conserved quantity, the internal dynamics randomizes only the multiplicity spaces $\{ {\mathcal {H}}_j^r\}$ and should be in the form of

$$\begin{aligned} U=\bigoplus _{j=1}^J I_j^l \otimes U_j^r, \end{aligned}$$

(3)

where $I_j^l$ is the identity on ${\mathcal {H}}_j^l$ and $U_j^r$ is a random unitary on ${\mathcal {H}}_j^r$. Hence, this case is also in the scope of partial decoupling with a DSP decomposition given by the symmetry. Similarly, all physical phenomena investigated based on decoupling [7,8,9,10,11] can be lifted up by partial decoupling to the situation with symmetry. We think that further significant implications on various topics will be obtained beyond these examples.

This paper is organized as follows. In Sect. 2, we introduce notations and definitions. In Sect. 3, we present formulations of the problem and the main results. Before we prove our main results, we provide discussions about implementations of our protocols by quantum circuits in Sect. 4. Section 5 describes the structure of the proofs of the main results, and provides lemmas that will be used in the proofs. The detailed proofs of the main theorems are provided in Sects. 6–8. Conclusions are given in Sect. 9. Some technical lemmas and proofs are provided in Appendices.

2 Preliminaries

We summarize notations and definitions that will be used throughout this paper. See also Appendix H for the list of notations.

2.1 Notations

We denote the set of linear operators and that of Hermitian operators on a Hilbert space ${\mathcal {H}}$ by ${\mathcal {L}}({\mathcal {H}})$ and $\mathrm{Her}({\mathcal {H}})$, respectively. For positive semidefinite operators, density operators and sub-normalized density operators, we use the following notations, respectively:

$$\begin{aligned} {\mathcal {P}}({\mathcal {H}})&= \{\rho \in \mathrm{Her}({\mathcal {H}}) : \rho \ge 0 \}, \end{aligned}$$

(4)

$$\begin{aligned} {\mathcal {S}}_=({\mathcal {H}})&= \{\rho \in {\mathcal {P}}({\mathcal {H}}) : {\mathrm {Tr}}[\rho ]=1 \}, \end{aligned}$$

(5)

$$\begin{aligned} {\mathcal {S}}_{\le }({\mathcal {H}})&= \{\rho \in {\mathcal {P}}({\mathcal {H}}) : {\mathrm {Tr}}[\rho ] \le 1 \}. \end{aligned}$$

(6)

A Hilbert space associated with a quantum system A is denoted by ${{\mathcal {H}}}^A$, and its dimension is denoted by $d_A$. A system composed of two subsystems A and B is denoted by AB. When M and N are linear operators on ${{\mathcal {H}}}^A$ and ${{\mathcal {H}}}^B$, respectively, we denote $M\otimes N$ as $M^A\otimes N^B$ for clarity. In the case of pure states, we often abbreviate $|\psi \rangle ^A\otimes |\phi \rangle ^B$ as $|\psi \rangle ^A|\phi \rangle ^B$. For $\rho ^{AB} \in {\mathcal {L}}({\mathcal {H}}^{AB})$, $\rho ^{A}$ represents $\mathrm{Tr}_B[\rho ^{AB}]$. We denote $|\psi \rangle \!\langle \psi |$ simply by $\psi $. The maximally entangled state between A and $A'$, where ${\mathcal {H}}^{A} \cong {\mathcal {H}}^{A'}$, is denoted by ${|\Phi \rangle }^{AA'}$ or $\Phi ^{AA'}$. The identity operator is denoted by I. We denote $(M^A\otimes I^B){|\psi \rangle }^{AB}$ as $M^A{|\psi \rangle }^{AB}$, and $(M^A\otimes I^B)\rho ^{AB}(M^A\otimes I^B)^{\dagger }$ as $M^A\rho ^{AB}M^{A\dagger }$.

When ${{\mathcal {E}}}$ is a supermap from ${\mathcal {L}}({\mathcal {H}}^{A})$ to ${\mathcal {L}}({\mathcal {H}}^{B})$, we denote it by ${\mathcal {E}}^{A \rightarrow B}$. When $A = B$, we use ${\mathcal {E}}^{A}$ for short. We also denote $({{\mathcal {E}}}^{A \rightarrow B} \otimes \mathrm{id}^C)(\rho ^{AC})$ by ${{\mathcal {E}}}^{A \rightarrow B} (\rho ^{AC})$. The set of linear completely-positive (CP) supermaps from A to B is denoted by ${\mathcal {C}}{\mathcal {P}}(A\rightarrow B)$, and the subset of trace non-increasing (resp. trace preserving) ones by ${\mathcal {C}}{\mathcal {P}}_\le (A\rightarrow B)$ (resp. ${\mathcal {C}}{\mathcal {P}}_=(A\rightarrow B)$). When a supermap is given by a conjugation of a unitary $U^A$ or an isometry $W^{A \rightarrow B}$, we especially denote it by its calligraphic font such as

$$\begin{aligned} {\mathcal {U}}^{A}(X^A):= (U^{A }) X^A (U^{A })^{\dagger }, \quad {\mathcal {W}}^{A \rightarrow B}(X^A):= (W^{A \rightarrow B}) X^A (W^{A \rightarrow B})^{\dagger }. \end{aligned}$$

(7)

Let A be a quantum system such that the associated Hilbert space ${\mathcal {H}}^A$ is decomposed into the DSP form as

$$\begin{aligned} {\mathcal {H}}^A=\bigoplus _{j=1}^J{\mathcal {H}}_j^{A_l}\otimes {\mathcal {H}}_j^{A_r}. \end{aligned}$$

(8)

For the dimension of each subspace, we introduce the following notation:

$$\begin{aligned} l_j:=\dim {{\mathcal {H}}_j^{A_l}},\quad r_j:=\dim {{\mathcal {H}}_j^{A_r}}. \end{aligned}$$

(9)

We denote by $\Pi _j^A$ the projection onto a subspace ${\mathcal {H}}_j^{A_l}\otimes {\mathcal {H}}_j^{A_r}\subseteq {\mathcal {H}}^A$ for each j. For any quantum system R and any $X\in {\mathcal {L}}({\mathcal {H}}^A\otimes {\mathcal {H}}^R)$, we introduce a notation

$$\begin{aligned} X_{jk}^{AR}:=\Pi _j^AX^{AR}\Pi _k^A, \end{aligned}$$

(10)

which leads to $X^{AR}=\sum _{j,k=1}^JX_{jk}^{AR}$.

2.2 Norms and distances

For a linear operator X, the trace norm is defined as $|\! | X |\! |_1 = {\mathrm {Tr}}[ \sqrt{X^{\dagger }X}]$, and the Hilbert-Schmidt norm as $|\! | X |\! |_2 = \sqrt{{\mathrm {Tr}}[ X^{\dagger }X]}$. The trace distance between two unnormalized states $\rho ,\rho '\in {\mathcal {P}}({\mathcal {H}})$ is defined by $\Vert \rho -\rho '\Vert _1$. For subnormalized states $\rho ,\rho '\in {\mathcal {S}}_\le ({\mathcal {H}})$, the generalized fidelity and the purified distance are defined by

$$\begin{aligned} {\bar{F}}(\rho ,\rho ') := \Vert \sqrt{\rho }\sqrt{\rho '}\Vert _1 + \sqrt{(1-\mathrm{Tr}[\rho ])(1-\mathrm{Tr}[\rho '])}, \quad P(\rho ,\rho ') := \sqrt{1-{\bar{F}}(\rho ,\rho ')^2}, \end{aligned}$$

(11)

respectively [25]. The epsilon ball of a subnormalized state $\rho \in {\mathcal {S}}_\le ({\mathcal {H}})$ is defined by

$$\begin{aligned} {\mathcal {B}}^\epsilon (\rho ):=\{\rho '\in {\mathcal {S}}_\le ({\mathcal {H}})|\,P(\rho ,\rho ')\le \epsilon \}. \end{aligned}$$

(12)

For a linear superoperator ${\mathcal {E}}^{A \rightarrow B}$, we define the DSP norm by

$$\begin{aligned} \Vert {\mathcal {E}}^{A \rightarrow B} \Vert _{\mathrm{DSP}}:= \sup _{C,\,\xi } \Vert {\mathcal {E}}^{A \rightarrow B}(\xi ^{AC}) \Vert _1, \end{aligned}$$

(13)

where the supremum is taken over all finite dimensional quantum systems C and all subnormalized states $\xi \in {\mathcal {S}}_\le ({\mathcal {H}}^{AC})$ such that the reduced state on A is decomposed in the form of

$$\begin{aligned} \xi ^A=\bigoplus _{j=1}^Jq_j \varpi _j^{A_l}\otimes \pi _j^{A_r}. \end{aligned}$$

(14)

Here, $\{q_j\}_{j=1}^J$ is a probability distribution, $\{\varpi _j\}_{j=1}^J$ is a set of subnormalized states on ${\mathcal {H}}_j^{A_l}$ and $\pi _j^{A_r}$ is the maximally mixed state on ${\mathcal {H}}_j^{A_r}$. The epsilon ball of linear CP maps with respect to the DSP norm is defined by

$$\begin{aligned} {\mathcal {B}}_{\mathrm{DSP}}^\epsilon ({\mathcal {E}}):= \{{\mathcal {E}}'\in {\mathcal {C}}{\mathcal {P}}_\le (A\rightarrow B)\,|\,\Vert {\mathcal {E}}'-{\mathcal {E}}\Vert _{\mathrm{DSP}}\le \epsilon \}. \end{aligned}$$

(15)

For quantum systems V, W, a linear operator $X\in {\mathcal {L}}({\mathcal {H}}^{VW})$ and a subnormalized state $\varsigma \in {\mathcal {S}}_\le ({\mathcal {H}}^W)$, we introduce the following notation:

$$\begin{aligned} |\!| X^{VW} |\!|_{2,\varsigma ^W}:=|\!| (\varsigma ^W)^{-1/4} X^{VW} (\varsigma ^W)^{-1/4} |\!|_{2}. \end{aligned}$$

(16)

This includes the case where V is a trivial (one-dimensional) system, in which case $X^{VW}=X^W$. We omit the superscript W for $\varsigma $ when there is no fear of confusion.

2.3 One-shot entropies

For any subnormalized state $\rho \in {\mathcal {S}}_\le ({\mathcal {H}}^{AB})$ and normalized state $\varsigma \in {\mathcal {S}}_=({\mathcal {H}}^{B})$, define

$$\begin{aligned} H_{\mathrm {min}}(A|B)_{\rho |\varsigma }&:= \sup \{ \lambda \in {\mathbb {R}}| 2^{-\lambda } I^A \otimes \varsigma ^B \ge \rho ^{AB} \}, \end{aligned}$$

(17)

$$\begin{aligned} H_{\mathrm {max}}(A|B)_{\rho |\varsigma }&:= \log {\Vert \sqrt{\rho ^{AB}}\sqrt{I^A\otimes \varsigma ^B}\Vert _1^2}, \end{aligned}$$

(18)

$$\begin{aligned} H_2(A|B)_{\rho |\varsigma }&:= - \log {\mathrm {Tr}}\bigl [ \bigl ( (\varsigma ^B)^{-1/4} \rho ^{AB} (\varsigma ^B)^{-1/4} \bigr )^2 \bigr ]. \end{aligned}$$

(19)

The conditional min-, max- and collision entropies (see e.g. [26]) are defined by

$$\begin{aligned} H_{\mathrm{min}}(A|B)_{\rho }&:= \sup _{\varsigma ^B \in {\mathcal {S}}_=({\mathcal {H}}^B)}H_{\mathrm{min}}(A|B)_{\rho |\varsigma }, \end{aligned}$$

(20)

$$\begin{aligned} H_{\mathrm{max}}(A|B)_{\rho }&:= \sup _{\varsigma ^B \in {\mathcal {S}}_=({\mathcal {H}}^B)}H_{\mathrm{max}}(A|B)_{\rho |\varsigma }, \end{aligned}$$

(21)

$$\begin{aligned} H_2(A|B)_{\rho }&:= \sup _{\varsigma ^B \in {\mathcal {S}}_=({\mathcal {H}}^B)}H_2(A|B)_{\rho |\varsigma }, \end{aligned}$$

(22)

respectively. The smoothed versions are of the key importance when we are interested in the one-shot scenario. We particularly use the smooth conditional min- and max-entropies:

$$\begin{aligned} H_{\mathrm{min}}^\epsilon (A|B)_{\rho }&:= \sup _{{\hat{\rho }}^{AB} \in {\mathcal {B}}^\epsilon (\rho )}H_{\mathrm{min}}(A|B)_{{{\hat{\rho }}}}, \end{aligned}$$

(23)

$$\begin{aligned} H_{\mathrm{max}}^\epsilon (A|B)_{\rho }&:= \inf _{{\hat{\rho }}^{AB} \in {\mathcal {B}}^\epsilon (\rho )}H_{\mathrm{max}}(A|B)_{{{\hat{\rho }}}} \end{aligned}$$

(24)

for $\epsilon \ge 0$. Note that Expressions (17)–(22) can be generalized to the case where $\rho \in {\mathcal {P}}({\mathcal {H}})$.

2.4 Choi–Jamiołkowski representation

Let ${\mathcal {T}}^{A \rightarrow B}$ be a linear supermap from ${\mathcal {L}}({\mathcal {H}}^A)$ to ${\mathcal {L}}({\mathcal {H}}^B)$, and let $\Phi ^{AA'}$ be the maximally entangled state between A and $A'$. A linear operator ${\mathfrak {J}}({\mathcal {T}}^{A \rightarrow B})\in {\mathcal {L}}({\mathcal {H}}^{AB})$ defined by ${\mathfrak {J}}({\mathcal {T}}^{A \rightarrow B}) := {\mathcal {T}}^{A' \rightarrow B}(\Phi ^{AA'})$ is called the Choi–Jamiołkowski representation of ${\mathcal {T}}$ [27, 28]. The representation is an isomorphism. The inverse map is given by, for an operator $X^{AB} \in {\mathcal {L}}({\mathcal {H}}^{AB})$,

$$\begin{aligned} {\mathfrak {J}}^{-1}(X^{AB}) (\varsigma ^A) = d_A {\mathrm {Tr}}_A \bigl [ (\varsigma ^{A^T} \otimes I^B) X^{AB} \bigr ], \end{aligned}$$

(25)

where $A^T$ denotes the transposition of A with respect to the Schmidt basis of $\Phi ^{AA'}$. When ${\mathcal {T}}$ is completely positive, then ${\mathfrak {J}}({\mathcal {T}}^{A \rightarrow B})$ is an unnormalized state on AB and is called the Choi–Jamiołkowski state of ${\mathcal {T}}$.

Note that the Choi–Jamiołkowski representation depends on the choice of the maximally entangled state $\Phi ^{AA'}$, i.e., the Schmidt basis thereof. When ${\mathcal {H}}^{A}$ is decomposed into the DSP form as (8), the isomorphic space ${\mathcal {H}}^{A'}$ is decomposed into the same form. In the rest of this paper, we fix the maximally entangled state $\Phi ^{AA'}$, which is decomposed as

$$\begin{aligned} {|\Phi \rangle }^{AA'}=\bigoplus _{j=1}^J\sqrt{\frac{l_jr_j}{d_A}}|\Phi _j^l\rangle ^{A_lA_l'}|\Phi _j^r\rangle ^{A_rA_r'}, \end{aligned}$$

(26)

where $\Phi _j^l$ and $\Phi _j^r$ are fixed maximally entangled states on ${\mathcal {H}}^{A_l}_j\otimes {\mathcal {H}}^{A_l'}_j$ and ${\mathcal {H}}^{A_r}_j\otimes {\mathcal {H}}^{A_r'}_j$, respectively.

2.5 Random unitaries

Random unitaries play a crucial role in the analyses of one-shot decoupling. By using them, it can be shown that there exists at least one unitary that achieves the desired task. In particular, the Haar measure on the unitary group is often used. The Haar measure $\mathsf{H}$ on the unitary group is the unique unitarily invariant provability measure, often called uniform distribution of the unitary group. When a random unitary U is chosen uniformly at random with respect to the Haar measure, it is referred to as a Haar random unitary and is denoted by $U \sim \mathsf{H}$.

The most important property of the Haar measure is the left- and right-unitary invariance: for a Haar random unitary $U \sim \mathsf{H}$ and any unitary V, the random unitaries VU and UV are both distributed uniformly with respect to the Haar measure. This property combined with the Schur–Weyl duality enables us to explicitly study the averages of many functions on the unitary group over the Haar measure. In the following, the average of a function f(U) on the unitary group over the Haar measure is denoted by ${\mathbb {E}}_{U \sim \mathsf{H}} [f]$.

In this paper, however, we are interested in the case where the Hilbert space is decomposed into the DSP form: ${\mathcal {H}}^A=\bigoplus _{j=1}^J{\mathcal {H}}_j^{A_l}\otimes {\mathcal {H}}_j^{A_r}$, and mainly consider the unitaries that act non-trivially only on $\{ {\mathcal {H}}_j^{A_r} \}_{j=1}^{J}$ such as the untiary in the form of $\bigoplus _{j=1}^J I_j^{A_l}\otimes U_j^{A_r}$, where $U_j^{A_r}$ is a unitary on ${\mathcal {H}}_j^{A_r}$. In this case, we can naturally introduce a product $\mathsf{H}_{\times }$ of the Haar measures by

$$\begin{aligned} \mathsf{H}_{\times } = \mathsf{H}_{1} \times \cdots \times \mathsf{H}_{J}, \end{aligned}$$

(27)

where $\mathsf{H}_{j}$ is the Haar measure on the unitary group on ${\mathcal {H}}_j^{A_r}$ for any j. Hence, when we write $U \sim \mathsf{H}_{\times }$ below, it means that U is in the form of $\bigoplus _{j=1}^J I_j^{A_l}\otimes U_j^{A_r}$ and $U_j^{A_r} \sim \mathsf{H}_j$.

3 Main Results

We consider two scenarios in which a bipartite quantum state $\Psi ^{AR}$ is transformed by a unitary operation on A and then is subject to the action of a quantum channel (linear CP map) ${\mathcal {T}}^{A\rightarrow E}$. The unitary is chosen at uniformly random from the set of unitaries that take a simple form under the DSP decomposition (1).

In the first scenario, which we call non-randomized partial decoupling, the unitaries are such that they completely randomize the space ${\mathcal {H}}_j^{A_r}$ for each j, while having no effect on j or the space ${\mathcal {H}}_j^{A_l}$. This scenario may find applications when complex quantum many-body systems are investigated based on the decoupling approach, in which case the DSP decomposition is, for instance, induced by the symmetry the system has. In the second scenario, which we refer to as randomized partial decoupling, we assume that $\mathrm{dim}{\mathcal {H}}_j^{A_l}=1$ and that $\mathrm{dim}{\mathcal {H}}_j^{A_r}$ does not depend on j. The unitaries do not only completely randomize the space ${\mathcal {H}}^{A_r}$, but also randomly permute j. This scenario may fit to the communication problems. For instance, one of the applications may be classical-quantum hybrid communicational tasks, where the division of the classical and quantum information leads to the DSP decomposition.

For both scenarios, our concern is how close the final state is, after the action of the unitary and the quantum channel, to the averaged final state over all unitaries. It should be noted that the averaged final state is in the form of a block-wise decoupled state in general. This is in contrast to the decoupling theorem, in which the averaged final state is a fully decoupled state.

3.1 Non-randomized partial decoupling

Let us consider the situation where U has the DSP form: $U:=\bigoplus _{j=1}^J I_j^{A_l} \otimes U_j^{A_r}$. For any state $\Psi ^{AR}$, the averaged state obtained after the action of the random unitary $U \sim \mathsf{H}_{\times }$ is given by

$$\begin{aligned} \Psi _{\mathrm{av}}^{AR} :={\mathbb {E}}_{U \sim \mathsf{H}_{\times }} [ U^A ( \Psi ^{AR} ) U^{\dagger A}] = \bigoplus _{j=1}^J \Psi _{jj}^{A_lR}\otimes \pi _j^{A_r}. \end{aligned}$$

(28)

Here, $\pi _j^{A_r}$ is the maximally mixed state on ${\mathcal {H}}_j^{A_r}$, and $\Psi _{jj}^{A_lR}$ is an unnormalized state on ${\mathcal {H}}_j^{A_l}\otimes {\mathcal {H}}^R$ defined by

$$\begin{aligned} \Psi _{jj}^{A_lR}:=\mathrm{Tr}_{A_r}[\Psi _{jj}^{AR}]=\mathrm{Tr}_{A_r}[\Pi _j^A\Psi ^{AR}\Pi _j^A]. \end{aligned}$$

(29)

Our interest is on the average distance between the state ${\mathcal {T}}^{A \rightarrow E} (U^A \Psi ^{AR} U^{\dagger A}) $ and the averaged state ${\mathcal {T}}^{A \rightarrow E}(\Psi _{\mathrm{av}}^{AR})$ over all $U \sim \mathsf{H}_{\times }$.

For expressing the upper bound on the average distance, we introduce a quantum system $A^*$ represented by a Hilbert space

$$\begin{aligned} {\mathcal {H}}^{A^*}:=\bigoplus _{j=1}^J{\mathcal {H}}_j^{A_r}\otimes {\mathcal {H}}_j^{{\bar{A}}_r}, \end{aligned}$$

(30)

and a linear operator $F^{A{\bar{A}}\rightarrow A^*}: {\mathcal {H}}^A\otimes {\mathcal {H}}^{{\bar{A}}} \rightarrow {\mathcal {H}}^{A^*}$ defined by

$$\begin{aligned}&F^{A{\bar{A}}\rightarrow A^*}:= \bigoplus _{j=1}^J \sqrt{\frac{d_Al_j}{r_j}} \langle \Phi _j^l|^{A_l{\bar{A}}_l}(\Pi _j^{A} \otimes \Pi _j^{{\bar{A}}}), \end{aligned}$$

(31)

where ${\mathcal {H}}_j^{{\bar{A}}_l}\cong {\mathcal {H}}_j^{A_l}$, ${\mathcal {H}}_j^{{\bar{A}}_r}\cong {\mathcal {H}}_j^{A_r}$ and ${\mathcal {H}}^{{\bar{A}}}\cong {\mathcal {H}}^{A}$.

The following is our first main theorem about the upper bound:

Theorem 1

(Main result 1: One-shot non-randomized partial decoupling.) For any $\epsilon ,\mu \ge 0$, any subnormalized state $\Psi ^{AR} \in {\mathcal {S}}_\le ({\mathcal {H}}^{AR})$ and any linear CP map ${\mathcal {T}}^{A \rightarrow E}$, it holds that

$$\begin{aligned}&{\mathbb {E}}_{U \sim \mathsf{H}_{\times }} \left[ \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {U}}^A ( \Psi ^{AR} ) - {\mathcal {T}}^{A \rightarrow E}(\Psi _{\mathrm{av}}^{AR}) \right\| _1 \right] \nonumber \\&\quad \le 2^{-\frac{1}{2} H_{\mathrm{min}}^{\epsilon ,\mu }(A^*|RE)_{{\Lambda }(\Psi ,{\mathcal {T}})}}+2(\epsilon \Vert {\mathcal {T}}\Vert _{\mathrm{DSP}}+\mu +\epsilon \mu ). \end{aligned}$$

(32)

Here, $H_{\mathrm{min}}^{\epsilon ,\mu }(A^*|RE)_{{\Lambda }(\Psi ,{\mathcal {T}})}$ is the smooth conditional min-entropy for an unnormalized state ${\Lambda }(\Psi ,{\mathcal {T}})$, defined by $F(\Psi ^{AR}\otimes \tau ^{{\bar{A}}E})F^\dagger $ with $\tau ^{AE} = {\mathfrak {J}}({\mathcal {T}}^{A \rightarrow E})$ being the Choi–Jamiołkowski representation of ${\mathcal {T}}^{A \rightarrow E}$. It is explicitly given by

$$\begin{aligned} H_{\mathrm{min}}^{\epsilon ,\mu }(A^*|RE)_{{\Lambda }(\Psi ,{\mathcal {T}})} :=\sup _{\Psi '\in {\mathcal {B}}^\epsilon (\Psi )}\sup _{{\mathcal {T}}'\in {\mathcal {B}}_{\mathrm{DSP}}^{\mu }({\mathcal {T}})} H_{\mathrm{min}}(A^*|RE)_{{\Lambda }(\Psi ',{\mathcal {T}}')}, \end{aligned}$$

(33)

where ${\mathcal {B}}_{\mathrm{DSP}}^{\mu }({\mathcal {T}})$ is the set of $\mu $-neighbourhoods of ${\mathcal {T}}$, defined by (15).

In the literature of chaotic quantum many-body systems, it is often assumed that the dynamics is approximated well by a random unitary channel, which is sometimes called scrambling [7, 29, 30]. Despite the fact that a number of novel research topics have been opened based on the idea of scrambling, some of which are using the decoupling approach [7, 9, 10], symmetry of the physical systems has rarely been taken into account properly. When the system has symmetry, the associated Hilbert space is naturally decomposed into a DSP form as

$$\begin{aligned} {\mathcal {H}}^A=\bigoplus _{j=1}^J{\mathcal {H}}_j^{A_l}\otimes {\mathcal {H}}_j^{A_r}, \end{aligned}$$

(34)

where j is the label of irreducible representations of a compact group of the symmetry, ${\mathcal {H}}_j^{A_l}$ is the irreducible representation and ${\mathcal {H}}_j^{A_r}$ corresponds to the multiplicity for each j. Due to the conservation law, the scrambling dynamics in the system should be compatible with this decomposition and should be in the form of $U^A=\bigoplus _{j=1}^J I_j^{A_l}\otimes U_j^{A_r}$. Hence, Theorem 1 is applicable to the study of complex physics in chaotic quantum many-body systems with symmetry.

Theorem 1 reduces to a simpler form when the symmetry is abelian. In this case, all the irreducible representation one-dimensional, i.e., $\dim {\mathcal {H}}_j^{A_l} =1$. The averaged output state is explicitly calculated to be

$$\begin{aligned} {\mathcal {T}}^{A \rightarrow E}(\Psi _{\mathrm{av}}^{AR})= \bigoplus _{j=1}^J \frac{d_A}{r_j} ({\mathrm {Tr}}_A[ \Pi _j^{A} \Psi ^{AR} ]) \otimes ({\mathrm {Tr}}_{{\bar{A}}}[\Pi _j^{{\bar{A}}} \tau ^{{\bar{A}}E}]). \end{aligned}$$

(35)

The operator $F^{A {\bar{A}} \rightarrow A^*}$ in (31) reduces to a direct sum of operators that are proportional to projectors, and the operator $\Lambda (\Psi , \mathcal{T}) \in \mathcal{S}_{\le }(\mathcal{H}^{A^*RE})$ in Theorem 1 reduces to

$$\begin{aligned} \Lambda (\Psi , \mathcal{T})= \bigoplus _{j, j'=1}^J \frac{d_A}{\sqrt{r_j r_{j'}}} (\Pi _j^{A} \Psi ^{AR} \Pi _{j'}^{A}) \otimes (\Pi _j^{{\bar{A}}} \tau ^{{\bar{A}}E} \Pi _{j'}^{{\bar{A}}}). \end{aligned}$$

(36)

Theorem 1 implies that, if the smooth conditional min-entropy of the unnormalized state $\Lambda (\Psi , \mathcal{T})$ is sufficiently large, the final state ${\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {U}}^A ( \Psi ^{AR} )$ is close to ${\mathcal {T}}^{A \rightarrow E}(\Psi _{\mathrm{av}}^{AR})$.

3.2 Randomized partial decoupling

Next we assume that

$$\begin{aligned} \dim {\mathcal {H}}_j^l=1,\quad \dim {\mathcal {H}}_j^r=r \quad (j=1,\ldots , J). \end{aligned}$$

(37)

The Hilbert space ${\mathcal {H}}^A=\oplus _{j=1}^J{{\mathcal {H}}}_j^{A_r}$ is then isomorphic to a tensor product Hilbert space ${\mathcal {H}}^{A_c} \otimes {\mathcal {H}}^{A_r}$, i.e., $A\cong A_cA_r$. Here, ${{\mathcal {H}}}^{A_c}$ is a J-dimensional Hilbert space with a fixed orthonormal basis $\{|j\rangle \}_{j=1}^J$, and ${{\mathcal {H}}}^{A_r}$ is an r-dimensional Hilbert space. We consider a random unitary U on system A of the form

$$\begin{aligned} U:=\sum _{j=1}^J{{|j\rangle }\!{\langle j|}}^{A_c} \otimes U_j^{A_r}, \end{aligned}$$

(38)

which we also denote by $U \sim \mathsf{H}_{\times }$. In addition, let ${\mathbb {P}}$ be the permutation group on $[1,\ldots ,J]$, and $\mathsf{P}$ be the uniform distribution on ${\mathbb {P}}$. We define a unitary $G_\sigma $ for any $\sigma \in {\mathbb {P}}$ by

$$\begin{aligned} G_\sigma :=\sum _{j=1}^J{{|\sigma (j)\rangle }\!{\langle j|}}^{A_c} \otimes I^{A_r}. \end{aligned}$$

(39)

We denote the supermap given by conjugation of $G_\sigma $ by the calligraphic font as ${\mathcal {G}}_\sigma (\cdot )=G_\sigma (\cdot )G_\sigma ^\dagger $. For the initial state, we use the notion of classically coherent states, defined as follows:

Definition 2

(classically coherent states [31]) Let $K_1$ and $K_2$ be d-dimensional quantum systems with fixed orthonormal bases $\{|k_1\rangle \}_{k_1=1}^d$ and $\{|k_2\rangle \}_{k_2=1}^d$, respectively, and let W be a quantum system. An unnormalized state $\varrho \in {\mathcal {P}}({\mathcal {H}}^{K_1K_2W})$ is said to be classically coherent in $K_1K_2$ if it satisfies $\varrho {|k\rangle }^{K_1}{|k'\rangle }^{K_2}=0$ for any $k\ne k'$, or equivalently, if $\varrho $ is in the form of

$$\begin{aligned} \varrho ^{K_1K_2W}=\sum _{k,k'=1}^d{{|k\rangle }\!{\langle k'|}}^{K_1}\otimes {{|k\rangle }\!{\langle k'|}}^{K_2}\otimes \varrho _{kk'}^W, \end{aligned}$$

(40)

where $\varrho _{kk'}\in {\mathcal {L}}({\mathcal {H}}^{W})$ for each k and $k'$.

We now provide our second main result:

Theorem 3

(Main result 2: One-shot randomized partial decoupling.) Let $\epsilon ,\mu \ge 0, R\cong R_cR_r, \Psi ^{AR}$ be a subnormalized state that is classically coherent in $A_cR_c$, and ${\mathcal {T}}^{A \rightarrow E}$ be a linear CP map such that the Choi–Jamiołkowski representation $\tau ^{AE} = {\mathfrak {J}}({\mathcal {T}}^{A\rightarrow E})$ satisfies $\mathrm{Tr}[\tau ]\le 1$. It holds that

$$\begin{aligned}&{\mathbb {E}}_{U \sim \mathsf{H}_{\times }, \sigma \sim \mathsf{P} } \left[ \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi ^{AR} ) -{\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A ( \Psi _{\mathrm{av}}^{AR} ) \right\| _1 \right] \nonumber \\&\quad \!\le \sqrt{\alpha (J)}\cdot 2^{-\frac{1}{2}H_I} +\beta (A_r)\cdot 2^{-\frac{1}{2}H_{I\!I}} +4(\epsilon +\mu +\epsilon \mu ), \end{aligned}$$

(41)

where $\Psi _{\mathrm{av}}^{AR}:={\mathbb {E}}_{U \sim \mathsf{H}_{\times }} [ {\mathcal {U}}^A ( \Psi ^{AR} ) ]$. The function $\alpha (J)$ is 0 for $J=1$ and $\frac{1}{J-1}$ for $J\ge 2$, and $\beta (A_r)$ is 0 for $\mathrm{dim}{\mathcal {H}}^{A_r}=1$ and 1 for $\mathrm{dim}{\mathcal {H}}^{A_r}\ge 2$. The exponents $H_I$ and $H_{I\!I}$ are given by

$$\begin{aligned} H_I= H_{\mathrm{min}}^\epsilon (A|R)_{\Psi }-H_{\mathrm{max}}^\mu (A|B)_{{\mathcal {C}}(\tau )}, \quad H_{I\!I}= H_{\mathrm{min}}^\epsilon (A|R)_{{\mathcal {C}}(\Psi )}-H_{\mathrm{max}}^\mu (A_r|BA_c)_{{\mathcal {C}}(\tau )}. \end{aligned}$$

(42)

Here, ${\mathcal {C}}$ is the completely dephasing channel on $A_c$ with respect to the basis $\{|j\rangle \}_{j=1}^J$, and $\tau ^{AB} = {\mathfrak {J}}({\mathcal {T}}^{A\rightarrow B})$ is the Choi–Jamiołkowski representation of the complementary channel ${\mathcal {T}}^{A\rightarrow B}$ of ${\mathcal {T}}^{A\rightarrow E}$.

Note that, since the subnormalized state $\Psi ^{AR}$ is classically coherent in $A_cR_c$, the averaged state $\Psi _{\mathrm{av}}^{AR}$ is explicitly given by

$$\begin{aligned} \Psi _{\mathrm{av}}^{AR}=\sum _{j=1}^J{{|j\rangle }\!{\langle j|}}^{A_c}\otimes \pi ^{A_r}\otimes \Psi _{jj}^{R_r}\otimes {{|j\rangle }\!{\langle j|}}^{R_c}. \end{aligned}$$

(43)

Small error for one-shot randomized partial decoupling implies that the third party having the purifying system of the final state may recover both classical and quantum parts of correlation in $\Psi ^{AR}$. Thus, it will be applicable, e.g., for analyzing simultaneous transmission of classical and quantum information in the presence of quantum side information. In this context, $H_I$ in the above expression quantifies how well the total correlation in $\Psi ^{AR}$ can be transmitted by the channel ${\mathcal {T}}^{A\rightarrow B}$, whereas $H_{I\!I}$ for only quantum part thereof (see [21,22,23]).

3.3 A converse bound

So far, we have presented achievabilities of non-randomized and randomized partial decoupling. At this point, we do not know whether the obtained bounds are “sufficiently tight”. To address this question, we prove a converse bound for partial decoupling. We assume the following two conditions for the converse:

Converse Condition 1:: $\dim {\mathcal {H}}_j^l=1,\quad \dim {\mathcal {H}}_j^r=r \quad (j=1,\ldots , J)$.
Converse Condition 2:: The initial (normalized) state $\Psi ^{AR}$ is classically coherent in $A_cR_c, \text {where } R\cong R_cR_r$.

Throughout the paper, we refer to the conditions as CC1 and CC2, respectively. The two conditions are always satisfied in the case of randomized partial decoupling, but not necessarily satisfied in the case of non-randomize one. Consequently, the converse bound we prove below is directly applicable to randomized partial decoupling, but is not applicable to non-randomized partial decoupling in general.

The converse bound is stated by the following theorem.

Theorem 4

(Main result 3: Converse for partial decoupling.) Suppose that CC1 and CC2 are satisfied. Let $|\Psi \rangle ^{ARD}$ be a purification of a normalized state $\Psi ^{AR}\in {\mathcal {S}}_=({\mathcal {H}}^{AR})$, which is classically coherent in $A_cR_c$ due to CC2, and ${\mathcal {T}}^{A \rightarrow E}$ be a trace preserving CP map with the complementary channel ${\mathcal {T}}^{A \rightarrow B}$. Suppose that, for $\delta >0$, there exists a normalized state $\Omega ^{ER}:=\sum _{j=1}^J\varsigma _j^E\otimes \Psi _{jj}^{R_r}\otimes {{|j\rangle }\!{\langle j|}}^{R_c}$, where $\{\varsigma _j\}_{j=1}^J$ are normalized states on E, such that

$$\begin{aligned} \left\| {\mathcal {T}}^{A \rightarrow E} ( \Psi ^{AR} ) -\Omega ^{ER} \right\| _1 \le \delta . \end{aligned}$$

(44)

Then, for any $\upsilon \in [0,1/2)$ and $\iota \in (0,1]$, it holds that

$$\begin{aligned} \!\! H_{\mathrm {min}}^{\lambda }(A|R)_\Psi -H_{\mathrm {min}}^{\upsilon }(RD|B)_{{\mathcal {T}}\circ {\mathcal {C}}(\Psi )}+\log {J}&\ge \log {\iota }, \end{aligned}$$

(45)

$$\begin{aligned} \!\! H_{\mathrm {min}}^{\lambda '}(A|R)_{{\mathcal {C}}(\Psi )} -H_{\mathrm {min}}^{\upsilon }(R_rD|BR_c)_{{\mathcal {T}}\circ {\mathcal {C}}(\Psi )}&\ge \log {\iota }+\log {(1-2\upsilon )}, \end{aligned}$$

(46)

where ${\mathcal {C}}$ is the completely dephasing channel on $A_c$, and the smoothing parameters $\lambda $ and $\lambda '$ are defined by

$$\begin{aligned}&\lambda := 2\sqrt{\iota +4\sqrt{20\upsilon +2\delta }} +\sqrt{2\sqrt{20\upsilon +2\delta }}+2\sqrt{2\delta } +2\sqrt{20\upsilon +2\delta } +3\upsilon , \end{aligned}$$

(47)

$$\begin{aligned}&\lambda ':= \upsilon +\sqrt{4\sqrt{\iota +2x}+2\sqrt{x}+(4\sqrt{\iota +8}+24) x } \end{aligned}$$

(48)

and $x:=\sqrt{2}\root 4 \of {24\upsilon +2\delta }$.

Note that, when a quantum channel ${\mathcal {T}}^{A\rightarrow E}$ achieves partial decoupling for a state $\Psi ^{AR}$ within a small error, it follows from the decomposition of $\Psi _{\mathrm{av}}$ (see (43)) that

$$\begin{aligned} {\mathcal {T}}^{A\rightarrow E}(\Psi ^{AR})\approx {\mathcal {T}}^{A\rightarrow E}(\Psi _{\mathrm{av}}^{AR})=\sum _{j=1}^J{\hat{\tau }}_j^E\otimes \Psi _{jj}^{R_r}\otimes {{|j\rangle }\!{\langle j|}}^{R_c}, \end{aligned}$$

(49)

where ${\hat{\tau }}_j^E:={\mathcal {T}}^{A\rightarrow E}({{|j\rangle }\!{\langle j|}}^{A_c}\otimes \pi ^{A_r})\in \mathcal {S}_=({\mathcal {H}}^E)$. This is in the same form as the assumption of Theorem 4.

Let us compare the direct part of randomized partial decoupling (Theorem 3) and the converse bound presented above. In the case of $J\ge 2$, the first term in the R.H.S. of the achievability bound (41) is calculated to be

$$\begin{aligned} -2\log {\left( \sqrt{\alpha (J)}\cdot 2^{-\frac{1}{2}H_I}\right) } = H_{\mathrm{min}}^\epsilon (A|R)_{\Psi }-H_{\mathrm{max}}^\mu (A|B)_{{\mathcal {C}}(\tau )}+\log {(J-1)}. \end{aligned}$$

(50)

On the other hand, the converse bound (45) yields

$$\begin{aligned} H_{\mathrm{min}}^{\lambda }(A|R)_\Psi -H_{\mathrm{min}}^\mu (A|B)_{{\mathcal {C}}(\psi )}+\log {J} \ge \log {\iota }, \end{aligned}$$

(51)

where $\psi ^{AB}:={\mathcal {T}}^{A'\rightarrow B}(\Psi _p^{AA'})$, with $|\Psi _p\rangle ^{AA'}$ being a purification of $\Psi ^A$ and ${\mathcal {H}}^A\cong {\mathcal {H}}^{A'}$. Note that there exists a linear isometry from $A'$ to RD that maps $|\Psi _p\rangle $ to $|\Psi \rangle $ [12], and that the conditional max entropy is invariant under local isometry (see Lemma 21 below). A similar argument also applies to the second term in (41) and (46). Thus, when $\Psi ^A$ is the maximally mixed state, in which case $|\Psi _p\rangle ^{AA'}=|\Phi \rangle ^{AA'}$ and thus $\psi =\tau $, the gap between the two bounds is only due to the difference in values of smoothing parameters and types of conditional entropies. By the fully quantum asymptotic equipartition property [32], this gap vanishes in the limit of infinitely many copies. From this viewpoint, we conclude that the achievability bound of randomized partial decoupling and the converse bound are sufficiently tight.

3.4 Reduction to the existing results

We briefly show that the existing results on one-shot decoupling [11] and dequantization [31] are obtained from Theorems 1, 3 and 4 as corollaries, up to changes in smoothing parameters. Thus, our results are indeed generalizations of these two tasks.

First, by letting $J=1$ in Theorem 3, we obtain the achievability of one-shot decoupling:

Corollary 5

(Achievability for one-shot decoupling: Theorem 3.1 in [11]) Let $\epsilon ,\mu \ge 0$, $\Psi ^{AR}$ be a subnormalized state, and ${\mathcal {T}}^{A \rightarrow E}$ be a linear CP map such that the Choi–Jamiołkowski representation $\tau ^{AE} = {\mathfrak {J}}({\mathcal {T}}^{A\rightarrow E})$ satisfies $\mathrm{Tr}[\tau ]\le 1$. Let $U\sim \mathsf{H}$ be the Haar random unitary on ${\mathcal {H}}^A$. Then, it holds that

$$\begin{aligned}&{\mathbb {E}}_{U \sim \mathsf{H}} \left[ \left\| {\mathcal {T}}^{A \rightarrow E}\circ {\mathcal {U}}^A ( \Psi ^{AR} ) -\tau ^E\otimes \Psi ^{R} \right\| _1 \right] \nonumber \\&\quad \le 2^{-\frac{1}{2}[H_{\mathrm{min}}^\epsilon (A|R)_{\Psi }+H_{\mathrm{min}}^\mu (A|E)_{\tau }]} +4(\epsilon +\mu +\epsilon \mu ). \end{aligned}$$

(52)

Note that the duality of the conditional min and max entropies ([25]: see also Lemma 24 in Sect. 5.2.2) implies $H_{\mathrm{min}}^\mu (A|E)_{\tau }=-H_{\mathrm{max}}^\mu (A|B)_{\tau }$, with $\tau ^{AB} = {\mathfrak {J}}({\mathcal {T}}^{A\rightarrow B})$ being the Choi–Jamiołkowski representation of the complementary channel ${\mathcal {T}}^{A\rightarrow B}$ of ${\mathcal {T}}^{A\rightarrow E}$. A similar bound is also obtained by letting $J=1$ and $\mathrm{dim}{\mathcal {H}}_j^{A_l}=1$ in Theorem 1. A converse bound for one-shot decoupling is obtained by letting $J=1$ in Theorem 4, and by using the duality of the conditional entropies, as follows:

Corollary 6

(Converse for one-shot decoupling: Theorem 4.1 in [11] ) Consider a normalized state $\Psi ^{AR}\in {\mathcal {S}}_=({\mathcal {H}}^{AR})$ and a trace preserving CP map ${\mathcal {T}}^{A \rightarrow E}$. Suppose that, for $\delta >0$, there exists a normalized state $\varsigma \in {\mathcal {S}}_=({\mathcal {H}}^E)$, such that $ \Vert {\mathcal {T}}^{A \rightarrow E} ( \Psi ^{AR} ) -\varsigma ^E\otimes \Psi ^R \Vert _1 \le \delta $. Then, for any $\upsilon \in [0,1/2)$ and $\iota \in (0,1]$, it holds that

$$\begin{aligned}&\!\! H_{\mathrm{min}}^{\lambda }(A|R)_\Psi +H_{\mathrm{max}}^{\upsilon }(A|E)_{{\mathcal {T}}^{A'\rightarrow E}(\Psi _p^{AA'})} \ge \log {\iota }, \end{aligned}$$

(53)

where $|\Psi _p\rangle ^{AA'}$ is a purification of $\Psi ^A$, ${\mathcal {H}}^A\cong {\mathcal {H}}^{A'}$, and the smoothing parameter $\lambda $ is defined by (47).

Next, we consider the opposite extreme for Theorem 3, i.e., we consider the case where $\mathrm{dim}{\mathcal {H}}^{A_r}=1$. This case yields the dequantizing theorem:

Corollary 7

(Achievability for dequantization: Theorem 3.1 in [31]) Let A be a quantum system with a fixed basis $\{|j\rangle \}_{j=1}^{d_A}$, ${\mathcal {H}}^R\cong {\mathcal {H}}^A$ and $\epsilon ,\mu \ge 0$. Consider a subnormalized state $\Psi ^{AR}$ that is classically coherent in AR, and a linear CP map ${\mathcal {T}}^{A \rightarrow E}$ such that the Choi–Jamiołkowski representation $\tau ^{AE} = {\mathfrak {J}}({\mathcal {T}}^{A\rightarrow E})$ satisfies $\mathrm{Tr}[\tau ]\le 1$. Let $\sigma $ be the random permutation on $[1,\ldots ,d_A]$ with the associated unitary $G_\sigma :=\sum _{j=1}^{d_A}{{|\sigma (j)\rangle }\!{\langle j|}}$. Then, it holds that

$$\begin{aligned}&{\mathbb {E}}_{\sigma \sim \mathsf{P} } \left[ \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A ( \Psi ^{AR} ) -{\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {C}}^A( \Psi ^{AR} ) \right\| _1 \right] \nonumber \\&\quad \le \frac{1}{\sqrt{d_A-1}}\cdot 2^{-\frac{1}{2}[H_{\mathrm{min}}^\epsilon (A|R)_{\Psi }-H_{\mathrm{max}}^\mu (A|B)_{{\mathcal {C}}(\tau )}]} +4(\epsilon +\mu +\epsilon \mu ), \end{aligned}$$

(54)

where ${\mathcal {C}}$ is the completely dephasing channel on A with respect to the basis $\{|j\rangle \}_{j=1}^J$, and $\tau ^{AB} = {\mathfrak {J}}({\mathcal {T}}^{A\rightarrow B})$ is the Choi–Jamiołkowski representation of the complementary channel ${\mathcal {T}}^{A\rightarrow B}$ of ${\mathcal {T}}^{A\rightarrow E}$.

In the same extreme, Theorem 4 provides a converse bound for dequantization, which has not been known so far:

Corollary 8

(Converse for dequantization.) Consider the same setting as in Corollary 7, and assume that $\Psi ^{AR}$ is normalized, and that ${\mathcal {T}}^{A \rightarrow E}$ is trace preserving. Let $|\Psi \rangle ^{ARD}$ be a purification of $\Psi ^{AR}$. Suppose that, for $\delta >0$, there exists a normalized state $\Omega ^{ER}:=\sum _{j=1}^Jp_j\varsigma _j^E\otimes {{|j\rangle }\!{\langle j|}}^{R}$, where $\{p_j,\varsigma _j\}_{j=1}^J$ is an ensemble of normalized states on E, such that $ \Vert {\mathcal {T}}^{A \rightarrow E} ( \Psi ^{AR} ) -\Omega ^{ER} \Vert _1 \le \delta $. Then, for any $\upsilon \in [0,1/2)$ and $\iota \in (0,1]$, it holds that

$$\begin{aligned}&\!\! H_{\mathrm{min}}^{\lambda }(A|R)_\Psi -H_{\mathrm{min}}^{\upsilon }(RD|B)_{{\mathcal {T}}\circ {\mathcal {C}}(\Psi )}+\log {J} \ge \log {\iota }, \end{aligned}$$

(55)

where the smoothing parameter $\lambda $ is defined by (47).

4 Implementing the Random Unitary with the DSP Form

Before we proceed to the proofs, we here briefly discuss how the random unitaries $U \sim \mathsf{H}_{\times }$ that respect the DSP form can be implemented by quantum circuits. Since Haar random unitaries are in general hard to implement, unitary t-designs, mimicking the t-th statistical moments of the Haar measure on average [33,34,35], have been exploited in many cases. Since the decoupling method makes use of the second statistical moments of the Haar measure, we could use the unitary 2-designs instead of the Haar measure for our tasks. Although a number of efficient implementations of unitary 2-designs have been discovered [33,34,35,36,37,38,39,40,41], and it is also shown that decoupling can be achieved using unitaries less random than unitary 2-designs [42, 43], we here need unitary designs in a given DSP form, which we refer to as the DSP unitary designs. Thus, we cannot directly use the existing constructions, posing a new problem about efficient implementations of DSP unitary designs. Although this problem is out of the scope in this paper, we will briefly discuss possible directions toward the solution.

One possible way is to simply modify the constructions of unitary designs known so far. This could be done by regarding each Hilbert space ${\mathcal {H}}^{A_r}_j$, on which each random unitary $U_j^{A_r} \sim \mathsf{H}_j$ acts, as the Hilbert space of “virtual” qubits. The complexity of the implementation, i.e. the number of quantum gates, is then determined by how complicated the unitary is that transforms the basis in each ${\mathcal {H}}^{A_r}_j$ into the standard basis of the virtual qubits. Another way is to use the implementation of designs on one qudit [44], where it was shown that alternate applications of random diagonal unitaries in two complementary bases achieves unitary designs. This implementation would be suited in quantum many-body systems because we can choose two natural bases, position and momentum bases, and just repeat switching random potentials in those bases under the condition that the potentials satisfy the DSP form. Finally, when the symmetry-induced DSP form is our concern, unitary designs with symmetry may possibly be implementable by applying random quantum gates that respects the symmetry.

In any case, the implementations of DSP unitary designs, or the symmetric unitary designs, and their efficiency are left fully open. Further analyses are desired.

5 Structure of the Proof

In the rest of the paper, we prove the three main theorems, Theorems 1, 3 and 4 in Sects. 6, 7 and 8, respectively. For the sake of clarity, we sketch the outline of the proofs in Sect. 5.1 (see also Fig. 1). We then list useful lemmas in Sect. 5.2. See also Appendix H for the list of notations used in the proofs.

5.1 Key lemmas and the structure of the proofs

For the achievability statements (Theorems 1 and 3), the key technical lemma is the twisted twirling, which can be seen as a generalization of the twirling method often used in quantum information science. See Appendix A for the proof.

Lemma 9

(Twisted Twirling) Let ${\mathcal {H}}_j^{A_r}$ be a $r_j$-dimensional subspace of ${\mathcal {H}}^{A_r}$, and $\Pi _j^{A_r}$ be the projector onto ${\mathcal {H}}_j^{A_r}\subset {\mathcal {H}}^{A_r}$ for each of $j=1,\ldots ,J$. Let ${\mathbb {I}}^{A_rA_r'}$ be $I^{A_r} \otimes I^{A_r'}$, and ${\mathbb {F}}^{A_rA_r'} \in {\mathcal {L}}({\mathcal {H}}^{A_rA_r'})$ be the swap operator defined by $\sum _{a,b} |a\rangle \langle b|^{A_r} \otimes |b\rangle \langle a|^{A_r'}$ for any orthonormal basis $\{ {|a\rangle } \}$ in ${\mathcal {H}}^{A_r}$ and ${\mathcal {H}}^{A_r'}$. In addition, let ${\mathbb {I}}_{jk}^{A_rA_r'}$ and ${\mathbb {F}}_{jk}^{A_rA_r'}$ be $\Pi _j^{A_r} \otimes \Pi _k^{A_r'}$ and $( \Pi _j^{A_r} \otimes \Pi _k^{A_r'}){\mathbb {F}}^{A_rA_r'}$, respectively. For any $M^{A_rA_r'BB'}\in {\mathcal {L}}({\mathcal {H}}^{A_rA_r'BB'})$, define

$$\begin{aligned} M^{BB'}_{{\mathbb {I}},jk}:={\mathrm {Tr}}_{A_rA_r'}[{\mathbb {I}}_{jk}^{A_rA_r'}M^{A_rA_r'BB'}], \quad M^{BB'}_{{\mathbb {F}},kj}:={\mathrm {Tr}}_{A_rA_r'}[{\mathbb {F}}_{kj}^{A_rA_r'}M^{A_rA_r'BB'}]. \end{aligned}$$

(56)

Then, it holds that, for $j \ne k$,

$$\begin{aligned}&{\mathbb {E}}_{U_j \sim \mathsf{H}_j,U_k \sim \mathsf{H}_k} \bigl [ (U_j^{A_r} \otimes U_k^{A_r'}) M^{A_rA_r'BB'} (U_j^{A_r} \otimes U_k^{A_r'})^{\dagger } \bigr ] = \frac{{\mathbb {I}}_{jk}^{A_rA_r'}}{r_jr_k} \otimes M_{{\mathbb {I}},jk}^{BB'}, \end{aligned}$$

(57)

$$\begin{aligned}&{\mathbb {E}}_{U_j \sim \mathsf{H}_j,U_k \sim \mathsf{H}_k} \bigl [ (U_j^{A_r} \otimes U_k^{A_r'}) M^{A_rA_r'BB'} (U_k^{A_r} \otimes U_j^{A_r'})^{\dagger } \bigr ] = \frac{{\mathbb {F}}_{jk}^{A_rA_r'}}{r_jr_k} \otimes M^{BB'}_{{\mathbb {F}},kj}. \end{aligned}$$

(58)

Moreover,

$$\begin{aligned}&{\mathbb {E}}_{U_j \sim \mathsf{H}_j} \bigl [ (U_j^{A_r} \otimes U_j^{A_r'}) M^{A_rA_r'BB'} (U_j^{A_r} \otimes U_j^{A_r'})^{\dagger } \bigr ]\nonumber \\&\quad =\frac{1}{r_j (r_j^2-1)} \left[ (r_j{\mathbb {I}}_{jj}^{A_rA_r'}- {\mathbb {F}}_{jj}^{A_rA_r'})\otimes M_{{\mathbb {I}},jj}^{BB'} + (r_j{\mathbb {F}}_{jj}^{A_rA_r'} -{\mathbb {I}}_{jj}^{A_rA_r'})\otimes M^{BB'}_{{\mathbb {F}},jj} \right] .\nonumber \\ \end{aligned}$$

(59)

Otherwise, ${\mathbb {E}}_{U_j,U_k,U_m,U_n} \bigl [ (U_j^{A_r} \otimes U_k^{A_r'}) M^{A_rA_r'BB'} (U_m^{A_r} \otimes U_n^{A_r'})^{\dagger } \bigr ]=0.$

The twisted twirling enables us to show the following lemma (see Appendix B).

Lemma 10

For any $\varsigma ^{ER} \in {\mathcal {S}}_=({\mathcal {H}}^{ER})$ and any $X\in \mathrm{Her}({\mathcal {H}}^{AR})$ such that $X_{jj}^{A_lR}=0$, the following inequality holds for any possible permutation $\sigma \in {\mathbb {P}}$:

$$\begin{aligned}&{\mathbb {E}}_{U \sim \mathsf{H}_{\times }} \left[ \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_{\sigma ^{-1}}^A \circ {\mathcal {U}}^A ( X^{AR} )\right\| _{2,\varsigma ^{ER}}^2\right] \nonumber \\&\quad \le \sum _{j,k=1}^J \frac{d_A^2}{r_j r_k} \left\| {\mathrm {Tr}}_{A_l}\left[ X_{\sigma (j)\sigma (k)}^{A_l^T A_r R}\tau ^{A_l {\bar{A}}_r E}_{jk} \right] \right\| ^2_{2, \varsigma ^{ER}}. \end{aligned}$$

(60)

Here, $A_l^T$ denotes the transposition of $A_l$ with respect to the Schmidt basis of the maximally entangled state $|\Phi _j^l\rangle ^{A_lA_l'}$ in (26), and the norm in the R.H.S. is defined by (16).

Based on this lemma, we can prove the non-smoothed versions of Theorems 1 and 3 in Sects. 6.1 and 7.1, respectively.

To complete the proofs of Theorems 1 and 3, smoothing the statements is needed, which is done in Sects. 6.2 and 7.2 based on the following lemma proven in Appendix C.

Lemma 11

Consider arbitrary unnormalized states $\Psi ^{AR},{\hat{\Psi }}^{AR}\in {\mathcal {P}}({\mathcal {H}}^{AR})$ and arbitrary CP maps ${\mathcal {T}},\hat{{\mathcal {T}}}:A\rightarrow E$. Let ${\mathcal {D}}_+^{A \rightarrow E}$ and ${\mathcal {D}}_-^{A \rightarrow E}$ be arbitrary CP maps such that ${\mathcal {T}}-\hat{{\mathcal {T}}}={\mathcal {D}}_+-{\mathcal {D}}_-$. Let $\delta _+^{AR}$ and $\delta _-^{AR}$ be linear operators on ${\mathcal {H}}^A\otimes {\mathcal {H}}^{R}$, such that

$$\begin{aligned} \delta _+^{AR}\ge 0,\quad \delta _-^{AR}\ge 0, \quad \mathrm{supp}[\delta _+^{AR}]\perp \mathrm{supp}[\delta _-^{AR}] \end{aligned}$$

(61)

and that

$$\begin{aligned} {\hat{\Psi }}^{AR} -\Psi ^{AR}=\delta _+^{AR}-\delta _-^{AR}. \end{aligned}$$

(62)

The following inequality holds for any possible permutation $\sigma \in {\mathbb {P}}$ and for both ${\Psi }_*={\Psi }_{\mathrm{av}}$ and ${\Psi }_*={\mathcal {C}}^A(\Psi )$:

$$\begin{aligned}&{\mathbb {E}}_{U \sim \mathsf{H}_{\times }}\left[ \left\| {{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\Psi }^{AR} - {\Psi }_*^{AR} )\right\| _1 \right] \nonumber \\&\quad \le {\mathbb {E}}_{U \sim \mathsf{H}_{\times }}\left[ \left\| \hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\hat{\Psi }}^{AR} - {\hat{\Psi }}_*^{AR} )\right\| _1 \right] \nonumber \\&\qquad +2 \, {\mathrm {Tr}}[({\mathcal {D}}_+^{A \rightarrow E}+ {\mathcal {D}}_-^{A \rightarrow E}) \circ {\mathcal {G}}_\sigma ^A ( \Psi _{\mathrm{av}}^{AR} )] \nonumber \\&\qquad +2 \,{\mathbb {E}}_{U \sim \mathsf{H}_{\times }}{\mathrm {Tr}}[\hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A (\delta _+^{AR}+\delta _-^{AR})]. \end{aligned}$$

(63)

Here, ${\hat{\Psi }}_*={\mathbb {E}}_{U\sim \mathsf{H}_\times }[{\mathcal {U}}^A({\hat{\Psi }}^{AR})]$ for ${\Psi }_*={\Psi }_{\mathrm{av}}$ and ${\hat{\Psi }}_*={\mathcal {C}}^A({\hat{\Psi }})$ for ${\Psi }_*={\mathcal {C}}^A(\Psi )$.

The converse statements are proved independently in Sect. 8.

When we prove the one-shot randomized decoupling theorem (Theorem 3) and the converse (Theorem 4), we first put the following two working assumptions:

WA 1:

$E\cong E_cE_r$, where $E_c$ is a quantum system of dimension J.

WA 2:

The CP map ${\mathcal {T}}^{A \rightarrow E}$ is decomposed into

$$\begin{aligned} {\mathcal {T}}^{A \rightarrow E}(X)=\sum _{j,k=1}^J{{|j\rangle }\!{\langle k|}}^{E_c}\otimes {\mathcal {T}}_{jk}^{A_r \rightarrow E_r}(X_{jk}), \end{aligned}$$

(64)

in which ${\mathcal {T}}_{jk}$ is a linear supermap from ${\mathcal {L}}({\mathcal {H}}^{A_r})$ to ${\mathcal {L}}({\mathcal {H}}^{E_r})$ defined by ${\mathcal {T}}_{jk}(\zeta )={\mathcal {T}}({{|j\rangle }\!{\langle k|}}\otimes \zeta )$ for each j, k.

These assumptions are finally dropped in Sects. 7.3 and 8.3 using the following lemma (see Appendix D for a proof).

Lemma 12

Let ${\mathcal {T}}^{A\rightarrow E}$ be a linear CP map that does not necessarily satisfies WA 1 and WA 2. By introducing a quantum system $E_c$ with dimension J, define an isometry $Y^{A_c \rightarrow A_c E_c}:=\sum _{j}{|jj\rangle }^{A_cE_c}{\langle j|}^{A_c}$, and a linear map $\check{{\mathcal {T}}}^{A \rightarrow EE_c}$ by ${\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {Y}}^{A_c \rightarrow A_c E_c}$. Then, $\check{{\mathcal {T}}}^{A \rightarrow EE_c}$ is a linear CP map and, for any $\Psi ^{AR}$ that is classically coherent in $A_cR_c$, the following equalities hold:

$$\begin{aligned} \left\| \check{{\mathcal {T}}}^{A \rightarrow EE_c} ( {\Psi }^{AR} - \Psi _{\mathrm {av}}^{AR} )\right\| _1&= \left\| {\mathcal {T}}^{A \rightarrow E} ( {\Psi }^{AR} - \Psi _{\mathrm {av}}^{AR} )\right\| _1, \end{aligned}$$

(65)

$$\begin{aligned} \left\| \check{{\mathcal {T}}}^{A \rightarrow EE_c} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\Psi }^{AR} - \Psi _{\mathrm {av}}^{AR} )\right\| _1&= \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\Psi }^{AR} - \Psi _{\mathrm {av}}^{AR} )\right\| _1. \end{aligned}$$

(66)

5.2 List of useful lemmas

We here provide several useful lemmas, some of which are in common with those in the proof of the one-shot decoupling theorem [11]. Proofs of Lemmas 16–20 and 29 –35 will be provided in Appendix E.

5.2.1 Properties of norms and distances

Lemma 13

(Lemma 3.6 in [11]) For any $\xi ^{AB} \in \mathrm{Her}({\mathcal {H}}^{AB})$, $|\!|\xi ^{AB}|\!|_2 \le \sqrt{d_A}|\!|\xi ^B|\!|_2$.

Lemma 14

(Lemma 3.7 in [11]) For any $X\in \mathrm{Her}({\mathcal {H}})$ and $\gamma \in {\mathcal {P}}({\mathcal {H}})$, it holds that

$$\begin{aligned} \left\| X\right\| _1 \le \sqrt{\mathrm{Tr}[\gamma ]} \left\| X\right\| _{2, \gamma } = \sqrt{\mathrm{Tr}[\gamma ]\cdot \mathrm{Tr}[(\gamma ^{-1/4}X\gamma ^{-1/4})^2]}. \end{aligned}$$

(67)

Lemma 15

(Sec. II in [25]) The purified distance defined by (11) satisfies the following properties:

1.
triangle inequality: For any $\rho ,\varsigma ,\tau \in {\mathcal {S}}_\le ({\mathcal {H}})$, it holds that $P(\rho ,\varsigma )\le P(\rho ,\tau )+P(\tau ,\varsigma )$.
2.
monotonicity: For any $\rho ,\varsigma \in {\mathcal {S}}_\le ({\mathcal {H}})$ and trace-nonincreasing CP map ${\mathcal {E}}$, it holds that $P(\rho ,\varsigma )\ge P({\mathcal {E}}(\rho ),{\mathcal {E}}(\varsigma ))$.
3.
Uhlmann’s theorem: For any $\rho ,\varsigma \in {\mathcal {S}}_\le ({\mathcal {H}})$ and any purification $|\varphi _\rho \rangle \in {\mathcal {H}}\otimes {\mathcal {H}}'$ of $\rho $, where ${\mathcal {H}}'\cong {\mathcal {H}}$, there exists a purification $|\varphi _\varsigma \rangle \in {\mathcal {H}}\otimes {\mathcal {H}}'$ of $\varsigma $ such that $P(\rho ,\varsigma )=P(\varphi _\rho ,\varphi _\varsigma )$.

Lemma 16

The purified distance defined by (11) satisfies the following properties:

1.
pure states: For any subnormalized pure state $|\psi \rangle \in {\mathcal {H}}$ and any normalized pure state $|\phi \rangle \in {\mathcal {H}}$, $P(\psi ,\phi )=\sqrt{1-|\langle \psi |\phi \rangle |^2}$.
2.
relation to the trace distance: For any $\rho ,\varsigma \in {\mathcal {S}}_\le ({\mathcal {H}})$, $ \frac{1}{2}\Vert \rho -\varsigma \Vert _1 \le P(\rho ,\varsigma ) \le \sqrt{2\Vert \rho -\varsigma \Vert _1}$.
3.
inequality for subnormalized pure states: For any subnormalized pure states ${|\psi \rangle },{|\phi \rangle }\in {\mathcal {H}}$, $P(\psi ,\phi ) \le \sqrt{1-|{\left\langle \psi |\phi \right\rangle }|^2} + \sqrt{1-{\left\langle \phi |\phi \right\rangle }}$.

Lemma 17

Let $\{p_k\}_k$ be a normalized probability distribution, $\{\rho _k\}_k$ be a set of normalized states on AB, and $\{{\hat{\rho }}_k\}_k$ be that of subnormalized ones. For $\rho ^{ABK}:=\sum _kp_k\rho _k^{AB}\otimes {{|k\rangle }\!{\langle k|}}^K$ and ${\hat{\rho }}^{ABK}:=\sum _kp_k{\hat{\rho }}_k^{AB}\otimes {{|k\rangle }\!{\langle k|}}^K$, the purified distance satisfies

$$\begin{aligned} P(\rho ^{ABK},{\hat{\rho }}^{ABK})\le \sqrt{2\sum _kp_kP(\rho _k^{AB},{\hat{\rho }}_k^{AB})}. \end{aligned}$$

(68)

Lemma 18

Let $\{p_k\}_k$ and $\{q_k\}_k$ be subnormalized probability distributions, and $\{\rho _k\}_k$ and $\{\varsigma _k\}_k$ be sets of normalized states on A. For $\rho ^{AK}:=\sum _kp_k\rho _k^{A}\otimes {{|k\rangle }\!{\langle k|}}^K$ and $\varsigma ^{AK}:=\sum _kq_k\varsigma _k^{A}\otimes {{|k\rangle }\!{\langle k|}}^K$, it holds that

$$\begin{aligned} \left| \sum _kp_k\left\| \rho _k-\varsigma _k\right\| _1 - \left\| \rho ^{AK}-\varsigma ^{AK}\right\| _1 \right| \le \sum _k|p_k-q_k| \le \left\| \rho ^{AK}-\varsigma ^{AK}\right\| _1. \end{aligned}$$

(69)

Lemma 19

The DSP norm defined by (13) satisfies the triangle inequality, i.e., for any superoperators ${\mathcal {E}}$ and ${\mathcal {F}}$ from ${\mathcal {L}}({\mathcal {H}}^A)$ to ${\mathcal {L}}({\mathcal {H}}^B)$, $\Vert {\mathcal {E}}+{\mathcal {F}}\Vert _{\mathrm{DSP}} \le \Vert {\mathcal {E}}\Vert _{\mathrm{DSP}} + \Vert {\mathcal {F}}\Vert _{\mathrm{DSP}}$.

Lemma 20

Let $\{\Pi _j\}_j$ be a set of orthogonal projectors on ${\mathcal {H}}$ such that $\sum _j\Pi _j=I$. For any $\varrho \in {\mathcal {P}}({\mathcal {H}})$, $\left\| \varrho \right\| _2^2=\sum _{j,k}\left\| \Pi _j\varrho \Pi _k\right\| _2^2$.

5.2.2 Properties of conditional entropies

Lemma 21

(Corollary of Lemma 13 in [25]) For any $\epsilon \ge 0$, $\rho ^{AB} \in {\mathcal {S}}_\le ({\mathcal {H}}^{AB})$ and any linear isometry $V:A\rightarrow C$, $H_{\mathrm{min}}^\epsilon (A|B)_\rho =H_{\mathrm{min}}^\epsilon (C|B)_{{\mathcal {V}}(\rho )}$.

Lemma 22

(Corollary of Lemma 15 in [25]) For any $\epsilon \ge 0$, $\rho ^{AB} \in {\mathcal {S}}_\le ({\mathcal {H}}^{AB})$ and any linear isometry $W:B\rightarrow D$, $H_{\mathrm{max}}^\epsilon (A|B)_\rho =H_{\mathrm{max}}^\epsilon (A|D)_{{\mathcal {W}}(\rho )}$.

Lemma 23

(Lemma A.1 in [11]) For any $\rho ^{AB} \in {\mathcal {S}}_\le ({\mathcal {H}}^{AB})$ and $\varsigma ^{B} \in {\mathcal {S}}_=({\mathcal {H}}^{B})$, it holds that

$$\begin{aligned} H_2(A|B)_{\rho |\varsigma } \ge H_{\mathrm{min}}(A|B)_{\rho |\varsigma },\; H_2(A|B)_{\rho } \ge H_{\mathrm{min}}(A|B)_{\rho }. \end{aligned}$$

(70)

Lemma 24

(Definition 14, Equality (6) and Lemma 16 in [25]) For any subnormalized pure state $|\psi \rangle $ on system ABC, and for any $\epsilon >0$, $H_{\mathrm{max}}^\epsilon (A|B)_\psi = - H_{\mathrm{min}}^\epsilon (A|C)_\psi $.

Lemma 25

(Lemma B.2 in [11]) Let $\psi ^{ABC}\in {\mathcal {S}}_\le ({\mathcal {H}}^{ABC})$ be a subnormalized pure state. For any full-rank state $\varsigma ^B\in {\mathcal {S}}_=({\mathcal {H}}^B)$, it holds that $\psi ^{ABC}\le Z^{AB}\otimes I^C$, where

$$\begin{aligned} Z^{AB}:= 2^{\frac{1}{2}H_{\mathrm{max}}(A|B)_{\psi |\varsigma }}\cdot (\varsigma ^B)^{-\frac{1}{2}} \sqrt{ (\varsigma ^B)^{\frac{1}{2}} \psi ^{AB} (\varsigma ^B)^{\frac{1}{2}} } (\varsigma ^B)^{-\frac{1}{2}}. \end{aligned}$$

(71)

Lemma 26

(Lemma A.5 in [11]) For any state $\rho ^{ABK} \in {\mathcal {S}}_=({\mathcal {H}}^{ABK})$ in the form of

$$\begin{aligned} \rho ^{ABK}=\sum _kp_k\rho _k^{AB}\otimes {{|k\rangle }\!{\langle k|}}^K, \end{aligned}$$

(72)

where $\rho _k \in {\mathcal {S}}_=({\mathcal {H}}^{AB})$, $\langle k|k'\rangle =\delta _{k,k'}$ and $\{p_k\}_k$ is a normalized probability distribution, it holds that

$$\begin{aligned}&H_{\mathrm{min}}(A|BK)_\rho =-\log \left( \sum _kp_k\cdot 2^{-H_{\mathrm{min}}(A|B)_{\rho _k}}\right) , \end{aligned}$$

(73)

$$\begin{aligned}&H_{\mathrm{max}}(A|BK)_\rho =\log \left( \sum _kp_k\cdot 2^{H_{\mathrm{max}}(A|B)_{\rho _k}}\right) . \end{aligned}$$

(74)

(It is straightforward to show that the above equalities also hold for $\rho ^{ABK} \in {\mathcal {S}}_\le ({\mathcal {H}}^{ABK})$ and $\rho _k \in {\mathcal {S}}_\le ({\mathcal {H}}^{AB})$, by noting that $H_{\mathrm{min}}(A|BK)_\rho =H_{\mathrm{min}}(A|BK)_{\rho /\mathrm{Tr}[\rho ]}-\log {\mathrm{Tr}[\rho ]}$ and that $H_{\mathrm{max}}(A|BK)_\rho =H_{\mathrm{max}}(A|BK)_{\rho /\mathrm{Tr}[\rho ]}+\log {\mathrm{Tr}[\rho ]}$.)

Lemma 27

(Lemma A.7 in [11]) For any state $\rho ^{ABK_1K_2} \in {\mathcal {S}}_\le ({\mathcal {H}}^{ABK_1K_2})$ in the form of

$$\begin{aligned} \rho ^{ABK_1K_2}=\sum _kp_k\rho _k^{AB}\otimes {{|k\rangle }\!{\langle k|}}^{K_1}\otimes {{|k\rangle }\!{\langle k|}}^{K_2}, \end{aligned}$$

(75)

where the notations are the same as in Lemma 26, and for any $\epsilon \ge 0$ it holds that

$$\begin{aligned} H_{\mathrm{min}}^\epsilon (AK_1|BK_2)_\rho =H_{\mathrm{min}}^\epsilon (A|BK_2)_\rho . \end{aligned}$$

(76)

(Note that, although Lemma A.7 in [11] assumes that $\rho ^{ABK_1K_2}$ is normalized, the condition is not used in the proof thereof.)

Lemma 28

(Lemma A.1 in [31]) Let $\rho \in {\mathcal {S}}_\le ({\mathcal {H}}^{K_1K_2AB})$ be a subnormalized state that is classically coherent in $K_1K_2$. For any $\epsilon \ge 0$, there exists ${\hat{\rho }}\in {\mathcal {B}}^\epsilon (\rho )$ that is classically coherent in $K_1K_2$, and $\varsigma \in {\mathcal {S}}_=({\mathcal {H}}^{K_2B})$ that is decomposed as $\varsigma =\sum _k{{|k\rangle }\!{\langle k|}}^{K_2}\otimes \varsigma _k^B$, such that

$$\begin{aligned} H_{\mathrm{min}}^\epsilon (K_1A|K_2B)_\rho =H_{\mathrm{min}}(K_1A|K_2B)_{{{\hat{\rho }}}} =H_{\mathrm{min}}(K_1A|K_2B)_{{\hat{\rho }}|\varsigma }. \end{aligned}$$

(77)

Lemma 29

In the same setting as in Lemma 27, it holds that

$$\begin{aligned} H_{\mathrm{max}}^\epsilon (AK_1|BK_2)_\rho =H_{\mathrm{max}}^\epsilon (A|BK_2)_\rho . \end{aligned}$$

(78)

Lemma 30

Let $\rho \in {\mathcal {S}}_\le ({\mathcal {H}}^{K_1K_2AB})$ be a subnormalized state that is classically coherent in $K_1K_2$. For any $\epsilon \ge 0$, there exists ${\hat{\rho }}\in {\mathcal {B}}^\epsilon (\rho )$ that is classically coherent in $K_1K_2$, such that

$$\begin{aligned} H_{\mathrm{max}}^\epsilon (K_1A|K_2B)_\rho =H_{\mathrm{max}}(K_1A|K_2B)_{{{\hat{\rho }}}}. \end{aligned}$$

(79)

If $\rho $ is also diagonal in $K_1K_2$ (i.e., if $\rho $ is in the form of (75)), there exists ${\hat{\rho }}$, satisfying the above conditions, that is diagonal in $K_1K_2$.

Lemma 31

Consider the same setting as in Lemma 26. For any $\{\epsilon _k\}_k$ such that $\epsilon _k\ge 0$, it holds that

$$\begin{aligned} H_{\mathrm{min}}^{\sqrt{2\varepsilon }}(A|BK)_\rho \ge -\log \left( \sum _kp_k\cdot 2^{-H_{\mathrm{min}}^{\epsilon _k}(A|B)_{\rho _k}}\right) , \end{aligned}$$

(80)

where $\varepsilon :=\sum _kp_k\epsilon _k$.

5.2.3 Other technical lemmas

Lemma 32

Consider two linear operators $X,Y:{\mathcal {H}}^A\rightarrow {\mathcal {H}}^B$ and assume that $A\cong A'$, $B\cong B'$. Let ${|\Phi \rangle }^{AA'}$ and ${|\Phi \rangle }^{BB'}$ be maximally entangled states between A and $A'$, and B and $B'$, respectively. Then, $\mathrm{Tr}[X^TY]=\sqrt{d_Ad_B}{\langle \Phi |}^{BB'}(X\otimes Y){|\Phi \rangle }^{AA'}$, where $d_A:=\dim {\mathcal {H}}^A$, $d_B:=\dim {\mathcal {H}}^B$ and the transposition is taken with respect to the Schmidt bases of ${|\Phi \rangle }^{AA'}$ and ${|\Phi \rangle }^{BB'}$.

Lemma 33

If $\varrho ^2$ is classically coherent in XY for a positive semidefinite operator $\varrho \in {\mathcal {P}}({\mathcal {H}}^{AXY})$, so is $\varrho $.

Lemma 34

Let $\pi $ be the maximally mixed state on system A, and let ${\mathcal {C}}$ be the completely dephasing operation on A with respect to a fixed basis $\{|i\rangle \}_{i=1}^{d_A}$. For any $\rho \in {\mathcal {P}}({\mathcal {H}}^{AB})$, it holds that

$$\begin{aligned}&\left\| \rho ^{AR}-\pi ^A\otimes \rho ^{R}\right\| _2^2 \le \left\| \rho ^{AR}\right\| _2^2, \end{aligned}$$

(81)

$$\begin{aligned}&\left\| \rho ^{AR}-{\mathcal {C}}^A(\rho ^{AR})\right\| _2^2 \le \left\| \rho ^{AR}\right\| _2^2. \end{aligned}$$

(82)

Lemma 35

For subnormalized pure states ${|\psi \rangle },{|\phi \rangle }\in {\mathcal {H}}$ and a real number $c>0$, suppose that there exists a normalized pure state ${|e\rangle }\in {\mathcal {H}}$ that satisfies ${\left\langle e|\psi \right\rangle }\ge c$ and ${\left\langle e|\phi \right\rangle }\ge c$. Then, $|{\left\langle \psi |\phi \right\rangle }|\ge 2c^2-1$.

Lemma 36

(Lemma 35 in [45]) Let $c\in (0,\infty )$ be a constant, $f:[0,c]\rightarrow {{\mathbb {R}}}$ be a monotonically nondecreasing function that satisfies $f(c)<\infty $, and $\{p_k\}_{k\in {{\mathbb {K}}}}$ be a probability distribution on a countable set ${{\mathbb {K}}}$. Suppose $\epsilon _k\,(k\in {{\mathbb {K}}})$ satisfies $\epsilon _k\in [0,c]$, and $\sum _{k\in {{\mathbb {K}}}}p_k\epsilon _k\le \epsilon $ for a given $\epsilon \in (0,c^2]$. Then we have

$$\begin{aligned} \sum _{k\in {{\mathbb {K}}}}p_kf(\epsilon _k)\le f(\sqrt{\epsilon })+f( c)\cdot \sqrt{\epsilon }. \end{aligned}$$

(83)

6 Proof of the Non-randomized Partial Decoupling (Theorem 1)

We now prove the non-randomized partial decoupling (Theorem 1). As sketched in Sect. 5.1, we proceed the proof in two steps: showing the non-smoothed version in Sect. 6.1, and then smoothing it in Sect. 6.2.

6.1 Proof of the non-smoothed non-randomized partial decoupling

The non-smoothed version of Theorem 1 is given by

$$\begin{aligned} {\mathbb {E}}_{U \sim \mathsf{H}_{\times }} \left[ \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {U}}^A ( \Psi ^{AR} ) - {\mathcal {T}}^{A \rightarrow E}(\Psi _{\mathrm{av}}^{AR}) \right\| _1 \right] \le 2^{-\frac{1}{2} H_{\mathrm{min}}(A^*|RE)_{\Lambda (\Psi ,{\mathcal {T}})}}, \end{aligned}$$

(84)

where $\Psi _{\mathrm{av}}^{AR} = \bigoplus _{j=1}^J \Psi _{jj}^{A_lR}\otimes \pi _j^{A_r}$. Note that, due to the definition of the conditional collision entropy (19), (22) and its relation to the conditional min-entropy (see Lemma 23), we have

$$\begin{aligned} \left\| {\Lambda }(\Psi ,{\mathcal {T}}) \right\| ^2_{2, \varsigma ^{ER}} = 2^{-\frac{1}{2}H_2(A^*|RE)_{{\Lambda }(\Psi ,{\mathcal {T}})}} \le 2^{-\frac{1}{2}H_{\mathrm{min}}(A^*|RE)_{{\Lambda }(\Psi ,{\mathcal {T}})}} \end{aligned}$$

(85)

for a proper choice of $\varsigma ^{ER}\in {\mathcal {S}}_=({\mathcal {H}}^{ER})$. In addition, it holds that

$$\begin{aligned} \left\| {\Lambda }(\Psi ,{\mathcal {T}}) \right\| ^2_{2, \varsigma ^{ER}} = \sum _{j,k=1}^J \frac{d_A^2}{r_j r_k} \left\| {\mathrm {Tr}}_{A_l} \left[ \Psi _{jk}^{A_l^T A_r R} \tau ^{A_l {\bar{A}}_r E}_{jk} \right] \right\| ^2_{2, \varsigma }. \end{aligned}$$

(86)

We first show this relation.

Let $\Pi _j^{A^*}$ be the projection onto a subspace ${\mathcal {H}}_j^{A_r}\otimes {\mathcal {H}}_j^{{\bar{A}}_r}\subset {\mathcal {H}}^{A^*}$ for each j. Due to the definition of $F^{A{\bar{A}}\rightarrow A^*}$ given by (31), it holds that

$$\begin{aligned} \Pi _j^{A^*}F^{A{\bar{A}}\rightarrow A^*} =\sqrt{\frac{d_Al_j}{r_j}} \langle \Phi _j^l|^{A_l{\bar{A}}_l}\left( \Pi _j^{A} \otimes \Pi _j^{{\bar{A}}}\right) . \end{aligned}$$

(87)

Using the property of the Hilbert–Schmidt norm (Lemma 20), we have

$$\begin{aligned} \left\| {\Lambda }(\Psi ,{\mathcal {T}})\right\| _{2,\varsigma }^2&= \left\| (\varsigma ^{ER})^{-1/4}{\Lambda }(\Psi ,{\mathcal {T}})(\varsigma ^{ER})^{-1/4}\right\| _2^2 \nonumber \\&= \sum _{j,k=1}^J \left\| \left( \Pi _j^{A^*}\otimes (\varsigma ^{ER})^{-1/4}\right) {\Lambda }_\varsigma (\Psi ,{\mathcal {T}}) \left( \Pi _k^{A^*}\otimes (\varsigma ^{ER})^{-1/4}\right) \right\| _2^2 \nonumber \\&= \sum _{j,k=1}^J \left\| \Pi _j^{A^*} {\Lambda }(\Psi ,{\mathcal {T}}) \Pi _k^{A^*}\right\| _{2,\varsigma }^2. \end{aligned}$$

(88)

Using Eq. (87) and the explicit form of ${\Lambda }(\Psi ,{\mathcal {T}})$, i.e. ${\Lambda }(\Psi ,{\mathcal {T}}):=F(\Psi ^{AR}\otimes \tau ^{{\bar{A}}E})F^\dagger $, each term in the summand is given by

$$\begin{aligned} \Pi _j^{A^*} {\Lambda }(\Psi ,{\mathcal {T}}) \Pi _k^{A^*}&= (\Pi _j^{A^*}\!F^{A{\bar{A}}\rightarrow A^*}) (\Psi ^{AR}\otimes \tau ^{{\bar{A}}E}) (\Pi _k^{A^*}\!F^{A{\bar{A}}\rightarrow A^*})^\dagger \nonumber \\&= \frac{d_A}{\sqrt{r_jr_k}}\cdot \sqrt{l_jl_k} \langle \Phi _j^l|^{A_l{\bar{A}}_l}(\Pi _j^{A}\Psi ^{AR}\Pi _k^{A} \otimes \Pi _j^{{\bar{A}}}\tau ^{{\bar{A}}E}\Pi _k^{{\bar{A}}})|\Phi _k^l\rangle ^{A_l{\bar{A}}_l} \nonumber \\&= \frac{d_A}{\sqrt{r_jr_k}}\cdot \sqrt{l_jl_k} \langle \Phi _j^l|^{A_l{\bar{A}}_l}(\Psi _{jk}^{A_lA_rR} \otimes \tau _{jk}^{{\bar{A}}_l{\bar{A}}_rE})|\Phi _k^l\rangle ^{A_l{\bar{A}}_l} \nonumber \\&= \frac{d_A}{\sqrt{r_jr_k}} \mathrm{Tr}_{A_l}\left[ \Psi _{jk}^{A_l^TA_rR} \tau _{jk}^{{A}_l{\bar{A}}_rE}\right] , \end{aligned}$$

(89)

where the last line follows from Lemma 32. Thus, we obtain (86).

From Eqs. (85) and (86), it suffices to prove that

$$\begin{aligned}&{\mathbb {E}}_{U \sim \mathsf{H}_{\times }} \left[ \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {U}}^A ( \Psi ^{AR} ) - {\mathcal {T}}^{A \rightarrow E}(\Psi _{\mathrm{av}}^{AR}) \right\| _1 \right] \nonumber \\&\quad \le \sum _{j,k=1}^J \frac{d_A^2}{r_j r_k} \left\| {\mathrm {Tr}}_{A_l} \left[ \Psi _{jk}^{A_l^T A_r R} \tau ^{A_l {\bar{A}}_r E}_{jk} \right] \right\| ^2_{2, \varsigma } \end{aligned}$$

(90)

for any $\varsigma ^{ER}\in {\mathcal {S}}_=({\mathcal {H}}^{ER})$. In the following, we denote the L.H.S. of Ineq. (90) by $\kappa $. Due to Lemma 14, for any $\varsigma \in {\mathcal {S}}_=({\mathcal {H}}^{ER})$, we have

$$\begin{aligned} \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {U}}^A ( \Psi ^{AR}-\Psi _{\mathrm{av}}^{AR} )\right\| _1 \le \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {U}}^A ( \Psi ^{AR}-\Psi _{\mathrm{av}}^{AR} )\right\| _{2,\varsigma ^{ER}}. \end{aligned}$$

(91)

Using this and Jensen’s inequality, we obtain

$$\begin{aligned} \kappa ^2 \le {\mathbb {E}}_{U \sim \mathsf{H}_{\times }}\left[ |\! | {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {U}}^A ( \Psi ^{AR} ) - {\mathcal {T}}^{A \rightarrow E} (\Psi _{\mathrm{av}}^{AR} ) |\! |_{2, \varsigma }^2\right] . \end{aligned}$$

(92)

Noting that $\Psi _{jj}^{A_lR} = \mathrm{Tr}_{A_r}[\Psi _{jj}^{A_lA_rR}] = \mathrm{Tr}_{A_r}[\Psi _{\mathrm{av}, jj}^{A_lR}\otimes \pi _j^{A_r}]=\Psi _{\mathrm{av}, jj}^{A_lR}$, we can apply Lemma 10 for $X^{AR}=\Psi ^{AR}-\Psi _{\mathrm{av}}^{AR}$ and $\sigma =\mathrm{id}$. This yields

$$\begin{aligned} \kappa ^2&\le \sum _{j,k =1}^J \frac{d_A^2}{r_j r_k} \left\| {\mathrm {Tr}}_{A_l} \left[ \left( \Psi _{jk}^{A_l^T A_r R} - \Psi _{\mathrm {av}, jk}^{A_l^T A_r R} \right) \tau ^{A_l {\bar{A}}_r E}_{jk} \right] \right\| ^2_{2, \varsigma } \nonumber \\ {}&= \sum _{j =1}^J \frac{d_A^2}{r_j^2} \left\| {\mathrm {Tr}}_{A_l} \left[ \left( \Psi _{jj}^{A_l^T A_r R} - \Psi _{jj}^{A_l^T R}\otimes \pi _{jj}^{A_r}\right) \tau ^{A_l {\bar{A}}_r E}_{jj} \right] \right\| ^2_{2, \varsigma } \nonumber \\ {}&\quad + \sum _{j\ne k} \frac{d_A^2}{r_j r_k} \left\| {\mathrm {Tr}}_{A_l} \left[ \Psi _{jk}^{A_l^T A_r R} \tau ^{A_l {\bar{A}}_r E}_{jk} \right] \right\| ^2_{2, \varsigma }, \end{aligned}$$

(93)

where the second line follows from the fact that $\Psi _{\mathrm{av},jk}^{A_lA_rR}=0$ for $j\ne k$. To calculate the first term in (93), note that

$$\begin{aligned} {\mathrm {Tr}}_{A_l} [ \Psi _{jj}^{A_l^T A_r R} \tau ^{A_l {\bar{A}}_r E}_{jj} ] \in {\mathcal {P}}({\mathcal {H}}^{A_r{\bar{A}}_rRE}) \end{aligned}$$

(94)

and that

$$\begin{aligned} {\mathrm {Tr}}_{A_l} \left[ \left( \Psi _{jj}^{A_l^T R}\otimes \pi _{jj}^{A_r}\right) \tau ^{A_l {\bar{A}}_r E}_{jj} \right] = {\mathrm {Tr}}_{A_lA_r} [ \Psi _{jj}^{A_l^T A_r R} \tau ^{A_l {\bar{A}}_r E}_{jj} ] \otimes \pi _{jj}^{A_r}. \end{aligned}$$

(95)

Thus, we simply apply Lemma 34 to obtain

$$\begin{aligned} \left\| {\mathrm {Tr}}_{A_l} \left[ \left( \Psi _{jj}^{A_l^T A_r R} - \Psi _{jj}^{A_l^T R}\otimes \pi _{jj}^{A_r}\right) \tau ^{A_l {\bar{A}}_r E}_{jj} \right] \right\| ^2_{2, \varsigma } \le \left\| {\mathrm {Tr}}_{A_l} \left[ \Psi _{jj}^{A_l^T A_r R} \tau ^{A_l {\bar{A}}_r E}_{jj} \right] \right\| ^2_{2, \varsigma } \end{aligned}$$

(96)

for each j. Substituting this to (93), we arrive at Ineq. (90). $\square $

6.2 Proof of the smoothed non-randomized partial decoupling

We now smoothen the conditional min-entropy to complete the proof of Theorem 1. To this end, fix ${\hat{\Psi }}\in {\mathcal {B}}^\epsilon (\Psi )$ and $\hat{{\mathcal {T}}}\in {\mathcal {B}}_{\mathrm{DSP}}^\mu ({\mathcal {T}})$ so that

$$\begin{aligned} H_{\mathrm{min}}^{\epsilon ,\mu }(A^*|RE)_{\Lambda (\Psi ,{\mathcal {T}})}=H_{\mathrm{min}}(A^*|RE)_{\Lambda ({\hat{\Psi }},\hat{{\mathcal {T}}})}. \end{aligned}$$

(97)

Let $|\Psi _{p,\mathrm{av}}\rangle ^{AA'}$ be a purification of $\Psi _{\mathrm{av}}^A$. Noting that $\Psi _{\mathrm{av}}$ is decomposed in the form of (28), by properly choosing a DSP decomposition for $A'$, it holds that

$$\begin{aligned} (\Pi _j^A\otimes \Pi _k^{A'})|\Psi _{p,\mathrm{av}}\rangle ^{AA'}= \delta _{jk}\sqrt{q_j}|\varpi _j\rangle ^{A_lA_l'}|\Phi _j^r\rangle ^{A_rA_r'}, \end{aligned}$$

(98)

where $q_j:={\mathrm {Tr}}{\Psi _{jj}}$ and $\varpi _j$ is a purification of $\Psi _{jj}^{A_l}/q_j$ for each j. Let $\Delta _+^{A'E}$ and $\Delta _-^{A'E}$ be linear operators on ${\mathcal {H}}^E\otimes {\mathcal {H}}^{A'}$ such that $\Delta _+^{A'E}\ge 0,\; \Delta _-^{A'E}\ge 0, \; \mathrm{supp}[\Delta _+^{A'E}]\perp \mathrm{supp}[\Delta _-^{A'E}]$ and that

$$\begin{aligned} {\mathcal {T}}^{A \rightarrow E}(\Psi _{p,\mathrm{av}}^{AA'}) -\hat{{\mathcal {T}}}^{A \rightarrow E}(\Psi _{p,\mathrm{av}}^{AA'})=\Delta _{+}^{A'E}-\Delta _{-}^{A'E}. \end{aligned}$$

(99)

In addition, let ${\mathcal {D}}_+^{A\rightarrow E}$ and ${\mathcal {D}}_-^{A\rightarrow E}$ be superoperators such that

$$\begin{aligned} {\mathcal {D}}_{+}^{A\rightarrow E}(\Psi _{p,\mathrm{av}}^{AA'})=\Delta _{+}^{A'E}, \quad {\mathcal {D}}_{-}^{A\rightarrow E}(\Psi _{p,\mathrm{av}}^{AA'})=\Delta _{-}^{A'E}, \end{aligned}$$

(100)

which yields ${\mathcal {T}}-\hat{{\mathcal {T}}}={\mathcal {D}}_{+}-{\mathcal {D}}_{-}$. Note that, in general, it does not necessarily imply that ${\mathcal {D}}_{+}={\mathcal {T}}$ and ${\mathcal {D}}_{-}=\hat{{\mathcal {T}}}$.

We now apply Lemma 11 for the case where $\sigma =\mathrm{id}$. To obtain the explicit forms, we compute

$$\begin{aligned} {\mathrm {Tr}}[({\mathcal {D}}_+^{A \rightarrow E}+ {\mathcal {D}}_-^{A \rightarrow E}) ( \Psi _{\mathrm{av}}^{AR} )]&= {\mathrm {Tr}}[({\mathcal {D}}_+^{A \rightarrow E}+ {\mathcal {D}}_-^{A \rightarrow E}) ( \Psi _{\mathrm{av}}^{A} )] \nonumber \\&={\mathrm {Tr}}[({\mathcal {D}}_+^{A \rightarrow E}+ {\mathcal {D}}_-^{A \rightarrow E}) ( \Psi _{p,\mathrm{av}}^{AA'} )] \nonumber \\&={\mathrm {Tr}}[\Delta _{+}^{A'E}+\Delta _{-}^{A'E}] \nonumber \\&=\left\| \Delta _{+}^{A'E}-\Delta _{-}^{A'E}\right\| _1 \nonumber \\&=\left\| {\mathcal {T}}^{A \rightarrow E}(\Psi _{p,\mathrm{av}}^{AA'}) -\hat{{\mathcal {T}}}^{A \rightarrow E}(\Psi _{p,\mathrm{av}}^{AA'})\right\| _1 \nonumber \\&\le \left\| {\mathcal {T}}^{A \rightarrow E}-\hat{{\mathcal {T}}}^{A \rightarrow E}\right\| _{\mathrm{DSP}}\le \mu , \end{aligned}$$

(101)

where we have used the properties of $\Psi _{p,\mathrm{av}}^{AA'}$, $\Delta _{\pm }^{A'E}$, and ${\mathcal {D}}_{\pm }^{A \rightarrow E}$ described above. The last line follows from the definition of the DSP norm. Furthermore, introducing a notation $\bar{{\mathcal {U}}}(\cdot ):={\mathbb {E}}_{U \sim \mathsf{H}_{\times }}[\,{\mathcal {U}}(\cdot )]$, we also have (see Lemma 11 for the definition and properties of $\delta _{\pm }^{AR}$)

$$\begin{aligned}&{\mathrm {Tr}}[\hat{{\mathcal {T}}}^{A \rightarrow E} \circ \bar{{\mathcal {U}}}^A (\delta _+^{AR}+\delta _-^{AR})]\nonumber \\&\quad =\left\| \hat{{\mathcal {T}}}^{A \rightarrow E} \circ \bar{{\mathcal {U}}}^A (\delta _+^{AR})\right\| _1+\left\| \hat{{\mathcal {T}}}^{A \rightarrow E} \circ \bar{{\mathcal {U}}}^A (\delta _-^{AR}) \right\| _1\nonumber \\&\quad ={\mathrm {Tr}}{[\delta _+^{AR}]}\cdot \left\| \hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\bar{{\mathcal {U}}}}^A (\delta _+^{AR}/{\mathrm {Tr}}{[\delta _+^{AR}]})\right\| _1\nonumber \\&\qquad +{\mathrm {Tr}}{[\delta _-^{AR}]}\cdot \left\| \hat{{\mathcal {T}}}^{A \rightarrow E} \circ \bar{{\mathcal {U}}}^A (\delta _-^{AR}/{\mathrm {Tr}}{[\delta _-^{AR}]})\right\| _1\nonumber \\&\quad \le ({\mathrm {Tr}}{[\delta _+^{AR}]}+{\mathrm {Tr}}{[\delta _-^{AR}]})\cdot \left\| \hat{{\mathcal {T}}}^{A \rightarrow E}\right\| _{\mathrm{DSP}}\nonumber \\&\quad =\left\| \delta _+^{AR} -\delta _-^{AR}\right\| _1\cdot \left\| \hat{{\mathcal {T}}}^{A \rightarrow E}\right\| _{\mathrm{DSP}} \nonumber \\&\quad =\left\| {\hat{\Psi }}^{AR} -\Psi ^{AR}\right\| _1\cdot \left\| \hat{{\mathcal {T}}}^{A \rightarrow E}\right\| _{\mathrm{DSP}} \nonumber \\&\quad \le \left\| {\hat{\Psi }}^{AR} \!-\!\Psi ^{AR}\right\| _1\cdot \left( \left\| {{\mathcal {T}}}^{A \rightarrow E}\right\| _{\mathrm{DSP}}\!+\!\left\| \hat{{\mathcal {T}}}^{A \rightarrow E}\!-\!{{\mathcal {T}}}^{A \rightarrow E}\right\| _{\mathrm{DSP}}\right) \nonumber \\&\quad \le \epsilon \left\| {{\mathcal {T}}}^{A \rightarrow E}\right\| _{\mathrm{DSP}}+\epsilon \mu , \end{aligned}$$

(102)

where the fourth line follows from the definition of the DSP norm (13), and the seventh line from the triangle inequality for the DSP norm (Lemma 19). Applying the non-smoothed version of the non-randomized partial decoupling (Ineq. (84)) to a state ${\hat{\Psi }}$ and a CP map $\hat{{\mathcal {T}}}$, we have

$$\begin{aligned} {\mathbb {E}}_{U \sim \mathsf {H}_{\times }} \bigl [ \bigl |\! \bigl | \hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {U}}^A ( {\hat{\Psi }}^{AR} ) - {{\mathcal {T}}}^{A \rightarrow E}({\hat{\Psi }}_{\mathrm {av}}^{AR}) \bigr |\! \bigr |_1 \bigr ] \le 2^{-\frac{1}{2} H_{\mathrm {min}}(A^*|RE)_{\Lambda ({\hat{\Psi }},\hat{{\mathcal {T}}})}}. \end{aligned}$$

(103)

All together, Ineq. (63) in Lemma 11 leads to

$$\begin{aligned}&{\mathbb {E}}_{U \sim \mathsf {H}_{\times }}\left[ \left\| {{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {U}}^A ( {\Psi }^{AR} ) - {{\mathcal {T}}}^{A \rightarrow E} ( \Psi _{\mathrm {av}}^{AR} )\right\| _1 \right] \nonumber \\ {}&\quad \le 2^{-\frac{1}{2} H_{\mathrm {min}}(A^*|RE)_{\Lambda ({\hat{\Psi }},\hat{{\mathcal {T}}})}} +2 \left( \mu +\epsilon \left\| {{\mathcal {T}}}^{A \rightarrow E}\right\| _{\mathrm {DSP}}+\epsilon \mu \right) , \end{aligned}$$

(104)

which, together with (97), concludes the proof of Theorem 1. $\square $

7 Proof of the Randomized Partial Decoupling (Theorem 3)

We here show Theorem 3. We first put the following two assumptions, which simplify the proof:

WA 1:

$E\cong E_cE_r$, where $E_c$ is a quantum system of dimension J with a fixed orthonormal basis ${\{|j\rangle \}^{J}_{j=1}}$.

WA 2:

The CP map ${\mathcal {T}}^{A \rightarrow E}$ is decomposed into

$$\begin{aligned} {\mathcal {T}}^{A \rightarrow E}(X)=\sum _{j,k=1}^J{{|j\rangle }\!{\langle k|}}^{E_c}\otimes {\mathcal {T}}_{jk}^{A_r \rightarrow E_r}(X_{jk}), \end{aligned}$$

(105)

in which ${\mathcal {T}}_{jk}$ is a linear supermap from ${\mathcal {L}}({\mathcal {H}}^{A_r})$ to ${\mathcal {L}}({\mathcal {H}}^{E_r})$ defined by ${\mathcal {T}}_{jk}(\zeta )={\mathcal {T}}({{|j\rangle }\!{\langle k|}}\otimes \zeta )$ for each j, k.

We show the non-smoothed version in Sect. 7.1 and the smoothed version in Sect. 7.2. The above assumptions are then dropped in Sect. 7.3.

7.1 Proof of the non-smoothed randomized partial decoupling under WA 1 and WA 2

Under the assumptions WA 1 and WA 2, the non-smoothed version of the randomized partial decoupling is given by

$$\begin{aligned}&{\mathbb {E}}_{U \sim \mathsf{H}_{\times }, \sigma \sim \mathsf{P}} \left[ \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi ^{AR} ) -{\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A ( \Psi _{\mathrm{av}}^{AR} ) \right\| _1 \right] \nonumber \\&\quad \le \sqrt{\alpha (J)}\cdot 2^{-\frac{1}{2}H_{\mathrm{min}}(A|R)_{\Psi }-\frac{1}{2}H_{\mathrm{min}}(A|E)_{\tau }} +\beta (A_r)\cdot 2^{-\frac{1}{2}H_{\mathrm{min}}(A|R)_{{\mathcal {C}}(\Psi )}-\frac{1}{2}H_{\mathrm{min}}(A|E)_{{\mathcal {C}}(\tau )}}.\nonumber \\ \end{aligned}$$

(106)

Note that, as we will describe in Sect. 7.3 for general cases, the min entropies $H_{\mathrm{min}}(A|E)_{\tau }$ and $H_{\mathrm{min}}(A|E)_{{\mathcal {C}}(\tau )}$ are equal to the max entropies $-H_{\mathrm{max}}(A|B)_{{\mathcal {C}}(\tau )}$ and $-H_{\mathrm{max}}(A_r|BA_c)_{{\mathcal {C}}(\tau )}$, respectively, due to the duality of the conditional entropies for pure states (Lemma 24). The proof of this inequality will be divided into three steps.

7.1.1 Upper bound on the average trace norm

To prove Ineq. (106), we first introduce the following lemma that relates the average trace norm of an operator ${\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( X^{AR})$ to the average Hilbert–Schmidt norm.

Lemma 37

Let $X^{AR}$ be an arbitrary Hermitian operator such that $X^{AR}=\sum _{j,k=1}^J{{|j\rangle }\!{\langle k|}}^{A_c}\otimes X_{jk}^{A_rR_r}\otimes {{|j\rangle }\!{\langle k|}}^{R_c}$, and let $\zeta \in {\mathcal {S}}_=({\mathcal {H}}^{E})$ and $\xi \in {\mathcal {S}}_=({\mathcal {H}}^{R})$ be arbitrary states that are decomposed as $\zeta ^{E}\!=\!\sum _j{{|j\rangle }\!{\langle j|}}^{E_c}\!\otimes \zeta _j^{E_r}$, $\xi ^{R}\!=\!\sum _j{{|j\rangle }\!{\langle j|}}^{R_c}\!\otimes \xi _j^{R_r}$, respectively. Then it holds that

$$\begin{aligned} {\mathbb {E}}_{\sigma ,U } \bigl [ \bigl |\! \bigl | {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( X^{AR}) \bigr |\! \bigr |_1 \bigr ] \le \frac{1}{\sqrt{J}}\!\cdot \!\sqrt{{\mathbb {E}}_{\sigma ,U } \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( X^{AR} )\right\| _{2,\,\zeta ^E\otimes \xi ^R}^2}\,,\! \end{aligned}$$

(107)

where the norm in the R.H.S. is defined by (16).

It should be noted that Lemma 37 provides a stronger inequality than that obtained simply using Lemma 14.

Proof

We exploit techniques developed in [31]. Recall that U is in the form of $\sum _{j=1}^J{{|j\rangle }\!{\langle j|}}^{A_c} \otimes U_j^{A_r}$, and $G_\sigma $ is defined by $G_\sigma :=\sum _{j=1}^J{{|\sigma (j)\rangle }\!{\langle j|}}^{A_c} \otimes I^{A_r}$ for any $\sigma \in {\mathbb {P}}$.

We define a subnormalized state $\gamma _\sigma \in {\mathcal {S}}_\le ({\mathcal {H}}^{ER})$ for each $\sigma $ by $\gamma _\sigma ^{ER}:=\sum _{j=1}^J{{|\sigma (j)\rangle }\!{\langle \sigma (j)|}}^{E_c}\otimes \zeta _{\sigma (j)}^{E_r}\otimes \xi _j^{R_r}\otimes {{|j\rangle }\!{\langle j|}}^{R_c}$. Further, by letting P be a quantum system with an orthonormal basis $\{|\sigma \rangle \}_{\sigma \in {\mathbb {P}}}$, we define a subnromalized state $\gamma \in {\mathcal {S}}_\le ({\mathcal {H}}^{PER})$ by

$$\begin{aligned} \gamma ^{PER}:=\frac{1}{|{{\mathbb {P}}}|}\sum _{\sigma \in {{\mathbb {P}}}}{{|\sigma \rangle }\!{\langle \sigma |}}^P\otimes \gamma _\sigma ^{ER}. \end{aligned}$$

(108)

Using Lemma 14 and Jensen’s inequality, we obtain

$$\begin{aligned}&{\mathbb {E}}_{\sigma } \bigl [ \bigl |\! \bigl | {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( X^{AR}) \bigr |\! \bigr |_1 \bigr ] \nonumber \\&\quad =\left\| {\mathbb {E}}_{\sigma } \bigl [ {{|\sigma \rangle }\!{\langle \sigma |}}^P\otimes {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A (X^{AR}) \bigr ]\right\| _1 \nonumber \\&\quad \le \sqrt{\mathrm{Tr}[\gamma ]}\cdot \left\| {\mathbb {E}}_{\sigma } \bigl [ {{|\sigma \rangle }\!{\langle \sigma |}}^P\otimes {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A (X^{AR} ) \bigr ] \right\| _{2,\gamma ^{PER}}\nonumber \\&\quad = \sqrt{\mathrm{Tr}[\gamma ]}\cdot {\mathbb {E}}_{\sigma } \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A (X^{AR} ) \right\| _{2,\gamma ^{ER}_{\sigma }}\nonumber \\&\quad = \sqrt{\mathrm{Tr}[\gamma ]}\cdot {\mathbb {E}}_{\sigma } \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A (X^{AR} ) \right\| _{2,\zeta ^E \otimes \xi ^R}. \end{aligned}$$

(109)

In the last line, we used the following relation:

$$\begin{aligned}&(\gamma ^{ER}_{\sigma })^{-1/4} \bigl [ {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A (X^{AR} ) \bigr ] (\gamma ^{ER}_{\sigma })^{-1/4}\nonumber \\&\quad = (\zeta ^E \otimes \xi ^R)^{-1/4} \bigl [ {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A (X^{AR} ) \bigr ] (\zeta ^E \otimes \xi ^R)^{-1/4}, \end{aligned}$$

(110)

which can be observed from the fact that, due to the decomposition of ${\mathcal {T}}^{A\rightarrow E}$ from WA 2,

$$\begin{aligned}&{\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( X^{AR} ) \nonumber \\&\quad =\sum _{j,k=1}^J{{|\sigma (j)\rangle }\!{\langle \sigma (k)|}}^{E_c}\otimes {\mathcal {T}}_{\sigma (j)\sigma (k)}^{A_r \rightarrow E_r} ( U_j^{A_r}X_{jk}^{A_rR_r}U_k^{\dagger A_r}) \otimes {{|j\rangle }\!{\langle k|}}^{R_c}. \end{aligned}$$

(111)

Due to the fact that

$$\begin{aligned} \frac{1}{|{{\mathbb {P}}}|}\sum _{\sigma \in {{\mathbb {P}}}}\mathrm{Tr}[\zeta _{\sigma (j)}^{E_r}]=\frac{1}{J}\sum _{j'=1}^J\mathrm{Tr}[\zeta _{j'}^{E_r}] \end{aligned}$$

(112)

for all j, we obtain

$$\begin{aligned} \mathrm{Tr}[\gamma ]&=\frac{1}{|{{\mathbb {P}}}|}\sum _{\sigma \in {{\mathbb {P}}}}\sum _{j=1}^J\mathrm{Tr}[\zeta _{\sigma (j)}^{E_r}]\mathrm{Tr}[\xi _j^{R_r}] \nonumber \\&=\sum _{j=1}^J\left( \frac{1}{|{{\mathbb {P}}}|}\sum _{\sigma \in {{\mathbb {P}}}}\mathrm{Tr}[\zeta _{\sigma (j)}^{E_r}]\right) \mathrm{Tr}[\xi _j^{R_r}] \nonumber \\&=\frac{1}{J}\sum _{j'=1}^J\mathrm{Tr}[\zeta _{j'}^{E_r}]\cdot \sum _{j=1}^J\mathrm{Tr}[\xi _{j}^{R_r}] \nonumber \\&=\frac{1}{J}\mathrm{Tr}[\zeta ^E]\cdot \mathrm{Tr}[\xi ^R] =\frac{1}{J}. \end{aligned}$$

(113)

Substituting this to (109), and by using Jensen’s inequality, we arrive at the desired result. $\square $

7.1.2 Generalization of the dequantizing theorem

Our second step to prove the non-smoothed randomized partial decoupling is to generalize the non-smoothed version of the dequantizing theorem (Proposition 3.5 in [31]).

Lemma 38

In the same setting as in Theorem 3, it holds that

$$\begin{aligned} {\mathbb {E}}_{\sigma ,U } \bigl [ \bigl |\! \bigl | {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi ^{AR} -\Psi _{\mathrm{dp}}^{AR} ) \bigr |\! \bigr |_1 \bigr ] \le \sqrt{\alpha (J)}\cdot 2^{-\frac{1}{2}H_{\mathrm{min}}(A|R)_{\Psi }-\frac{1}{2}H_{\mathrm{min}}(A|E)_{\tau }}, \end{aligned}$$

(114)

where we have defined $\Psi _{\mathrm{dp}}^{AR}:={\mathcal {C}}^A(\Psi ^{AR})=\sum _{j=1}^J{{|j\rangle }\!{\langle j|}}^{A_c}\otimes \Psi _{jj}^{A_rR}$.

Note that $\alpha (J)$ is 0 for $J=1$ and $\frac{1}{J-1}$ for $J\ge 2$.

Proof

Since $\Psi ^{AR}$ and $\Psi _{\mathrm{av}}^{AR}$ are classically coherent in $A_cR_c$ by assumption, we can apply Lemma 37 for $X^{AR}=\Psi ^{AR}-\Psi _{\mathrm{dp}}^{AR}$ to obtain

$$\begin{aligned}&{\mathbb {E}}_{\sigma ,U} \left[ \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi ^{AR}-\Psi _{\mathrm{dp}}^{AR}) \right\| _1 \right] \nonumber \\&\quad \le \frac{1}{\sqrt{J}}\cdot \sqrt{{\mathbb {E}}_{\sigma ,U} \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi ^{AR}-\Psi _{\mathrm{dp}}^{AR} )\right\| _{2,\,\zeta ^E\otimes \xi ^R}^2}\,. \end{aligned}$$

(115)

Noting that $\Psi _{jj}^{AR}-\Psi _{\mathrm{dp},jj}^{AR}=0$, we can also apply Lemma 10 under the assumption that $A_l$ is a one-dimensional system, $r_j=r$ and $\varsigma ^{ER}=\zeta ^E\otimes \xi ^R$. Then, we obtain, for any $\sigma \in {\mathbb {P}}$,

$$\begin{aligned}&{\mathbb {E}}_{U} \left[ \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_{\sigma ^{-1}}^A \circ {\mathcal {U}}^A ( \Psi ^{AR}-\Psi _{\mathrm {dp}}^{AR} )\right\| _{2,\,\zeta ^E\otimes \xi ^R}^2\right] \nonumber \\ {}&\quad \le \frac{d_A^2}{r^2} \sum _{j\ne k} \bigl | \! \bigl | \Psi _{\sigma (j)\sigma (k)}^{ A_r R} \otimes \tau ^{{\bar{A}}_r E}_{jk} \bigr | \! \bigr |^2_{2,\,\zeta ^E\otimes \xi ^R} \nonumber \\ {}&\quad = J^2 \sum _{j\ne k} \bigl | \! \bigl | \Psi _{\sigma (j)\sigma (k)}^{ A_r R}\bigr | \! \bigr |^2_{2,\,\xi ^R}\cdot \bigl | \! \bigl | \tau ^{A_r E}_{jk} \bigr | \! \bigr |^2_{2,\,\zeta ^E}, \end{aligned}$$

(116)

where we have used $d_A=rJ$ in the last line. Taking the case of $J=1$ into account, and noting that ${\mathbb {E}}_{\sigma }[g(\sigma )]={\mathbb {E}}_{\sigma }[g(\sigma ^{-1})]$ for any function g, it follows that

$$\begin{aligned}&{\mathbb {E}}_{\sigma ,U } \left[ \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi ^{AR}-\Psi _{\mathrm{dp}}^{AR} )\right\| _{2,\,\zeta ^E\otimes \xi ^R}^2\right] \nonumber \\&\quad ={\mathbb {E}}_{\sigma ,U } \left[ \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_{\sigma ^{-1}}^A \circ {\mathcal {U}}^A ( \Psi ^{AR}-\Psi _{\mathrm{dp}}^{AR} )\right\| _{2,\,\zeta ^E\otimes \xi ^R}^2\right] \nonumber \\&\quad \le J^2 \sum _{ j\ne k} {\mathbb {E}}_{\sigma }\left[ \bigl | \! \bigl |\Psi _{\sigma (j)\sigma (k)}^{ A_r R} \bigr | \! \bigr |^2_{2,\xi ^R}\right] \cdot \bigl | \! \bigl | \tau ^{A_r E}_{jk} \bigr | \! \bigr |^2_{2,\zeta ^E}\nonumber \\&\quad =J\alpha (J) \sum _{j'\ne k'}\bigl | \! \bigl |\Psi _{j'k'}^{ A_r R} \bigr | \! \bigr |^2_{2,\xi ^R} \cdot \sum _{j\ne k} \bigl | \! \bigl | \tau ^{A_r E}_{jk} \bigr | \! \bigr |^2_{2,\zeta ^E}\nonumber \\&\quad =J\alpha (J) \left\| \sum _{j'\ne k'}{{|j'\rangle }\!{\langle k'|}}^{A_c}\otimes \Psi ^{ A_r R}_{j'k'} \right\| _{2,\xi ^R}^2 \cdot \left\| \sum _{j\ne k}{{|j\rangle }\!{\langle k|}}^{A_c}\otimes \tau ^{ A_r E}_{jk}\right\| _{2,\zeta ^E}^2 \nonumber \\&\quad = J\alpha (J) \left\| \Psi ^{AR}-\Psi _{\mathrm{dp}}^{AR}\right\| _{2,\xi ^R}^2 \cdot \left\| \tau ^{AE}-\tau _{\mathrm{dp}}^{AE}\right\| _{2,\zeta ^E}^2 \nonumber \\&\quad \le J\alpha (J) \left\| \Psi ^{AR}\right\| _{2,\xi ^R}^2 \cdot \left\| \tau ^{AE}\right\| _{2,\zeta ^E}^2 \nonumber \\&\quad = J\alpha (J) \cdot 2^{-H_2(A|R)_{\Psi |\xi }-H_2(A|E)_{\tau |\zeta }}. \end{aligned}$$

(117)

Here, we have used the definitions $\Psi _{\mathrm{dp}}^{AR}:={\mathcal {C}}^A(\Psi ^{AR})$ and $\tau _{\mathrm{dp}}^{AE}:={\mathcal {C}}^A(\tau ^{AE})$ in the sixth line, and Lemma 34 in the seventh line. Due the relation between the conditional collision entropy and the conditional min-entropy (Lemma 23), it is further bounded from above by $2^{-H_{\mathrm{min}}(A|R)_{\Psi |\xi }-H_{\mathrm{min}}(A|E)_{\tau |\zeta }}$.

Finally, we use the property of the the conditional min-entropy (Lemma 28). There exist normalized states $\xi $ and $\zeta $ in the form of

$$\begin{aligned} \xi ^{R} =\!\sum _j{{|j\rangle }\!{\langle j|}}^{R_c}\!\otimes \xi _j^{R_r}, \quad \zeta ^{E}&=\!\sum _j{{|j\rangle }\!{\langle j|}}^{E_c}\!\otimes \zeta _j^{E_r}, \end{aligned}$$

(118)

such that $H_{\mathrm{min}}(A|R)_{\Psi |\xi }=H_{\mathrm{min}}(A|R)_{\Psi }$ and $H_{\mathrm{min}}(A|E)_{\tau |\zeta }=H_{\mathrm{min}}(A|E)_{\tau }$. Thus, we obtain

$$\begin{aligned}&{\mathbb {E}}_{\sigma ,U } \left[ \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi ^{AR}-\Psi _{\mathrm{dp}}^{AR} )\right\| _{2,\,\zeta ^E\otimes \xi ^R}^2\right] \le J\alpha (J) \cdot 2^{-H_{\mathrm{min}}(A|R)_{\Psi }-H_{\mathrm{min}}(A|E)_{\tau }}, \end{aligned}$$

(119)

which, together with Ineq. (115), complete the proof of Lemma 38. $\square $

7.1.3 Proof of the non-smoothed randomized partial decoupling

We now prove the non-smoothed randomized partial decoupling, i.e.,

$$\begin{aligned}&{\mathbb {E}}_{U \sim \mathsf{H}_{\times }, \sigma \sim \mathsf{P}} \left[ \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi ^{AR} ) -{\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A ( \Psi _{\mathrm{av}}^{AR} ) \right\| _1 \right] \nonumber \\&\quad \le \sqrt{\alpha (J)}\cdot 2^{-\frac{1}{2}H_{\mathrm{min}}(A|R)_{\Psi }-\frac{1}{2}H_{\mathrm{min}}(A|E)_{\tau }} +\beta (A_r)\cdot 2^{-\frac{1}{2}H_{\mathrm{min}}(A|R)_{{\mathcal {C}}(\Psi )}-\frac{1}{2}H_{\mathrm{min}}(A|E)_{{\mathcal {C}}(\tau )}}, \nonumber \\ \end{aligned}$$

(120)

under the assumptions WA 1 and WA 2. Note that $\beta (A_r)$ is 0 for $\mathrm{dim}{\mathcal {H}}^{A_r}=1$ and 1 for $\mathrm{dim}{\mathcal {H}}^{A_r}\ge 2$. By the triangle inequality, we have

$$\begin{aligned}&{\mathbb {E}}_{\sigma ,U } \bigl [ \bigl |\! \bigl | {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi ^{AR} ) -{\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A ( \Psi _{\mathrm{av}}^{AR} ) \bigr |\! \bigr |_1 \bigr ]\nonumber \\&\quad \le {\mathbb {E}}_{\sigma ,U } \bigl [ \bigl |\! \bigl | {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi ^{AR} - \Psi _{\mathrm{dp}}^{AR} ) \bigr |\! \bigr |_1 \bigr ] \nonumber \\&\qquad + {\mathbb {E}}_{\sigma ,U } \bigl [ \bigl |\! \bigl | {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi _{\mathrm{dp}}^{AR} - \Psi _{\mathrm{av}}^{AR} ) \bigr |\! \bigr |_1 \bigr ], \end{aligned}$$

(121)

where we have used the fact that the unitary invariance of the Haar measure implies ${\mathcal {U}}^A(\Psi _{\mathrm{av}}^{AR})=\Psi _{\mathrm{av}}^{AR}$ for any unitary U. The first term is bounded by simply using Lemma 38.

To bound the second term in (121), we use Lemma 37, leading to

$$\begin{aligned}&{\mathbb {E}}_{\sigma ,U } \bigl [ \bigl |\! \bigl | {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi ^{AR}_{\mathrm{dp}}-\Psi _{\mathrm{av}}^{AR}) \bigr |\! \bigr |_1 \bigr ]\nonumber \\&\quad \le \frac{1}{\sqrt{J}}\cdot \sqrt{{\mathbb {E}}_{\sigma ,U } \left\| {\mathcal {T}}^{A \rightarrow E}\! \circ \!{\mathcal {G}}_\sigma ^A \! \circ \! {\mathcal {U}}^A ( \Psi ^{AR}_{\mathrm{dp}}\!-\!\Psi _{\mathrm{av}}^{AR} )\right\| _{2,\,\zeta ^E\otimes \xi ^R}^2}\,. \end{aligned}$$

(122)

Since $\Psi _{\mathrm{dp},jj}^{R}=\Psi _{\mathrm{av},jj}^{R}$ by definition, we can apply Lemma 10 for $X^{AR}=\Psi ^{AR}_{\mathrm{dp}}-\Psi _{\mathrm{av}}^{AR}$. Noting that $\Psi _{\mathrm{dp},jk}^{ A_r R}=\Psi _{\mathrm{av},jk}^{ A_r R}=0$ for $j\ne k$, this yields

$$\begin{aligned}&{\mathbb {E}}_{U} \left[ \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi _{\mathrm{dp}}^{AR}-\Psi _{\mathrm{av}}^{AR} )\right\| _{2,\zeta ^E\otimes \xi ^R}^2\right] \nonumber \\&\quad \le \frac{d_A^2}{r^2} \sum _{j =1}^J \left\| \Psi _{\sigma (j)\sigma (j)}^{ A_r R} \otimes \tau ^{ {\bar{A}}_r E}_{jj} \right\| ^2_{2, \zeta ^E\otimes \xi ^R} \nonumber \\&\quad =J^2 \sum _{j =1}^J \bigl | \! \bigl |\Psi _{\sigma (j)\sigma (j)}^{ A_r R} \bigr | \! \bigr |^2_{2,\xi ^R} \cdot \bigl | \! \bigl | \tau ^{A_r E}_{jj} \bigr | \! \bigr |^2_{2,\zeta ^E}. \end{aligned}$$

(123)

Thus, similarly to the derivation around Eq. (117), we obtain

$$\begin{aligned}&{\mathbb {E}}_{\sigma ,U } \left[ \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi ^{AR}_{\mathrm{dp}}-\Psi _{\mathrm{av}}^{AR} )\right\| _{2,\,\zeta ^E\otimes \xi ^R}^2\right] \nonumber \\&\quad \le J\cdot 2^{-H_{\mathrm{min}}(A|R)_{\Psi _{\mathrm{dp}}}-H_{\mathrm{min}}(A|E)_{\tau _{\mathrm{dp}}}}. \end{aligned}$$

(124)

Substituting this into Ineq. (122), and noting that $\Psi _{\mathrm{dp}}^{AR} - \Psi _{\mathrm{av}}^{AR}=0$ if $\mathrm{dim}{\mathcal {H}}^{A_r}=1$, we obtain an upper bound on the second term of the R.H.S. in Ineq. (121).

All together, we obtain Ineq. (120) as desired. $\square $

7.2 Proof of the randomized partial decoupling under the conditions WA 1 and WA 2

We now show, under the conditions WA 1 and WA 2, the randomized partial decoupling:

$$\begin{aligned}&{\mathbb {E}}_{U \sim \mathsf{H}_{\times }, \sigma \sim \mathsf{P} } \left[ \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi ^{AR} ) -{\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A ( \Psi _{\mathrm{av}}^{AR} ) \right\| _1 \right] \nonumber \\&\quad \!\le \sqrt{\alpha (J)}\cdot 2^{-\frac{1}{2}{\tilde{H}}_I} +\beta (A_r)\cdot 2^{-\frac{1}{2}{\tilde{H}}_{I\!I}} +4(\epsilon \cdot \mathrm{Tr}[\tau ]+\mu +\epsilon \mu ), \end{aligned}$$

(125)

where $\Psi _{\mathrm{av}}^{AR}:={\mathbb {E}}_{U \sim \mathsf{H}_{\times }} [ {\mathcal {U}}^A ( \Psi ^{AR} ) ]$. The function $\alpha (J)$ is 0 for $J=1$ and $\frac{1}{J-1}$ for $J\ge 2$, and $\beta (A_r)$ is 0 for $\mathrm{dim}{\mathcal {H}}^{A_r}=1$ and 1 for $\mathrm{dim}{\mathcal {H}}^{A_r}\ge 2$. The exponents ${\tilde{H}}_I$ and ${\tilde{H}}_{I\!I}$ are given by

$$\begin{aligned} {\tilde{H}}_I= H_{\mathrm{min}}^\epsilon (A|R)_{\Psi } + H_{\mathrm{min}}^\mu (A|E)_{\tau }, \quad {\tilde{H}}_{I\!I}= H_{\mathrm{min}}^\epsilon (A|R)_{{\mathcal {C}}(\Psi )} + H_{\mathrm{min}}^\mu (A|E)_{{\mathcal {C}}(\tau )}. \end{aligned}$$

(126)

Note that, the duality of the conditional smooth entropies for pure states (Lemma 24) implies $H_{\mathrm{min}}^\mu (A|E)_{\tau }=-H_{\mathrm{max}}^\mu (A|B)_{{\mathcal {C}}(\tau )}$ and $H_{\mathrm{min}}^\mu (A|E)_{{\mathcal {C}}(\tau )}=-H_{\mathrm{max}}^\mu (A_r|BA_c)_{{\mathcal {C}}(\tau )}$ (see Sect. 7.3 for the detail).

To prove the statement, we again start with the triangle inequaltiy: By the triangle inequality, we have

$$\begin{aligned}&{\mathbb {E}}_{\sigma ,U } \bigl [ \bigl |\! \bigl | {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi ^{AR} ) -{\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A ( \Psi _{\mathrm{av}}^{AR} ) \bigr |\! \bigr |_1 \bigr ]\nonumber \\&\quad \le {\mathbb {E}}_{\sigma ,U } \bigl [ \bigl |\! \bigl | {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi ^{AR} - \Psi _{\mathrm{dp}}^{AR} ) \bigr |\! \bigr |_1 \bigr ] \nonumber \\&\qquad + {\mathbb {E}}_{\sigma ,U } \bigl [ \bigl |\! \bigl | {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi _{\mathrm{dp}}^{AR} - \Psi _{\mathrm{av}}^{AR} ) \bigr |\! \bigr |_1 \bigr ]. \end{aligned}$$

(127)

Below, we derive upper bounds on the two terms in the R.H.S. separately.

For an upper bound on the first term, fix ${\hat{\Psi }}\in {\mathcal {B}}^\epsilon (\Psi )$ and ${\hat{\tau }}\in {\mathcal {B}}^\mu (\tau )$ so that we have $H_{\mathrm{min}}(A|R)_{{{\hat{\Psi }}}}=H_{\mathrm{min}}^\epsilon (A|R)_{\Psi }$ and $H_{\mathrm{min}}(A|E)_{{{\hat{\tau }}}}=H_{\mathrm{min}}^\epsilon (A|E)_{\tau }$. Let $\Delta _+^{A'E}$ and $\Delta _-^{A'E}$ be linear operators on ${\mathcal {H}}^{A'}\otimes {\mathcal {H}}^E$ such that

$$\begin{aligned} \Delta _+^{A'E}\ge 0,\; \Delta _-^{A'E}\ge 0, \; \mathrm{supp}[\Delta _+^{A'E}]\perp \mathrm{supp}[\Delta _-^{A'E}] \end{aligned}$$

(128)

and that

$$\begin{aligned} \tau ^{A'E} -{\hat{\tau }}^{A'E}=\Delta _{+}^{A'E}-\Delta _{-}^{A'E}. \end{aligned}$$

(129)

Let ${\mathcal {D}}_+^{A\rightarrow E}$ and ${\mathcal {D}}_-^{A\rightarrow E}$ be superoperators such that

$$\begin{aligned} {\mathcal {D}}_{+}^{A\rightarrow E}(\Phi ^{AA'})=\Delta _{+}^{A'E}, \quad {\mathcal {D}}_{-}^{A\rightarrow E}(\Phi ^{AA'})=\Delta _{-}^{A'E}, \end{aligned}$$

(130)

which yields ${\mathcal {T}}-\hat{{\mathcal {T}}}={\mathcal {D}}_+-{\mathcal {D}}_-$. From Lemma 11, the CP map $\hat{{\mathcal {T}}}^{A \rightarrow E}$ having the Choi–Jamiołkowski state ${\hat{\tau }}^{AE}$ satisfies

$$\begin{aligned}&{\mathbb {E}}_{\sigma ,U } \left[ \left\| {{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\Psi }^{AR} - \Psi _{\mathrm{dp}}^{AR} )\right\| _1 \right] \nonumber \\&\quad \le {\mathbb {E}}_{\sigma ,U } \left[ \left\| \hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\hat{\Psi }}^{AR} - {\hat{\Psi }}_{\mathrm{dp}}^{AR} )\right\| _1 \right] \nonumber \\&\qquad +2\, {\mathbb {E}}_{\sigma } \left[ {\mathrm {Tr}}[({\mathcal {D}}_+^{A \rightarrow E}+ {\mathcal {D}}_-^{A \rightarrow E}) \circ {\mathcal {G}}_\sigma ^A ( \Psi _{\mathrm{av}}^{AR} )] \right] \nonumber \\&\qquad +2\,{\mathbb {E}}_{\sigma ,U} \left[ {\mathrm {Tr}}[\hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A (\delta _+^{AR}+\delta _-^{AR})] \right] . \end{aligned}$$

(131)

Due to Lemma 38, the first term in the R.H.S. of the above inequality is bounded as

$$\begin{aligned} {\mathbb {E}}_{\sigma ,U } \left[ \left\| \hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\hat{\Psi }}^{AR} - {\hat{\Psi }}_{\mathrm{dp}}^{AR} )\right\| _1 \right] \le \sqrt{\alpha (J)}\cdot 2^{-\frac{1}{2}H_{\mathrm{min}}(A|R)_{{{\hat{\Psi }}}}-\frac{1}{2}H_{\mathrm{min}}(A|E)_{{{\hat{\tau }}}}}. \end{aligned}$$

(132)

Similarly to (101) and (102), using (128) and (129), it turns out that the second and the third terms are bounded from above by

$$\begin{aligned} {\mathbb {E}}_{\sigma } \left[ {\mathrm {Tr}}[({\mathcal {D}}_+^{A \rightarrow E}+ {\mathcal {D}}_-^{A \rightarrow E}) \circ {\mathcal {G}}_\sigma ^A ( \Psi _{\mathrm{av}}^{AR} )] \right] \le \mu \end{aligned}$$

(133)

and

$$\begin{aligned} {\mathbb {E}}_{\sigma ,U} \left[ {\mathrm {Tr}}[\hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A (\delta _+^{AR}+\delta _-^{AR})] \right] \le \epsilon \cdot {\mathrm {Tr}}[\tau ]+\epsilon \mu , \end{aligned}$$

(134)

respectively. Hence, we obtain

$$\begin{aligned}&{\mathbb {E}}_{\sigma ,U } \left[ \left\| {{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\Psi }^{AR} - \Psi _{\mathrm{dp}}^{AR} )\right\| _1 \right] \nonumber \\&\quad \le \sqrt{\alpha (J)}\cdot 2^{-\frac{1}{2}H_{\mathrm{min}}^\epsilon (A|R)_{\Psi }-\frac{1}{2}H_{\mathrm{min}}^\mu (A|E)_{\tau }} + 2( \epsilon \cdot {\mathrm {Tr}}[\tau ]+\mu +\epsilon \mu ). \end{aligned}$$

(135)

In the same way, we also have

$$\begin{aligned}&{\mathbb {E}}_{\sigma ,U} \left[ \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi _{\mathrm{dp}}^{AR} - \Psi _{\mathrm{av}}^{AR} )\right\| _1 \right] \\&\quad \le \beta (A_r)\cdot 2^{-\frac{1}{2}H_{\mathrm{min}}^\epsilon (A|R)_{\Psi _{\mathrm{dp}}}-\frac{1}{2}H_{\mathrm{min}}^\mu (A|E)_{\tau _{\mathrm{dp}}}} + 2(\epsilon \cdot {\mathrm {Tr}}[\tau ]+\mu +\epsilon \mu ). \end{aligned}$$

Substituting these inequalities into Eq. (127), we obtain the desired result (Ineq. (125)).

$\square $

7.3 Dropping working assumptions WA 1 and WA 2

We now drop the working assumptions WA 1 and WA 2, and show that Theorem 3 holds in general. To remind the working assumptions, we write them down here again:

WA 1:

$E\cong E_cE_r$, where $E_c$ is a quantum system of dimension J

WA 2:

The CP map ${\mathcal {T}}^{A \rightarrow E}$ is decomposed into

$$\begin{aligned} {\mathcal {T}}^{A \rightarrow E}(X)=\sum _{j,k=1}^J{{|j\rangle }\!{\langle k|}}^{E_c}\otimes {\mathcal {T}}_{jk}^{A_r \rightarrow E_r}(X_{jk}), \end{aligned}$$

(136)

in which ${\mathcal {T}}_{jk}$ is a linear supermap from ${\mathcal {L}}({\mathcal {H}}^{A_r})$ to ${\mathcal {L}}({\mathcal {H}}^{E_r})$ defined by ${\mathcal {T}}_{jk}(\zeta )={\mathcal {T}}({{|j\rangle }\!{\langle k|}}\otimes \zeta )$ for each j, k,

To drop these assumptions, we use Lemma 12. Using the linear isometry $Y^{A_c\rightarrow A_cE_c}$, given by $Y=\sum _{j}{|jj\rangle }^{A_cE_c}{\langle j|}^{A_c}$, we define a new CP map $\check{{\mathcal {T}}}^{A \rightarrow EE_c}$ by ${\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {Y}}^{A_c \rightarrow A_c E_c}$. Lemma 12 states that

$$\begin{aligned} \left\| \check{{\mathcal {T}}}^{A \rightarrow EE_c} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\Psi }^{AR} - \Psi _{\mathrm{av}}^{AR} )\right\| _1&= \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\Psi }^{AR} - \Psi _{\mathrm{av}}^{AR} )\right\| _1. \end{aligned}$$

(137)

Let $\check{\tau }^{AEE_c}$ be the Choi–Jamiołkowski state of ${\check{{\mathcal {T}}}}^{A \rightarrow EE_c}$, i.e., $\check{\tau }^{AEE_c}:={\mathfrak {J}}(\check{{\mathcal {T}}}^{A \rightarrow EE_c})$. We denote by $|\tau \rangle ^{ABE}$ a purification of $\tau ^{AE}$ such that the reduced state $\tau ^{AB}$ is equal to ${\mathfrak {J}}({\mathcal {T}}^{A\rightarrow B})$, where ${\mathcal {T}}^{A\rightarrow B}$ is the complementary map of ${\mathcal {T}}^{A\rightarrow E}$. Then, it is clear that $\check{\tau }^{AEE_c}={\mathcal {Y}}(\tau ^{AE})$, which implies that a purification $|\check{\tau }\rangle ^{ABEE_c}$ of $\check{\tau }^{AEE_c}$ is given by $|\check{\tau }\rangle ^{ABEE_c}=Y|\tau \rangle ^{ABE}$. It is also straightforward to verify that $\check{\tau }^{AB}={\mathcal {C}}(\tau ^{AB})$.

The new CP map $\check{{\mathcal {T}}}^{A \rightarrow EE_c}$ clearly satisfies WA 1 and WA 2. Hence, using Eq. (137) and achievability of the randomized partial decoupling under those assumptions (Ineq. (125)), we obtain

$$\begin{aligned}&{\mathbb {E}}_{\sigma ,U } \left[ \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\Psi }^{AR} - \Psi _{\mathrm{av}}^{AR} )\right\| _1 \right] \nonumber \\&\quad = {\mathbb {E}}_{\sigma ,U } \left[ \left\| \check{{\mathcal {T}}}^{A \rightarrow EE_c} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\Psi }^{AR} - \Psi _{\mathrm{av}}^{AR} )\right\| _1 \right] \nonumber \\&\quad \le \sqrt{\alpha (J)}\cdot 2^{-\frac{1}{2}H_{\mathrm{min}}^\epsilon (A|R)_{\Psi }-\frac{1}{2}H_{\mathrm{min}}^{\mu }(A|EE_c)_{\check{\tau }}} \nonumber \\&\qquad +\beta (A_r)\cdot 2^{-\frac{1}{2}H_{\mathrm{min}}^\epsilon (A|R)_{{\mathcal {C}}(\Psi )}-\frac{1}{2}H_{\mathrm{min}}^{\mu }(A|EE_c)_{{\mathcal {C}}(\check{\tau })}} + 4( \epsilon \cdot {\mathrm {Tr}}[\check{\tau }]+\mu +\epsilon \mu ). \end{aligned}$$

(138)

Due to the duality of conditional smooth entropies (Lemma 24), we have

$$\begin{aligned} H_{\mathrm{min}}^{\mu }(A|EE_c)_{\check{\tau }} = -H_{\mathrm{max}}^{\mu }(A|B)_{\check{\tau }} = -H_{\mathrm{max}}^{\mu }(A|B)_{{\mathcal {C}}(\tau )}. \end{aligned}$$

(139)

Using the property of the conditional smooth entropy for classical-quantum states (Lemma 27), and noting that $\check{\tau }^{AEE_c}$ is classically coherent in $A_cE_c$, we also have

$$\begin{aligned} H_{\mathrm{min}}^{\mu }(A|EE_c)_{{\mathcal {C}}(\check{\tau })} = H_{\mathrm{min}}^{\mu }(A_r|EE_c)_{\check{\tau }} = -H_{\mathrm{max}}^{\mu }(A_r|BA_c)_{\check{\tau }} = -H_{\mathrm{max}}^{\mu }(A_r|BA_c)_{{\mathcal {C}}(\tau )}. \end{aligned}$$

(140)

Substituting these into (138), and noting that ${\mathrm {Tr}}[\check{\tau }]={\mathrm {Tr}}[\tau ]\le 1$ by assumption, we obtain Theorem 3. $\square $

8 Proof of the Converse

We provide the proof of Theorem 4 under Converse Conditions 1 and 2, which are

CC 1:: $\dim {\mathcal {H}}_j^l=1,\quad \dim {\mathcal {H}}_j^r=r \quad (j=1,\ldots , J)$.
CC 2:: The initial (normalized) state $\Psi ^{AR}$ is classically coherent in $A_cR_c$.

The proof proceeds along the similar line as the proof of the converse part of the one-shot decoupling theorem (see Section 4 in [11]). Suppose that there exists a normalized state $\Omega ^{ER}:=\sum _{j=1}^J\varsigma _j^E\otimes \Psi _{jj}^{R_r}\otimes {{|j\rangle }\!{\langle j|}}^{R_c}$, where $\{\varsigma _j\}_{j=1}^J$ are normalized states on E, such that, for $\delta >0$,

$$\begin{aligned} \left\| {\mathcal {T}}^{A \rightarrow E} ( \Psi ^{AR} ) -\Omega ^{ER} \right\| _1 \le \delta . \end{aligned}$$

(141)

We separately prove that, in this case, the following inequalities hold for any $\upsilon \in [0,1/2)$ and $\iota \in (0,1]$:

$$\begin{aligned}&H_{\mathrm{min}}^{\lambda }(A|R)_\Psi + H_{\mathrm{max}}^{\upsilon }(RD|E)_{{\mathcal {T}}(\Psi )} +\log {J} \ge \log {\iota }, \end{aligned}$$

(142)

$$\begin{aligned}&H_{\mathrm{min}}^{\lambda '}(A|R)_{{\mathcal {C}}(\Psi )} + H_{\mathrm{max}}^{\upsilon }(RD|E)_{{\mathcal {T}}\circ {\mathcal {C}}(\Psi )} \ge \log {\iota }+\log {(1-2\upsilon )}. \end{aligned}$$

(143)

Here, $\lambda $ and $\lambda '$ are given by

$$\begin{aligned}&\lambda := 2\sqrt{\iota +4\sqrt{20\upsilon +2\delta }} +\sqrt{2\sqrt{20\upsilon +2\delta }}+2\sqrt{2\delta } +2\sqrt{20\upsilon +2\delta } +3\upsilon , \end{aligned}$$

(144)

$$\begin{aligned}&\lambda ':= \upsilon +\sqrt{4\sqrt{\iota +2x}+2\sqrt{x}+(4\sqrt{\iota +8}+24) x} \end{aligned}$$

(145)

and $x:=\sqrt{2}\root 4 \of {24\upsilon +2\delta }$.

First, we prove these relations based on the working assumptions WA 1 and WA 2 in Sects. 8.1 and 8.2. We complete the proof of Theorem 4 by dropping these assumptions in Sect. 8.3.

8.1 Proof of Ineq. (142) under WA 1 and WA 2

To prove Ineq. (142), we introduce the following notations:

${|\Psi \rangle }^{ARD}$: A purification of $\Psi ^{AR}$.
$V^{A\rightarrow BE}$: A Stinespring dilation of ${\mathcal {T}}^{A\rightarrow E}$.
${|\Theta \rangle }^{BERD}$: A pure state on BERD defined by ${|\Theta \rangle }:=V{|\Psi \rangle }$.
${|\theta \rangle }^{BERD}$: A subnormalized pure state on BERD such that
$$\begin{aligned} H_{\mathrm{max}}(RD|E)_\theta =H_{\mathrm{max}}^{\upsilon }(RD|E)_{\Theta }, \quad P(\theta ^{BERD},\Theta ^{BERD})\le \upsilon , \end{aligned}$$
(146)
which is classically coherent in $E_cR_c$.

Note that the existence of ${|\theta \rangle }$ satisfying the above condition follows from Lemma 30 about the property of the conditional max-entropy for classically coherent states. From the definition of the conditional max-entropy, and from the definitions of $\theta $ and $\Theta $, we have

$$\begin{aligned} H_{\mathrm{max}}(RD|E)_{\theta |\theta } \le H_{\mathrm{max}}(RD|E)_{\theta } = H_{\mathrm{max}}^{\upsilon }(RD|E)_{\Theta } = H_{\mathrm{max}}^{\upsilon }(RD|E)_{{\mathcal {T}}(\Psi )}. \end{aligned}$$

(147)

The proof of Ineq. (142) proceeds as follows. First, we prove that for any $X\in {\mathcal {P}}({\mathcal {H}}^{ER})$, we can construct a subnormalized pure state ${|\theta _X\rangle }^{BERD}$ from $\theta $ and X such that

$$\begin{aligned} \theta ^{BER}_X \le \frac{ 2^{H_{\mathrm{max}}(RD|E)_{\theta |\theta }} }{\iota } \cdot I^B \otimes X^{ER}. \end{aligned}$$

(148)

Second, we prove that if $X^{ER}$ satisfies certain conditions, the $\theta _X$ satisfies

$$\begin{aligned} H_{\mathrm{min}}(BE|R)_{\theta _X} \le H_{\mathrm{min}}^{\lambda }(A|R)_{\Psi }. \end{aligned}$$

(149)

Third, we prove that for a proper choice of $X^{ER}$ satisfying the conditions for (149), Ineq. (148) implies

$$\begin{aligned} H_{\mathrm{min}}(BE|R)_{\theta _X} +H_{\mathrm{max}}(RD|E)_{\theta |\theta } +\log {J} \ge \log {\iota }. \end{aligned}$$

(150)

Combining (147), (149) and (150), we arrive at (142).

Before we start, we remark that the partial decoupling condition (141) is used in the proof of (149), particularly when we evaluate the smoothing parameter $\lambda $.

8.1.1 Proof of Ineq. (148)

Define $ Y^{ERD}:= 2^{-\frac{1}{2}H_{\mathrm{max}}(RD|E)_{\theta |\theta }} \cdot (\theta ^E)^{-\frac{1}{2}} \sqrt{ (\theta ^E)^{\frac{1}{2}} \theta ^{ERD}(\theta ^E)^{\frac{1}{2}} } (\theta ^E)^{-\frac{1}{2}}. $ Due to Lemma 25, it holds that $ \theta ^{BERD} \le 2^{H_{\mathrm{max}}(RD|E)_{\theta |\theta }} \cdot I^B \otimes Y^{ERD} $ and thus

$$\begin{aligned} \theta ^{BER} \le 2^{H_{\mathrm{max}}(RD|E)_{\theta |\theta }} \cdot I^B \otimes Y^{ER}. \end{aligned}$$

(151)

Let $X\in {\mathcal {P}}({\mathcal {H}}^{ER})$ be an arbitrary positive semidefinite operator, and define

$$\begin{aligned} \Gamma _X^{ER}&:= \sqrt{1-\iota }\cdot (X^{ER})^{\frac{1}{2}}((1-\iota )\cdot X^{ER}+\iota \cdot Y^{ER})^{-\frac{1}{2}} \end{aligned}$$

(152)

and ${|\theta _X\rangle }^{BERD}:=\Gamma _X^{ER}{|\theta \rangle }^{BERD}$. From (151), $X\ge 0$ and the assumption that $\iota \le 1$, it follows that

$$\begin{aligned} \theta ^{BER} \le \frac{ 2^{H_{\mathrm{max}}(RD|E)_{\theta |\theta }} }{\iota } \cdot I^B \otimes ((1-\iota )\cdot X^{ER}+\iota \cdot Y^{ER}), \end{aligned}$$

(153)

and consequently,

$$\begin{aligned} \theta ^{BER}_X&= \Gamma _X^{ER}\theta ^{BER}\Gamma _X^{\dagger ER} \le \frac{ (1-\iota )\cdot 2^{H_{\mathrm{max}}(RD|E)_{\theta |\theta }} }{\iota } \cdot I^B \otimes X^{ER} \nonumber \\&\le \frac{ 2^{H_{\mathrm{max}}(RD|E)_{\theta |\theta }} }{\iota } \cdot I^B \otimes X^{ER}. \end{aligned}$$

(154)

8.1.2 Proof of Ineq. (149)

Define a subnormalized probability distribution $\bigl \{q_k :=\Vert {\langle k|}^{R_c}{|\theta \rangle }\Vert _1^2 \bigr \}_{k=1}^J$, and normalized pure states ${|\theta _k\rangle }^{E_rR_r}$ by ${|\theta _k\rangle }^{E_rR_r}:=q_k^{-1/2}{\langle k|}^{E_c}{\langle k|}^{R_c}{|\theta \rangle }$ for k such that $q_k>0$. Let $\omega \in {\mathcal {S}}_\le ({\mathcal {H}}^{ER})$ be a subnormalized state defined by

$$\begin{aligned} \omega ^{ER}&:=\sum _{k:q_k>0}q_k{{|k\rangle }\!{\langle k|}}^{E_c}\otimes \theta _k^{E_r}\otimes \theta _k^{R_r}\otimes {{|k\rangle }\!{\langle k|}}^{R_c}, \end{aligned}$$

(155)

where $\theta _k^{E_r}$ and $\theta _k^{R_r}$ are reduced states of ${|\theta _k\rangle }$ on $E_r$ and $R_r$, respectively. Consider an arbitrary $X\in {\mathcal {P}}({\mathcal {H}}^{ER})$ so that

$$\begin{aligned}{}[(X^{ER})^{-\frac{1}{2}},\omega ^{ER}]=0 \end{aligned}$$

(156)

and

$$\begin{aligned} (\theta ^E)^{-\frac{1}{2}}(X^{ER})^{-\frac{1}{2}}\omega ^{ER}(X^{ER})^{-\frac{1}{2}}(\theta ^E)^{-\frac{1}{2}} =\sum _{k:q_k>0}{{|k\rangle }\!{\langle k|}}^{E_c}\otimes I_k^{E_r}\otimes I_k^{R_r}\otimes {{|k\rangle }\!{\langle k|}}^{R_c}. \end{aligned}$$

(157)

As we prove in Appendix F, for any such X, the state ${|\theta _X\rangle }$ is a subnormalized pure state, and the partial decoupling condition (141) implies

$$\begin{aligned} P(\theta ^{BER}_X,\Theta ^{BER}) \le \lambda , \end{aligned}$$

(158)

where $\lambda $ is defined by (144). Due to the definition of $\Theta $ and the invariance of min-entropy under local isometry (Lemma 21), we obtain

$$\begin{aligned} H_{\mathrm{min}}(BE|R)_{\theta _X} \le H_{\mathrm{min}}^{\lambda }(BE|R)_{\Theta } =H_{\mathrm{min}}^{\lambda }(A|R)_{\Psi }. \end{aligned}$$

(159)

8.1.3 Proof of Ineq. (150)

We choose a proper $X^{ER}$ satisfying Conditions (156) and (157), and prove Ineq. (150) from (148). Define a normalized state

$$\begin{aligned} {\hat{\theta }}^R:=\frac{1}{J'}\sum _{k:q_k>0}{{|k\rangle }\!{\langle k|}}^{R_c}\otimes \theta _k^{R_r} \end{aligned}$$

(160)

where $J':=|\{k|1\le k\le J,\,q_k>0\}|$, and $X^{ER}:=J'\cdot I^E\otimes {\hat{\theta }}^R$. Noting that $\theta $ is classically coherent in $E_cR_c$, it is straightforward to verify that

$$\begin{aligned} (X^{ER})^{-\frac{1}{2}} = \sum _{k:q_k>0}I^E\otimes {{|k\rangle }\!{\langle k|}}^{R_c}\otimes (\theta _k^{R_r})^{-\frac{1}{2}}, \quad (\theta ^E)^{-\frac{1}{2}} = \sum _{k:q_k>0} q_k^{-\frac{1}{2}} {{|k\rangle }\!{\langle k|}}^{E_c}\otimes (\theta _k^{E_r})^{-\frac{1}{2}}. \end{aligned}$$

(161)

Consequently, $X^{ER}$ satisfies Conditions (156) and (157).

Using Ineq. (148), we have

$$\begin{aligned} \theta ^{BER}_X \le \frac{ J'\cdot 2^{H_{\mathrm{max}}(RD|E)_{\theta |\theta }} }{\iota } I^{BE} \otimes {\hat{\theta }}^R, \end{aligned}$$

(162)

which implies, together from the definition of the conditional min-entropy and $J'\le J$, that

$$\begin{aligned} H_{\mathrm{min}}(BE|R)_{\theta _X} +H_{\mathrm{max}}(RD|E)_{\theta |\theta } +\log {J} \ge \log {\iota }. \end{aligned}$$

(163)

8.2 Proof of Ineq. (143) under WA 1 and WA 2

We prove (143), that is,

$$\begin{aligned} H_{\mathrm{min}}^{\lambda '}(A|R)_{{\mathcal {C}}(\Psi )} + H_{\mathrm{max}}^{\upsilon }(RD|E)_{{\mathcal {T}}\circ {\mathcal {C}}(\Psi )} \ge \log {\iota }+\log {(1-2\upsilon )}, \end{aligned}$$

(164)

under the assumptions WA 1 and WA 2. To show this, we introduce the following notations:

${|\Psi \rangle }^{ARD}$: A purification of $\Psi ^{AR}$, in the same way as in the previous subsection.
${\mathcal {T}}_{{\mathcal {C}}}^{A\rightarrow E}$: A trace preserving CP map defined by ${\mathcal {T}}_{{\mathcal {C}}}^{A\rightarrow E}:={\mathcal {T}}^{A\rightarrow E}\circ {\mathcal {C}}^A$.
$\Theta _{{\mathcal {C}}}^{ERD}$: A normalized state on ERD defined by $\Theta _{{\mathcal {C}}}^{ERD}:={\mathcal {T}}^{A\rightarrow E}\circ {\mathcal {C}}^A(\Psi ^{ARD})$.
$\theta _{{\mathcal {C}}}^{ERD}$: A subnormalized state on ERD such that $H_{\mathrm{max}}(RD|E)_{\theta _{{\mathcal {C}}}}=H_{\mathrm{max}}^{\upsilon }(RD|E)_{\Theta _{{\mathcal {C}}}}$ and $P(\theta _{{\mathcal {C}}},\Theta _{{\mathcal {C}}})\le \upsilon $, which is classically coherent and diagonal in $E_cR_c$.
${\hat{\theta }}_{{\mathcal {C}}}^{ERD}$: A normalized state on ERD defined by ${\hat{\theta }}_{{\mathcal {C}}}^{ERD}:=\theta _{{\mathcal {C}}}^{ERD}/\mathrm{Tr}[{\theta _{{\mathcal {C}}}}]$.

The assumptions WA 1 and WA 2 imply that $\Theta _{{\mathcal {C}}}^{ERD}$ is classically coherent and diagonal in $E_cR_c$. Thus, the existence of $\theta _{{\mathcal {C}}}$ satisfying the above condition follows from Lemma 30. By definition, we have

$$\begin{aligned} H_{\mathrm{max}}^{\upsilon }(RD|E)_{{\mathcal {T}}\circ {\mathcal {C}}(\Psi )} = H_{\mathrm{max}}(RD|E)_{\theta _{{\mathcal {C}}}} = H_{\mathrm{max}}(RD|E)_{{\hat{\theta }}_{{\mathcal {C}}}} + \log \mathrm{Tr}[\theta _{{\mathcal {C}}}]. \end{aligned}$$

(165)

The proof of Ineq. (164) proceeds as follows. First, we introduce a quantum state ${\hat{\Psi }}^{ARD}$ and a quantum channel $\hat{{\mathcal {T}}}_{{\mathcal {C}}}^{A\rightarrow E}$, such that $\hat{{\mathcal {T}}}_{{\mathcal {C}}}^{A\rightarrow E}({\hat{\Psi }}^{ARD})$ is close to the state ${\mathcal {T}}_{{\mathcal {C}}}^{A\rightarrow E}(\Psi ^{ARD})$. Second, we apply the converse inequality (142) to the channel $\hat{{\mathcal {T}}}_{{\mathcal {C}},k}^{A_r\rightarrow E}$ and the state ${\hat{\Psi }}_{k}^{A_rR_r}$, which are obtained by restricting $\hat{{\mathcal {T}}}_{{\mathcal {C}}}^{A\rightarrow E}$ and ${\hat{\Psi }}^{ARD}$ to the k-th subspace. The obtained inequalities are then averaged over all k. Finally, by using the properties of the smooth entropies, we obtain Ineq. (164).

To explicitly define ${\hat{\Psi }}^{ARD}$ and $\hat{{\mathcal {T}}}_{{\mathcal {C}}}^{A\rightarrow E}$, observe that, since $\Theta _{{\mathcal {C}}}$ is a normalized state, we have

$$\begin{aligned} P(\Theta _{{\mathcal {C}}}^{RD},{\hat{\theta }}_{{\mathcal {C}}}^{RD}) \le P(\Theta _{{\mathcal {C}}}^{ERD},{\hat{\theta }}_{{\mathcal {C}}}^{ERD}) \le P(\Theta _{{\mathcal {C}}}^{ERD},\theta _{{\mathcal {C}}}^{ERD}) \le \upsilon . \end{aligned}$$

(166)

Thus, due to Uhlmann’s theorem, and noting that $\Theta _{{\mathcal {C}}}^{RD}=\Psi ^{RD}$, there exists a normalized pure state $|{\hat{\Psi }}\rangle ^{ARD}$ such that $P(\Psi ^{ARD},{\hat{\Psi }}^{ARD})\le \upsilon $ and ${\hat{\Psi }}^{RD}={\hat{\theta }}_{{\mathcal {C}}}^{RD}$. It follows from the latter equality that there exists a trace preserving CP map $\hat{{\mathcal {T}}}_{{\mathcal {C}}}^{A\rightarrow E}$ satisfying ${\hat{\theta }}_{{\mathcal {C}}}^{ERD}=\hat{{\mathcal {T}}}_{{\mathcal {C}}}^{A\rightarrow E}({\hat{\Psi }}^{ARD})$.

8.2.1 Block-wise application of the converse inequality (142)

Define a normalized probability distribution $\{r_k:=\Vert {\langle k|}^{R_c}{|{\hat{\Psi }}\rangle }\Vert _1^2\}_{k=1}^J$, and let $ |{\hat{\Psi }}_{k}\rangle ^{A_rR_rD}:=r_k^{-1/2}{\langle k|}^{E_c}{\langle k|}^{R_c}{|{\hat{\Psi }}\rangle } $ for k such that $r_k>0$. Since ${\hat{\Psi }}$ is classically coherent in $E_cR_c$, the ${\hat{\Psi }}_{k}$ are normalized states. Define also a CP map $\hat{{\mathcal {T}}}_{{\mathcal {C}},k}^{A_r\rightarrow E}$ by

$$\begin{aligned} \hat{{\mathcal {T}}}_{{\mathcal {C}},k}^{A_r\rightarrow E}(\tau ) = {{|k\rangle }\!{\langle k|}}^{E_c}\hat{{\mathcal {T}}}_{{\mathcal {C}}}^{A\rightarrow E}({{|k\rangle }\!{\langle k|}}^{A_c}\otimes \tau ^{A_r}){{|k\rangle }\!{\langle k|}}^{E_c}, \end{aligned}$$

(167)

which is trace preserving due to the assumptions WA1 and WA2. We apply the converse inequality (142) for ${\hat{\Psi }}_{k}$ and $\hat{{\mathcal {T}}}_{{\mathcal {C}},k}^{A_r\rightarrow E}$ for each k, by letting $J=1$. We particularly choose $\upsilon =0$, in which case Ineq. (142) leads to

$$\begin{aligned} H_{\mathrm{min}}^{\lambda _k}(A_r|R_r)_{{\hat{\Psi }}_{k}} + H_{\mathrm{max}}(R_rD|E_r)_{\hat{{\mathcal {T}}}_{{\mathcal {C}},k}({\hat{\Psi }}_{k})} \ge \log {\iota }. \end{aligned}$$

(168)

The smoothing parameter $\lambda _k$ is given by

$$\begin{aligned} \lambda _k:=2\sqrt{\iota +4\sqrt{2\delta _k}} +\sqrt{2\sqrt{2\delta _k}}+4\sqrt{2\delta _k}, \quad \delta _k:= \left\| \hat{{\mathcal {T}}}_{{\mathcal {C}},k}^{A_r\rightarrow E}({\hat{\Psi }}_{k}^{A_rR_r}) - \varsigma _k^E\otimes {\hat{\Psi }}_{k}^{R_r} \right\| _1. \end{aligned}$$

(169)

A simple calculation yields

$$\begin{aligned} - \log { \left( \sum _kr_k\cdot 2^{-H_{\mathrm{min}}^{\lambda _k}(A_r|R_r)_{{\hat{\Psi }}_{k}}} \right) }&\ge - \log { \left( \sum _kr_k\cdot 2^{ H_{\mathrm{max}}(R_rD|E_r)_{\hat{{\mathcal {T}}}_{{\mathcal {C}},k}({\hat{\Psi }}_{k})} } \right) } + \log {\iota }. \end{aligned}$$

(170)

8.2.2 Calculation of averaged entropies

Using the fact that ${\hat{\theta }}_{{\mathcal {C}}}$ is classically coherent and diagonal in $E_cR_c$, it is straightforward to verify that $ {\hat{\theta }}_{{\mathcal {C}}}^{ERD}=\hat{{\mathcal {T}}}_{{\mathcal {C}}}^{A\rightarrow E}({\hat{\Psi }}^{ARD})=\sum _kr_k\hat{{\mathcal {T}}}_{{\mathcal {C}},k}^{A_r\rightarrow E}({\hat{\Psi }}_{k}^{A_rR_rD})\otimes {{|k\rangle }\!{\langle k|}}^{R_c} $. Thus, by using the property of the smooth conditional entropies (Lemmas 26 and 31 ) and $P(\Psi ,{\hat{\Psi }})\le \upsilon $, both sides of Ineq. (170) are calculated to be

$$\begin{aligned}&- \log { \left( \sum _kr_k\cdot 2^{ H_{\mathrm{max}}(R_rD|E_r)_{\hat{{\mathcal {T}}}_{{\mathcal {C}},k}({\hat{\Psi }}_{k})} } \right) } = -H_{\mathrm{max}}(RD|E)_{{\hat{\theta }}_{{\mathcal {C}}}}, \end{aligned}$$

(171)

$$\begin{aligned}&- \log { \left( \sum _kr_k\cdot 2^{-H_{\mathrm{min}}^{\lambda _k}(A_r|R_r)_{{\hat{\Psi }}_{k}}} \right) } \le H_{\mathrm{min}}^{\sqrt{2{\bar{\lambda }}}}(A|R)_{{\mathcal {C}}({\hat{\Psi }})} \le H_{\mathrm{min}}^{\upsilon +\sqrt{2{\bar{\lambda }}}}(A|R)_{{\mathcal {C}}(\Psi )}, \end{aligned}$$

(172)

where ${\bar{\lambda }}:=\sum _kr_k\lambda _k$. Combining these all together with Eq. (165), we obtain

$$\begin{aligned} H_{\mathrm{min}}^{\upsilon +\sqrt{2{\bar{\lambda }}}}(A|R)_{{\mathcal {C}}(\Psi )} + H_{\mathrm{max}}^{\upsilon }(RD|E)_{{\mathcal {T}}\circ {\mathcal {C}}(\Psi )} \ge \log {\iota } + \log \mathrm{Tr}[\theta _{{\mathcal {C}}}]. \end{aligned}$$

(173)

As we prove in Appendix G, the partial decoupling condition (141) implies

$$\begin{aligned} {\bar{\lambda }}\le \lambda (\iota ,\sqrt{2}\root 4 \of {24\upsilon +2\delta })+\lambda (\iota ,4)\cdot \sqrt{2}\root 4 \of {24\upsilon +2\delta }, \end{aligned}$$

(174)

where $\lambda (\iota ,x):=2\sqrt{\iota +2x}+\sqrt{x}+2x$. A simple calculation then yields

$$\begin{aligned} \upsilon +\sqrt{2{\bar{\lambda }}} \le \upsilon +\sqrt{4\sqrt{\iota +2x}+2\sqrt{x}+(4\sqrt{\iota +8}+24) x}, \end{aligned}$$

(175)

whose right-hand side is exactly $\lambda '$ given in (145). In addition, noting that $\Theta _{{\mathcal {C}}}$ is normalized, and by using the relation between the purified distance and the trace distance (Property 2 in Lemma 16), the last term in the R.H.S. of (173) is calculated to be

$$\begin{aligned} \mathrm{Tr}[\theta _{{\mathcal {C}}}]\ge \Vert \Theta _{{\mathcal {C}}}\Vert _1-\left\| \theta _{{\mathcal {C}}}-\Theta _{{\mathcal {C}}}\right\| _1 \ge 1-2P(\theta _{{\mathcal {C}}},\Theta _{{\mathcal {C}}}) \ge 1-2\upsilon . \end{aligned}$$

(176)

Combining these all together, we arrive at

$$\begin{aligned} H_{\mathrm{min}}^{\lambda '}(A|R)_{{\mathcal {C}}(\Psi )} + H_{\mathrm{max}}^{\upsilon }(RD|E)_{{\mathcal {T}}\circ {\mathcal {C}}(\Psi )} \ge \log {\iota } + \log {(1-2\upsilon )}. \end{aligned}$$

(177)

8.3 Dropping the working assumptions WA 1 and WA 2

We here show that the working assumptions WA 1 and WA 2 can be dropped. The proof is based on Lemma 12. Since the CP map $\check{{\mathcal {T}}}^{A \rightarrow E E_c}$, defined in Lemma 12, satisfy both conditions, it satisfies Ineq. (142), which is

$$\begin{aligned}&H_{\mathrm{min}}^{\lambda }(A|R)_\Psi + H_{\mathrm{max}}^{\upsilon }(RD|EE_c)_{\check{{\mathcal {T}}}(\Psi )} +\log {J} \ge \log {\iota }. \end{aligned}$$

(178)

Let $V^{A\rightarrow BE}$ be a Stinespring dilation of ${\mathcal {T}}^{A\rightarrow E}$, and let $Z^{R_c\rightarrow R_cE_c}$ be a linear isometry defined by $Z:=\sum _{j}{|jj\rangle }^{R_cE_c}{\langle j|}^{R_c}$. A purification $|\vartheta \rangle ^{BRDEE_c}$ of $\check{{\mathcal {T}}}^{A\rightarrow EE_c}(\Psi ^{ARD})$ is given by $|\vartheta \rangle ^{BRDEE_c}=(V^{A\rightarrow BE}\otimes Z^{R_c\rightarrow R_cE_c})|\Psi \rangle ^{ARD}$, and satisfies $\vartheta ^{BRD}={\mathcal {T}}^{A\rightarrow B}\circ {\mathcal {C}}^A(\Psi ^{ARD})$. Hence, due to the duality for the conditional smooth entropy (Lemma 24), it holds that

$$\begin{aligned} H_{\mathrm{max}}^{\upsilon }(RD|EE_c)_{\check{{\mathcal {T}}}(\Psi )} = H_{\mathrm{max}}^{\upsilon }(RD|EE_c)_{\vartheta } = -H_{\mathrm{min}}^{\upsilon }(RD|B)_{\vartheta } = -H_{\mathrm{min}}^{\upsilon }(RD|B)_{{\mathcal {T}}\circ {\mathcal {C}}(\Psi )}. \end{aligned}$$

(179)

Combining this with (178), we conclude

$$\begin{aligned} H_{\mathrm{min}}^{\lambda }(A|R)_\Psi -H_{\mathrm{min}}^{\upsilon }(RD|B)_{{\mathcal {T}}\circ {\mathcal {C}}(\Psi )}+\log {J} \ge \log {\iota }. \end{aligned}$$

(180)

The map $\check{{\mathcal {T}}}^{A \rightarrow E E_c}$ also satisfies Ineq. (143):

$$\begin{aligned} H_{\mathrm{min}}^{\lambda '}(A|R)_{{\mathcal {C}}(\Psi )} + H_{\mathrm{max}}^{\upsilon }(RD|EE_c)_{\check{{\mathcal {T}}}\circ {\mathcal {C}}(\Psi )} \ge \log {\iota }+\log {(1-2\upsilon )}. \end{aligned}$$

(181)

Similarly to (179) and (140), by using the property of the conditional max entropy for classical-quantum states (Lemma 29), we have

$$\begin{aligned}&H_{\mathrm{max}}^{\upsilon }(RD|EE_c)_{\check{{\mathcal {T}}}\circ {\mathcal {C}}(\Psi )} = H_{\mathrm{max}}^{\upsilon }(R_rD|EE_c)_{\check{{\mathcal {T}}}\circ {\mathcal {C}}(\Psi )} = H_{\mathrm{max}}^{\upsilon }(R_rD|EE_c)_{\check{{\mathcal {T}}}(\Psi )} \nonumber \\&\quad = H_{\mathrm{max}}^{\upsilon }(R_rD|EE_c)_{\vartheta } = -H_{\mathrm{min}}^{\upsilon }(R_rD|BR_c)_{\vartheta } = -H_{\mathrm{min}}^{\upsilon }(R_rD|BR_c)_{{\mathcal {T}}\circ {\mathcal {C}}(\Psi )}, \end{aligned}$$

(182)

which leads to

$$\begin{aligned} H_{\mathrm{min}}^{\lambda '}(A|R)_{{\mathcal {C}}(\Psi )} -H_{\mathrm{min}}^{\upsilon }(R_rD|BR_c)_{{\mathcal {T}}\circ {\mathcal {C}}(\Psi )} \ge \log {\iota }+\log {(1-2\upsilon )}. \end{aligned}$$

(183)

This concludes the proof of Theorem 4 for any trace preserving CP map ${\mathcal {T}}^{A \rightarrow E}$. $\square $

9 Conclusion

In this paper, we have proposed and analyzed a task that we call partial decoupling. We have presented two different formulations of partial decoupling, and derived lower and upper bounds on how precisely partial decoupling can be achieved. The bounds are represented in terms of the smooth conditional entropies of quantum states involving the initial state, the channel and the decomposition of the Hilbert space. Thereby we provided a generalization of the decoupling theorem in the version of [11], by incorporating the direct-sum-product decomposition of the Hilbert space. Applications of our result to quantum communication tasks and black hole information paradox are provided in Refs. [21,22,23] and [24], respectively. A future direction is to apply the result to various scenarios that have been analyzed in terms of the decoupling theorem, such as relative thermalization [10] and area laws [8] in the foundation of statistical mechanics.

References

Hayden, P., Horodecki, M., Winter, A., Yard, J.: A decoupling approach to the quantum capacity. Open. Syst. Inf. Dyn. 15, 7 (2008)
Article MathSciNet Google Scholar
Abeyesinghe, A., Devetak, I., Hayden, P., Winter, A.: The mother of all protocols: Restructuring quantum information’s family tree. Proc. R. Soc. A 465, 2537 (2009)
Article ADS MathSciNet Google Scholar
Horodecki, M., Oppenheim, J., Winter, A.: Partial quantum information. Nature 436, 673–676 (2005)
Article ADS Google Scholar
Horodecki, M., Oppenheim, J., Winter, A.: Quantum state merging and negative information. Commun. Math. Phys. 269, 107–136 (2007)
Article ADS MathSciNet Google Scholar
Groisman, B., Popescu, S., Winter, A.: Quantum, classical, and total amount of correlations in a quantum state. Phys. Rev. A 72(3), 032317 (2005)
Article ADS MathSciNet Google Scholar
Berta, M., Brandão, F.G.S.L., Majenz, C., Wilde, M.M.: Conditional decoupling of quantum information. Phys. Rev. Lett. 121(4), 040504 (2018)
Article ADS MathSciNet Google Scholar
Hayden, P., Preskill, J.: Black holes as mirrors: quantum information in random subsystems. J. High Energy Phys. 2007(09), 120 (2007)
Article MathSciNet Google Scholar
Brandão, F.G.S.L., Horodecki, M.: Exponential decay of correlations implies area law. Commun. Math. Phys. 333(2), 761–798 (2015)
Article ADS MathSciNet Google Scholar
del Rio, L., Aberg, J., Renner, R., Dahlsten, O., Vedral, V.: The thermodynamic meaning of negative entropy. Nature 474(7349), 61–63 (2011)
Article Google Scholar
del Rio, L., Hutter, A., Renner, R., Wehner, S.: Relative thermalization. Phys. Rev. E 94(2), 022104 (2016)
ADS Google Scholar
Dupuis, F., Berta, M., Wullschleger, J., Renner, R.: One-shot decoupling. Commun. Math. Phys. 328, 251 (2014)
Article ADS MathSciNet Google Scholar
Uhlmann, A.: The “transition probability” in the state space of a $c^*$-algebra. Rep. Math. Phys. 9(2), 273–279 (1976)
Devetak, I., Shor, P.W.: The capacity of a quantum channel for simultaneous transmission of classical and quantum information. Commun. Math. Phys. 256(2), 287–303 (2005)
Article ADS MathSciNet Google Scholar
Kohout, R.B., Ng, H.K., Poulin, D., Viola, L.: Information-preserving structures: a general framework for quantum zero-error information. Phys. Rev. A 82, 062306 (2010)
Article ADS Google Scholar
Kohout, R.B., Ng, H.K., Poulin, D., Viola, L.: Characterizing the structure of preserved information in quantum processes. Phys. Rev. Lett. 100, 030501 (2008)
Article Google Scholar
Koashi, M., Imoto, N.: Operations that do not disturb partially known quantum states. Phys. Rev. A 66, 022318 (2002)
Article ADS MathSciNet Google Scholar
Koashi, M., Imoto, N.: Compressibility of quantum mixed-state signals. Phys. Rev. Lett. 87, 017902 (2001)
Article ADS Google Scholar
Hayden, P., Jozsa, R., Petz, D., Winter, A.: Structure of states which satisfy strong subadditivity of quantum entropy with equality. Commun. Math. Phys. 246, 359–374 (2004)
Article ADS MathSciNet Google Scholar
Wakakuwa, E., Soeda, A., Murao, M.: Markovianizing cost of tripartite quantum states. IEEE Trans. Inf. Theory 63(2), 1280–1298 (2017)
Article MathSciNet Google Scholar
Bartlett, S.D., Rudolph, T., Spekkens, R.W.: Reference frames, superselection rules, and quantum information. Rev. Mod. Phys. 79, 555 (2007)
Article ADS MathSciNet Google Scholar
Wakakuwa, E., Nakata, Y.: Randomized partial decoupling unifies one-shot quantum channel capacities. arXiv:2004.12593 (2020)
Nakata, Y., Wakakuwa, E., Yamasaki, H.: One-shot quantum error correction of classical and quantum information: towards demonstration of quantum channel coding. arXiv:2011.00668 (2020)
Wakakuwa, E., Nakata, Y., Hsieh, M.-H.: One-shot hybrid state redistribution. arXiv:2006.12059 (2020)
Nakata, Y., Wakakuwa, E., Koashi, M.: Black holes as clouded mirrors: the Hayden–Preskill protocol with symmetry. arXiv:2007.00895 (2020)
Tomamichel, M., Colbeck, R., Renner, R.: Duality between smooth min-and max-entropies. IEEE Trans. Inf. Theory 56(9), 4674–4681 (2010)
Article MathSciNet Google Scholar
Tomamichel, M.: Quantum Information Processing with Finite Resources. Springer Briefs in Mathematical Physics (2016)
Jamiołkowski, A.: Linear transformations which preserve trace and positive semidefiniteness of operators. Rep. Math. Phys. 3, 275 (1972)
Article ADS MathSciNet Google Scholar
Choi, M.D.: Completely positive linear maps on complex matrices. Linear Algebra Appl. 10, 285 (1975)
Article MathSciNet Google Scholar
Sekino, Y., Susskind, L.: Fast scramblers. J. High Energy Phys. 2008(10), 065 (2008)
Article MathSciNet Google Scholar
Lashkari, N., Stanford, D., Hastings, M., Osborne, T., Hayden, P.: Towards the fast scrambling conjecture. J. High Energy Phys. 4, 2013 (2013)
MathSciNet MATH Google Scholar
Dupuis, F., Szehr, O., Tomamichel, M.: A decoupling approach to classical data transmission over quantum channels. IEEE Trans. Inf. Theory 60(3), 1562–1572 (2014)
Article MathSciNet Google Scholar
Tomamichel, M., Colbeck, R., Renner, R.: A fully quantum asymptotic equipartition property. IEEE Trans. Inf. Theory 55(12), 5840–5847 (2009)
Article MathSciNet Google Scholar
DiVincenzo, D.P., Leung, D.W., Terhal, B.M.: Quantum data hiding. IEEE Trans. Inf. Theory 48, 580 (2002)
Article MathSciNet Google Scholar
Dankert, C., Cleve, R., Emerson, J., Livine, E.: Exact and approximate unitary 2-designs and their application to fidelity estimation. Phys. Rev. A 80, 012304 (2009)
Article ADS Google Scholar
Gross, D., Audenaert, K., Eisert, J.: Evenly distributed unitaries: on the structure of unitary designs. J. Math. Phys. 48(5), 052104 (2007)
Article ADS MathSciNet Google Scholar
Brown, W.G., Weinstein, Y.S., Viola, L.: Quantum pseudorandomness from cluster-state quantum computation. Phys. Rev. A 77(4), 040303(R) (2008)
Article ADS Google Scholar
Weinstein, Y.S., Brown, W.G., Viola, L.: Parameters of pseudorandom quantum circuits. Phys. Rev. A 78(5), 052332 (2008)
Article ADS Google Scholar
Harrow, A.W., Low, R.A.: Random quantum circuits are approximate 2-designs. Commun. Math. Phys. 291, 257 (2009)
Article ADS MathSciNet Google Scholar
Diniz, I.T., Jonathan, D.: Comment on “Random quantum circuits are approximate 2-designs.” Commun. Math. Phys. 304, 281 (2011)
Cleve, R., Leung, D., Liu, L., Wang, C.: Near-linear constructions of exact unitary 2-designs. Quantum Inf. Comput. 16(9 & 10), 0721–0756 (2016)
MathSciNet Google Scholar
Nakata, Y., Hirche, C., Morgan, C., Winter, A.: Unitary $2$-designs from random $X$- and $Z$-diagonal unitaries. arXiv:1502.07514 (2015)
Brown, W., Fawzi, O.: Decoupling with random quantum circuits. Commun. Math. Phys. 340, 867 (2015)
Article ADS MathSciNet Google Scholar
Nakata, Y., Hirche, C., Morgan, C., Winter, A.: Decoupling with random diagonal unitaries. arXiv:1509.05155 (2015)
Nakata, Y., Hirche, C., Koashi, M., Winter, A.: Efficient quantum pseudorandomness with nearly time-independent Hamiltonian dynamics. Phys. Rev. X 7(2), 021006 (2017)
Google Scholar
Wakakuwa, E., Soeda, A., Murao, M.: A coding theorem for bipartite unitaries in distributed quantum computation. IEEE Trans. Inf. Theory 63(8), 5372–5403 (2017)
Article MathSciNet Google Scholar
Goodman, R., Wallach, N.R.: Representations and Invariants of the Classical Groups. Cambridge University Press, Cambridge (1999)
MATH Google Scholar
Fumio, H.: Matrix analysis: matrix monotone functions, matrix means, and majorization. Int. Inf. Sci. 16(2), 139–248 (2010)
MathSciNet MATH Google Scholar
Tomamichel, M.: A framework for non-asymptotic quantum information theory. PhD thesis, ETH Zurich (2012). arXiv:1203.2142

Download references

Acknowledgements

This work was supported by JST CREST, Grant Number JPMJCR1671 as well as by JST, PRESTO Grant Number JPMJPR1865, Japan, and by JSPS KAKENHI, Grant Number 18J01329.

Author information

Authors and Affiliations

Department of Communication Engineering and Informatics, Graduate School of Informatics and Engineering, The University of Electro-Communications, Tokyo, 182-8585, Japan
Eyuri Wakakuwa
Photon Science Center, Graduate School of Engineering, The University of Tokyo, Bunkyo-ku, Tokyo, 113-8656, Japan
Yoshifumi Nakata
Yukawa Institute for Theoretical Physics, Kyoto university, Kitashirakawa Oiwakecho, Sakyo-ku, Kyoto, 606-8502, Japan
Yoshifumi Nakata
JST, PRESTO, 4-1-8 Honcho, Kawaguchi, Saitama, 332-0012, Japan
Yoshifumi Nakata

Authors

Eyuri Wakakuwa
View author publications
You can also search for this author in PubMed Google Scholar
Yoshifumi Nakata
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Eyuri Wakakuwa.

Additional information

Communicated by H.-T. Yau.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A: Proof of the twisted twirling

We here provide the proof of the twisted twirling (Lemma 9). The statement is as follows: let ${\mathcal {H}}_j^{A_r}$ be a subspace of ${\mathcal {H}}^{A_r}$ of dimension $r_j$, and $\Pi _j^{A_r}$ be the projector onto ${\mathcal {H}}_j^{A_r}\subset {\mathcal {H}}^{A_r}$ for $j=1,\ldots ,J$. Let ${\mathbb {I}}^{A_rA_r'}$ be $I^{A_r} \otimes I^{A_r'}$, and ${\mathbb {F}}^{A_rA_r'} \in {\mathcal {L}}({\mathcal {H}}^{A_rA_r'})$ be the swap operator defined by $\sum _{a,b} |a\rangle \langle b|^{A_r} \otimes |b\rangle \langle a|^{A_r'}$ for any orthonormal basis $\{ {|a\rangle } \}$ in ${\mathcal {H}}^{A_r}$ and ${\mathcal {H}}^{A_r'}$. Further, let ${\mathbb {I}}_{jk}^{A_rA_r'}$ and ${\mathbb {F}}_{jk}^{A_rA_r'}$ be $\Pi _j^{A_r} \otimes \Pi _k^{A_r'}$ and $( \Pi _j^{A_r} \otimes \Pi _k^{A_r'}){\mathbb {F}}^{A_rA_r'}$, respectively. For any $M^{A_rA_r'BB'}\in {\mathcal {L}}({\mathcal {H}}^{A_rA_r'BB'})$, define

$$\begin{aligned} M^{BB'}_{{\mathbb {I}},jk}:={\mathrm {Tr}}_{A_rA_r'}[{\mathbb {I}}_{jk}^{A_rA_r'}M^{A_rA_r'BB'}], \quad M^{BB'}_{{\mathbb {F}},kj}:={\mathrm {Tr}}_{A_rA_r'}[{\mathbb {F}}_{kj}^{A_rA_r'}M^{A_rA_r'BB'}]. \end{aligned}$$

(A1)

Then, it holds that, for $j \ne k$,

$$\begin{aligned}&{\mathbb {E}}_{U_j \sim \mathsf{H}_j,U_k \sim \mathsf{H}_k} \bigl [ (U_j^{A_r} \otimes U_k^{A_r'}) M^{A_rA_r'BB'} (U_j^{A_r} \otimes U_k^{A_r'})^{\dagger } \bigr ] = \frac{{\mathbb {I}}_{jk}^{A_rA_r'}}{r_jr_k} \otimes M_{{\mathbb {I}},jk}^{BB'}, \end{aligned}$$

(A2)

$$\begin{aligned}&{\mathbb {E}}_{U_j \sim \mathsf{H}_j,U_k \sim \mathsf{H}_k} \bigl [ (U_j^{A_r} \otimes U_k^{A_r'}) M^{A_rA_r'BB'} (U_k^{A_r} \otimes U_j^{A_r'})^{\dagger } \bigr ] = \frac{{\mathbb {F}}_{jk}^{A_rA_r'}}{r_jr_k} \otimes M^{BB'}_{{\mathbb {F}},kj}. \end{aligned}$$

(A3)

Moreover,

$$\begin{aligned}&{\mathbb {E}}_{U_j \sim \mathsf{H}_j} \bigl [ (U_j^{A_r} \otimes U_j^{A_r'}) M^{A_rA_r'BB'} (U_j^{A_r} \otimes U_j^{A_r'})^{\dagger } \bigr ]\nonumber \\&\quad =\frac{1}{r_j (r_j^2-1)} \left[ (r_j{\mathbb {I}}_{jj}^{A_rA_r'}- {\mathbb {F}}_{jj}^{A_rA_r'})\otimes M_{{\mathbb {I}},jj}^{BB'} + (r_j{\mathbb {F}}_{jj}^{A_rA_r'} -{\mathbb {I}}_{jj}^{A_rA_r'})\otimes M^{BB'}_{{\mathbb {F}},jj} \right] .\nonumber \\ \end{aligned}$$

(A4)

Otherwise, ${\mathbb {E}}_{U_j,U_k,U_m,U_n} \bigl [ (U_j^{A_r} \otimes U_k^{A_r'}) M^{A_rA_r'BB'} (U_m^{A_r} \otimes U_n^{A_r'})^{\dagger } \bigr ]=0$.

Proof

The equation ${\mathbb {E}}_{U_j,U_k,U_m,U_n} \bigl [ (U_j^{A_r} \otimes U_k^{A_r'}) M^{A_rA_r'BB'} (U_m^{A_r} \otimes U_n^{A_r'})^{\dagger } \bigr ]=0$ for $i \ne j \ne k \ne l$ trivially follows from the fact that the random unitaries $\{ U_j \}_j$ are independent and that ${\mathbb {E}}_{U_j \sim \mathsf{H}_j}[U_j]=0$.

Let us consider the case where $j \ne k$ and prove Eqs. (A2) and (A3). Note that any $X^{A_rB}\in {\mathcal {L}}({\mathcal {H}}^{A_rB})$ is decomposed into $X^{A_rB}=\sum _{p,q}X_p^{A_r}\otimes X_q^B$, where $X_p^{A_r} \in {\mathcal {L}}({\mathcal {H}}^{A_r})$ and $X_q^B \in {\mathcal {L}}({\mathcal {H}}^B)$. Using the fact that

$$\begin{aligned} {\mathbb {E}}_{U_j \sim \mathsf{H}_j}[U_j^{A_r} X_p^{A_r} U_j^{A \dagger }] = \frac{{\mathrm {Tr}}[\Pi _j^{A_r}X_p^{A_r}]}{r_j} \Pi _j^{A_r} \end{aligned}$$

(A5)

for any $X_p^{A_r}\in {\mathcal {L}}({\mathcal {H}}^{A_r})$, which follows from the Schur–Weyl duality [46], we have

$$\begin{aligned} {\mathbb {E}}_{U_j \sim \mathsf{H}_j}[ U_j^{A_r} X^{A_rB} U_j^{A \dagger }]&= \sum _{p,q} {\mathbb {E}}_{U_j \sim \mathsf{H}_j}[ U_j^{A_r} X_p^{A_r} U_j^{A \dagger } ] \otimes X_q^B \nonumber \\&= \frac{\Pi _j^{A_r}}{r_j} \otimes \sum _{p,q} {\mathrm {Tr}}[\Pi _j^{A_r}X_p^{A_r}] X_q^B \nonumber \\&= \frac{\Pi _j^{A_r}}{r_j} \otimes {\mathrm {Tr}}_{A_r}[\Pi _j^{A_r}X^{A_rB}]. \end{aligned}$$

(A6)

Using this equality twice for j and k, we obtain Eq. (A2). It also leads to Eq. (A3) as follows:

$$\begin{aligned}&{\mathbb {E}}_{U_j,U_k } \bigl [ (U_j^{A_r} \otimes U_k^{A_r'}) M^{A_rA_r'BB'} (U_k^{A_r} \otimes U_j^{A_r'})^{\dagger } \bigr ] \nonumber \\&\quad = {\mathbb {E}}_{U_j,U_k} \bigl [ (U_j^{A_r} \otimes U_k^{A_r'}) M^{A_rA_r'BB'} {\mathbb {F}}^{A_rA_r'}(U_j^{A_r} \otimes U_k^{A_r'})^{\dagger } \bigr ] {\mathbb {F}}^{A_rA_r'} \nonumber \\&\quad = {\mathbb {E}}_{U_j,U_k} \bigl [ (U_j^{A_r} \otimes U_k^{A_r'}) M^{A_rA_r'BB'} {\mathbb {F}}_{kj}^{A_rA_r'}(U_j^{A_r} \otimes U_k^{A_r'})^{\dagger } \bigr ] {\mathbb {F}}_{jk}^{A_rA_r'} \nonumber \\&\quad = \frac{{\mathbb {F}}_{jk}^{A_rA_r'}}{r_jr_k} \otimes {\mathrm {Tr}}_{A_rA_r'} [{\mathbb {I}}_{jk}^{A_rA_r'} M^{A_rA_r'BB'} {\mathbb {F}}_{kj}^{A_rA_r'} ]\nonumber \\&\quad = \frac{{\mathbb {F}}_{jk}^{A_rA_r'}}{r_jr_k} \otimes {\mathbb {M}}_{{\mathbb {F}},kj}^{BB'}. \end{aligned}$$

(A7)

Here, we have used relations

$$\begin{aligned} {\mathbb {F}}_{kj}^{A_rA_r'}&= (\Pi _k^{A_r}\otimes \Pi _j^{A_r'}){\mathbb {F}}^{A_rA_r'} = {\mathbb {F}}^{A_rA_r'}(\Pi _j^{A_r}\otimes \Pi _k^{A_r'}), \\ {\mathbb {F}}_{kj}^{A_rA_r'}{\mathbb {I}}_{jk}^{A_rA_r'}&= {\mathbb {F}}_{kj}^{A_rA_r'}(\Pi _j^{A_r}\otimes \Pi _k^{A_r'}) = {\mathbb {F}}_{kj}^{A_rA_r'}, \end{aligned}$$

and used Eq. (A2) in the last line.

We finally show Eq. (A4). Consider the operator ${\mathbb {E}}_{U_j \sim \mathsf{H}_j} \bigl [ (U_j^{A_r} \otimes U_j^{A_r'}) {{|p\rangle }\!{\langle q|}}^{A_r} \otimes {{|s\rangle }\!{\langle t|}}^{A_r'} (U_j^{A_r} \otimes U_j^{A_r'})^{\dagger } \bigr ]$. Since this commutes with $V^{\otimes 2}$ ($\forall V \in {\mathbb {U}}(r_j)$), we obtain from the Schur–Weyl duality [46] that

$$\begin{aligned} {\mathbb {E}}_{U_j \sim \mathsf{H}_j} \bigl [ (U_j^{A_r} \otimes U_j^{A_r'}) {{|p\rangle }\!{\langle q|}}^{A_r} \otimes {{|s\rangle }\!{\langle t|}}^{A_r'} (U_j^{A_r} \otimes U_j^{A_r'})^{\dagger } \bigr ] = \alpha _{pqst} {\mathbb {I}}_{jj}^{A_rA_r'} + \beta _{pqst} {\mathbb {F}}_{jj}^{A_rA_r'}, \end{aligned}$$

(A8)

where $\alpha _{pqst}$ and $\beta _{pqst}$ are determined by

$$\begin{aligned} \delta _{pq} \delta _{st} = \alpha _{pqst} r_j^2 + \beta _{pqst} r_j, \quad \delta _{pt} \delta _{qs} = \alpha _{pqst} r_j + \beta _{pqst} r_j^2. \end{aligned}$$

(A9)

Note that the first equation is obtained by taking the trace of Eq. (A8), and the second is by calculating the expectation of ${\mathbb {F}}^{A_rA_r'}$ by both sides in Eq. (A8). Solving these equalities, we obtain

$$\begin{aligned}&{\mathbb {E}}_{U_j \sim \mathsf{H}_j} \bigl [ (U_j^{A_r} \otimes U_j^{A_r'}) {{|p\rangle }\!{\langle q|}}^{A_r} \otimes {{|s\rangle }\!{\langle t|}}^{A_r'} (U_j^{A_r} \otimes U_j^{A_r'})^{\dagger } \bigr ] \nonumber \\&\quad = \frac{1}{r_j(r_j^2-1)} \left( (\delta _{pq} \delta _{st} r_j - \delta _{pt}\delta _{qs} ) {\mathbb {I}}_{jj}^{A_rA_r'} + (\delta _{pt} \delta _{qs} r_j - \delta _{pq}\delta _{st} ) {\mathbb {F}}_{jj}^{A_rA_r'} \right) , \end{aligned}$$

(A10)

from which the equation (A4) is obtained after a straightforward calculation. $\square $

Appendix B: Proof of Lemma 10

We prove Lemma 10 based on the twisted twirling (Lemma 9) and the swap trick, a commonly used method in the context of decoupling given as follows:

Lemma 39

(Swap trick (see e.g. [11])) Let $X^A$ and $Y^A$ be linear operators on ${\mathcal {H}}^A$, and ${\mathbb {F}}^{AA'}$ be the swap operator between ${\mathcal {H}}^A$ and ${\mathcal {H}}^{A'}$ defined by $\sum _{i,j} {{|i\rangle }\!{\langle j|}}^A \otimes {{|j\rangle }\!{\langle i|}}^{A'}$, where $\{ {|i\rangle }\}$ is any basis of ${\mathcal {H}}^A$ and ${\mathcal {H}}^{A'} \cong {\mathcal {H}}^{A}$. Then, ${\mathrm {Tr}}[X^AY^A] = {\mathrm {Tr}}[(X^A \otimes Y^{A'}) {\mathbb {F}}^{AA'}]$.

For simplicity of notations in the proof, we embed a Hilbert space that has the DSP form to the tensor product of three Hilbert spaces. We explain the notation for this embedding in Subsection B1 and then show Lemma 10 in Subsection B2.

1.1 1. Embedding of the Hilbert space

Let A be a quantum system described by a finite dimensional Hilbert space ${\mathcal {H}}^A$, which is decomposed in the form of

$$\begin{aligned} {\mathcal {H}}^A=\bigoplus _{j=1}^J{\mathcal {H}}_j^{A_l}\otimes {\mathcal {H}}_j^{A_r}. \end{aligned}$$

(B1)

The dimension of each subspace is denoted by $l_j:=\dim {\mathcal {H}}_j^{A_l}$, $r_j:=\dim {\mathcal {H}}_j^{A_r}$. Let ${\mathcal {H}}^{A_c}$, ${\mathcal {H}}^{A_l}$ and ${\mathcal {H}}^{A_r}$ be Hilbert spaces such that

$$\begin{aligned} \dim {\mathcal {H}}^{A_c}=J,\;\dim {\mathcal {H}}^{A_l}=\max _{1\le j\le J}l_j,\;{\mathcal {H}}^{A_r}=\max _{1\le j\le J}r_j, \end{aligned}$$

(B2)

and fix linear isometries $W_{j}^{A_l}:{{\mathcal {H}}}_j^{A_l} \rightarrow {{\mathcal {H}}}^{A_l}$, $W _{j}^{A_r}:{{\mathcal {H}}}_j^{A_r}\rightarrow {{\mathcal {H}}}^{A_r}$ for each j. We introduce the following linear isometry, by which the Hilbert space ${\mathcal {H}}^A$ is embedded into ${\mathcal {H}}^{A_c} \otimes {\mathcal {H}}^{A_l} \otimes {\mathcal {H}}^{A_r}$:

$$\begin{aligned} W^{A \rightarrow A_cA_lA_r}:=\sum _{j=1}^J {|j\rangle }^{A_c} \otimes ( W _{j}^{A_l}\otimes W _{j}^{A_r})\Pi _j. \end{aligned}$$

(B3)

Here, $\Pi _j$ is the projection onto a subspace ${{\mathcal {H}}}_j^{A_l}\otimes {{\mathcal {H}}}_j^{A_r}\subset {{\mathcal {H}}}^A$, and $\{{|j\rangle }\}_{j=1}^J$ is a fixed orthonormal basis of ${\mathcal {H}}^{A_c}$. The W is indeed an isometry, because

$$\begin{aligned} (W^{A \rightarrow A_cA_lA_r})^{\dagger } W^{A \rightarrow A_cA_lA_r} =I^A. \end{aligned}$$

(B4)

Noting that ${\mathcal {H}}_j^{A_l}=\mathrm{img} W _{j}^{A_l} \subset {{\mathcal {H}}}^{A_l}$ and ${\mathcal {H}}_j^{A_r}=\mathrm{img} W _{j}^{A_r}\subset {{\mathcal {H}}}^{A_r}$, we have

$$\begin{aligned} \mathrm{img} (W^{A \rightarrow A_cA_lA_r}) =\bigoplus _{j=1}^J{{\mathcal {H}}}_j^{A_c}\otimes {{\mathcal {H}}}_j^{A_l} \otimes {{\mathcal {H}}}_j^{A_r} \subset {\mathcal {H}}^{A_c} \otimes {\mathcal {H}}^{A_l} \otimes {\mathcal {H}}^{A_r}, \end{aligned}$$

(B5)

where ${{\mathcal {H}}}_j^{A_c}\subset {{\mathcal {H}}}^{A_c}$ is a one-dimensional subspace spanned by ${|j\rangle }$ for each j. Denoting the projection onto ${{\mathcal {H}}}_j^{A_l}\subset {{\mathcal {H}}}^{A_l}$ by $\Pi _j^{A_l}\in {\mathcal {L}}({\mathcal {H}}^{A_l})$ and one onto ${{\mathcal {H}}}_j^{A_r}\subset {{\mathcal {H}}}^{A_r}$ by $\Pi _j^{A_r}\in {\mathcal {L}}({\mathcal {H}}^{A_r})$, we also have

$$\begin{aligned} W _{j}^{A_l}(W _{j}^{A_l})^\dagger =\Pi _j^{A_l}, \quad W _{j}^{A_r}(W _{j}^{A_r})^\dagger =\Pi _j^{A_r} \end{aligned}$$

(B6)

and thus

$$\begin{aligned} (W^{A \rightarrow A_cA_lA_r}) (W^{A \rightarrow A_cA_lA_r})^\dagger =\sum _{j=1}^J{{|j\rangle }\!{\langle j|}}^{A_c} \otimes \Pi _j^{A_c} \otimes \Pi _j^{A_r}. \end{aligned}$$

(B7)

Let R be another quantum system represented by a finite dimensional Hilbert space ${\mathcal {H}}^R$. Any $X^{A R} \in {\mathcal {L}}({\mathcal {H}}^{AR})$ is decomposed by $ W ^{A \rightarrow A_cA_lA_r}$ in the form of

$$\begin{aligned} {\mathcal {W}}^{A \rightarrow A_cA_lA_r} (X^{AR})=\sum _{j,k\in {\mathcal {J}}}{{|j\rangle }\!{\langle k|}}^{A_c}\otimes {\tilde{X}}_{jk}^{A_l A_r R}, \end{aligned}$$

(B8)

where

$$\begin{aligned} {\tilde{X}}_{jk}^{A_l A_r R}&:={\langle j|}^{A_c} {\mathcal {W}}^{A \rightarrow A_cA_lA_r}(X^{AR}) {|k\rangle }^{A_c} =(W ^{A_l}_j \otimes W ^{A_r}_j) \Pi _j X^{AR} \Pi _k ( W ^{A_l}_k \otimes W ^{A_r}_k)^{\dagger }. \end{aligned}$$

(B9)

Conversely, any $Y^{A_cA_lA_r} \in {\mathcal {L}}({\mathcal {H}}^{A_c}\otimes {\mathcal {H}}^{A_l}\otimes {\mathcal {H}}^{A_r})$ such that $\mathrm{supp}(Y^{A_c A_l A_r}) \subset \mathrm{img} ( W ^{A \rightarrow A_c A_l A_r})$, is mapped to $({\mathcal {W}}^{A \rightarrow A_c A_l A_r})^\dagger (Y^{A_c A_l A_r} ) \in {\mathcal {L}}({\mathcal {H}}^A)$. Note that ${\tilde{X}}_{jk}$ is related to $X_{jk}$ defined by (10) as ${{|j\rangle }\!{\langle k|}}^{A_c}\otimes {\tilde{X}}_{jk}^{A_l A_r R}= {\mathcal {W}}^{A \rightarrow A_cA_lA_r} (X_{jk}^{AR})$. In the following, we denote ${\tilde{X}}_{jk}^{A_l A_r R}$ by $X_{jk}^{A_l A_r R}$ for simplicity of notations.

Let $A'$ be a quantum system such that ${\mathcal {H}}^A\cong {\mathcal {H}}^{A'}$. It is straightforward to verify that the fixed maximally entangled state $|\Phi \rangle $ defined by (26) is decomposed by W as

$$\begin{aligned} (W^{A\rightarrow A_cA_lA_r}\otimes W^{A'\rightarrow A_c'A_l'A_r'})|\Phi \rangle ^{AA'} =\sum _{j=1}^J\sqrt{\frac{l_jr_j}{d_A}}{|j\rangle }^{A_c}{|j\rangle }^{A_c'}|\Phi _j^l\rangle ^{A_lA_l'}|\Phi _j^r\rangle ^{A_rA_r'}, \end{aligned}$$

(B10)

where $|\Phi _j^l\rangle \in {\mathcal {H}}_j^{A_l}\otimes {\mathcal {H}}_j^{A_l'}$ and $|\Phi _j^r\rangle \in {\mathcal {H}}_j^{A_r}\otimes {\mathcal {H}}_j^{A_r'}$ are fixed maximally entangled states of rank $l_j$ and $r_j$, respectively.

1.2 2. Proof of Lemma 10

We now prove Lemma 10. The statement is given as follows: for any $\varsigma ^{ER} \in {\mathcal {S}}_=({\mathcal {H}}^{ER})$ and any $X\in \mathrm{Her}({\mathcal {H}}^{AR})$ such that $X_{jj}^{A_lR}=0$, the following inequality holds for any possible permutation $\sigma \in {\mathbb {P}}$:

$$\begin{aligned}&{\mathbb {E}}_{U \sim \mathsf{H}_{\times }} \left[ \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_{\sigma ^{-1}}^A \circ {\mathcal {U}}^A ( X^{AR} )\right\| _{2,\varsigma ^{ER}}^2\right] \nonumber \\&\quad \le \sum _{j,k=1}^J \frac{d_A^2}{r_j r_k} \left\| {\mathrm {Tr}}_{A_l}\left[ X_{\sigma (j)\sigma (k)}^{A_l^T A_r R}\tau ^{A_l {\bar{A}}_r E}_{jk} \right] \right\| ^2_{2, \varsigma ^{ER}} . \end{aligned}$$

(B11)

Here, $A_l^T$ denotes the transposition of $A_l$ with respect to the Schmidt basis of the fixed maximally entangled state used to define the Choi–Jamiołkowski representation $\tau ^{AE}$ of ${\mathcal {T}}^{A \rightarrow E}$.

Proof

Introducing a notation ${\mathbb {F}}^{RE,R'E'}_{\varsigma }:= ( (\varsigma ^{ER})^{\otimes 2})^{-1/4}({\mathbb {F}}^{RR'} \otimes {\mathbb {F}}^{EE'}) ( (\varsigma ^{ER})^{\otimes 2} )^{-1/4}$, we have

$$\begin{aligned}&\bigl |\!\bigl | {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^{\dagger A} \circ {\mathcal {U}}^{\dagger A} (X^{AR} ) \bigr |\!\bigr |_{2,\varsigma ^{ER}}^2 \nonumber \\&\quad = {\mathrm {Tr}}\biggl [ \biggl ( (\varsigma ^{ER})^{-\frac{1}{4}} {\mathcal {T}}^{A \rightarrow E}\! \circ {\mathcal {G}}_\sigma ^{\dagger A} \!\circ {\mathcal {U}}^{\dagger A} ( X^{AR} ) (\varsigma ^{ER})^{-\frac{1}{4}} \biggr )^2 \biggr ] \nonumber \\&\quad = {\mathrm {Tr}}\left[ \biggl ( (\varsigma ^{ER})^{-\frac{1}{4}} {\mathcal {T}}^{A \rightarrow E} \!\circ {\mathcal {G}}_\sigma ^{\dagger A} \!\circ {\mathcal {U}}^{\dagger A} (X^{AR} ) (\varsigma ^{ER})^{-\frac{1}{4}} \biggr )^{\otimes 2} \right. \bigl ( {\mathbb {F}}^{RR'} \otimes {\mathbb {F}}^{EE'} \bigr ) \biggr ] \nonumber \\&\quad = {\mathrm {Tr}}\left[ \left( {\mathcal {T}}^{A \rightarrow E} \!\circ {\mathcal {G}}_\sigma ^{\dagger A} \!\circ {\mathcal {U}}^{\dagger A} (X^{AR} ) \right) ^{\otimes 2} {\mathbb {F}}^{RE,R'E'}_{\varsigma } \right] \nonumber \\&\quad = {\mathrm {Tr}}\bigl [ \bigl ( X^{AR} \bigr )^{\otimes 2} \bigl [ ( {\mathcal {G}}_\sigma ^A\circ {\mathcal {U}}^A \circ {\mathcal {T}}^{* E \rightarrow A })^{\otimes 2}({\mathbb {F}}^{RE,R'E'}_{\varsigma })\bigr ] \bigr ]. \end{aligned}$$

(B12)

Thus, using the fact that ${\mathcal {G}}_{\sigma ^{-1}}={\mathcal {G}}_{\sigma }^\dagger $ and that ${\mathbb {E}}_{U \sim \mathsf{H}_{\times }}[f(U)]={\mathbb {E}}_{U \sim \mathsf{H}_{\times }}[f(U^\dagger )]$ for any function f, we have

$$\begin{aligned}&{\mathbb {E}}_{U } \left[ \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_{\sigma ^{-1}}^A \circ {\mathcal {U}}^A ( X^{AR} )\right\| _{2,\varsigma ^{ER}}^2\right] \nonumber \\&\quad ={\mathbb {E}}_{U} \biggr [\bigl |\!\bigl | {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^{\dagger A} \circ {\mathcal {U}}^{\dagger A} (X^{AR} ) \bigr |\!\bigr |_{2,\varsigma ^{ER}}^2 \biggl ] \nonumber \\&\quad = {\mathbb {E}}_{U } {\mathrm {Tr}}\bigl [ \bigl ( X^{AR} \bigr )^{\otimes 2} \bigl [ ( {\mathcal {G}}_\sigma ^A\circ {\mathcal {U}}^A \circ {\mathcal {T}}^{* E \rightarrow A })^{\otimes 2}({\mathbb {F}}^{RE,R'E'}_{\varsigma })\bigr ] \bigr ]\nonumber \\&\quad = {\mathrm {Tr}}\bigl [ \bigl ( X^{AR} \bigr )^{\otimes 2} {\mathbb {E}}_{U }\bigl [ ( {\mathcal {G}}_\sigma ^A\circ {\mathcal {U}}^A \circ {\mathcal {T}}^{* E \rightarrow A })^{\otimes 2}({\mathbb {F}}^{RE,R'E'}_{\varsigma })\bigr ] \bigr ] \nonumber \\&\quad ={\mathrm {Tr}}[ (X^{AR})^{\otimes 2} \Xi _\sigma ^{AA'RR'}], \end{aligned}$$

(B13)

where we have defined $\Xi _\sigma ^{AA'RR'}:= {\mathbb {E}}_{U \sim \mathsf{H}_{\times }} [ ( {\mathcal {G}}_\sigma ^A\circ {\mathcal {U}}^A \circ {\mathcal {T}}^{* E \rightarrow A })^{\otimes 2}({\mathbb {F}}^{RE,R'E'}_{\varsigma }) ]$.

We first embed the operator $\Xi _\sigma ^{AA'RR'}$ into the space $A_c A_l A_r R$ and $A'_c A'_l A'_r R'$. We introduce the following notations for the embedded map and the embedded operators:

$$\begin{aligned}&{\mathcal {T}}^{A_cA_lA_r \rightarrow E} : ={\mathcal {T}}^{A \rightarrow E} \circ ({\mathcal {W}}^{A \rightarrow A_cA_lA_r})^{\dagger }, \quad \tau ^{A_cA_lA_rE} : ={\mathcal {W}}^{A \rightarrow A_cA_lA_r} (\tau ^{AE}), \end{aligned}$$

(B14)

$$\begin{aligned}&\Upsilon _\varsigma := ({\mathcal {T}}^{* E\rightarrow A_cA_lA_r})^{\otimes 2}({\mathbb {F}}^{RE,R'E'}_{\varsigma }), \nonumber \\&\Upsilon _{\varsigma ,jkmn}^{A_lA_rR A_l'A_r'R'}:=({\langle j|}^{A_c}\otimes {\langle k|}^{A_c'})\Upsilon _\varsigma ({|m\rangle }^{A_c}\otimes {|n\rangle }^{A_c'}). \end{aligned}$$

(B15)

Using these notations, the operator $\Xi _\sigma ^{AA'RR'}$ is embedded to be

$$\begin{aligned}&({\mathcal {W}}^{A \rightarrow A_cA_lA_r})^{\otimes 2}(\Xi _\sigma ^{AA'RR'})\\&\quad = \sum _{j,k,m,n =1}^J \bigl [ {{|\sigma (j)\rangle }\!{\langle \sigma (m)|}}^{A_c}\otimes {{|\sigma (k)\rangle }\!{\langle \sigma (n)|}}^{A_c'}\bigr ] \\&\qquad \otimes {\mathbb {E}}_{U \sim \mathsf{H}_{\times }}\bigl [ (U_j^{A_r} \otimes U_k^{A_r'}) \Upsilon _{\varsigma ,jkmn}^{A_lA_rR A_l'A_r'R'} (U_m^{\dagger A_r} \otimes U_n^{\dagger A_r'}) \bigr ]. \end{aligned}$$

Due to Lemma 9, the terms in the summation remain non-zero only in the following three cases: (i) $J\ge 2$ and $(j,k)=(m,n)$ ($j \ne k$), (ii) $J\ge 2$ and $(j,k)=(n,m)$ ($j \ne k$), and (iii) $j=k=m=n$. In the following, we assume that $J\ge 2$, and separately investigate the three cases using Lemma 9. Our concern is then $\Xi _{\sigma ,\mathrm{(i)}}$, $\Xi _{\sigma ,\mathrm{(ii)}}$ and $\Xi _{\sigma ,\mathrm{(iii)}}$ such that

$$\begin{aligned}&({\mathcal {W}}^{A \rightarrow A_cA_lA_r})^{\otimes 2}(\Xi _{\sigma ,\mathrm{(i)}}) \nonumber \\&\quad = \sum _{j,k =1}^J \bigl [ {{|\sigma (j)\rangle }\!{\langle \sigma (j)|}}^{A_c}\otimes {{|\sigma (k)\rangle }\!{\langle \sigma (k)|}}^{A_c'}\bigr ]\nonumber \\&\qquad \otimes {\mathbb {E}}_{U }\bigl [ (U_j^{A_r} \otimes U_k^{A_r'}) \Upsilon _{\varsigma ,jkjk}^{A_lA_rR A_l'A_r'R'} (U_j^{\dagger A_r} \otimes U_k^{\dagger A_r'}) \bigr ], \end{aligned}$$

(B16)

$$\begin{aligned}&({\mathcal {W}}^{A \rightarrow A_cA_lA_r})^{\otimes 2}(\Xi _{\sigma ,\mathrm{(ii)}}) \nonumber \\&\quad = \sum _{j,k =1}^J \bigl [ {{|\sigma (j)\rangle }\!{\langle \sigma (k)|}}^{A_c}\otimes {{|\sigma (k)\rangle }\!{\langle \sigma (j)|}}^{A_c'}\bigr ]\nonumber \\&\qquad \otimes {\mathbb {E}}_{U }\bigl [ ( U_j^{A_r} \otimes U_k^{A_r'}) \Upsilon _{\varsigma ,jkkj}^{A_lA_rR A_l'A_r'R'} (U_k^{\dagger A_r} \otimes U_j^{\dagger A_r'}) \bigr ], \end{aligned}$$

(B17)

$$\begin{aligned}&({\mathcal {W}}^{A \rightarrow A_cA_lA_r})^{\otimes 2}(\Xi _{\sigma ,\mathrm{(iii)}}) \nonumber \\&\quad = \sum _{j =1}^J \bigl [ {{|\sigma (j)\rangle }\!{\langle \sigma (j)|}}^{A_c}\otimes {{|\sigma (j)\rangle }\!{\langle \sigma (j)|}}^{A_c'}\bigr ]\nonumber \\&\qquad \otimes {\mathbb {E}}_{U }\bigl [ (U_j^{A_r} \otimes U_j^{A_r'}) \Upsilon _{\varsigma ,jjjj}^{A_lA_rR A_l'A_r'R'} (U_j^{\dagger A_r} \otimes U_j^{\dagger A_r'}) \bigr ]. \end{aligned}$$

(B18)

Note that $\Xi _\sigma =\Xi _{\sigma ,\mathrm{(i)}}+\Xi _{\sigma ,\mathrm{(ii)}}+\Xi _{\sigma ,\mathrm{(iii)}}$.

In the case (i), from Lemma 9, we have

$$\begin{aligned}&({\mathcal {W}}^{A \rightarrow A_cA_lA_r})^{\otimes 2}(\Xi ^{AA'RR'}_{\sigma ,\mathrm{(i)}}) = \nonumber \\&\qquad \sum _{j\ne k}\frac{1}{r_jr_k}{{|\sigma (j)\rangle }\!{\langle \sigma (j)|}}^{A_c}\otimes {{|\sigma (k)\rangle }\!{\langle \sigma (k)|}}^{A_c'} \otimes {\mathbb {I}}_{jk}^{A_rA_r'} \otimes \Xi ^{A_lR A_l'R'}_{\mathrm{(i)},jk}, \end{aligned}$$

(B19)

where $\Xi ^{A_lR A_l'R'}_{\mathrm{(i)},jk} = {\mathrm {Tr}}_{A_rA_r'} \bigl [ {\mathbb {I}}_{jk}^{A_rA_r'} \Upsilon _{\varsigma ,jkjk}^{A_lA_rR A_l'A_r'R'} \bigr ]$. It follows that

$$\begin{aligned}&{\mathrm {Tr}}\left[ \left( X^{A_lA_rR}_{\sigma (j)\sigma (j)} \otimes X^{A_l'A'_r R'}_{\sigma (k)\sigma (k)} \right) \left( {\mathbb {I}}_{jk}^{A_rA_r'} \otimes \Xi ^{A_lRA_l'R'}_{\mathrm{(i)},jk}\right) \right] \nonumber \\&\quad = {\mathrm {Tr}}\left[ \left( X^{A_lR}_{\sigma (j)\sigma (j)} \otimes X^{A_l' R'}_{\sigma (k)\sigma (k)} \right) \Xi ^{A_lRA_l'R'}_{\mathrm{(i)},jk} \right] , \end{aligned}$$

(B20)

and consequently, from the condition for X, i.e. $X_{jj}^{A_lR}=0$, that ${\mathrm {Tr}}[ (X^{AR})^{\otimes 2} \Xi ^{AA'RR'}_{\sigma ,\mathrm{(i)}} ] =0$.

Let us next consider the case (ii), where $(j,k)=(n,m)$ ($j\ne k)$. This case yields

$$\begin{aligned}&({\mathcal {W}}^{A \rightarrow A_cA_lA_r})^{\otimes 2}(\Xi ^{AA'RR'}_{\sigma ,\mathrm{(ii)}}) \nonumber \\&\quad = \sum _{j\ne k}\frac{1}{r_jr_k}{{|\sigma (j)\rangle }\!{\langle \sigma (k)|}}^{A_c}\otimes {{|\sigma (k)\rangle }\!{\langle \sigma (j)|}}^{A_c'} \otimes {\mathbb {F}}_{jk}^{A_r A_r'} \otimes \Xi ^{A_lR A_l'R'}_{\mathrm{(ii)},jk}, \end{aligned}$$

(B21)

where $\Xi ^{A_lR A_l'R'}_{\mathrm{(ii)},jk} = {\mathrm {Tr}}_{A_rA_r'} \bigl [ \Upsilon _{\varsigma ,jkkj}^{A_lA_rR A_l'A_r'R'} {\mathbb {F}}_{kj}^{A_r A_r'}\bigr ]$. Denoting the $A_r$ part of $\Upsilon $ and ${\mathcal {T}}^*$ by ${\bar{A}}_r$, we have

$$\begin{aligned}&{\mathrm {Tr}}\left[ \left( X^{A_lA_rR}_{\sigma (k)\sigma (j)} \otimes X^{A_l'A'_r R'}_{\sigma (j)\sigma (k)} \right) \left( {\mathbb {F}}_{jk}^{A_r A_r'} \otimes \Xi ^{A_lRA_l'R'}_{\mathrm{(ii)},jk}\right) \right] \nonumber \\&\quad = {\mathrm {Tr}}\left[ \left( X^{A_lA_rR}_{\sigma (k)\sigma (j)} \otimes X^{A_l'A'_r R'}_{\sigma (j)\sigma (k)}\otimes {\mathbb {F}}_{kj}^{{\bar{A}}_r{\bar{A}}_r'} \right) \left( {\mathbb {F}}_{jk}^{A_r A_r'} \otimes \Upsilon _{\varsigma ,jkkj}^{A_l{\bar{A}}_rR A_l'{\bar{A}}_r'R'}\right) \right] \nonumber \\&\quad = {\mathrm {Tr}}\left[ \left( {{|k\rangle }\!{\langle j|}}^{A_c}\!\otimes \! {{|j\rangle }\!{\langle k|}}^{A_c'}\!\otimes \! X^{A_lA_rR}_{\sigma (k)\sigma (j)} \!\otimes \! X^{A_l'A_r'R'}_{\sigma (j)\sigma (k)} \!\otimes \! {\mathbb {F}}_{kj}^{{\bar{A}}_r{\bar{A}}_r'}\right) \right. \nonumber \\&\qquad \left. \left( {\mathbb {F}}_{jk}^{A_r A_r'} \otimes ({\mathcal {T}}^{* E\rightarrow A_cA_l{\bar{A}}_r})^{\otimes 2}({\mathbb {F}}^{RE,R'E'}_{\varsigma })\right) \right] \nonumber \\&\quad = {\mathrm {Tr}}\left[ \left( ({\mathcal {T}}^{A_cA_l{\bar{A}}_r \rightarrow E} )^{\otimes 2} \left( {{|k\rangle }\!{\langle j|}}^{A_c} \!\otimes \! {{|j\rangle }\!{\langle k|}}^{A_c'} \otimes X^{A_lA_rR}_{\sigma (k)\sigma (j)} \!\otimes \! X^{A_l'A_r'R'}_{\sigma (j)\sigma (k)} \!\otimes \! {\mathbb {F}}_{kj}^{{\bar{A}}_r{\bar{A}}_r'}\right) \right) \right. \nonumber \\&\qquad \left. \left( {\mathbb {F}}_{jk}^{A_r A_r'} \otimes {\mathbb {F}}^{RE,R'E'}_{\varsigma }\right) \right] \nonumber \\&\quad =d_A^2 {\mathrm {Tr}}\left[ \left( \left( {{|j\rangle }\!{\langle k|}}^{A_c}\!\otimes \! {{|k\rangle }\!{\langle j|}}^{A_c'}\!\otimes \! X^{A_l^TA_rR}_{\sigma (k)\sigma (j)} \!\otimes \! X^{A_l'^TA_r'R'}_{\sigma (j)\sigma (k)} \!\otimes \! {\mathbb {F}}_{jk}^{{\bar{A}}_r{\bar{A}}_r'} \right) (\tau ^{A_cA_l {\bar{A}}_r E} )^{\otimes 2} \right) \right. \nonumber \\&\qquad \left. \left( {\mathbb {F}}_{jk}^{A_r A_r'} \!\otimes \!{\mathbb {F}}^{RE,R'E'}_{\varsigma } \right) \right] \nonumber \\&\quad =d_A^2 {\mathrm {Tr}}\left[ \left( X_{\sigma (k)\sigma (j)}^{A_l^T A_r R} \otimes X_{\sigma (j)\sigma (k)}^{A_l'^T A'_r R'} \right) \left( \tau _{kj}^{A_l {\bar{A}}_r E} \otimes \tau _{jk}^{A_l' {\bar{A}}'_r E'} \right) \right. \nonumber \\&\qquad \left. \left( {\mathbb {F}}_{jk}^{A_r A_r'} \otimes {\mathbb {F}}_{jk}^{{\bar{A}}_r {\bar{A}}_r'} \otimes {\mathbb {F}}^{RE,R'E'}_{\varsigma } \right) \right] \nonumber \\&\quad =d_A^2{\mathrm {Tr}}\left[ \left( {\mathrm {Tr}}_{A_l} \left[ X_{\sigma (k)\sigma (j)}^{A_l^T A_r R}\tau _{kj}^{A_l {\bar{A}}_r E}\right] \otimes {\mathrm {Tr}}_{A_l'}\left[ X_{\sigma (j)\sigma (k)}^{A_l'^T A'_r R'}\tau _{jk}^{A_l' {\bar{A}}'_r E'}\right] \right) \right. \nonumber \\&\qquad \left. \left( {\mathbb {F}}_{jk}^{A_r A_r'} \otimes {\mathbb {F}}_{jk}^{{\bar{A}}_r {\bar{A}}_r'} \otimes {\mathbb {F}}^{RE,R'E'}_{\varsigma } \right) \right] \nonumber \\&\quad =d_A^2 \left\| {\mathrm {Tr}}_{A_l}\left[ X^{A_l ^TA_r R}_{\sigma (j)\sigma (k)} \tau ^{A_l {\bar{A}}_r E}_{jk} \right] \right\| _{2,\varsigma ^{ER}}^2, \end{aligned}$$

(B22)

where the fourth line follows from the Choi–Jamiołkowski correspondence (25) and the last line from the swap trick (Lemma 39). Hence we obtain

$$\begin{aligned}&{\mathrm {Tr}}[ (X^{AR})^{\otimes 2} \Xi ^{AA'RR'}_{\sigma ,\mathrm{(ii)}} ] \nonumber \\&\quad = \sum _{j\ne k} \frac{d_{A}^{2}}{r_{j} r_{k}} \left\| {\mathrm {Tr}}_{A_l} \left[ X^{A_{l}^{T}A_{r} R}_{\sigma (j)\sigma (k)} \tau ^{A_l {\bar{A}}_{r} E}_{jk} \right] \right\| _{2,\varsigma ^{ER}}^2. \end{aligned}$$

(B23)

Finally, we investigate the case (iii). Lemma 9 leads to

$$\begin{aligned}&({\mathcal {W}}^{A \rightarrow A_cA_lA_r})^{\otimes 2}(\Xi ^{AA'RR'}_{\sigma ,\mathrm{(iii)}}) \nonumber \\&\quad = \sum _{j=1}^J\!\frac{1}{r_j(r_j^2-1)}{{|\sigma (j)\rangle }\!{\langle \sigma (j)|}}^{A_c}\!\otimes \!{{|\sigma (j)\rangle }\!{\langle \sigma (j)|}}^{A_c'}\!\otimes \!\Xi _{\mathrm{(iii)},jj}^{A_lRA_l'R'}\! , \end{aligned}$$

(B24)

where

$$\begin{aligned}&\Xi _{\mathrm{(iii)},jj}^{A_lRA_l'R'}:= \left[ r_j{\mathbb {I}}_{jj}^{A_rA_r'}\otimes \Xi _{\mathrm{(i)},jj}^{A_lRA_l'R'} - {\mathbb {I}}_{jj}^{A_rA_r'}\otimes \Xi ^{A_lRA_l'R'}_{\mathrm{(ii)},jj}\right. \nonumber \\&\quad \left. +r_j{\mathbb {F}}_{jj}^{A_rA_r'}\otimes \Xi _{\mathrm{(ii)},jj}^{A_lRA_l'R'} - {\mathbb {F}}_{jj}^{A_rA_r'}\otimes \Xi ^{A_lRA_l'R'}_{\mathrm{(i)},jj} \right] . \end{aligned}$$

(B25)

Similarly to (B20) and (B22), we have

$$\begin{aligned} {\mathrm {Tr}}\left[ \left( X^{A_lA_rR}_{\sigma (j)\sigma (j)} \otimes X^{A_l'A'_r R'}_{\sigma (j)\sigma (j)} \right) \left( {\mathbb {I}}_{jj}^{A_r A_r'} \otimes \Xi ^{A_lRA_l'R'}_{\mathrm{(ii)},jj}\right) \right] = 0 \end{aligned}$$

(B26)

and

$$\begin{aligned}&{\mathrm {Tr}}\left[ \left( X^{A_lA_rR}_{\sigma (j)\sigma (j)} \otimes X^{A_l'A'_r R'}_{\sigma (j)\sigma (j)} \right) \left( {\mathbb {F}}_{jj}^{A_r A_r'} \otimes \Xi ^{A_lRA_l'R'}_{\mathrm{(i)},jj} \right) \right] \nonumber \\&\quad = {\mathrm {Tr}}\left[ \left( {{|j\rangle }\!{\langle j|}}^{A_c} \!\otimes \! {{|j\rangle }\!{\langle j|}}^{A_c'} \!\otimes \! X^{A_lA_rR}_{\sigma (j)\sigma (j)} \!\otimes \! X^{A_l'A_r'R'}_{\sigma (j)\sigma (j)} \!\otimes \! {\mathbb {I}}_{jj}^{{\bar{A}}_r{\bar{A}}_r'}\right) \right. \nonumber \\&\qquad \left. \left( {\mathbb {F}}_{jj}^{A_r A_r'} \otimes ({\mathcal {T}}^{* E\rightarrow A_cA_l{\bar{A}}_r})^{\otimes 2} \left( {\mathbb {F}}^{RE,R'E'}_{\varsigma }\right) \right) \right] \nonumber \\&\quad = {\mathrm {Tr}}\left[ \left( ({\mathcal {T}}^{A_cA_l{\bar{A}}_r \rightarrow E})^{\otimes 2} \left( {{|j\rangle }\!{\langle j|}}^{A_c} \otimes {{|j\rangle }\!{\langle j|}}^{A_c'} \otimes X^{A_lA_rR}_{\sigma (j)\sigma (j)} \otimes X^{A_l'A_r'R'}_{\sigma (j)\sigma (j)} \otimes {\mathbb {I}}_{jj}^{{\bar{A}}_r{\bar{A}}_r'}\right) \right) \right. \nonumber \\&\qquad \left. \left( {\mathbb {F}}_{jj}^{A_r A_r'} \otimes {\mathbb {F}}^{RE,R'E'}_{\varsigma }\right) \right] \nonumber \\&\quad =d_A^2 {\mathrm {Tr}}\left[ \left( X_{\sigma (j)\sigma (j)}^{A_l ^TA_r R} \otimes X_{\sigma (j)\sigma (j)}^{A_l'^T A'_r R'}\right) \left( \tau _{jj}^{A_l {\bar{A}}_r E} \otimes \tau _{jj}^{A_l' {\bar{A}}'_r E'}\right) \right. \nonumber \\&\qquad \left. \left( {\mathbb {F}}_{jj}^{A_r A_r'} \otimes {\mathbb {I}}_{jj}^{{\bar{A}}_r {\bar{A}}_r'} \otimes {\mathbb {F}}^{RE,R'E'}_{\varsigma } \right) \right] \nonumber \\&\quad =d_A^2{\mathrm {Tr}}\left[ \left( {\mathrm {Tr}}_{A_l}\!\left[ X_{\sigma (j)\sigma (j)}^{A_l^T A_r R}\tau _{j}^{A_l E}\right] \!\otimes \! {\mathrm {Tr}}_{A_l'}\!\left[ X_{\sigma (j)\sigma (j)}^{A_l'^T A'_r R'}\tau _{jj}^{A_l' E'}\right] \right) \left( {\mathbb {F}}_{jj}^{A_r A_r'} \otimes {\mathbb {F}}^{RE,R'E'}_{\varsigma } \right) \right] \nonumber \\&\quad =d_A^2\left\| {\mathrm {Tr}}_{A_l} \left[ X^{A_l^T A_r R}_{\sigma (j)\sigma (j)} \tau ^{A_l E}_{jj} \right] \right\| _{2,\varsigma ^{ER}}^2. \end{aligned}$$

(B27)

Combining this with (B20), (B22) and (B25), we obtain

$$\begin{aligned}&{\mathrm {Tr}}\left[ (X_{\sigma (j)\sigma (j)}^{A_lR})^{\otimes 2} \Xi ^{A_lA_l'RR'}_{\mathrm{(iii)},jj} \right] \\&\quad =d_A^2 r_j \left\| {\mathrm {Tr}}_{A_l}\left[ X^{A_l^T A_r R}_{\sigma (j)\sigma (j)}\tau ^{A_l {\bar{A}}_r E}_{jj} \right] \right\| _{2,\varsigma }^2 -d_A^2 \left\| {\mathrm {Tr}}_{A_l}\left[ X^{A_l^T A_r R}_{\sigma (j)\sigma (j)}\tau ^{A_l E}_{jj} \right] \right\| _{2,\varsigma }^2. \end{aligned}$$

Noting that ${\mathrm {Tr}}_{A_l}[ X^{A_l^T A_r R}_{\sigma (j)\sigma (j)}\tau ^{A_l {\bar{A}}_r E}_{jj}]$ is a Hermitian operator for each j, and by using the property of the Hilbert–Schmidt norm (see Lemma 13), the above equality leads to

$$\begin{aligned} {\mathrm {Tr}}\left[ (X_{\sigma (j)\sigma (j)}^{A_lR})^{\otimes 2} \Xi ^{A_lA_l'RR'}_{\mathrm{(iii)},jj} \right] \le d_A^2 \left( r_j -\frac{1}{r_j}\right) \left\| {\mathrm {Tr}}_{A_l} \left[ X^{A_l^T A_r R}_{\sigma (j)\sigma (j)}\tau ^{A_l {\bar{A}}_r E}_{jj} \right] \right\| _{2,\varsigma ^{ER}}^2. \end{aligned}$$

(B28)

Combining this with (B24), we have

$$\begin{aligned} {\mathrm {Tr}}[ (X^{AR})^{\otimes 2} \Xi ^{AA'RR'}_{\sigma ,\mathrm{(iii)}} ]&=\sum _{j=1}^J \frac{1}{r_j(r_j^2-1)} {\mathrm {Tr}}\left[ (X_{\sigma (j)\sigma (j)}^{A_lR})^{\otimes 2} \Xi ^{A_lA_l'RR'}_{\mathrm{(iii)},jj} \right] \nonumber \\&\le \sum _{j=1}^J \frac{d_A^2}{r_j^2} \left\| {\mathrm {Tr}}_{A_l} \left[ X^{A_l^T A_r R}_{\sigma (j)\sigma (j)}\tau ^{A_l {\bar{A}}_r E}_{jj} \right] \right\| _{2,\varsigma ^{ER}}^2. \end{aligned}$$

(B29)

Since $\Xi _\sigma =\Xi _{\sigma ,\mathrm{(i)}}+\Xi _{\sigma ,\mathrm{(ii)}}+\Xi _{\sigma ,\mathrm{(iii)}}$, we can thus obtain from these evaluations that

$$\begin{aligned} {\mathrm {Tr}}[ (X^{AR})^{\otimes 2} \Xi _\sigma ^{AA'RR'}] \le \sum _{j,k =1}^J \frac{d_A^2}{r_j r_k} \left\| {\mathrm {Tr}}_{A_l}\left[ X_{\sigma (j)\sigma (k)}^{A_l^T A_r R}\tau ^{A_l {\bar{A}}_r E}_{jk} \right] \right\| ^2_{2, \varsigma ^{ER}} \end{aligned}$$

(B30)

for any $\varsigma ^{ER} \in {\mathcal {S}}_=({\mathcal {H}}^{ER})$ and $\sigma \in {\mathbb {P}}$. Combining this with Eq. (B13) concludes the proof. $\square $

Appendix C: Proof of Lemma 11

We prove Lemma 11. We start with recalling the statement: Consider arbitrary unnormalized states $\Psi ^{AR},{\hat{\Psi }}^{AR}\in {\mathcal {P}}({\mathcal {H}}^{AR})$ and arbitrary CP maps ${\mathcal {T}},\hat{{\mathcal {T}}}:A\rightarrow E$. Let ${\mathcal {D}}_+^{A \rightarrow E}$ and ${\mathcal {D}}_-^{A \rightarrow E}$ be arbitrary CP maps such that ${\mathcal {T}}-\hat{{\mathcal {T}}}={\mathcal {D}}_+-{\mathcal {D}}_-$. Let $\delta _+^{AR}$ and $\delta _-^{AR}$ be linear operators on ${\mathcal {H}}^A\otimes {\mathcal {H}}^{R}$, such that

$$\begin{aligned} \delta _+^{AR}\ge 0,\quad \delta _-^{AR}\ge 0, \quad \mathrm{supp}[\delta _+^{AR}]\perp \mathrm{supp}[\delta _-^{AR}] \end{aligned}$$

(C1)

and that

$$\begin{aligned} {\hat{\Psi }}^{AR} -\Psi ^{AR}=\delta _+^{AR}-\delta _-^{AR}. \end{aligned}$$

(C2)

The following inequality holds for any possible permutation $\sigma \in {\mathbb {P}}$ and for both ${\Psi }_*={\Psi }_{\mathrm{av}}$ and ${\Psi }_*={\mathcal {C}}^A(\Psi )$:

$$\begin{aligned}&{\mathbb {E}}_{U \sim \mathsf{H}_{\times }}\left[ \left\| {{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\Psi }^{AR} - {\Psi }_*^{AR} )\right\| _1 \right] \nonumber \\&\quad \le {\mathbb {E}}_{U \sim \mathsf{H}_{\times }}\left[ \left\| \hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\hat{\Psi }}^{AR} - {\hat{\Psi }}_*^{AR} )\right\| _1 \right] \nonumber \\&\qquad +2 \, {\mathrm {Tr}}[({\mathcal {D}}_+^{A \rightarrow E}+ {\mathcal {D}}_-^{A \rightarrow E}) \circ {\mathcal {G}}_\sigma ^A ( \Psi _{\mathrm{av}}^{AR} )] \nonumber \\&\qquad +2 \,{\mathbb {E}}_{U \sim \mathsf{H}_{\times }}{\mathrm {Tr}}[\hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A (\delta _+^{AR}+\delta _-^{AR})]. \end{aligned}$$

(C3)

Here, ${\hat{\Psi }}_*={\mathbb {E}}_{U\sim \mathsf{H}_\times }[{\mathcal {U}}^A({\hat{\Psi }}^{AR})]$ for ${\Psi }_*={\Psi }_{\mathrm{av}}$ and ${\hat{\Psi }}_*={\mathcal {C}}^A({\hat{\Psi }})$ for ${\Psi }_*={\mathcal {C}}^A(\Psi )$.

Proof

By a recursive application of the triangle inequality, we have

$$\begin{aligned}&\left\| {{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\Psi }^{AR} - {\Psi }_*^{AR} )\right\| _1\nonumber \\&\quad \le \left\| ({{\mathcal {T}}}^{A \rightarrow E} - \hat{{\mathcal {T}}}^{A \rightarrow E} )\circ {\mathcal {G}}_\sigma ^A \circ {{\mathcal {U}}}^A ( \Psi ^{AR} )\right\| _1+\left\| \hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi ^{AR} - {\hat{\Psi }}^{AR} )\right\| _1\nonumber \\&\qquad +\left\| \hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\hat{\Psi }}^{AR} - {\hat{\Psi }}_*^{AR} )\right\| _1 +\left\| \hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\hat{\Psi }}_*^{AR} - \Psi _*^{AR} )\right\| _1\nonumber \\&\qquad +\left\| (\hat{{\mathcal {T}}}^{A \rightarrow E} - {{\mathcal {T}}}^{A \rightarrow E}) \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\Psi }_*^{AR} )\right\| _1. \end{aligned}$$

(C4)

The expectation value of the first term is bounded as

$$\begin{aligned}&{\mathbb {E}}_{U} \left[ \left\| ({{\mathcal {T}}}^{A \rightarrow E} - \hat{{\mathcal {T}}}^{A \rightarrow E})\circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi ^{AR} ) \right\| _1 \right] \nonumber \\&\quad = {\mathbb {E}}_{U } \left[ \left\| ({\mathcal {D}}_+^{A \rightarrow E} - {\mathcal {D}}_-^{A \rightarrow E})\circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi ^{AR} ) \right\| _1 \right] \nonumber \\&\quad \le {\mathbb {E}}_{U } \left[ \left\| {\mathcal {D}}_+^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi ^{AR} )\right\| _1 \right] +{\mathbb {E}}_{U } \left[ \left\| {\mathcal {D}}_-^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {{\mathcal {U}}}^A ( \Psi ^{AR} )\right\| _1 \right] \nonumber \\&\quad = {\mathbb {E}}_{U } \left[ {\mathrm {Tr}}[{\mathcal {D}}_+^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {{\mathcal {U}}}^A ( \Psi ^{AR} )] \right] + {\mathbb {E}}_{U } \left[ {\mathrm {Tr}}[ {\mathcal {D}}_-^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {{\mathcal {U}}}^A ( \Psi ^{AR} )] \right] \nonumber \\&\quad = {\mathrm {Tr}}[({\mathcal {D}}_+^{A \rightarrow E}+ {\mathcal {D}}_-^{A \rightarrow E}) \circ {\mathcal {G}}_\sigma ^A ( \Psi _{\mathrm{av}}^{AR} )]. \end{aligned}$$

(C5)

In the same way, the expectation value of the last term is bounded as

$$\begin{aligned} {\mathbb {E}}_{U} \left[ \left\| (\hat{{\mathcal {T}}}^{A \rightarrow E} - {{\mathcal {T}}}^{A \rightarrow E} )\circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( \Psi _*^{AR} ) \right\| _1 \right] \le {\mathrm {Tr}}[({\mathcal {D}}_+^{A \rightarrow E}+ {\mathcal {D}}_-^{A \rightarrow E}) \circ {\mathcal {G}}_\sigma ^A ( \Psi _{\mathrm{av}}^{AR} )]. \end{aligned}$$

(C6)

For the second term, we have

$$\begin{aligned}&{\mathbb {E}}_{U} \left[ \left\| \hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {{\mathcal {U}}}^A ( {\Psi }^{AR} - {\hat{\Psi }}^{AR} )\right\| _1 \right] \nonumber \\&\quad ={\mathbb {E}}_{U } \left[ \left\| \hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A (\delta _+^{AR}-\delta _-^{AR}) \right\| _1 \right] \nonumber \\&\quad \le {\mathbb {E}}_{U } \left[ \left\| \hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {{\mathcal {U}}}^A (\delta _+^{AR})\right\| _1\right] +{\mathbb {E}}_{U } \left[ \left\| {{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {{\mathcal {U}}}^A (\delta _-^{AR}) \right\| _1 \right] \nonumber \\&\quad ={\mathbb {E}}_{U } \left[ {\mathrm {Tr}}[\hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {{\mathcal {U}}}^A (\delta _+^{AR})] \right] + {\mathbb {E}}_{U } \left[ {\mathrm {Tr}}[\hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {{\mathcal {U}}}^A (\delta _-^{AR})] \right] \nonumber \\&\quad ={\mathbb {E}}_{U} \left[ {\mathrm {Tr}}[\hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A (\delta _+^{AR}+\delta _-^{AR})]\right] . \end{aligned}$$

(C7)

Similarly, the expectation value of the fourth term is bounded as

$$\begin{aligned} {\mathbb {E}}_{U }\left[ \left\| \hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\hat{\Psi }}_*^{AR} - {\Psi }_*^{AR} )\right\| _1 \right] \le {\mathbb {E}}_{U}\left[ {\mathrm {Tr}}[\hat{{\mathcal {T}}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A (\delta _+^{AR}+\delta _-^{AR})]\right] . \end{aligned}$$

(C8)

Combining these all together, we obtain (C3). $\square $

Appendix D: Proof of Lemma 12

We prove Lemma 12, the statement of which is as follows: let ${\mathcal {T}}^{A\rightarrow E}$ be a CP map, and introduce a quantum system $E_c$ with dimension J. Define an isometry $Y:=\sum _{j}{|jj\rangle }^{A_cE_c}{\langle j|}^{A_c}$, and a linear supermap $\check{{\mathcal {T}}}^{A \rightarrow EE_c}$ by ${\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {Y}}^{A_c \rightarrow A_c E_c}$. Then, $\check{{\mathcal {T}}}^{A \rightarrow EE_c}$ is a CP map and, for any $\Psi ^{AR}$ that is classically coherent in $A_cR_c$, it holds that

$$\begin{aligned}&\left\| \check{{\mathcal {T}}}^{A \rightarrow EE_c} ( {\Psi }^{AR} - \Psi _{\mathrm{av}}^{AR} )\right\| _1 = \left\| {\mathcal {T}}^{A \rightarrow E} ( {\Psi }^{AR} - \Psi _{\mathrm{av}}^{AR} )\right\| _1, \end{aligned}$$

(D1)

$$\begin{aligned}&\left\| \check{{\mathcal {T}}}^{A \rightarrow EE_c} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\Psi }^{AR} - \Psi _{\mathrm{av}}^{AR} )\right\| _1 = \left\| {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\Psi }^{AR} - \Psi _{\mathrm{av}}^{AR} )\right\| _1. \end{aligned}$$

(D2)

Proof

Define $Z^{R_c\rightarrow R_cE_c}$ by $Z:=\sum _{j}{|jj\rangle }^{R_cE_c}{\langle j|}^{R_c}$. Since $\Psi ^{AR}$ is classically coherent in $A_cR_c$ and the averaged state is given by $\Psi _{\mathrm{av}}^{AR}=\sum _{j=1}^J{{|j\rangle }\!{\langle j|}}^{A_c}\otimes \pi ^{A_r}\otimes \Psi _{jj}^{R_r}\otimes {{|j\rangle }\!{\langle j|}}^{R_c}$, we have

$$\begin{aligned} \check{{\mathcal {T}}}^{A \rightarrow EE_c} ( {\Psi }^{AR} - \Psi _{\mathrm{ex}}^{AR} ) = {\mathcal {T}}^{A \rightarrow E} \otimes {\mathcal {Z}}^{R_c\rightarrow R_cE_c}( {\Psi }^{AR} - \Psi _{\mathrm{ex}}^{AR} ) \end{aligned}$$

(D3)

and

$$\begin{aligned} \check{{\mathcal {T}}}^{A \rightarrow EE_c} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A ( {\Psi }^{AR} - \Psi _{\mathrm{ex}}^{AR} ) = {\mathcal {T}}^{A \rightarrow E} \circ {\mathcal {G}}_\sigma ^A \circ {\mathcal {U}}^A \otimes {\mathcal {Z}}^{R_c\rightarrow R_cE_c}( {\Psi }^{AR} - \Psi _{\mathrm{ex}}^{AR} ). \end{aligned}$$

(D4)

Therefore, due to the invariance of the the trace distance under linear isometry, we obtain (D1) and (D2).$\square $

Appendix E: Proof of Lemmas 16–20 and 29 –35

Proof of Lemma 16

Property 1 immediately follows from the definition of the purified distance.

To show Property 2, note that for any $\rho ,\varsigma \in {\mathcal {S}}_\le ({\mathcal {H}})$, we have (see Lemma 6 in [25])

$$\begin{aligned} {\bar{D}}(\rho ,\varsigma ) \le P(\rho ,\varsigma ) \le \sqrt{2{\bar{D}}(\rho ,\varsigma )}, \end{aligned}$$

(E1)

where ${\bar{D}}$ is the generalized the trace distance defined by

$$\begin{aligned} {\bar{D}}(\rho ,\varsigma ):=\frac{1}{2}\Vert \rho -\varsigma \Vert _1+\frac{1}{2}|\mathrm{Tr}[\rho ]-\mathrm{Tr}[\varsigma ]|. \end{aligned}$$

(E2)

Noting that the second term in the above expression is no greater than the first term, we conclude the proof.

For Property 3, define $\lambda _\phi :={\left\langle \phi |\phi \right\rangle }$ and consider a normalized pure state ${|\phi _{\mathrm{n}}\rangle }:=\lambda _\phi ^{-1/2}{|\phi \rangle }$. Due to the triangle inequality and the first statement of this lemma, we have

$$\begin{aligned} P(\psi ,\phi )&\le P(\psi ,\phi _{\mathrm{n}}) + P(\phi _{\mathrm{n}},\phi ) \nonumber \\&=\sqrt{1-|{\left\langle \psi |\phi _{\mathrm{n}}\right\rangle }|^2}+\sqrt{1-|{\left\langle \phi _{\mathrm{n}}|\phi \right\rangle }|^2} \nonumber \\&=\sqrt{1-\lambda _\phi ^{-1}|{\left\langle \psi |\phi \right\rangle }|^2}+\sqrt{1-\lambda _\phi ^{-1}|{\left\langle \phi |\phi \right\rangle }|^2}\nonumber \\&\le \sqrt{1-|{\left\langle \psi |\phi \right\rangle }|^2}+\sqrt{1-\lambda _\phi }, \end{aligned}$$

(E3)

which completes the proof. $\square $

Proof of Lemma 17

Since $\rho ^{ABK}$ and $\rho _k^{AB}$ are normalized, the purified distances are given by

$$\begin{aligned}&P(\rho ^{ABK},{\hat{\rho }}^{ABK})=\sqrt{1-\Vert \sqrt{\rho ^{ABK}}\sqrt{{\hat{\rho }}^{ABK}}\Vert _1^2},&\delta _k:=P(\rho _k,{\hat{\rho }}_k)=\sqrt{1-\Vert \sqrt{\rho _k}\sqrt{{\hat{\rho }}_k}\Vert _1^2}. \end{aligned}$$

(E4)

The latter equality leads to

$$\begin{aligned} \sum _kp_k\Vert \sqrt{\rho _k}\sqrt{{\hat{\rho }}_k}\Vert _1=\sum _kp_k\sqrt{1-\delta _k^2}\ge \sum _kp_k(1-\delta _k)=1-\sum _kp_k\delta _k. \end{aligned}$$

(E5)

In addition, a simple calculation yields $\Vert \sqrt{\rho ^{ABK}}\sqrt{{\hat{\rho }}^{ABK}}\Vert _1=\sum _kp_k\Vert \sqrt{\rho _k}\sqrt{{\hat{\rho }}_k}\Vert _1$. Combining these relations with the first one in (E4), and by using $\sqrt{1-(1-x)^2}\le \sqrt{2x}$, we obtain the desired result. $\square $

Proof of Lemma 18

Define $\varsigma '^{AK}:=\sum _kp_k\varsigma _k^{A}\otimes {{|k\rangle }\!{\langle k|}}^K$. By the triangle inequality, we have

$$\begin{aligned}&\left\| \rho ^{AK}-\varsigma ^{AK}\right\| _1 \le \left\| \rho ^{AK}-\varsigma '^{AK}\right\| _1 + \left\| \varsigma '^{AK}-\varsigma ^{AK}\right\| _1 \nonumber \\&\quad = \sum _kp_k\left\| \rho _k-\varsigma _k\right\| _1 + \sum _k|p_k-q_k|. \end{aligned}$$

(E6)

We also have

$$\begin{aligned} \sum _kp_k\left\| \rho _k-\varsigma _k\right\| _1&= \sum _k\left\| p_k\rho _k-p_k\varsigma _k\right\| _1 \nonumber \\&\le \sum _k\left\| p_k\rho _k-q_k\varsigma _k\right\| _1 + \sum _k\left\| q_k\varsigma _k-p_k\varsigma _k\right\| _1 \nonumber \\&= \left\| \rho ^{AK}-\varsigma ^{AK}\right\| _1 + \sum _k|p_k-q_k|, \end{aligned}$$

(E7)

which implies the first inequality in (69). The second inequality simply follows from the monotonicity of the trace distance under discarding of system A.$\square $

Proof of Lemma 19

Consider arbitrary finite dimensional quantum system C and any subnormalized state $\xi $ on AC such that the reduced state on A takes the form of $\xi ^A=\bigoplus _{j=1}^Jq_j \varpi _j^{A_l}\otimes \pi _j^{A_r}$. Due to the triangle inequality for the trace norm, it holds that

$$\begin{aligned}&\Vert {\mathcal {E}}^{A\rightarrow B}(\xi ^{AC})+{\mathcal {F}}^{A\rightarrow B}(\xi ^{AC})\Vert _1 \le \Vert {\mathcal {E}}^{A\rightarrow B}(\xi ^{AC})\Vert _1+\Vert {\mathcal {F}}^{A\rightarrow B}(\xi ^{AC})\Vert _1 \\&\quad \le \Vert {\mathcal {E}}^{A\rightarrow B}\Vert _{\mathrm{DSP}}+\Vert {\mathcal {F}}^{A\rightarrow B}\Vert _{\mathrm{DSP}}. \end{aligned}$$

By taking the supremum over all C and $\xi $ in the first line, we obtain Lemma 19. $\square $

Proof of Lemma 20

Due to the completeness of the set of projectors, it holds that $ \varrho =\sum _{j,k}\Pi _j\varrho \Pi _k. $. This yields $ \mathrm{Tr}[\varrho ^\dagger \varrho ] = \sum _{j,j',k}\mathrm{Tr}[\Pi _j\varrho \Pi _k \Pi _k \varrho \Pi _{j'}] = \sum _{j,k}\mathrm{Tr}[\Pi _j\varrho \Pi _k \Pi _k \varrho \Pi _{j}] $ and completes the proof.$\square $

Proof of Lemma 29

Let ${|\varphi _k\rangle }^{ABC}$ be a purification of $\rho _k^{AB}$ for each k. A purification of $\rho ^{ABK_1K_2}$ is given by ${|\varphi \rangle }^{ABCK_1K_2K_3}:=\sum _k\sqrt{p_k}{|\varphi _k\rangle }^{ABC}{|k\rangle }^{K_1}{|k\rangle }^{K_2}{|k\rangle }^{K_3}$. Due to the duality of the conditional entropies (Lemma 24), Lemma 27 and isometric invariance (Lemma 22), we have

$$\begin{aligned} H_{\mathrm{max}}^\epsilon (AK_1|BK_2)_\rho&= -H_{\mathrm{min}}^\epsilon (AK_1|CK_3)_\varphi = -H_{\mathrm{min}}^\epsilon (A|CK_3)_\varphi \nonumber \\&= H_{\mathrm{max}}^\epsilon (A|BK_1K_2)_\rho = H_{\mathrm{max}}^\epsilon (A|BK_2)_\rho , \end{aligned}$$

(E8)

which completes the proof. $\square $

Proof of Lemma 30

Consider $\rho '\in {\mathcal {B}}^\epsilon (\rho )$ such that $H_{\mathrm{max}}^\epsilon (K_1A|K_2B)_\rho =H_{\mathrm{max}}(K_1A|K_2B)_{\rho '}$. Introduce a projector $\Pi ^{K_1K_2}:=\sum _k{{|k\rangle }\!{\langle k|}}^{K_1}\otimes {{|k\rangle }\!{\langle k|}}^{K_2}$, and define ${\hat{\rho }}^{K_1K_2AB}:=\Pi ^{K_1K_2}\rho '^{K_1K_2AB}\Pi ^{K_1K_2}$. Using the monotonicity of purified distance under trace non-increasing CP map (Property 2 in Lemma 15), and noting that $\rho ^{K_1K_2AB}=\Pi ^{K_1K_2}\rho ^{K_1K_2AB}\Pi ^{K_1K_2}$ by assumption, we have $P({\hat{\rho }}^{K_1K_2AB},\rho ^{K_1K_2AB})\le P(\rho '^{K_1K_2AB},\rho ^{K_1K_2AB})$, which yields ${\hat{\rho }}\in {\mathcal {B}}^\epsilon (\rho )$. Due to the operator monotonicity of the square root function (see e.g. [47]) and $\rho '^{K_1K_2AB}\ge {\hat{\rho }}^{K_1K_2AB}$, we have, for any $\varsigma \in {\mathcal {S}}({\mathcal {H}}^{K_2B})$,

$$\begin{aligned} \left\| \sqrt{\rho '^{K_1AK_2B}}\sqrt{\varsigma ^{K_2B}}\right\| _1&= \mathrm{Tr}\left[ \sqrt{\sqrt{\varsigma ^{K_2B}}\rho '^{K_1AK_2B}\sqrt{\varsigma ^{K_2B}}}\right] \nonumber \\&\ge \mathrm{Tr}\left[ \sqrt{\sqrt{\varsigma ^{K_2B}}{\hat{\rho }}^{K_1AK_2B}\sqrt{\varsigma ^{K_2B}}}\right] \nonumber \\&=\left\| \sqrt{{\hat{\rho }}^{K_1AK_2B}}\sqrt{\varsigma ^{K_2B}}\right\| _1. \end{aligned}$$

(E9)

Recalling the definition of the conditional max entropy (18), (21) and (24), this implies

$$\begin{aligned} H_{\mathrm{max}}(K_1A|K_2B)_{\rho '}\ge H_{\mathrm{max}}(K_1A|K_2B)_{{\hat{\rho }}}\ge H_{\mathrm{max}}^\epsilon (K_1A|K_2B)_\rho , \end{aligned}$$

(E10)

and consequently, $H_{\mathrm{max}}^\epsilon (K_1A|K_2B)_\rho =H_{\mathrm{max}}(K_1A|K_2B)_{{\hat{\rho }}}$. If $\rho $ is also diagonal in $K_1K_2$, we may, without loss of generality, assume that $\rho '$ is diagonal in $K_1K_2$ (see Proposition 5.8 in [48]), which completes the proof. $\square $

Proof of Lemma 31

Let ${\hat{\rho }}_k^{AB}\in {\mathcal {B}}^{\epsilon _k}(\rho _k^{AB})$ be such that $H_{\mathrm{min}}^{\epsilon _k}(A|B)_{\rho _k}=H_{\mathrm{min}}(A|B)_{{\hat{\rho }}_k}$ for each k, and define a subnormalized state ${\hat{\rho }}^{ABK}:=\sum _kp_k{\hat{\rho }}_k^{AB}\otimes {{|k\rangle }\!{\langle k|}}^K$. From Lemma 26, we have $H_{\mathrm{min}}(A|BK)_{{{\hat{\rho }}}}=-\log (\sum _kp_k\cdot 2^{-H_{\mathrm{min}}(A|B)_{{\hat{\rho }}_k}})$. Due to the property of the purified distance (Lemma 17), we also have ${\hat{\rho }}^{ABK}\in {\mathcal {B}}^{\sqrt{2\varepsilon }}(\rho ^{ABK})$, where $\varepsilon =\sum _kp_k\epsilon _k$. This completes the proof. $\square $

Proof of Lemma 32

Let $\{{|i\rangle }\}_{i=1}^{d_A}$ and $\{{|j\rangle }\}_{j=1}^{d_B}$ be the Schmidt bases of ${|\Phi \rangle }^{AA'}$ and ${|\Phi \rangle }^{BB'}$, respectively, and suppose that $X=\sum _{i,j}x_{ij}{{|j\rangle }\!{\langle i|}}$ and $Y=\sum _{i,j}y_{ij}{{|j\rangle }\!{\langle i|}}$. The statement follows by noting that $\mathrm{Tr}[X^TY]=\sum _{i,j}x_{ij}y_{ij}$. $\square $

Proof of Lemma 33

Suppose that $\varrho ^2$ is classically coherent. For any $x\ne y$, it holds that

$$\begin{aligned} 0&= {\langle x|}^X{\langle y|}^Y\varrho ^2{|x\rangle }^X{|y\rangle }^Y \nonumber \\&= \sum _{x',y'}{\langle x|}^X{\langle y|}^Y\varrho {|x'\rangle }^X{|y'\rangle }^Y\cdot {\langle x'|}^X{\langle y'|}^Y\varrho {|x\rangle }^X{|y\rangle }^Y \nonumber \\&\ge ({\langle x|}^X{\langle y|}^Y\varrho {|x\rangle }^X{|y\rangle }^Y)^2, \end{aligned}$$

(E11)

which implies ${\langle x|}^X{\langle y|}^Y\varrho {|x\rangle }^X{|y\rangle }^Y=0$ and completes the proof. $\square $

Proof of Lemma 34

The first inequality is proved as

$$\begin{aligned} \left\| \rho ^{AR}-\pi ^A\otimes \rho ^{R}\right\| _2^2&= \mathrm{Tr}[(\rho ^{AR}-\pi ^A\otimes \rho ^{R})^2] \nonumber \\&= \mathrm{Tr}[(\rho ^{AR})^2-\rho ^{AR}(\pi ^A\otimes \rho ^{R}) -(\pi ^A\otimes \rho ^{R})\rho ^{AR}+(\pi ^A\otimes \rho ^{R})^2] \nonumber \\&= \mathrm{Tr}[(\rho ^{AR})^2] - \frac{1}{d_A} \mathrm{Tr}[(\rho ^{R})^2] \nonumber \\&\le \mathrm{Tr}[(\rho ^{AR})^2] =\left\| \rho ^{AR}\right\| _2^2. \end{aligned}$$

(E12)

Similarly, we obtain the second one as

$$\begin{aligned}&\left\| \rho ^{AR}-{\mathcal {C}}^A(\rho ^{AR})\right\| _2^2 \nonumber \\&\quad = \mathrm{Tr}[(\rho ^{AR}-{\mathcal {C}}^A(\rho ^{AR}))^2] \nonumber \\&\quad = \mathrm{Tr}[(\rho ^{AR})^2]+\mathrm{Tr}[({\mathcal {C}}^A(\rho ^{AR}))^2] - 2\mathrm{Tr}[ \rho ^{AR}{\mathcal {C}}^A(\rho ^{AR})] \nonumber \\&\quad = \mathrm{Tr}[(\rho ^{AR})^2]+ \sum _{i,j}\mathrm{Tr}[({{|i\rangle }\!{\langle i|}}^A\otimes \rho _{ii}^{R})({{|j\rangle }\!{\langle j|}}^A\otimes \rho _{jj}^{R})] - 2\sum _j\mathrm{Tr}[ \rho ^{AR}({{|j\rangle }\!{\langle j|}}^A\otimes \rho _{jj}^{R})] \nonumber \\&\quad = \mathrm{Tr}[(\rho ^{AR})^2] - \sum _j \mathrm{Tr}[(\rho _{jj}^{R})^2] \nonumber \\&\quad \le \mathrm{Tr}[(\rho ^{AR})^2] =\left\| \rho ^{AR}\right\| _2^2, \end{aligned}$$

(E13)

which concludes the proof. $\square $

Proof of Lemma 35

There exist normalized state vectors ${|\psi '\rangle },{|\phi '\rangle }\in {\mathcal {H}}$ such that

$$\begin{aligned} {|\psi \rangle }={\left\langle e|\psi \right\rangle }{|e\rangle }+\alpha {|\psi '\rangle }, \quad {|\phi \rangle }={\left\langle e|\phi \right\rangle }{|e\rangle }+\beta {|\phi '\rangle }, \quad {\left\langle e|\psi '\right\rangle }={\left\langle e|\phi '\right\rangle }=0, \end{aligned}$$

(E14)

where the coefficients $\alpha $ and $\beta $ are given by

$$\begin{aligned} \alpha =\sqrt{1-{\left\langle e|\psi \right\rangle }^2}, \quad \beta =\sqrt{1-{\left\langle e|\phi \right\rangle }^2}. \end{aligned}$$

(E15)

Since ${\left\langle e|\psi \right\rangle }\ge c$, and ${\left\langle e|\phi \right\rangle }\ge c$, we have $\alpha ,\beta \le \sqrt{1-c^2}$, which implies

$$\begin{aligned} |{\left\langle \psi |\phi \right\rangle }| = |{\left\langle \psi |e\right\rangle }{\left\langle e|\phi \right\rangle }+\alpha \beta {\left\langle \psi '|\phi '\right\rangle }| \ge |{\left\langle \psi |e\right\rangle }{\left\langle e|\phi \right\rangle }|-|\alpha \beta {\left\langle \psi '|\phi '\right\rangle }| \ge c^2-(1-c^2). \end{aligned}$$

(E16)

This completes the proof. $\square $

Appendix F: Proof of Ineq. (158)

We prove Ineq. (158), i.e.

$$\begin{aligned} P(\theta _X^{BER},\Theta ^{BER}) \le 2\sqrt{\iota +4\sqrt{20\upsilon +2\delta }} +\sqrt{2\sqrt{20\upsilon +2\delta }}+2\sqrt{2\delta } +2\sqrt{20\upsilon +2\delta } +3\upsilon , \end{aligned}$$

(F1)

under the following conditions that are presented in Sect. 8:

(i)
The $\delta $-partial decoupling condition is satisfied, that is, there exists a state
$$\begin{aligned} \Omega ^{ER}:=\sum _{j=1}^J\varsigma _j^E\otimes \Psi _{jj}^{R_r}\otimes {{|j\rangle }\!{\langle j|}}^{R_c}, \end{aligned}$$
(F2)
where $\{\varsigma _j\}_{j=1}^J$ are normalized states on E, such that
$$\begin{aligned} \left\| {\mathcal {T}}^{A \rightarrow E} ( \Psi ^{AR} ) -\Omega ^{ER} \right\| _1 \le \delta . \end{aligned}$$
(F3)
(ii)
The operator $X\in {\mathcal {P}}({\mathcal {H}}^{ER})$ satisfies
$$\begin{aligned}{}[(X^{ER})^{-\frac{1}{2}},\omega ^{ER}]=0 \end{aligned}$$
(F4)
and
$$\begin{aligned} (\theta ^E)^{-\frac{1}{2}}(X^{ER})^{-\frac{1}{2}}\omega ^{ER}(X^{ER})^{-\frac{1}{2}}(\theta ^E)^{-\frac{1}{2}} =\sum _{k:q_k>0}{{|k\rangle }\!{\langle k|}}^{E_c}\otimes I_k^{E_r}\otimes I_k^{R_r}\otimes {{|k\rangle }\!{\langle k|}}^{R_c}, \end{aligned}$$
(F5)
where $\omega $ is a subnormalized state defined by
$$\begin{aligned} \omega ^{ER}&:=\sum _{k:q_k>0}q_k{{|k\rangle }\!{\langle k|}}^{E_c}\otimes \theta _k^{E_r}\otimes \theta _k^{R_r}\otimes {{|k\rangle }\!{\langle k|}}^{R_c}. \end{aligned}$$
(F6)
and
$$\begin{aligned} q_k :=\Vert {\langle k|}^{R_c}{|\theta \rangle }\Vert _1^2, \quad {|\theta _k\rangle }^{E_rR_r}:=q_k^{-1/2}{\langle k|}^{E_c}{\langle k|}^{R_c}{|\theta \rangle }. \end{aligned}$$
(F7)

To this end, we evaluate the distances between purifications of $\Omega ^{ER}\in {\mathcal {S}}_=({\mathcal {H}}^{ER})$ and $\omega ^{ER}\in {\mathcal {S}}_\le ({\mathcal {H}}^{ER})$, in addition to a normalized pure state ${|\Theta \rangle }$ and subnormalized pure states ${|\theta \rangle }$, ${|\theta _X\rangle }$ and ${|\omega _X\rangle }$ on BERD. Recall that ${|\Theta \rangle }$ and ${|\theta \rangle }$ are defined as follows:

${|\Theta \rangle }:=V{|\Psi \rangle }$, where ${|\Psi \rangle }^{ARD}$ is a purification of $\Psi ^{AR}$ and $V^{A\rightarrow BE}$ is a Stinespring dilation of ${\mathcal {T}}^{A\rightarrow E}$.
${|\theta \rangle }$: A subnormalized pure state such that
$$\begin{aligned} H_{\mathrm{max}}(RD|E)_\theta =H_{\mathrm{max}}^{\upsilon }(RD|E)_{\Theta }, \quad P(\theta ^{BERD},\Theta ^{BERD})\le \upsilon , \end{aligned}$$
(F8)
which is classically coherent in $E_cR_c$.

With $\Gamma _X^{ER}$ being a linear operator

$$\begin{aligned} \Gamma _X^{ER}&:= \sqrt{1-\iota }\cdot (X^{ER})^{\frac{1}{2}}((1-\iota )\cdot X^{ER}+\iota \cdot Y^{ER})^{-\frac{1}{2}}, \end{aligned}$$

(F9)

the subnormalized pure states ${|\theta _X\rangle }$ and ${|\omega _X\rangle }$ are define by

$$\begin{aligned} {|\theta _X\rangle }:=\Gamma _X^{ER}{|\theta \rangle }, \quad {|\omega _X\rangle }:=\Gamma _X^{ER}{|\omega \rangle }. \end{aligned}$$

(F10)

Due to the operator monotonicity of the inverse function (see e.g. [47]), we have

$$\begin{aligned} \Gamma _X\Gamma _X^\dagger&= (1-\iota )\cdot (X^{ER})^{\frac{1}{2}}((1-\iota )\cdot X^{ER}+\iota \cdot Y^{ER})^{-1}(X^{ER})^{\frac{1}{2}} \nonumber \\&\le (1-\iota )\cdot (X^{ER})^{\frac{1}{2}}((1-\iota )\cdot X^{ER})^{-1}(X^{ER})^{\frac{1}{2}} = I^{ER}. \end{aligned}$$

(F11)

Consequently, $\Gamma _X^{ER}$ is contractive, and thus ${|\theta _X\rangle }$ and ${|\omega _X\rangle }$ are indeed subnormalized states. Relations among these states are depicted in Figure 2.

1.1 1. Application of triangle inequality

Consider a subnormalized state $\omega ^{ER}$ defined by (F6). Due to Uhlmann’s theorem (Lemma 15), there exists a purification $|\omega \rangle ^{BERD}$ of $\omega ^{ER}$ such that

$$\begin{aligned} P(\theta ^{BERD},\omega ^{BERD})=P( \theta ^{ER},\omega ^{ER} ). \end{aligned}$$

(F12)

By the triangle inequality for the the purified distance, it holds that

$$\begin{aligned}&P(\theta ^{BER}_X,\Theta ^{BER}) \nonumber \\&\quad \le P(\theta ^{BER}_X,\omega _X^{BER}) + P(\omega _X^{BER},\omega ^{BER}) + P(\omega ^{BER},\theta ^{BER}) + P(\theta ^{BER},\Theta ^{BER}) \nonumber \\&\quad \le P(\theta ^{BERD}_X,\omega _X^{BERD}) + P(\omega _X^{BERD},\omega ^{BERD}) + P(\omega ^{BERD},\theta ^{BERD}) \nonumber \\&\qquad + P(\theta ^{BERD},\Theta ^{BERD}) \nonumber \\&\quad \le 2P(\omega ^{BERD},\theta ^{BERD}) + P(\omega _X^{BERD},\omega ^{BERD})+\upsilon \nonumber \\&\quad = 2P(\omega ^{ER},\theta ^{ER}) + P(\omega _X^{BERD},\omega ^{BERD})+\upsilon \nonumber \\&\quad \le 2P(\omega ^{ER},\Omega ^{ER}) + 2P(\Omega ^{ER},\Theta ^{ER}) + 2P(\Theta ^{ER},\theta ^{ER}) + P(\omega _X^{BERD},\omega ^{BERD})+\upsilon \nonumber \\&\quad \le 2P(\omega ^{ER},\Omega ^{ER}) + 2P(\Omega ^{ER},\Theta ^{ER}) + P(\omega _X^{BERD},\omega ^{BERD})+3\upsilon . \end{aligned}$$

(F13)

Here, the third line follows from the monotonicity of the purified distance under partial trace (see Lemma 15); the fourth line from the monotonicity of the purified distance under the trace-nonincreasing CP map $\Gamma _X^{ER}$ and from the condition for $\theta $ given by (F8); the fifth line due to Eq. (F12); and the last line again from (F8). Noting that we have $\Theta ^{ER}={\mathcal {T}}^{A\rightarrow E}(\Psi ^{AR})$ from the definition of $\Theta $, and by using the partial decoupling condition (F3) as well as the relation between the purified distance and the the trace distance (Lemma 16), we have

$$\begin{aligned} P(\Omega ^{ER},\Theta ^{ER}) \le \sqrt{2\left\| \Omega ^{ER}-\Theta ^{ER}\right\| _1} \le \sqrt{2\delta } \end{aligned}$$

(F14)

for the second term in (F13). In the following, we prove that the first and the third term in (F13) are bounded as

$$\begin{aligned}&P(\omega ^{ER},\Omega ^{ER}) \le \sqrt{20\upsilon +2\delta }, \end{aligned}$$

(F15)

$$\begin{aligned}&P(\omega _X^{BERD},\omega ^{BERD}) \le 2\sqrt{\iota +4P(\omega ^{ER},\Omega ^{ER})} + \sqrt{2P(\omega ^{ER},\Omega ^{ER})}, \end{aligned}$$

(F16)

respectively. Combining these all together, we arrive at (F1).

1.2 2. Evaluation of $P(\Omega ^{ER},\omega ^{ER})$

We first evaluate $P(\Omega ^{ER},\omega ^{ER})$ by using the partial decoupling condition (F3). From the normalized state ${|\Theta \rangle }$, define

$$\begin{aligned} p_k:=\Vert {\langle k|}^{R_c}{|\Theta \rangle }\Vert _1^2, \quad {|\Theta _k\rangle }:=p_k^{-1/2}{\langle k|}^{E_c}{\langle k|}^{R_c}{|\Theta \rangle }. \end{aligned}$$

(F17)

From the condition that $\Psi ^{AR}$ is classically coherent in $A_cR_c$ and ${\mathcal {T}}^{A\rightarrow E}$ is trace-preserving, it follows that

$$\begin{aligned} p_k\Theta _k^{R_r}&= \mathrm{Tr}_{BED}[{\langle k|}^{R_c}{{|\Theta \rangle }\!{\langle \Theta |}}^{BER_cR_rD}{|k\rangle }^{R_c}] \nonumber \\&= \mathrm{Tr}_{AD}[{\langle k|}^{R_c}{{|\Psi \rangle }\!{\langle \Psi |}}^{AR_cR_rD}{|k\rangle }^{R_c}] \nonumber \\&={\langle k|}^{R_c}\Psi ^{R_cR_r}{|k\rangle }^{R_c}=\Psi _{kk}^{R_c}. \end{aligned}$$

(F18)

Consequently, the state $\Omega ^{ER}$ defined by (F2) is represented as

$$\begin{aligned} \Omega ^{ER}=\sum _{k=1}^Jp_k\varsigma _k^{E_cE_r}\otimes \Theta _k^{R_r}\otimes {{|k\rangle }\!{\langle k|}}^{R_c}. \end{aligned}$$

(F19)

Thus, from the definition of $\omega $ given by (F6) and (F7), and by using the property of the trace distance (Lemma 18),we have

$$\begin{aligned}&\left\| \Omega ^{ER}-\omega ^{ER}\right\| _1 \nonumber \\&\quad \le \sum _{k=1}^J|p_k-q_k| + \sum _{k=1}^J p_k\left\| \varsigma _k^{E_cE_r}\otimes \Theta _k^{R_r}-{{|k\rangle }\!{\langle k|}}^{E_c}\otimes \theta _k^{E_r}\otimes \theta _k^{R_r}\right\| _1 \nonumber \\&\quad \le \sum _{k=1}^J|p_k-q_k| + \sum _{k=1}^J p_k\left\| \varsigma _k^{E_cE_r}\otimes \Theta _k^{R_r}-{{|k\rangle }\!{\langle k|}}^{E_c}\otimes \Theta _k^{E_r}\otimes \Theta _k^{R_r}\right\| _1 \nonumber \\&\qquad + \sum _{k=1}^J p_k\left\| \Theta _k^{E_r}\otimes \Theta _k^{R_r}-\Theta _k^{E_r}\otimes \theta _k^{R_r}\right\| _1 + \sum _{k=1}^J p_k\left\| \Theta _k^{E_r}\otimes \theta _k^{R_r}-\theta _k^{E_r}\otimes \theta _k^{R_r}\right\| _1 \nonumber \\&\quad = \sum _{k=1}^J|p_k-q_k| + \sum _{k=1}^J p_k\left\| \varsigma _k^{E_cE_r}-{{|k\rangle }\!{\langle k|}}^{E_c}\otimes \Theta _k^{E_r}\right\| _1 \nonumber \\&\qquad + \sum _{k=1}^J p_k\left\| \Theta _k^{R_r}-\theta _k^{R_r}\right\| _1 + \sum _{k=1}^J p_k\left\| \Theta _k^{E_r}-\theta _k^{E_r}\right\| _1. \end{aligned}$$

(F20)

Noting that $\Theta ^{E_c}$ and $\theta ^{E_c}$ are both diagonal in $\{{|k\rangle }\}_k$, the first term is equal to $\Vert \Theta ^{E_c}-\theta ^{E_c}\Vert _1$. By using Lemma 18 again, the third and the fourth terms are bounded as $\sum _{k=1}^J p_k\Vert \Theta _k^{R_r}-\theta _k^{R_r}\Vert _1\le 2\Vert \Theta ^{R_cR_r}-\theta ^{R_cR_r}\Vert _1$ and $\sum _{k=1}^J p_k\Vert \Theta _k^{E_r}-\theta _k^{E_r}\Vert _1\le 2\Vert \Theta ^{E_cE_r}-\theta ^{E_cE_r}\Vert _1$, respectively. In addition, denoting by ${\mathcal {C}}^{R_c}$ the completely dephasing operation on $R_c$ with respect to the basis $\{{|k\rangle }\}_k$, the second term is bounded as

$$\begin{aligned}&\sum _{k=1}^J p_k\left\| \varsigma _k^{E_cE_r}-{{|k\rangle }\!{\langle k|}}^{E_c}\otimes \Theta _k^{E_r}\right\| _1 \nonumber \\&\quad = \left\| \sum _{k=1}^J p_k\varsigma _k^{E_cE_r}\otimes {{|k\rangle }\!{\langle k|}}^{R_c}-\sum _{k=1}^J p_k{{|k\rangle }\!{\langle k|}}^{E_c}\otimes \Theta _k^{E_r}\otimes {{|k\rangle }\!{\langle k|}}^{R_c}\right\| _1 \nonumber \\&\quad = \left\| \Omega ^{E_cE_rR_c}-{\mathcal {C}}^{R_c}(\Theta ^{E_cE_rR_c})\right\| _1 \nonumber \\&\quad \le \left\| \Omega ^{E_cE_rR_c}-\Theta ^{E_cE_rR_c}\right\| _1, \end{aligned}$$

(F21)

where we used $\Omega ^{E_cE_rR_c}={\mathcal {C}}^{R_c}(\Omega ^{E_cE_rR_c})$ in the last line. Substituting all these inequalities to (F20), we arrive at

$$\begin{aligned}&\left\| \Omega ^{ER}-\omega ^{ER}\right\| _1 \nonumber \\&\quad \le \left\| \Theta ^{E_c}-\theta ^{E_c}\right\| _1 + 2\left\| \Theta ^{R_cR_r}-\theta ^{R_cR_r}\right\| _1 + 2\left\| \Theta ^{E_cE_r}-\theta ^{E_cE_r}\right\| _1 \nonumber \\&\qquad + \left\| \Omega ^{E_cE_rR_c}-\Theta ^{E_cE_rR_c}\right\| _1 \nonumber \\&\quad \le 5\left\| \Theta ^{ER}-\theta ^{ER}\right\| _1 + \left\| \Omega ^{ER}-\Theta ^{ER}\right\| _1 \nonumber \\&\quad \le 5\left\| \Theta ^{ER}-\theta ^{ER}\right\| _1 + \delta , \end{aligned}$$

(F22)

where the last line follows from the partial decoupling condition (F3) and $\Theta ^{ER}={\mathcal {T}}^{A\rightarrow E}(\Psi ^{AR})$. From the relation between the trace distance and the purified distance (see Lemma 15), and from the definition of $\theta $, the first term is bounded as

$$\begin{aligned} \left\| \Theta ^{ER}-\theta ^{ER}\right\| _1 \le 2P(\Theta ^{ER},\theta ^{ER}) \le 2\upsilon . \end{aligned}$$

(F23)

Substituting this to (F22), and again using Lemma 15, it follows that

$$\begin{aligned} P(\Omega ^{ER},\omega ^{ER}) \le \sqrt{2 \left\| \Omega ^{ER}-\omega ^{ER}\right\| _1} \le \sqrt{ 20\upsilon +2\delta }, \end{aligned}$$

(F24)

which implies (F15).

1.3 3. Evaluation of $P(\omega _X^{BERD},\omega ^{BERD})$

Due to the property of the purified distance for subnormalized pure states (Property 3 in Lemma 16), we have

$$\begin{aligned} P(\omega _X^{BERD},\omega ^{BERD}) \le \sqrt{1-|{\left\langle \omega _X|\omega \right\rangle }|^2} + \sqrt{\chi _\omega } = \sqrt{1-|{\langle \omega |}(\Gamma _X^{ER})^\dagger {|\omega \rangle }|^2} + \sqrt{\chi _\omega }, \end{aligned}$$

(F25)

where $\chi _\omega :=1-{\left\langle \omega |\omega \right\rangle }$ and the last line follows from the definition of $\omega _X$ given by (F10). To bound the first term, define

$$\begin{aligned} {|{\tilde{\omega }}\rangle }^{BERD}:=\sqrt{\frac{1-\iota }{\alpha }}\cdot (\Gamma _X^{ER})^{-1}{|\omega \rangle }^{BERD}, \end{aligned}$$

(F26)

where

$$\begin{aligned} \alpha :=(1-\iota )\cdot \langle \omega |\omega \rangle +\iota . \end{aligned}$$

(F27)

Note that $\alpha \le 1$ due to the condition $\iota \le 1$. As we prove below, ${|{\tilde{\omega }}\rangle }$ is a normalized pure state. In addition, since $\Gamma _X^\dagger $ is a contraction, $(\Gamma _X^{ER})^\dagger |\omega \rangle $ is a subnormalized pure state. Hence, we can apply Lemma 35 for subnormalized pure states ${|\omega \rangle },(\Gamma _X^{ER})^\dagger |\omega \rangle $ and a normalized pure state ${|{\tilde{\omega }}\rangle }$ to bound the first term in (F25).

Due to the definition of $\tilde{\omega }$ in (F26) and $\alpha \le 1$, we have

$$\begin{aligned} \langle {\tilde{\omega }}|(\Gamma _X^{ER})^\dagger |\omega \rangle = \sqrt{\frac{1-\iota }{\alpha }}\cdot \langle \omega |\omega \rangle \ge \sqrt{1-\iota }\cdot (1-\chi _\omega ). \end{aligned}$$

(F28)

In addition, we have

$$\begin{aligned} \langle \omega |{\tilde{\omega }}\rangle&= \sqrt{\frac{1-\iota }{\alpha }}\cdot \langle \omega | (\Gamma _X^{ER})^{-1} |\omega \rangle \nonumber \\&= \alpha ^{-1/2}\cdot \mathrm{Tr}[ \omega ^{ER} ((1-\iota )\cdot X^{ER}+\iota \cdot Y^{ER})^{\frac{1}{2}} (X^{ER})^{-\frac{1}{2}}] \nonumber \\&= \alpha ^{-1/2}\cdot \mathrm{Tr}[ (X^{ER})^{-\frac{1}{4}} \omega ^{ER} (X^{ER})^{-\frac{1}{4}} ((1-\iota )\cdot X^{ER}+\iota \cdot Y^{ER})^{\frac{1}{2}} ] \nonumber \\&\ge \sqrt{1-\iota } \cdot \mathrm{Tr}[ (X^{ER})^{-\frac{1}{4}} \omega ^{ER} (X^{ER})^{-\frac{1}{4}} \cdot (X^{ER})^{\frac{1}{2}} ] \nonumber \\&= \sqrt{1-\iota } \cdot \mathrm{Tr}[ \omega ^{ER} ] = \sqrt{1-\iota } \cdot (1-\chi _\omega ). \end{aligned}$$

(F29)

Here, the second line follows from the definition of $\Gamma _X$ by (F9), the third line from the commutativity of $(X^{ER})^{-1/2}$ and $\omega ^{ER}$, given by (F4), and the fourth line due to $\alpha \le 1$ and the matrix monotonicity of the square root function. Thus, Lemma 35 yields

$$\begin{aligned} |{\langle \omega |}(\Gamma _X^{ER})^\dagger {|\omega \rangle }| \ge 2(1-\iota ) \cdot (1-\chi _\omega )^2-1 \ge 1-2(\iota +2 \chi _\omega ). \end{aligned}$$

(F30)

Combining this with (F25), and by using $\sqrt{1-(1-x)^2}\le \sqrt{2x}$, we obtain

$$\begin{aligned} \!\! P(\omega _X^{BERD},\omega ^{BERD}) \le 2 \sqrt{\iota +2\chi _\omega } + \sqrt{\chi _\omega }. \end{aligned}$$

(F31)

Noting that $\Omega ^{ER}$ is a normalized state, the triangle inequality for the trace norm and the relation between the trace distance and the purified distance (Lemma 16) lead to

$$\begin{aligned} \chi _\omega =\mathrm{Tr}[ \Omega ^{ER} ]-\mathrm{Tr}[ \omega ^{ER} ] =\Vert \Omega ^{ER} \Vert _1 -\Vert \omega ^{ER} \Vert _1 \le \Vert \omega ^{ER}-\Omega ^{ER}\Vert _1 \le 2P(\omega ^{ER},\Omega ^{ER}). \end{aligned}$$

(F32)

Substituting this to (F31), we arrive at (F16).

To prove that ${|{\tilde{\omega }}\rangle }$ is a normalized pure state, we observe, from the definition of $\Gamma _X$ in (F9) and that of $\tilde{\omega }$ in (F26), that

$$\begin{aligned} \alpha \cdot \langle {\tilde{\omega }}|\tilde{\omega }\rangle&= (1-\iota )\cdot {\langle \omega |}(\Gamma _X^{ER})^{-1\dagger }(\Gamma _X^{ER})^{-1}{|\omega \rangle } \nonumber \\&= {\langle \omega |}(X^{ER})^{-\frac{1}{2}}((1-\iota )\cdot X^{ER}+\iota \cdot Y^{ER})(X^{ER})^{-\frac{1}{2}}{|\omega \rangle } \nonumber \\&= (1-\iota )\cdot \langle \omega |\omega \rangle + \iota \cdot {\langle \omega |}(X^{ER})^{-\frac{1}{2}}Y^{ER}(X^{ER})^{-\frac{1}{2}}{|\omega \rangle } \nonumber \\&= (1-\iota )\cdot \langle \omega |\omega \rangle + \iota \cdot \mathrm{Tr}[ (X^{ER})^{-\frac{1}{2}}\omega ^{ER}(X^{ER})^{-\frac{1}{2}}Y^{ER} ]. \end{aligned}$$

(F33)

Noting that $Y^{ER}$ is classically coherent in $E_cR_c$ due to Lemma 33, we obtain from the property (F5) of $X^{ER}$ that

$$\begin{aligned}&(X^{ER})^{-\frac{1}{2}}\omega ^{ER}(X^{ER})^{-\frac{1}{2}}Y^{ER} \nonumber \\&\quad = (\theta ^E)^{\frac{1}{2}}\cdot (\theta ^E)^{-\frac{1}{2}}(X^{ER})^{-\frac{1}{2}}\omega ^{ER}(X^{ER})^{-\frac{1}{2}}(\theta ^E)^{-\frac{1}{2}} \cdot (\theta ^E)^{\frac{1}{2}}Y^{ER} \nonumber \\&\quad = \left( \sum _{k:q_k>0}{{|k\rangle }\!{\langle k|}}^{E_c}\otimes \theta _k^{E_r}\otimes I_k^{R_r}\otimes {{|k\rangle }\!{\langle k|}}^{R_c}\right) Y^{ER} \nonumber \\&\quad = (\theta ^E\otimes I^R)Y^{ER}. \end{aligned}$$

(F34)

Substituting this to (F33), we obtain

$$\begin{aligned} \alpha \cdot \langle \tilde{\omega }|\tilde{\omega }\rangle = (1-\iota )\cdot \langle \omega |\omega \rangle + \iota \cdot \mathrm{Tr}[ \theta ^EY^{ER} ]. \end{aligned}$$

(F35)

Note that we have $ \mathrm{Tr}[\theta ^EY^{ER}]=\mathrm{Tr}[\theta ^EY^{ERD}]=1 $ from the definition of the conditional max-entropy and the definition of $Y^{ERD}$. Thus, using the definition of $\alpha $ in (F27), we arrive at $ \langle \tilde{\omega }|\tilde{\omega }\rangle =1 $. $\square $

Appendix G: Proof of Ineq. (174)

We prove Ineq. (174), that is,

$$\begin{aligned} {\bar{\lambda }}:=\sum _kr_k\lambda _k\le \lambda \left( \iota ,\sqrt{2}\root 4 \of {24\upsilon +2\delta }\right) +\lambda (\iota ,4)\cdot \sqrt{2}\root 4 \of {24\upsilon +2\delta }, \end{aligned}$$

(G1)

under the partial decoupling condition (141). Recall that $\lambda (\iota ,x)$ is defined by $\lambda (\iota ,x):=2\sqrt{\iota +2x}+\sqrt{x}+2x$, and that $r_k$ and $\lambda _k$ are given by

$$\begin{aligned}&r_k:=\Vert {\langle k|}^{R_c}{|{\hat{\Psi }}\rangle }\Vert _1^2, \quad \lambda _k:=2\sqrt{\iota +4\sqrt{2\delta _k}} +\sqrt{2\sqrt{2\delta _k}}+4\sqrt{2\delta _k}, \end{aligned}$$

(G2)

where

$$\begin{aligned} \delta _k:= \left\| \hat{{\mathcal {T}}}_{{\mathcal {C}},k}^{A_r\rightarrow E}({\hat{\Psi }}_{k}^{A_rR_r}) - \varsigma _k^E\otimes {\hat{\Psi }}_{k}^{R_r} \right\| _1 \end{aligned}$$

(G3)

and

$$\begin{aligned}&|{\hat{\Psi }}_{k}\rangle ^{A_rR_rD}:=r_k^{-1/2}{\langle k|}^{E_c}{\langle k|}^{R_c}{|{\hat{\Psi }}\rangle },\nonumber \\&\quad \hat{{\mathcal {T}}}_{{\mathcal {C}},k}^{A_r\rightarrow E}(\tau ) = {{|k\rangle }\!{\langle k|}}^{E_c}\hat{{\mathcal {T}}}_{{\mathcal {C}}}^{A\rightarrow E}({{|k\rangle }\!{\langle k|}}^{A_c}\otimes \tau ^{A_r}){{|k\rangle }\!{\langle k|}}^{E_c}. \end{aligned}$$

(G4)

We introduce similar notations for ${|\Psi \rangle }$ and ${\mathcal {T}}_{{\mathcal {C}}}^{A\rightarrow E}:={\mathcal {T}}^{A\rightarrow E}\circ {\mathcal {C}}$ as follows:

$$\begin{aligned}&p_k:=\Vert {\langle k|}^{R_c}{|\Psi \rangle }\Vert _1^2, \quad |\Psi _{k}\rangle ^{A_rR_rD}:=p_k^{-1/2}{\langle k|}^{E_c}{\langle k|}^{R_c}{|\Psi \rangle }, \nonumber \\&{\mathcal {T}}_{{\mathcal {C}},k}^{A_r\rightarrow E}(\tau ) = {{|k\rangle }\!{\langle k|}}^{E_c}{\mathcal {T}}_{{\mathcal {C}}}^{A\rightarrow E}({{|k\rangle }\!{\langle k|}}^{A_c}\otimes \tau ^{A_r}){{|k\rangle }\!{\langle k|}}^{E_c}. \end{aligned}$$

(G5)

Note that $\Psi _{kk}^{R_r}=p_k\Psi _{k}^{R_r}$. It is straightforward to verify that the states $\hat{{\mathcal {T}}}_{\mathcal {C}}^{A \rightarrow E}( {\hat{\Psi }}^{AR} )$ and ${\mathcal {T}}_{\mathcal {C}}^{A \rightarrow E} ( \Psi ^{AR} )$ are represented by

$$\begin{aligned}&\hat{{\mathcal {T}}}_{\mathcal {C}}^{A \rightarrow E} ( {\hat{\Psi }}^{AR} ) = \sum _kr_k{{|k\rangle }\!{\langle k|}}^{E_c}\otimes {{|k\rangle }\!{\langle k|}}^{R_c}\otimes \hat{{\mathcal {T}}}_{{\mathcal {C}},k}^{A_r\rightarrow E}({\hat{\Psi }}_{k}^{A_rR_r}), \end{aligned}$$

(G6)

$$\begin{aligned}&{\mathcal {T}}_{\mathcal {C}}^{A \rightarrow E} ( \Psi ^{AR} ) = \sum _kp_k{{|k\rangle }\!{\langle k|}}^{E_c}\otimes {{|k\rangle }\!{\langle k|}}^{R_c}\otimes {\mathcal {T}}_{{\mathcal {C}},k}^{A_r\rightarrow E}(\Psi _{k}^{A_rR_r}). \end{aligned}$$

(G7)

Since $\Psi ^{AR}$ is assumed to be classically coherent in $A_cR_c$ (Converse Condition 2), the partial decoupling condition (141) implies that there exists $\{ \varsigma _j^E \}$ ($\varsigma _j^E \in {\mathcal {S}}_={{\mathcal {H}}^E} $) satisfying

$$\begin{aligned} \Vert {\mathcal {T}}^{A \rightarrow E}\circ {\mathcal {C}}^A ( \Psi ^{AR} ) -\Omega ^{ER} \Vert _1 \le \delta , \end{aligned}$$

(G8)

where $\Omega ^{ER}:=\sum _{j=1}^J\varsigma _j^E\otimes \Psi _{jj}^{R_r}\otimes {{|j\rangle }\!{\langle j|}}^{R_c}=\sum _{j=1}^Jp_j\varsigma _j^E\otimes \Psi _{j}^{R_r}\otimes {{|j\rangle }\!{\langle j|}}^{R_c}$.

From (G2) and the definition of $\lambda (\iota ,x)$, we have $\lambda _k=\lambda (\iota ,2\sqrt{2\delta _k})$. Noting that $\delta _k\le 2$ by the definition of the trace distance, and that $\sum _kr_k\cdot 2\sqrt{2\delta _k}\le 2\sqrt{2{\bar{\delta }}}$ by Jensen’s inequality, where ${\bar{\delta }}:=\sum _kr_k\delta _k$, we can apply Lemma 36 for $f(x)=\lambda (\iota ,x)$, $c=4$ and $\epsilon _k=2\sqrt{2\delta _k}$ to obtain

$$\begin{aligned} {\bar{\lambda }}=\sum _kr_k\lambda (\iota ,2\sqrt{2\delta _k}) \le \lambda \left( \iota ,\sqrt{2\sqrt{2{\bar{\delta }}}}\right) +\lambda (\iota ,4)\cdot \sqrt{2\sqrt{2{\bar{\delta }}}}. \end{aligned}$$

(G9)

The ${{\bar{\delta }}}$ can further be calculated as follows. By the triangle inequality, we have

$$\begin{aligned} {\bar{\delta }}&=\sum _kr_k\left\| \hat{{\mathcal {T}}}_{{\mathcal {C}},k}^{A_r\rightarrow E}({\hat{\Psi }}_{k}^{A_rR_r}) - \varsigma _k^E\otimes {\hat{\Psi }}_{k}^{R_r} \right\| _1 \nonumber \\&\le \sum _kr_k\left\| \hat{{\mathcal {T}}}_{{\mathcal {C}},k}^{A_r\rightarrow E}({\hat{\Psi }}_{k}^{A_rR_r}) - {\mathcal {T}}_{{\mathcal {C}},k}^{A_r\rightarrow E}(\Psi _{k}^{A_rR_r}) \right\| _1 \nonumber \\&\quad + \sum _kr_k\left\| {\mathcal {T}}_{{\mathcal {C}},k}^{A_r\rightarrow E}(\Psi _{k}^{A_rR_r}) - \varsigma _k^E\otimes \Psi _{k}^{R_r} \right\| _1 \nonumber \\&\quad + \sum _kr_k\left\| \Psi _{k}^{R_r} - {\hat{\Psi }}_{k}^{R_r} \right\| _1 \nonumber \\&\le 2\sum _kr_k\left\| \hat{{\mathcal {T}}}_{{\mathcal {C}},k}^{A_r\rightarrow E}({\hat{\Psi }}_{k}^{A_rR_r}) - {\mathcal {T}}_{{\mathcal {C}},k}^{A_r\rightarrow E}(\Psi _{k}^{A_rR_r}) \right\| _1 \nonumber \\&\quad + \sum _kr_k\left\| {\mathcal {T}}_{{\mathcal {C}},k}^{A_r\rightarrow E}(\Psi _{k}^{A_rR_r}) - \varsigma _k^E\otimes \Psi _{k}^{R_r} \right\| _1, \end{aligned}$$

(G10)

where the last line follows from the monotonicity of the trace distance under partial trace.

Using the property of the trace distance (Lemma 18 and 16 ), and Eqs. (G6) and (G7), the first term in (G10) is bounded as

$$\begin{aligned}&\sum _kr_k\left\| \hat{{\mathcal {T}}}_{{\mathcal {C}},k}^{A_r\rightarrow E}({\hat{\Psi }}_{k}^{A_rR_r}) - {\mathcal {T}}_{{\mathcal {C}},k}^{A_r\rightarrow E}(\Psi _{k}^{A_rR_r}) \right\| _1 \nonumber \\&\quad \le 2\left\| \hat{{\mathcal {T}}}_{{\mathcal {C}}}^{A\rightarrow E}({\hat{\Psi }}^{AR}) - {{\mathcal {T}}}_{{\mathcal {C}}}^{A\rightarrow E}(\Psi ^{AR}) \right\| _1 \nonumber \\&\quad \le 4P( \hat{{\mathcal {T}}}_{{\mathcal {C}}}^{A\rightarrow E}({\hat{\Psi }}^{AR}) , {{\mathcal {T}}}_{{\mathcal {C}}}^{A\rightarrow E}(\Psi ^{AR}) ). \end{aligned}$$

(G11)

Noting that $\hat{{\mathcal {T}}}_{{\mathcal {C}}}^{A\rightarrow E}({\hat{\Psi }}^{AR})={\hat{\theta }}_{{\mathcal {C}}}^{ER}$ and ${\mathcal {T}}_{{\mathcal {C}}}^{A\rightarrow E}(\Psi ^{AR})=\Theta _{{\mathcal {C}}}^{ER}$ from the definitions of $\hat{{\mathcal {T}}}_{{\mathcal {C}}}$ and $\Theta _{{\mathcal {C}}}$, and recalling Ineq. (166), we have $ P( \hat{{\mathcal {T}}}_{{\mathcal {C}}}^{A\rightarrow E}({\hat{\Psi }}^{AR}) , {{\mathcal {T}}}_{{\mathcal {C}}}^{A\rightarrow E}(\Psi ^{AR}) ) \le P( {\hat{\theta }}_{{\mathcal {C}}}^{ERD}, \Theta _{{\mathcal {C}}}^{ERD} ) \le \upsilon $. Whereas, noting that the total variation distance is no greater than 2, the second term is calculated to be

$$\begin{aligned}&\sum _kr_k\left\| {\mathcal {T}}_{{\mathcal {C}},k}^{A_r\rightarrow E}(\Psi _{k}^{A_rR_r}) - \varsigma _k^E\otimes \Psi _{k}^{R_r} \right\| _1 \nonumber \\&\quad \le 2\sum _k|p_k-r_k| + \sum _kp_k\left\| {\mathcal {T}}_{{\mathcal {C}},k}^{A_r\rightarrow E}(\Psi _{k}^{A_rR_r}) - \varsigma _k^E\otimes \Psi _{k}^{R_r} \right\| _1 \nonumber \\&\quad \le 2\left\| \hat{{\mathcal {T}}}_{{\mathcal {C}}}^{A\rightarrow E}({\hat{\Psi }}^{AR}) - {{\mathcal {T}}}_{{\mathcal {C}}}^{A\rightarrow E}(\Psi ^{AR}) \right\| _1 + \sum _kp_k\left\| {\mathcal {T}}_{{\mathcal {C}},k}^{A_r\rightarrow E}(\Psi _{k}^{A_rR_r}) - \varsigma _k^E\otimes \Psi _{k}^{R_r} \right\| _1 \nonumber \\&\quad \le 4\upsilon + \sum _kp_k\left\| {\mathcal {T}}_{{\mathcal {C}},k}^{A_r\rightarrow E}(\Psi _{k}^{A_rR_r}) - \varsigma _k^E\otimes \Psi _{k}^{R_r} \right\| _1 \nonumber \\&\quad = 4\upsilon + \left\| \sum _kp_k {\mathcal {T}}_{{\mathcal {C}},k}^{A_r\rightarrow E}(\Psi _{k}^{A_rR_r}) \otimes {{|k\rangle }\!{\langle k|}}^{R_c} - \sum _kp_k \varsigma _k^{E}\otimes \Psi _{k}^{R_r} \otimes {{|k\rangle }\!{\langle k|}}^{R_c} \right\| _1 \nonumber \\&\quad = 4\upsilon + \left\| {\mathcal {T}}^{A\rightarrow E}\circ {\mathcal {C}}^A(\Psi ^{AR}) - \Omega ^{ER} \right\| _1 \le 4\upsilon + \delta , \end{aligned}$$

(G12)

where the fourth line follows from the similar argument to show the bound of the first term, and the last line follows from the partial decoupling condition (G8).

Combining these all together, we obtain $ {\bar{\delta }} \le 12\upsilon +\delta $. Substituting this to (G9), we arrive at

$$\begin{aligned} {\bar{\lambda }} \le \lambda \left( \iota ,\sqrt{2}\root 4 \of {24\upsilon +2\delta }\right) +\lambda (\iota ,4)\cdot \sqrt{2}\root 4 \of {24\upsilon +2\delta }. \end{aligned}$$

(G13)

$\square $

Appendix H: List of notations

The followings are the lists of notations used in the proofs of the main theorems.

General notation
${\mathcal {L}}({\mathcal {H}})$	The set of linear operators on ${\mathcal {H}}$
${\mathcal {L}}({\mathcal {H}}^A,{\mathcal {H}}^B)$	The set of linear operators from ${\mathcal {H}}^A$ to ${\mathcal {H}}^B$
$\mathrm{Her}({\mathcal {H}})$	$\{\rho \in {\mathcal {L}}({\mathcal {H}}) : \rho = \rho ^\dagger \}$
${\mathcal {P}}({\mathcal {H}})$	$\{\rho \in \mathrm{Her}({\mathcal {H}}) : \rho \ge 0 \}$
${\mathcal {S}}_\le ({\mathcal {H}})$	$\{\rho \in {\mathcal {P}}({\mathcal {H}}) : {\mathrm {Tr}}[\rho ] \le 1 \}$
${\mathcal {S}}_=({\mathcal {H}})$	$\{\rho \in {\mathcal {P}}({\mathcal {H}}) : {\mathrm {Tr}}[\rho ]=1 \}$
${\mathcal {C}}{\mathcal {P}}(A\rightarrow B)$	The set of CP maps from A to B
${\mathcal {C}}{\mathcal {P}}_\le (A\rightarrow B)$	The set of trace non-increasing CP maps from A to B
${\mathcal {C}}{\mathcal {P}}_=(A\rightarrow B)$	The set of trace preserving CP maps from A to B
$\Psi ^{AR}$	A subnormalized (resp. normalized) state on AR in Theorem 1 and 3 (resp. Theorem 4)
${\mathcal {T}}^{A\rightarrow E}$	A completely-positive superoperator from ${\mathcal {L}}({\mathcal {H}}^A)$ to ${\mathcal {L}}({\mathcal {H}}^B)$ (trace-preserving in Theorem 4)
${\mathcal {T}}^{A\rightarrow B}$	A complementary superoperator of ${\mathcal {T}}^{A\rightarrow E}$
$\Phi ^{AA'}$	Maximally entangled state between A and $A'$ (${\mathcal {H}}^A\cong {\mathcal {H}}^{A'}$)
$\tau ^{AE}$, $\tau ^{AB}$	The Choi–Jamiołkowski state of ${\mathcal {T}}^{A\rightarrow E}$ and ${\mathcal {T}}^{A\rightarrow B}$:
	$\tau ^{AE}={\mathcal {T}}^{A'\rightarrow E}(\Phi ^{AA'})$, $\tau ^{AB}={\mathcal {T}}^{A'\rightarrow B}(\Phi ^{AA'})$
${\mathbb {U}}(d)$	Unitary group of degree d

Norms and distances
$\Vert X\Vert _1$	The trace norm of a linear operator X: $\Vert X\Vert _1=\mathrm{Tr}[\sqrt{XX^\dagger }]$
$\Vert X\Vert _2$	The Hilbert–Schmidt norm of a linear operator X: $\Vert X\Vert _2=\sqrt{\mathrm{Tr}[XX^\dagger ]}$
$\|\!\| X^{VW} \|\!\|_{2,\varsigma ^W}$	$\|\!\| (\varsigma ^W)^{-1/4} X^{VW} (\varsigma ^W)^{-1/4} \|\!\|_{2}$ for $\varsigma \in {\mathcal {S}}_=({\mathcal {H}}^W)$
${\bar{F}}(\rho ,\rho ')$	Generalized fidelity between subnormalized states $\rho ,\rho '\in {\mathcal {S}}_\le ({\mathcal {H}})$:
	${\bar{F}}(\rho ,\rho ')=\Vert \sqrt{\rho }\sqrt{\rho '}\Vert _1+\sqrt{(1-\mathrm{Tr}[\rho ])(1-\mathrm{Tr}[\rho '])}$
$P(\rho ,\rho ')$	Purified distance between subnormalized states $\rho ,\rho '\in {\mathcal {S}}_\le ({\mathcal {H}})$: $P(\rho ,\rho ')=\sqrt{1-{\bar{F}}(\rho ,\rho ')^2}$
${\mathcal {B}}^\epsilon (\rho )$	The $\epsilon $-ball of a subnormalized state $\rho $: ${\mathcal {B}}^\epsilon (\rho )=\{\rho '\in {\mathcal {S}}_\le ({\mathcal {H}})\|\,P(\rho ,\rho ')\le \epsilon \}$

Conditional Entropies for $\rho \in {\mathcal {P}}({\mathcal {H}}^{AB})$ and $\varsigma \in {\mathcal {S}}_=({\mathcal {H}}^B)$
$H_{\mathrm{min}}(A\|B)_{\rho \|\varsigma } $	$\sup \{ \lambda \in {\mathbb {R}}\| 2^{-\lambda } I^A \otimes \varsigma ^B \ge \rho ^{AB} \}$
$H_{\mathrm{max}}(A\|B)_{\rho \|\varsigma }$	$\log {\Vert \sqrt{\rho ^{AB}}\sqrt{I^A\otimes \varsigma ^B}\Vert _1^2}$
$H_2(A\|B)_{\rho \|\varsigma } $	$- \log {\mathrm {Tr}}\bigl [ \bigl ( (\varsigma ^B)^{-1/4} \rho ^{AB} (\varsigma ^B)^{-1/4} \bigr )^2 \bigr ]$
$H_{\mathrm{min}}(A\|B)_{\rho }$	$\sup _{\varsigma ^B \in {\mathcal {S}}_=({\mathcal {H}}^B)}H_{\mathrm{min}}(A\|B)_{\rho \|\varsigma }$
$H_{\mathrm{max}}(A\|B)_{\rho }$	$\sup _{\varsigma ^B \in {\mathcal {S}}_=({\mathcal {H}}^B)}H_{\mathrm{max}}(A\|B)_{\rho \|\varsigma }$
$H_2(A\|B)_{\rho }$	$\sup _{\varsigma ^B \in {\mathcal {S}}_=({\mathcal {H}}^B)}H_2(A\|B)_{\rho \|\varsigma }$
$H_{\mathrm{min}}^\epsilon (A\|B)_{\rho }$	$\sup _{{\hat{\rho }}^{AB} \in {\mathcal {B}}^\epsilon (\rho )}H_{\mathrm{min}}(A\|B)_{{{\hat{\rho }}}}$ for $\rho \in {\mathcal {S}}_\le ({\mathcal {H}}^{AB})$
$H_{\mathrm{max}}^\epsilon (A\|B)_{\rho }$	$\inf _{{\hat{\rho }}^{AB} \in {\mathcal {B}}^\epsilon (\rho )}H_{\mathrm{max}}(A\|B)_{{{\hat{\rho }}}}$ for $\rho \in {\mathcal {S}}_\le ({\mathcal {H}}^{AB})$

Notations when a Hilbert space ${\mathcal {H}}^A$ is decomposed into $\bigoplus _{j=1}^J{\mathcal {H}}_j^{A_l}\otimes {\mathcal {H}}_j^{A_r}$ (Theorem 1)
$l_j$ and $r_j$	$\dim {{\mathcal {H}}_j^{A_l}}$ and $\dim {{\mathcal {H}}_j^{A_r}}$, respectively
$\Pi _j^A\in {\mathcal {P}}({\mathcal {H}}^A)$	The projection onto ${\mathcal {H}}_j^{A_l}\otimes {\mathcal {H}}_j^{A_r}$
$\Phi _j^l$, $\Phi _j^r$	Maximally entangled states on ${\mathcal {H}}_j^{A_l}\otimes {\mathcal {H}}_j^{{\bar{A}}_l}$ and ${\mathcal {H}}_j^{A_r}\otimes {\mathcal {H}}_j^{{\bar{A}}_r}$ (${\mathcal {H}}_j^{A_l}\cong {\mathcal {H}}_j^{{\bar{A}}_l}$, ${\mathcal {H}}_j^{A_r}\cong {\mathcal {H}}_j^{{\bar{A}}_r}$)
$\Phi ^{AA'}$	Maximally entangled state between A and $A'$:
	${\|\Phi \rangle }^{AA'}=\sum _{j=1}^J\sqrt{l_jr_j/d_A}\|\Phi _j^l\rangle ^{A_lA_l'}\|\Phi _j^r\rangle ^{A_rA_r'}$
$A^*$	A quantum system represented by a Hilbert space
	${\mathcal {H}}^{A^*}:=\bigoplus _{j=1}^J{\mathcal {H}}_j^{A_r}\otimes {\mathcal {H}}_j^{{\bar{A}}_r}$ (${\mathcal {H}}_j^{A_r}\cong {\mathcal {H}}_j^{{\bar{A}}_r}$)
$F^{A{\bar{A}}\rightarrow A^*}$	A linear operator from ${\mathcal {H}}^A\otimes {\mathcal {H}}^{{{\bar{A}}}}$ to ${\mathcal {H}}^{A^*}$:
	$F^{A{\bar{A}}\rightarrow A^*}= \bigoplus _{j=1}^J \sqrt{d_Al_j/r_j} \langle \Phi _j^l\|^{A_l{\bar{A}}_l}(\Pi _j^{A} \otimes \Pi _j^{{\bar{A}}})$
${\Lambda }(\Psi ,{\mathcal {T}})$	An unnormalized state on $A^RE$: ${\Lambda }(\Psi ,{\mathcal {T}})=F(\Psi ^{AR}\otimes \tau ^{{\bar{A}}E})F^\dagger \in {\mathcal {P}}({\mathcal {H}}^{A^RE})$
$\Psi _{jk}^{A_lA_rR}$	$\Pi _j^A\Psi ^{AR}\Pi _k\in {\mathcal {L}}({\mathcal {H}}_k^{A_l}\otimes {\mathcal {H}}_k^{A_r}\otimes {\mathcal {H}}^R,{\mathcal {H}}_j^{A_l}\otimes {\mathcal {H}}_j^{A_r}\otimes {\mathcal {H}}^R)$
$\tau _{jk}^{A_lA_rE}$	$\Pi _j^A\tau ^{AE}\Pi _k\in {\mathcal {L}}({\mathcal {H}}_k^{A_l}\otimes {\mathcal {H}}_k^{A_r}\otimes {\mathcal {H}}^E,{\mathcal {H}}_j^{A_l}\otimes {\mathcal {H}}_j^{A_r}\otimes {\mathcal {H}}^E)$
$\pi _j^{A_r}\in {\mathcal {S}}({\mathcal {H}}_j^{A_r})$	The maximally mixed state on ${\mathcal {H}}_j^{A_r}$
$\mathsf{H}_j$	The Haar measure on ${\mathbb {U}}(r_j)$
$\mathsf{H}_\times $	A product measure $\mathsf{H}_1\times \cdots \times \mathsf{H}_J$ on ${\mathbb {U}}(r_1)\times \cdots \times {\mathbb {U}}(r_J)$
$\Psi _{\mathrm{av}}^{AR}$	A subnormalized state on AR: $\Psi _{\mathrm{av}}^{AR}={\mathbb {E}}_{U\sim \mathsf{H}_\times }[{\mathcal {U}}^A(\Psi ^{AR})]$
$\Vert {\mathcal {E}}^{A \rightarrow B} \Vert _{\mathrm{DSP}}$	The DSP-diamond norm of a supermap ${\mathcal {E}}$ from ${\mathcal {L}}({\mathcal {H}}^A)$ to ${\mathcal {L}}({\mathcal {H}}^B)$:
	$\Vert {\mathcal {E}}^{A \rightarrow B} \Vert _{\mathrm{DSP}}=\sup _{C,\,\xi } \{\Vert {\mathcal {E}}^{A \rightarrow B}(\xi ^{AC}) \Vert _1:\xi \in {\mathcal {S}}_\le ({\mathcal {H}}^{AC}),\,\xi ^A=\bigoplus _{j=1}^Jq_j \varpi _j^{A_l}\otimes \pi _j^{A_r}\}$
${\mathcal {B}}_{\mathrm{DSP}}^\epsilon ({\mathcal {E}})$	$\{{\mathcal {E}}'\in {\mathcal {C}}{\mathcal {P}}_=(A\rightarrow B)\,\|\,\Vert {\mathcal {E}}'-{\mathcal {E}}\Vert _{\mathrm{DSP}}\le \epsilon \}$
$ H_{\mathrm{min}}^{\epsilon ,\mu }(A^*\|RE)_{{\Lambda }(\Psi ,{\mathcal {T}})}$	$\sup _{\Psi '\in {\mathcal {B}}^\epsilon (\Psi )}\sup _{{\mathcal {T}}'\in {\mathcal {B}}_{\mathrm{DSP}}^{\mu }({\mathcal {T}})} H_{\mathrm{min}}(A^*\|RE)_{{\Lambda }(\Psi ',{\mathcal {T}}')}$

Notations when $l_j=1$ and $r_j=r$ for $1\le j\le J$ (Theorem 3 and 4 )
$\alpha (J)$	A function that is equal to 0 when $J=1$ and to $1/(J-1)$ if $J\ge 2$
${\mathbb {P}}$	The permutation group on $[1,\ldots ,J]$
$\mathsf P$	The uniform distribution on ${\mathbb {P}}$
$G_\sigma $	A unitary in ${\mathcal {H}}^A$: $G_\sigma =\sum _{j=1}^J{{\|\sigma (j)\rangle }\!{\langle j\|}}^{A_c} \otimes I^{A_r}$ for any $\sigma \in {{\mathbb {P}}}$
${\mathcal {C}}$	The completely dephasing operation on $A_c$ with respect to the basis $\{{\|j\rangle }\}_{j=1}^J$
$\Psi _{\mathrm{dp}}^{AR}$	A normalized state on AR: $\Psi _{\mathrm{dp}}^{AR}={\mathcal {C}}(\Psi ^{AR})$
$\pi ^{A_r}$	The maximally mixed state on ${\mathcal {H}}^{A_r}$

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wakakuwa, E., Nakata, Y. One-Shot Randomized and Nonrandomized Partial Decoupling. Commun. Math. Phys. 386, 589–649 (2021). https://doi.org/10.1007/s00220-021-04136-5

Download citation

Received: 27 May 2019
Accepted: 03 June 2021
Published: 16 July 2021
Issue Date: September 2021
DOI: https://doi.org/10.1007/s00220-021-04136-5

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

General notation
\({\mathcal {L}}({\mathcal {H}})\)	The set of linear operators on \({\mathcal {H}}\)
\({\mathcal {L}}({\mathcal {H}}^A,{\mathcal {H}}^B)\)	The set of linear operators from \({\mathcal {H}}^A\) to \({\mathcal {H}}^B\)
\(\mathrm{Her}({\mathcal {H}})\)	\(\{\rho \in {\mathcal {L}}({\mathcal {H}}) : \rho = \rho ^\dagger \}\)
\({\mathcal {P}}({\mathcal {H}})\)	\(\{\rho \in \mathrm{Her}({\mathcal {H}}) : \rho \ge 0 \}\)
\({\mathcal {S}}_\le ({\mathcal {H}})\)	\(\{\rho \in {\mathcal {P}}({\mathcal {H}}) : {\mathrm {Tr}}[\rho ] \le 1 \}\)
\({\mathcal {S}}_=({\mathcal {H}})\)	\(\{\rho \in {\mathcal {P}}({\mathcal {H}}) : {\mathrm {Tr}}[\rho ]=1 \}\)
\({\mathcal {C}}{\mathcal {P}}(A\rightarrow B)\)	The set of CP maps from A to B
\({\mathcal {C}}{\mathcal {P}}_\le (A\rightarrow B)\)	The set of trace non-increasing CP maps from A to B
\({\mathcal {C}}{\mathcal {P}}_=(A\rightarrow B)\)	The set of trace preserving CP maps from A to B
\(\Psi ^{AR}\)	A subnormalized (resp. normalized) state on AR in Theorem 1 and 3 (resp. Theorem 4)
\({\mathcal {T}}^{A\rightarrow E}\)	A completely-positive superoperator from \({\mathcal {L}}({\mathcal {H}}^A)\) to \({\mathcal {L}}({\mathcal {H}}^B)\) (trace-preserving in Theorem 4)
\({\mathcal {T}}^{A\rightarrow B}\)	A complementary superoperator of \({\mathcal {T}}^{A\rightarrow E}\)
\(\Phi ^{AA'}\)	Maximally entangled state between A and \(A'\) (\({\mathcal {H}}^A\cong {\mathcal {H}}^{A'}\))
\(\tau ^{AE}\), \(\tau ^{AB}\)	The Choi–Jamiołkowski state of \({\mathcal {T}}^{A\rightarrow E}\) and \({\mathcal {T}}^{A\rightarrow B}\):
	\(\tau ^{AE}={\mathcal {T}}^{A'\rightarrow E}(\Phi ^{AA'})\), \(\tau ^{AB}={\mathcal {T}}^{A'\rightarrow B}(\Phi ^{AA'})\)
\({\mathbb {U}}(d)\)	Unitary group of degree d

Norms and distances
\(\Vert X\Vert _1\)	The trace norm of a linear operator X: \(\Vert X\Vert _1=\mathrm{Tr}[\sqrt{XX^\dagger }]\)
\(\Vert X\Vert _2\)	The Hilbert–Schmidt norm of a linear operator X: \(\Vert X\Vert _2=\sqrt{\mathrm{Tr}[XX^\dagger ]}\)
\(\|\!\| X^{VW} \|\!\|_{2,\varsigma ^W}\)	\(\|\!\| (\varsigma ^W)^{-1/4} X^{VW} (\varsigma ^W)^{-1/4} \|\!\|_{2}\) for \(\varsigma \in {\mathcal {S}}_=({\mathcal {H}}^W)\)
\({\bar{F}}(\rho ,\rho ')\)	Generalized fidelity between subnormalized states \(\rho ,\rho '\in {\mathcal {S}}_\le ({\mathcal {H}})\):
	\({\bar{F}}(\rho ,\rho ')=\Vert \sqrt{\rho }\sqrt{\rho '}\Vert _1+\sqrt{(1-\mathrm{Tr}[\rho ])(1-\mathrm{Tr}[\rho '])}\)
\(P(\rho ,\rho ')\)	Purified distance between subnormalized states \(\rho ,\rho '\in {\mathcal {S}}_\le ({\mathcal {H}})\): \(P(\rho ,\rho ')=\sqrt{1-{\bar{F}}(\rho ,\rho ')^2}\)
\({\mathcal {B}}^\epsilon (\rho )\)	The \(\epsilon \)-ball of a subnormalized state \(\rho \): \({\mathcal {B}}^\epsilon (\rho )=\{\rho '\in {\mathcal {S}}_\le ({\mathcal {H}})\|\,P(\rho ,\rho ')\le \epsilon \}\)

Conditional Entropies for \(\rho \in {\mathcal {P}}({\mathcal {H}}^{AB})\) and \(\varsigma \in {\mathcal {S}}_=({\mathcal {H}}^B)\)
\(H_{\mathrm{min}}(A\|B)_{\rho \|\varsigma } \)	\(\sup \{ \lambda \in {\mathbb {R}}\| 2^{-\lambda } I^A \otimes \varsigma ^B \ge \rho ^{AB} \}\)
\(H_{\mathrm{max}}(A\|B)_{\rho \|\varsigma }\)	\(\log {\Vert \sqrt{\rho ^{AB}}\sqrt{I^A\otimes \varsigma ^B}\Vert _1^2}\)
\(H_2(A\|B)_{\rho \|\varsigma } \)	\(- \log {\mathrm {Tr}}\bigl [ \bigl ( (\varsigma ^B)^{-1/4} \rho ^{AB} (\varsigma ^B)^{-1/4} \bigr )^2 \bigr ]\)
\(H_{\mathrm{min}}(A\|B)_{\rho }\)	\(\sup _{\varsigma ^B \in {\mathcal {S}}_=({\mathcal {H}}^B)}H_{\mathrm{min}}(A\|B)_{\rho \|\varsigma }\)
\(H_{\mathrm{max}}(A\|B)_{\rho }\)	\(\sup _{\varsigma ^B \in {\mathcal {S}}_=({\mathcal {H}}^B)}H_{\mathrm{max}}(A\|B)_{\rho \|\varsigma }\)
\(H_2(A\|B)_{\rho }\)	\(\sup _{\varsigma ^B \in {\mathcal {S}}_=({\mathcal {H}}^B)}H_2(A\|B)_{\rho \|\varsigma }\)
\(H_{\mathrm{min}}^\epsilon (A\|B)_{\rho }\)	\(\sup _{{\hat{\rho }}^{AB} \in {\mathcal {B}}^\epsilon (\rho )}H_{\mathrm{min}}(A\|B)_{{{\hat{\rho }}}}\) for \(\rho \in {\mathcal {S}}_\le ({\mathcal {H}}^{AB})\)
\(H_{\mathrm{max}}^\epsilon (A\|B)_{\rho }\)	\(\inf _{{\hat{\rho }}^{AB} \in {\mathcal {B}}^\epsilon (\rho )}H_{\mathrm{max}}(A\|B)_{{{\hat{\rho }}}}\) for \(\rho \in {\mathcal {S}}_\le ({\mathcal {H}}^{AB})\)

Notations when a Hilbert space \({\mathcal {H}}^A\) is decomposed into \(\bigoplus _{j=1}^J{\mathcal {H}}_j^{A_l}\otimes {\mathcal {H}}_j^{A_r}\) (Theorem 1)
\(l_j\) and \(r_j\)	\(\dim {{\mathcal {H}}_j^{A_l}}\) and \(\dim {{\mathcal {H}}_j^{A_r}}\), respectively
\(\Pi _j^A\in {\mathcal {P}}({\mathcal {H}}^A)\)	The projection onto \({\mathcal {H}}_j^{A_l}\otimes {\mathcal {H}}_j^{A_r}\)
\(\Phi _j^l\), \(\Phi _j^r\)	Maximally entangled states on \({\mathcal {H}}_j^{A_l}\otimes {\mathcal {H}}_j^{{\bar{A}}_l}\) and \({\mathcal {H}}_j^{A_r}\otimes {\mathcal {H}}_j^{{\bar{A}}_r}\) (\({\mathcal {H}}_j^{A_l}\cong {\mathcal {H}}_j^{{\bar{A}}_l}\), \({\mathcal {H}}_j^{A_r}\cong {\mathcal {H}}_j^{{\bar{A}}_r}\))
\(\Phi ^{AA'}\)	Maximally entangled state between A and \(A'\):
	\({\|\Phi \rangle }^{AA'}=\sum _{j=1}^J\sqrt{l_jr_j/d_A}\|\Phi _j^l\rangle ^{A_lA_l'}\|\Phi _j^r\rangle ^{A_rA_r'}\)
\(A^*\)	A quantum system represented by a Hilbert space
	\({\mathcal {H}}^{A^*}:=\bigoplus _{j=1}^J{\mathcal {H}}_j^{A_r}\otimes {\mathcal {H}}_j^{{\bar{A}}_r}\) (\({\mathcal {H}}_j^{A_r}\cong {\mathcal {H}}_j^{{\bar{A}}_r}\))
\(F^{A{\bar{A}}\rightarrow A^*}\)	A linear operator from \({\mathcal {H}}^A\otimes {\mathcal {H}}^{{{\bar{A}}}}\) to \({\mathcal {H}}^{A^*}\):
	\(F^{A{\bar{A}}\rightarrow A^*}= \bigoplus _{j=1}^J \sqrt{d_Al_j/r_j} \langle \Phi _j^l\|^{A_l{\bar{A}}_l}(\Pi _j^{A} \otimes \Pi _j^{{\bar{A}}})\)
\({\Lambda }(\Psi ,{\mathcal {T}})\)	An unnormalized state on \(A^RE\): \({\Lambda }(\Psi ,{\mathcal {T}})=F(\Psi ^{AR}\otimes \tau ^{{\bar{A}}E})F^\dagger \in {\mathcal {P}}({\mathcal {H}}^{A^RE})\)
\(\Psi _{jk}^{A_lA_rR}\)	\(\Pi _j^A\Psi ^{AR}\Pi _k\in {\mathcal {L}}({\mathcal {H}}_k^{A_l}\otimes {\mathcal {H}}_k^{A_r}\otimes {\mathcal {H}}^R,{\mathcal {H}}_j^{A_l}\otimes {\mathcal {H}}_j^{A_r}\otimes {\mathcal {H}}^R)\)
\(\tau _{jk}^{A_lA_rE}\)	\(\Pi _j^A\tau ^{AE}\Pi _k\in {\mathcal {L}}({\mathcal {H}}_k^{A_l}\otimes {\mathcal {H}}_k^{A_r}\otimes {\mathcal {H}}^E,{\mathcal {H}}_j^{A_l}\otimes {\mathcal {H}}_j^{A_r}\otimes {\mathcal {H}}^E)\)
\(\pi _j^{A_r}\in {\mathcal {S}}({\mathcal {H}}_j^{A_r})\)	The maximally mixed state on \({\mathcal {H}}_j^{A_r}\)
\(\mathsf{H}_j\)	The Haar measure on \({\mathbb {U}}(r_j)\)
\(\mathsf{H}_\times \)	A product measure \(\mathsf{H}_1\times \cdots \times \mathsf{H}_J\) on \({\mathbb {U}}(r_1)\times \cdots \times {\mathbb {U}}(r_J)\)
\(\Psi _{\mathrm{av}}^{AR}\)	A subnormalized state on AR: \(\Psi _{\mathrm{av}}^{AR}={\mathbb {E}}_{U\sim \mathsf{H}_\times }[{\mathcal {U}}^A(\Psi ^{AR})]\)
\(\Vert {\mathcal {E}}^{A \rightarrow B} \Vert _{\mathrm{DSP}}\)	The DSP-diamond norm of a supermap \({\mathcal {E}}\) from \({\mathcal {L}}({\mathcal {H}}^A)\) to \({\mathcal {L}}({\mathcal {H}}^B)\):
	\(\Vert {\mathcal {E}}^{A \rightarrow B} \Vert _{\mathrm{DSP}}=\sup _{C,\,\xi } \{\Vert {\mathcal {E}}^{A \rightarrow B}(\xi ^{AC}) \Vert _1:\xi \in {\mathcal {S}}_\le ({\mathcal {H}}^{AC}),\,\xi ^A=\bigoplus _{j=1}^Jq_j \varpi _j^{A_l}\otimes \pi _j^{A_r}\}\)
\({\mathcal {B}}_{\mathrm{DSP}}^\epsilon ({\mathcal {E}})\)	\(\{{\mathcal {E}}'\in {\mathcal {C}}{\mathcal {P}}_=(A\rightarrow B)\,\|\,\Vert {\mathcal {E}}'-{\mathcal {E}}\Vert _{\mathrm{DSP}}\le \epsilon \}\)
\( H_{\mathrm{min}}^{\epsilon ,\mu }(A^*\|RE)_{{\Lambda }(\Psi ,{\mathcal {T}})}\)	\(\sup _{\Psi '\in {\mathcal {B}}^\epsilon (\Psi )}\sup _{{\mathcal {T}}'\in {\mathcal {B}}_{\mathrm{DSP}}^{\mu }({\mathcal {T}})} H_{\mathrm{min}}(A^*\|RE)_{{\Lambda }(\Psi ',{\mathcal {T}}')}\)

Notations when \(l_j=1\) and \(r_j=r\) for \(1\le j\le J\) (Theorem 3 and 4 )
\(\alpha (J)\)	A function that is equal to 0 when \(J=1\) and to \(1/(J-1)\) if \(J\ge 2\)
\({\mathbb {P}}\)	The permutation group on \([1,\ldots ,J]\)
\(\mathsf P\)	The uniform distribution on \({\mathbb {P}}\)
\(G_\sigma \)	A unitary in \({\mathcal {H}}^A\): \(G_\sigma =\sum _{j=1}^J{{\|\sigma (j)\rangle }\!{\langle j\|}}^{A_c} \otimes I^{A_r}\) for any \(\sigma \in {{\mathbb {P}}}\)
\({\mathcal {C}}\)	The completely dephasing operation on \(A_c\) with respect to the basis \(\{{\|j\rangle }\}_{j=1}^J\)
\(\Psi _{\mathrm{dp}}^{AR}\)	A normalized state on AR: \(\Psi _{\mathrm{dp}}^{AR}={\mathcal {C}}(\Psi ^{AR})\)
\(\pi ^{A_r}\)	The maximally mixed state on \({\mathcal {H}}^{A_r}\)

One-Shot Randomized and Nonrandomized Partial Decoupling

Abstract

Similar content being viewed by others

Efficient methods for one-shot quantum communication

Decoupling with Random Quantum Circuits

Reliability Function of Quantum Information Decoupling via the Sandwiched Rényi Divergence

1 Introduction

2 Preliminaries

2.1 Notations

2.2 Norms and distances

2.3 One-shot entropies

2.4 Choi–Jamiołkowski representation

2.5 Random unitaries

3 Main Results

3.1 Non-randomized partial decoupling

Theorem 1

3.2 Randomized partial decoupling

Definition 2

Theorem 3

3.3 A converse bound

Theorem 4

3.4 Reduction to the existing results

Corollary 5

Corollary 6

Corollary 7

Corollary 8

4 Implementing the Random Unitary with the DSP Form

5 Structure of the Proof

5.1 Key lemmas and the structure of the proofs

Lemma 9

Lemma 10

Lemma 11

Lemma 12

5.2 List of useful lemmas

5.2.1 Properties of norms and distances

Lemma 13

Lemma 14

Lemma 15

Lemma 16

Lemma 17

Lemma 18

Lemma 19

Lemma 20

5.2.2 Properties of conditional entropies

Lemma 21

Lemma 22

Lemma 23

Lemma 24

Lemma 25

Lemma 26

Lemma 27

Lemma 28

Lemma 29

Lemma 30

Lemma 31

5.2.3 Other technical lemmas

Lemma 32

Lemma 33

Lemma 34

Lemma 35

Lemma 36

6 Proof of the Non-randomized Partial Decoupling (Theorem 1)

6.1 Proof of the non-smoothed non-randomized partial decoupling

6.2 Proof of the smoothed non-randomized partial decoupling

7 Proof of the Randomized Partial Decoupling (Theorem 3)

7.1 Proof of the non-smoothed randomized partial decoupling under WA 1 and WA 2

7.1.1 Upper bound on the average trace norm

Lemma 37

Proof

7.1.2 Generalization of the dequantizing theorem

Lemma 38

Proof

7.1.3 Proof of the non-smoothed randomized partial decoupling

7.2 Proof of the randomized partial decoupling under the conditions WA 1 and WA 2

7.3 Dropping working assumptions WA 1 and WA 2

8 Proof of the Converse

8.1 Proof of Ineq. (142) under WA 1 and WA 2

8.1.1 Proof of Ineq. (148)

8.1.2 Proof of Ineq. (149)

8.1.3 Proof of Ineq. (150)