Operational Restrictions in General Probabilistic Theories

Filippov, Sergey N.; Gudder, Stan; Heinosaari, Teiko; Leppäjärvi, Leevi

doi:10.1007/s10701-020-00352-6

Operational Restrictions in General Probabilistic Theories

Open access
Published: 12 July 2020

Volume 50, pages 850–876, (2020)
Cite this article

Download PDF

You have full access to this open access article

Foundations of Physics Aims and scope Submit manuscript

Operational Restrictions in General Probabilistic Theories

Download PDF

1710 Accesses
12 Citations
2 Altmetric
Explore all metrics

Abstract

The formalism of general probabilistic theories provides a universal paradigm that is suitable for describing various physical systems including classical and quantum ones as particular cases. Contrary to the usual no-restriction hypothesis, the set of accessible meters within a given theory can be limited for different reasons, and this raises a question of what restrictions on meters are operationally relevant. We argue that all operational restrictions must be closed under simulation, where the simulation scheme involves mixing and classical post-processing of meters. We distinguish three classes of such operational restrictions: restrictions on meters originating from restrictions on effects; restrictions on meters that do not restrict the set of effects in any way; and all other restrictions. We fully characterize the first class of restrictions and discuss its connection to convex effect subalgebras. We show that the restrictions belonging to the second class can impose severe physical limitations despite the fact that all effects are accessible, which takes place, e.g., in the unambiguous discrimination of pure quantum states via effectively dichotomic meters. We further demonstrate that there are physically meaningful restrictions that fall into the third class. The presented study of operational restrictions provides a better understanding on how accessible measurements modify general probabilistic theories and quantum theory in particular.

Post-Classical Probability Theory

Hierarchical axioms for quantum mechanics

Article 24 September 2019

Quantum Mechanics as a Theory of Probability

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The framework of general probabilistic theories (GPTs) provides an abstract setting for possible physical theories based on operational principles. Containing not only quantum and classical theories but also countless toy theories in between and beyond, GPTs give us means to study well-known properties of quantum theory (such as measurement incompatibility [1], steering [2, 3], entanglement [4] and no-information-without-disturbance [5]) in a more general setting. This allows us to formulate and examine these properties in different theories, quantify them and even compare different theories to each other based on how these properties behave within them. Many properties that were thought to be special features of quantum theory have actually been shown to be general among all non-classical probabilistic theories, the no-broadcasting theorem being perhaps the most well known example [6].

One of the most long-standing motivations has been to provide a set of physical principles, formulated in the GPT framework, that would lead to an axiomatic derivation of quantum theory. In recent years, followed by the success of quantum information theory, there has been a new boom of such efforts and many information-theoretic axioms have been proposed from which the quantum theory has been successfully derived [7]. In addition to a full physical axiomatization, one can focus on some specific property of interest and study it independently of the underlying theory with the aim of finding something meaningful on the nature of the property itself.

GPTs are based on operational notions of states, effects, measurements, transformations, and composite systems so that by specifying them one fixes the theory. The most important operational principle for describing the state space ${\mathcal {S}}$ of the theory is the statistical mixing of states which then leads to ${\mathcal {S}}$ being a convex subset of a real vector space. As the most simple type of measurements, the effects are then taken to be affine functionals $e: {\mathcal {S}}\rightarrow [0,1]$ that give probabilities on states so that e(s) can then be interpreted as the probability of observing the effect e when the system is measured in state $s \in {\mathcal {S}}$. The affinity of effects is a result of the basic statistical correspondence between states and measurements. A meter that corresponds to a measurement device can then be described as a normalized collection of effects. A meter provides a generalization of the positive operator-valued measure (POVM) in quantum theory.

The assumption of taking all mathematically valid affine functionals that give probabilities on states as physical effects of the theory has been coined as the no-restriction hypothesis [8]. The no-restriction hypothesis is satisfied in both classical and quantum theories, so it is usually accepted in other theories too for the purpose of mathematical convenience. If the no-restriction hypothesis is assumed, then the (single-system) theory is completely determined by the state space alone. However, as it has been pointed out, e.g., in [9], the no-restriction hypothesis has no operational grounds. In fact, it is possible to provide different kinds of consistent restrictions on the set of effects that then give rise to new models and have consequences even on the way the composite systems could be formed [9]. Other works beyond the no-restriction hypothesis are, e.g., [10,11,12].

Interestingly, in the recent work [13] it was shown that the no-restriction hypothesis plays a significant role in the correlations that can be achieved within quantum theory. In particular, it was shown that a set of correlations that is close to the set of quantum correlations, called the almost-quantum correlations, violate the no-restriction hypothesis. This means that no GPT with the no-restriction hypothesis is able to reproduce the almost-quantum correlations. Therefore, the no-restriction hypothesis may be a crucial part of singling out the quantum correlations from other non-signalling theories.

Even if we restrict to the quantum theory, there is also a practical motivation to investigate restrictions on meters and their consequences. For example, conventional measurement schemes for superconducting qubits and polarized photons perform dichotomic measurements in the computational basis or the rotated computational basis [14]. Measurements with more than two outcomes cannot be directly implemented for such two-level systems. To obtain more than two outcomes one usually resorts to mixing and post-processing dichotomic observables instead. Therefore, only effectively dichotomic observables are available in conventional quantum experimental setups with no entanglement between the system and an ancilla. The use of the ancilla enables one to perform measurements with a greater number of outcomes, the number of measurement outcomes depending on the dimension of the ancillary system. Moreover, even the dichotomic measurements are never perfectly projective [14, 15], which imposes a restriction on the noise content of accessible meters. Another example of practical restrictions is that the effects for fermionic systems are not arbitrary and must satisfy the parity superselection rule [16].

In the current work we consider restrictions not only at the level of effects but also on the level of meters, and we show that the previously studied effect restrictions are not enough to capture all operationally valid restrictions. We propose an operational condition that any restriction on meters should satisfy, namely the simulation closedness criterion. In accordance with the operational interpretation of GPTs, for a given set of meters there are two classical operations one can always implement that will lead to some outcome statistics differing from those of any other meter that may be used. In particular, similarly to mixing states, one can choose to mix meters, and after the measurement it is possible to post-process the obtained outcomes. The scheme consisting of both mixing and post-processing of meters, called the measurement simulability, has been previously studied in [17,18,19]. Our operational condition of simulation closedness for meters then states that given a set of allowed meters as a restriction, also all meters that can be obtained by the simulation scheme from the allowed ones should be included in the restriction as well. A violation of this condition would mean that some classical procedure consisting of mixing and post-processing of outcomes is not allowed, and that would therefore be a weird and unphysical restriction.

We show that the introduced operational restrictions can be divided into three disjoint classes: (R1) restrictions on meters that are dictated by the restrictions on effects, (R2) restrictions on meters that do not restrict the effects in any way, and (R3) restrictions on meters that cannot be reproduced by any restriction solely on effects, but nevertheless restrict the set of effects as well. We demonstrate these restrictions in quantum theory.

Our investigation is organized as follows. A brief overview of the relevant concepts is given in Sect. 2. In Sect. 3 we introduce the classification of operational restrictions into three disjoint classes (R1)–(R3). In Sect. 4 we characterize those effect restrictions that give simulation closed restrictions of type (R1) and examine convex effect algebras and their subalgebras and see how they are related to these restrictions. Effectively n-tomic theories are presented as a class of restrictions of type (R2) and they are examined in Sect. 5. In Sect. 6 we give examples of restrictions that belong to (R3). Finally, in Sect. 7 we summarize our investigation.

2 Preliminaries

2.1 States, Effects, Meters

We start by recalling the ordered vector space formulation of GPTs (for more details see, e.g., [20]). The state space ${\mathcal {S}}$ of a GPT is a compact convex subset of a finite-dimensional real vector space V. Whereas compactness and the finite-dimensionality of the state space are merely technical assumptions, the convexity follows from the possible statistical mixing of the states: if we can prepare our system in states $s_1 \in {\mathcal {S}}$ or $s_2 \in {\mathcal {S}}$, by fixing some $p \in [0,1]$ we can choose to use state $s_1$ with probability p and state $s_2$ with probability $1-p$ in each round of the experiment so that $p s_1 + (1-p) s_2$ must be a valid state in ${\mathcal {S}}$.

If $\dim (\mathrm {aff}({\mathcal {S}})) =d$, then V can be chosen to be $(d+1)$-dimensional and ${\mathcal {S}}$ forms a compact base for a closed generating proper cone $V_+$^{Footnote 1}. The cone $V_+$ defines a partial order in V in the usual way; we denote $v \le w$ (or $v \le _{V_+} w$ if we want to explicitly write the cone to avoid confusion) if $w-v \in V_+$. Thus, $V_+$ consists of all of the positive elements induced by this order. As a base of $V_+$, the state space ${\mathcal {S}}$ can be expressed in terms of a strictly positive functional $u \in V^*$ as

$$\begin{aligned} {\mathcal {S}}= \{ s \in V \, | \, s \ge 0, \ u(s)=1 \}. \end{aligned}$$

(1)

The effect space ${\mathcal {E}}({\mathcal {S}})$ consists of affine functionals $e: {\mathcal {S}}\rightarrow [0,1]$ giving probabilities on states: we interpret e(s) as the probability that the effect e is observed when the system is measured in state $s \in {\mathcal {S}}$. Affinity of effects is a result of them respecting the basic statistical correspondence of states and effects:

$$\begin{aligned} e(p s_1 +(1-p)s_2) = p e(s_1) +(1-p) e(s_2) \end{aligned}$$

(2)

for all $p \in [0,1]$, $s_1,s_2 \in {\mathcal {S}}$ and $e \in {\mathcal {E}}({\mathcal {S}})$.

In the ordered vector space formulation we can express the effect space as ${\mathcal {E}}({\mathcal {S}}) = V^*_+ \cap (u- V^*_+)$, where $V^*_+$ is the (closed generating proper) positive dual cone^{Footnote 2} of $V_+$ in the dual space $V^*$ and u is the unit effect in $V^*_+$. Explicitly,

$$\begin{aligned} {\mathcal {E}}({\mathcal {S}}) = \{ e \in V^* \, | \, o \le e \le u \}, \end{aligned}$$

(3)

where o is the zero effect that gives value 0 for every state and where the partial order is now the dual order induced by the dual cone $V^*_+$.

An effect $f \in {\mathcal {E}}({\mathcal {S}}) \subset V^*$, $f \ne o$, is called indecomposable if whenever a decomposition $f= f_1 +f_2$ of f into a sum of some other nonzero effects $f_1, f_2 \in {\mathcal {E}}({\mathcal {S}})$ implies that $f= \alpha _1 f_1 = \alpha _2 f_2$ for some $\alpha _1, \alpha _2 >0$. The indecomposable effects are precisely the effects lying on the extreme rays of the dual cone $V^*_+$. It was shown in [21] that every effect can be decomposed into a sum of some indecomposable effects and that indecomposable extreme effects exist in all GPTs.

For an effect $f \in {\mathcal {E}}({\mathcal {S}})$, we denote by $\lambda _{\min }(f)$ and $\lambda _{\max }(f)$ its smallest and largest values on ${\mathcal {S}}$, i.e., $\lambda _{\min }(f) = \inf _{s \in {\mathcal {S}}} f(s)$ and $\lambda _{\max }(f) = \sup _{s \in {\mathcal {S}}} f(s)$. We note that these are attained because ${\mathcal {S}}$ is compact and f is continuous.

A meter ${\mathsf {A}}$ with n outcomes is a mapping ${\mathsf {A}}: x \rightarrow {\mathsf {A}}_x$ from an outcome set $\Omega _{\mathsf {A}}= \{1, \ldots , n\} \subset {\mathbb {N}}$ to the set of effects ${\mathcal {E}}({\mathcal {S}})$ such that the normalization condition $\sum _{x \in \Omega _{\mathsf {A}}} {\mathsf {A}}_x = u$ is satisfied. Thus, the set $\Omega _{\mathsf {A}}$ includes all the possible outcomes of the experiment where the meter ${\mathsf {A}}$ is used, the normalization condition guarantees that some outcome is registered, and ${\mathsf {A}}_x(s)$ can be then interpreted as the probability that outcome $x \in \Omega _{\mathsf {A}}$ was observed when the systems was in the state $s \in {\mathcal {S}}$ and meter ${\mathsf {A}}$ was used to measure the system. We denote the set of meters on ${\mathcal {S}}$ as ${\mathcal {M}}({\mathcal {S}})$, or simply as ${\mathcal {M}}$ if the state space is understood from the context.

For the purpose of this work it is worth noting that when we presented the usual definition of the effect space, no further restrictions on its elements was given. This means that all mathematically valid functionals (i.e., affine functionals that give probabilities on states) are also considered to be valid physical effects in the theory. This assumption is commonly called the no-restriction hypothesis. In this work we give operationally justifiable restrictions that we pose on the unrestricted set of effects/meters but unless otherwise stated, the underlying set of effects of the theory is taken to be unrestricted.

Example 1

In finite-dimensional quantum theory, the state space ${\mathcal {S}}({\mathcal {H}})$ consists of positive trace-1 operators on a finite-dimensional Hilbert space ${\mathcal {H}}$, i.e.,

$$\begin{aligned} {\mathcal {S}}({\mathcal {H}}) := \{ \varrho \in {\mathcal {L}}_s({\mathcal {H}})\, | \, \varrho \ge O, \ \mathrm{tr}\left[ {\varrho }\right] = 1\}, \end{aligned}$$

(4)

where O is the zero operator on ${\mathcal {H}}$, ${\mathcal {L}}_s({\mathcal {H}})$ denotes the real vector space of self-adjoint operators on ${\mathcal {H}}$ and the order is induced by the cone of positive-semidefinite operators on ${\mathcal {H}}$, i.e., $A \ge O$ if and only if $\left\langle \,{\varphi }\,|\,{A \varphi }\,\right\rangle \ge 0$ for all $\varphi \in {\mathcal {H}}$.

The effect space ${\mathcal {E}}({\mathcal {S}}({\mathcal {H}}))$ can be shown to be isomorphic to the set of selfadjoint operators between the zero operator O and the identity operator I i.e.,

$$\begin{aligned} {\mathcal {E}}({\mathcal {S}}({\mathcal {H}})) \cong {\mathcal {E}}({\mathcal {H}}):= \{ E \in {\mathcal {L}}_s({\mathcal {H}})\, | \, O \le E \le I \}, \end{aligned}$$

(5)

where naturally the zero operator O corresponds to the zero functional o and the identity operator I corresponds to the unit effect u.

Each meter on ${\mathcal {S}}({\mathcal {H}})$ with a finite number of outcomes can be associated with a positive operator-valued measure (POVM) ${\mathsf {A}}: x \rightarrow {\mathsf {A}}(x)$ from a finite outcome set $\Omega _{\mathsf {A}}$ to the set of effects ${\mathcal {E}}({\mathcal {H}})$ such that $\sum _{x \in \Omega _{\mathsf {A}}} {\mathsf {A}}(x) = I $.

2.2 Simulation of Meters

Given a set of measurement devices (meters) one can always choose to do some classical manipulations with the measurement data outputted by the devices. For instance, one can consider if it is possible to construct some new meters by classically manipulating the pre-existing meters and their measurement data. These type of considerations have led to the concept of measurement simulability and have been studied in [17,18,19, 22].

By classical manipulations we mean mixing the meters and/or post-processing the outcomes of the meters: If we have meters ${\mathsf {B}}^{(1)}, \ldots , {\mathsf {B}}^{(m)}$ we can assign to them probabilities $p_1, \ldots , p_m$ of using the different meters in each round of the measurement process so that we obtain a mixed meter ${\mathsf {B}}= \sum _i p_i {\mathsf {B}}^{(i)}$. In addition to mixing, we can classically post-process the measurement outcomes of any ${\mathsf {B}}^{(i)}$ by assigning a stochastic post-processing matrix $\nu ^{(i)} = (\nu _{xy}^{(i)})_{x \in \Omega _{{\mathsf {B}}^{(i)}}, y \in \Omega _{{\mathsf {A}}^{(i)}}}$ to each ${\mathsf {B}}^{(i)}$, where $\Omega _{{\mathsf {B}}^{(i)}}$ is the outcome set of the pre-existing meter ${\mathsf {B}}^{(i)}$ and $\Omega _{{\mathsf {A}}^{(i)}}$ is some other outcome set such that $\nu ^{(i)}_{xy} \ge 0$ and $\sum _{y \in \Omega _{{\mathsf {A}}^{(i)}}} \nu ^{(i)}_{xy} =1$ for all $x \in \Omega _{{\mathsf {B}}^{(i)}}$, $y \in \Omega _{{\mathsf {A}}^{(i)}}$. We can use $\nu ^{(i)}$ to define a new meter ${\mathsf {A}}^{(i)} = \nu ^{(i)} \circ {\mathsf {B}}^{(i)}$ with outcome set $\Omega _{\mathsf {A}^{(i)}}$ by setting ${\mathsf {A}}^{(i)}_y = \sum _{x \in \Omega _{{\mathsf {B}}^{(i)}}} \nu ^{(i)}_{xy} {\mathsf {B}}^{(i)}_x$ for all $y \in \Omega _{{\mathsf {A}}^{(i)}}$. Here the matrix element $\nu ^{(i)}_{xy}$ can thus be interpreted as the transition probability that the outcome x is mapped into outcome y.

By combining both mixing and post-processing we get the simulation scheme which results in a new meter ${\mathsf {A}}$ defined by

$$\begin{aligned} {\mathsf {A}}_y = \sum _{i=1}^m p_i (\nu ^{(i)} \circ {\mathsf {B}}^{(i)})_y = \sum _{i=1}^m \sum _{x \in \Omega _{\mathsf {B}}} p_i \nu ^{(i)}_{xy} {\mathsf {B}}^{(i)}_x, \end{aligned}$$

(6)

for all $y \in \Omega _{\mathsf {A}}$, where we have set all the outcome sets $\Omega _{{\mathsf {B}}^{(i)}}$ equal, and denoted the resulting outcome set $\Omega _{\mathsf {B}}$, by adding zero outcomes if needed, and similarly for $\Omega _{\mathsf {A}}$.

We denote the set of meters obtained from the meters ${\mathsf {B}}^{(1)}, \ldots , {\mathsf {B}}^{(m)}$ by this simulation scheme with some probability distribution $(p_i)_i$ and post-processings $\nu ^{(i)}$ by ${\mathfrak {sim}}(\{ {\mathsf {B}}^{(1)}, \ldots , {\mathsf {B}}^{(m)} \})$. If we have a (possibly infinite) set of meters ${\mathcal {B}}$, we denote by ${\mathfrak {sim}}({\mathcal {B}})$ the set of meters that can be simulated by using some finite subset of ${\mathcal {B}}$, and call meters in ${\mathcal {B}}$ as simulators. One can show that ${\mathfrak {sim}}({\mathcal {B}})$ is closed both under post-processing and mixing, i.e., ${\mathfrak {sim}}({\mathcal {B}})$ is convex and $\nu \circ {\mathsf {B}}\in {\mathfrak {sim}}({\mathcal {B}})$ for any post-processing $\nu $ and meter ${\mathsf {B}}\in {\mathfrak {sim}}({\mathcal {B}})$.

Being considered as a mapping on the power set $2^{\mathcal {M}}$, the simulation map ${\mathfrak {sim}}(\cdot )$ can be shown to be a closure operator so that it satisfies the following three properties for all subsets ${\mathcal {B}}, {\mathcal {C}} \subseteq {\mathcal {M}}$:

(SIM1)
${\mathcal {B}} \subseteq {\mathfrak {sim}}({\mathcal {B}})$
(SIM2)
${\mathfrak {sim}}({\mathfrak {sim}}({\mathcal {B}})) = {\mathfrak {sim}}({\mathcal {B}})$
(SIM3)
${\mathcal {B}} \subseteq {\mathcal {C}} \Rightarrow {\mathfrak {sim}}({\mathcal {B}}) \subseteq {\mathfrak {sim}}({\mathcal {C}})$

We call a subset of meters ${\mathcal {B}}$simulation closed if the equality holds in (SIM1), i.e., ${\mathfrak {sim}}({\mathcal {B}}) ={\mathcal {B}}$. By the property (SIM2) we see that ${\mathfrak {sim}}({\mathcal {B}})$ is simulation closed for any ${\mathcal {B}} \subseteq {\mathcal {M}}$. Simulation closed sets have some basic properties. In particular, if ${\mathcal {B}}_i$, $i\in I$, are simulation closed sets, then also $\bigcap _{i \in I} {\mathcal {B}}_i$ is simulation closed.

3 Three Types of Operational Restrictions

In this work, by a restriction we will mean that the allowed or possible meters belong to a subset $\tilde{{\mathcal {M}}}\subset {\mathcal {M}}$. We require the following condition for all restrictions:

(SC)
simulation closedness: ${\mathfrak {sim}}(\tilde{{\mathcal {M}}})=\tilde{{\mathcal {M}}}$

As has been explained above, given a set of meters, we can always choose to mix them or post-process their outcomes so that any meter that can be simulated this way from the pre-existing meters should always be a feasible meter as well. Given a non-simulation closed restriction $\tilde{{\mathcal {M}}}$, we can make it simulation closed by taking its simulation closure ${\mathfrak {sim}}(\tilde{{\mathcal {M}}})$.

We note that simulation closedness implies that all trivial meters are always included in the restriction as they can be post-processed from any meter. By a trivial meter we mean a meter ${\mathsf {T}}$ of the form ${\mathsf {T}}_x = p_x u$ for all $x \in \Omega _{\mathsf {T}}$ for some probability distribution $(p_x)_x$ on $\Omega _{\mathsf {T}}$ so that it does not give any information about the input state. In practice, trivial meters can always be implemented just by ignoring the input state and choosing an outcome according to some fixed probability distribution.

In the following, by a restriction we mean a choice $\tilde{{\mathcal {M}}}\subset {\mathcal {M}}$ that satisfies the condition (SC). We recall that the range of a meter ${\mathsf {A}}$ can be expressed as $\mathrm{ran}\,({\mathsf {A}}) = \{ \sum _{y \in \tilde{\Omega }} {\mathsf {A}}_y \, | \, \tilde{\Omega } \subseteq \Omega _{\mathsf {A}}\}$. We use the following notation.

For a subset $\tilde{{\mathcal {M}}}\subset {\mathcal {M}}$, we denote by ${\mathcal {E}}_{\tilde{{\mathcal {M}}}}$ the set of all $e\in {\mathcal {E}}$ such that $e\in \mathrm{ran}\,({\mathsf {A}})$ for some ${\mathsf {A}}\in \tilde{{\mathcal {M}}}$.

Given a restriction $\tilde{{\mathcal {M}}}$, the set of possible effects is then ${\mathcal {E}}_{\tilde{{\mathcal {M}}}}$.

We can also consider restrictions on meters induced by some restriction on effects. For this, we also use the following notation:

For a subset $\tilde{{\mathcal {E}}}\subset {\mathcal {E}}$, we denote by ${\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ the set of all ${\mathsf {A}}\in {\mathcal {M}}$ such that $\mathrm{ran}\,({\mathsf {A}}) \subset \tilde{{\mathcal {E}}}$.

As in [9], we impose some consistency conditions for $\tilde{{\mathcal {E}}}$ to generate a restriction ${\mathcal {M}}_{\tilde{{\mathcal {E}}}}$:

(E1)
$u \in \tilde{{\mathcal {E}}}$ as it is an essential part of the definition of a meter, and
(E2)
for every $e \in \tilde{{\mathcal {E}}}$, there exists ${\mathsf {A}}\in {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ such that $e \in \mathrm{ran}\,({\mathsf {A}})$, i.e., for every physical effect $e \in \tilde{{\mathcal {E}}}$ we must have a way to implement it as a part of some meter.

As previously, (SC) is also required to hold for restrictions ${\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ given by some effect restriction $\tilde{{\mathcal {E}}}$.

The previous considerations lead to the following classification of measurement restrictions into three disjoint cases. Firstly, we can have

(R1)
$\tilde{{\mathcal {M}}}= {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ for some $\tilde{{\mathcal {E}}}\subset {\mathcal {E}}$.

In this case the restriction takes place essentially on the level of effects and the limitations on meters can be seen as a consequence. We show later in Proposition 9 that under the consistency conditions (E1) and (E2), the effect restriction $\tilde{{\mathcal {E}}}$ in ${\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ is unique. We further emphasize that the set $\tilde{{\mathcal {E}}}$ must be chosen specifically so that ${\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ is simulation closed. We will show that a necessary and sufficient condition for an effect restriction $\tilde{{\mathcal {E}}}$ that satisfies the conditions (E1) and (E2) to be simulation closed is that $\tilde{{\mathcal {E}}}$ is a convex subset of ${\mathcal {E}}$. In particular, this is the case when $\tilde{{\mathcal {E}}}$ is a convex subalgebra of ${\mathcal {E}}$; this will be discussed in detail in Sect. 4.

Secondly, we can have

(R2)
${\mathcal {E}}_{\tilde{{\mathcal {M}}}}={\mathcal {E}}$ (but $\tilde{{\mathcal {M}}}\ne {\mathcal {M}}$).

In this case, the restriction does not limit the possible effects but only how they compose into meters. A restriction satisfying (R2) cannot satisfy (R1), as ${\mathcal {M}}_{\mathcal {E}}={\mathcal {M}}$ and we are assuming that a restricted set $\tilde{{\mathcal {M}}}$ is a proper subset of ${\mathcal {M}}$. An important class of restrictions of type (R2) are restrictions to effectivelyn-tomic meters [18, 19] and, in fact, any restriction of type (R2) contains effectively dichotomic meters. This class of restrictions is described and studied in Sect. 5.

The third possibility is that the restriction is neither (R1) nor (R2). This means that

(R3)
${\mathcal {E}}_{\tilde{{\mathcal {M}}}}\subset {\mathcal {E}}$ and $\tilde{{\mathcal {M}}}\ne {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ for any $\tilde{{\mathcal {E}}}\subset {\mathcal {E}}$.

In this case there are limitations already at the level of effects, but there are also limitations that come visible only at the level of meters. Restrictions of this type will be considered in Sect. 6.

Finally, we note that there can also be other operational requirements that one might want to hold depending on the restriction. One such requirement might be tomographic completeness:

(TC)
tomographic completeness: ${\mathsf {A}}(s_1)={\mathsf {A}}(s_2) \quad \forall {\mathsf {A}}\in \tilde{{\mathcal {M}}}\Rightarrow s_1=s_2$.

This requirement is relevant, e.g., if one starts from a more general framework of convex structures and then needs to justify that the set of states is a convex subset of a real vector space [23]. However, in this work we concentrate on (SC) and we do not study other requirements.

Remark 1

In [9], in addition to (E1) and (E2), also convexity of $\tilde{{\mathcal {E}}}$ along with two other consistency conditions are required to hold:

(E3)
for any two effects $e, f \in \tilde{{\mathcal {E}}}$ such that $e,f \in \mathrm{ran}\,({\mathsf {A}})$ for some physical meter ${\mathsf {A}}$, we must have $e+f \in \tilde{{\mathcal {E}}}$, and
(E4)
the adjoint $T^*$ of a linear state transformation $T: {\mathcal {S}}\rightarrow {\mathcal {S}}$, defined by $[T^*(e)](s) =e(T(s))$ for all states and effects, must give a valid effect for all valid effects, i.e., $T^*(e) \in \tilde{{\mathcal {E}}}$ for all $e \in \tilde{{\mathcal {E}}}$.

In particular, one can show that the effect restrictions considered in [9] induce restrictions on meters that are simulation closed. We see that the condition (E3) is built in the definition of ${\mathcal {M}}_{\tilde{{\mathcal {E}}}}$: if $e,f \in \tilde{{\mathcal {E}}}$ such that $e,f \in \mathrm{ran}\,({\mathsf {A}})$ for some physical meter ${\mathsf {A}}$, then according to our definition of physicality, we must have ${\mathsf {A}}\in {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ so that in particular $e+f \in \mathrm{ran}\,({\mathsf {A}}) \subset \tilde{{\mathcal {E}}}$.

The point we want to emphasize is that even if we are considering restrictions on meters given by restrictions on effects, we must also consider how our physical effects are connected to our physical meters. In our work this is done by defining ${\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ and in [9] this is addressed by the condition (E3). Thus, the condition (E3) is different in nature to (E1) and (E2) as it is not expressed only in terms of effects but involves also meters. Regarding (E4), we do not consider state transformations in our current work.

4 Restriction Class (R1) and Convex Effect Algebras

In this section we provide a characterization of restrictions of type (R1). We then consider a more special case of convex effect restrictions, namely the convex effect subalgebras. This type of restriction has been used, e.g., in [11]. We derive a representation theorem for convex effect subalgebras and we also demonstrate that there are physically meaningful (R1) restrictions that do not have the structure of a convex effect subalgebra. The material presented in Sects. 4.2 and 4.3 has some overlap with the recent work [24] of one of the present authors. We include this material to make the present investigation self-contained.

4.1 Characterization of (R1) Restrictions

As was described earlier, we consider restrictions of type (R1) to be induced by a subset $\tilde{{\mathcal {E}}}\subset {\mathcal {E}}$ of effects that satisfies the consistency conditions (E1) and (E2) such that ${\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ describes the physical, restricted set of meters that is simulation closed. We start by showing some simple consequences of the consistency conditions (E1) and (E2) which will be seen useful later.

Lemma 1

Let$\tilde{{\mathcal {E}}}\subset {\mathcal {E}}$be a restriction on effects such that consistency conditions (E1) and (E2) are satisfied. Then

(a)
$o \in \tilde{{\mathcal {E}}}$,
(b)
for each$e \in \tilde{{\mathcal {E}}}$also the complement effect$u-e \in \tilde{{\mathcal {E}}}$.

Proof

(a)
By (E1) and (E2) there exists a meter ${\mathsf {A}}\in {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ such that $u \in \mathrm{ran}\,({\mathsf {A}})$ and since $o \in \mathrm{ran}\,({\mathsf {B}})$ for any meter ${\mathsf {B}}\in {\mathcal {M}}$, we must have from the definition of ${\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ that $o \in \mathrm{ran}\,({\mathsf {A}}) \subset \tilde{{\mathcal {E}}}$.
(b)
By (E2) for any $e \in \tilde{{\mathcal {E}}}$, there exists a meter ${\mathsf {A}}\in {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ such that $e \in \mathrm{ran}\,({\mathsf {A}})$. Since $u-e \in \mathrm{ran}\,({\mathsf {A}})$, we have from the definition of ${\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ that $u-e \in \tilde{{\mathcal {E}}}$.$\square $

We can now give a complete characterization of effect restrictions $\tilde{{\mathcal {E}}}$ that give rise to restrictions of type (R1).

Theorem 1

Let$\tilde{{\mathcal {E}}}\subset {\mathcal {E}}$be a restriction on effects such that consistency conditions (E1) and (E2) are satisfied. Then${\mathcal {M}}_{\tilde{{\mathcal {E}}}}$is simulation closed if and only if$\tilde{{\mathcal {E}}}$is convex.

Proof

Let first ${\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ be simulation closed. If $e, f \in \tilde{{\mathcal {E}}}$ then from (E2) it follows that there exist ${\mathsf {A}},{\mathsf {B}}\in {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ such that $e \in \mathrm{ran}\,({\mathsf {A}})$ and $f \in \mathrm{ran}\,({\mathsf {B}})$. In fact, we have that $u-e \in \mathrm{ran}\,({\mathsf {A}})\subset \tilde{{\mathcal {E}}}$ and $u-f \in \mathrm{ran}\,({\mathsf {B}})\subset \tilde{{\mathcal {E}}}$ so that if we define two dichotomic meters ${\mathsf {E}}$ and ${\mathsf {F}}$ with effect $e, u-e$ and $f, u-f$ respectively, then ${\mathsf {E}}, {\mathsf {F}}\in {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$. Now from (SC) it follows that $t {\mathsf {E}}+(1-t) {\mathsf {F}}\in {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ for any $t \in [0,1]$ so that $t e +(1-t)f \in \mathrm{ran}\,(t {\mathsf {E}}+(1-t) {\mathsf {F}}) \subset \tilde{{\mathcal {E}}}$. Thus, $\tilde{{\mathcal {E}}}$ is convex.

Let now $\tilde{{\mathcal {E}}}$ be convex. Let ${\mathsf {A}}\in {\mathfrak {sim}}({\mathcal {M}}_{\tilde{{\mathcal {E}}}})$ so that there exist meters $\{{\mathsf {B}}^{(i)}\}_i \subset {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$, post-processings $\nu ^{(i)}: \Omega _{\mathsf {B}}\rightarrow \Omega _{\mathsf {A}}$ and a probability distribution $(p_i)_i$ such that ${\mathsf {A}}= \sum _i p_i (\nu ^{(i)} \circ {\mathsf {B}}^{(i)})$. We need to show that ${\mathsf {A}}\in {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$, i.e., that $\mathrm{ran}\,({\mathsf {A}}) \subset \tilde{{\mathcal {E}}}$. Since $\mathrm{ran}\,({\mathsf {A}}) = \{ \sum _{y \in \tilde{\Omega }} {\mathsf {A}}_y \, | \, \tilde{\Omega } \subseteq \Omega _{\mathsf {A}}\}$, we take $\tilde{\Omega } \subseteq \Omega _{\mathsf {A}}$ and consider the effect

$$\begin{aligned} \sum _{y \in \tilde{\Omega }} {\mathsf {A}}_y = \sum _{y \in \tilde{\Omega }} \sum _i \sum _{x \in \Omega _{\mathsf {B}}} p_i \nu ^{(i)}_{xy} {\mathsf {B}}^{(i)}_x =\sum _i p_i \left[ \sum _{x \in \Omega _{\mathsf {B}}} \left( \sum _{y \in \tilde{\Omega }} \nu ^{(i)}_{xy} \right) {\mathsf {B}}^{(i)}_x \right] . \end{aligned}$$

(7)

Let us denote $\tilde{\nu }^{(i)}_x := \sum _{y \in \tilde{\Omega }} \nu ^{(i)}_{xy} \in [0,1]$ so that

$$\begin{aligned} \sum _{y \in \tilde{\Omega }} {\mathsf {A}}_y = \sum _i p_i \left( \sum _{x \in \Omega _{\mathsf {B}}} \tilde{\nu }^{(i)}_{x} {\mathsf {B}}^{(i)}_x \right) . \end{aligned}$$

(8)

From the convexity of $\tilde{{\mathcal {E}}}$ we see that if $\sum _{x \in \Omega _{\mathsf {B}}}\tilde{\nu }^{(i)}_{x} {\mathsf {B}}^{(i)}_x \in \tilde{{\mathcal {E}}}$ for all i, then $\sum _{y \in \tilde{\Omega }} {\mathsf {A}}_y \in \tilde{{\mathcal {E}}}$ which would prove the claim. Thus, we will fix i and focus on $\sum _{x \in \Omega _{\mathsf {B}}}\tilde{\nu }^{(i)}_{x} {\mathsf {B}}^{(i)}_x$ and show that it is contained in $\tilde{{\mathcal {E}}}$.

Since $\Omega _{\mathsf {B}}= \{1, \ldots , n\}$ for some $n \in {\mathbb {N}}$, we can rename the effects of ${\mathsf {B}}^{(i)}$ such that $\tilde{\nu }^{(i)}_x \le \tilde{\nu }^{(i)}_{x'}$ for $x < x'$. If we set $\tilde{\nu }^{(i)}_0 = 0$, one can confirm that

$$\begin{aligned} \sum _{x=1}^n \tilde{\nu }^{(i)}_x {\mathsf {B}}^{(i)}_x = \sum _{k=1}^n \left[ \left( \tilde{\nu }^{(i)}_{k} - \tilde{\nu }^{(i)}_{k-1} \right) \sum _{x=k}^n {\mathsf {B}}^{(i)}_x \right] . \end{aligned}$$

(9)

One sees that $\sum _{x=k}^n {\mathsf {B}}^{(i)}_x \in \mathrm{ran}\,({\mathsf {B}}^{(i)}) \subset \tilde{{\mathcal {E}}}$ and that $\tilde{\nu }^{(i)}_{k} - \tilde{\nu }^{(i)}_{k-1} \ge 0$ for all $k \in \{1, \ldots , n\}$. Furthermore, we see that $\sum _{k=1}^n \left( \tilde{\nu }^{(i)}_{k} - \tilde{\nu }^{(i)}_{k-1} \right) = \tilde{\nu }^{(i)}_n \in [0,1]$ so that we can make the RHS of Eq. (9) a convex sum of the terms $\sum _{x=k}^n {\mathsf {B}}^{(i)}_x \in \mathrm{ran}\,({\mathsf {B}}^{(i)})$ by adding a zero element $(1- \tilde{\nu }^{(i)}_n ) o$ which by Lemma 1 must be included in $\tilde{{\mathcal {E}}}$.

Hence, $\sum _{x=1}^n \tilde{\nu }^{(i)}_x {\mathsf {B}}^{(i)}_x$ can be expressed as a convex combination of elements in $\tilde{{\mathcal {E}}}$ so that from the convexity of $\tilde{{\mathcal {E}}}$ is follows that $\sum _{x \in \Omega _{\mathsf {B}}}\tilde{\nu }^{(i)}_{x} {\mathsf {B}}^{(i)}_x \in \tilde{{\mathcal {E}}}$ for all i. $\square $

4.2 Convex Effect Algebras

We start by recalling the notion of (abstract) convex effect algebra and the operational basis of this mathematical structure. An effect algebra [25] is a non-empty set ${\mathcal {E}}$ with two distinguished elements ${\mathfrak {0}}$ and ${\mathfrak {1}}$ and a partially defined operation $\oplus $ that satisfies the following conditions:

(EA1)
if $e \oplus f$ is defined, then $f \oplus e$ is defined and $e \oplus f =f \oplus e$.
(EA2)
if $e \oplus f$ and $(e \oplus f)\oplus g$ are defined, then $f \oplus g$ and $e \oplus (f\oplus g)$ are defined and $(e \oplus f)\oplus g = e \oplus (f\oplus g)$.
(EA3)
for every $e \in {\mathcal {E}}$, there is a unique $e'$ such that $e \oplus e' = {\mathfrak {1}}$.
(EA4)
if $e \oplus {\mathfrak {1}}$ is defined, then $e={\mathfrak {0}}$.

A physical interpretation of an effect algebra is that ${\mathcal {E}}$ is a collection of events and the partial operation $\oplus $ describes joining of events. The element ${\mathfrak {0}}$ corresponds to the event that never happens whereas ${\mathfrak {1}}$ corresponds to the event that always happens. An important example of an effect algebra is the collection of all fuzzy sets on some set X. An abstract effect algebra can be seen as a generalization of this structure, including the Hilbert space effect algebra as an important example. It is clear that the set of all effects in a GPT also forms an effect algebra.

When thinking about the interpretation of an effect algebra as a collection of events, one could come up with some additional properties that would seem reasonable to require as axioms. However, several such properties can be derived from the defining conditions (EA1)–(EA4). For instance, it can be shown [25] that $(e')'=e$ and that the cancellation law holds: if $e \oplus f= e \oplus g$, then $f=g$.

Let us then consider an effect algebra that describes events that correspond to outcomes, or collections of outcomes, in a measurement device or devices. An operational interpretation of the partial operation $\oplus $ is that two outcomes are merged into one. Merging two outcomes is an irreversible action; if we are given the newly formed device, we cannot know which effects have been merged. There is, however, a way to split one outcome into two so that merging is a one side inverse to this procedure. This splitting goes as follows. When an outcome related to an effect e occurs, we toss a coin and, depending on the result, either record the outcome as it was, or mark it as a new outcome. We thus obtain two effects, $e_{same}$ and $e_{new}$. Clearly, merging of the outcomes should give the original effect, thus $e_{same} \oplus e_{new} = e$. In this way we have introduced a map $e \mapsto e_{same}$ for every coin tossing probability $\alpha $.

Mathematically speaking, an effect algebra ${\mathcal {E}}$ is a convex effect algebra [26] if for every effect $e\in {\mathcal {E}}$ and real number $\alpha \in [0,1]$, we can form a new effect, denoted by $\alpha e$ such that the following conditions hold for every $\alpha ,\beta \in [0,1]$ and $e,f\in {\mathcal {E}}$:

(CEA1)
$\alpha (\beta e) = (\alpha \beta )e$.
(CEA2)
$1 e = e$.
(CEA3)
If $\alpha + \beta \le 1$, then $\alpha e \oplus \beta e$ is defined and $(\alpha +\beta )e = \alpha e \oplus \beta e$.
(CEA4)
If $e\oplus f$ is defined, then $\alpha e \oplus \alpha f$ is defined and $\alpha (e \oplus f) = \alpha e \oplus \alpha f$.

As we have described above, the map $(\alpha ,e) \mapsto \alpha e$ can be interpreted as a splitting of e into two effects, $\alpha e$ and $(1-\alpha ) e$. We point out that this mathematical structure describes the action only at the level of individual effects, not meters, which allows for other interpretations. We can, for instance, interpret the action in a way that the residual effect $(1-\alpha ) e$ does not generate a new outcome but is combined into some already existing outcomes.

It is shown in [26] that if $\alpha ,\beta \in [0,1]$ with $\alpha + \beta \le 1$, then $\alpha e \oplus \beta f$ is defined for every $e,f\in {\mathcal {E}}$. This further implies that for any $\alpha \in [0,1]$ and any $e,f\in {\mathcal {E}}$, the effect sum $\alpha e \oplus (1-\alpha ) f$ is defined. The resulting element is called a mixture of e and f. Mixing of effects is therefore a derived notion in convex effect algebras.

4.3 Characterization of Convex Effect Algebras and Subalgebras

As we have seen earlier, the partial order in an effect algebra is derived from the partially defined effect sum. To construct concrete convex effect algebras, we can start from an ordered vector space and use that structure to form an effect algebra. This construction works as follows. Let W be a finite dimensional real vector space, and let $C\subset W$ be a proper cone. For any nonzero $u\in C$, we then denote $[0,u]_C:=\{e\in C: e \le _C u \}$. Then, for any $e,f \in [0,u]_C$, the combination $e \oplus f$ is defined if $e+f \le _C u$, and then $e \oplus f := e+f$. The set $[0,u]_C$ is a convex subset of C and $0\in C$. Therefore, $\alpha e \in [0,u]_C$ for any $e \in [0,u]_C$ and $0\le \alpha \le 1$. In this way, $[0,u]_C$ is a concrete convex effect algebra, also called a linear effect algebra [27]. The chosen vector u is the identity element in $[0,u]_C$.

When forming linear effect algebras, we typically want $[0,u]_C$ to generate the vector space W, which means that W is the linear span of vectors of the form $\alpha e$, where $\alpha \in {\mathbb {R}}^+$ and $e\in [0,u]_C$. Due to the following result it is not restrictive to consider this kind of linear effect algebras when we investigate the properties of convex effect algebras.

Theorem 2

( [26]) Let${\mathcal {E}}$be a convex effect algebra. There exists a real vector spaceW, a coneCand a nonzero element$u\in C$such that$[0,u]_C$generatesWand${\mathcal {E}}$is affinely isomorphic to$[0,u]_C$.

We remark that this characterization of convex effect algebras shows a natural connection to the GPT framework. Namely, if one starts from a GPT state space ${\mathcal {S}}\subset V$ (see Sect. 2.1), then W is the dual space $V^*$ and C is the positive dual cone $V^*_+$. More detailed discussions about this connection are provided in [28, 29].

As with any algebraic structures, there are natural notions of substructures for effect algebras and convex effect algebras. Namely, let ${\mathcal {E}}$ be an effect algebra. A nonempty subset $\tilde{{\mathcal {E}}}\subset {\mathcal {E}}$ is a subalgebra of ${\mathcal {E}}$ if the following conditions hold:

(SA1)
${\mathfrak {1}}\in \tilde{{\mathcal {E}}}$.
(SA2)
$e'\in \tilde{{\mathcal {E}}}$ for all $e\in \tilde{{\mathcal {E}}}$.
(SA3)
$e\oplus f\in \tilde{{\mathcal {E}}}$ for all $e,f\in \tilde{{\mathcal {E}}}$ such that $e\oplus f$ is defined in ${\mathcal {E}}$.

If ${\mathcal {E}}$ is a convex effect algebra, then a subalgebra $\tilde{{\mathcal {E}}}$ is a convex subalgebra of ${\mathcal {E}}$ if it satisfies also the following condition:

(SA4)
$\alpha e\in \tilde{{\mathcal {E}}}$ for all $\alpha \in [0,1]$ and $e\in \tilde{{\mathcal {E}}}$.

We note that every convex effect algebra ${\mathcal {E}}$ has two trivial convex subalgebras: ${\mathcal {E}}$ itself and $\{ \alpha {\mathfrak {1}}\, | \, \alpha \in [0,1] \}$. The following result characterizes all convex subalgebras.

Theorem 3

LetVbe a finite-dimensional vector space, Ca cone and${\mathcal {E}}=[0,u]_C$a convex effect algebra generatingV. A subset$\tilde{{\mathcal {E}}}\subset {\mathcal {E}}$is a convex subalgebra if and only if$u\in \tilde{{\mathcal {E}}}$and there exist$e_1,\ldots ,e_n\in {\mathcal {E}}$such that

$$\begin{aligned} \tilde{{\mathcal {E}}}&= \mathrm {span}_{\mathbb {R}}\{ e_1,\ldots ,e_n\} \cap {\mathcal {E}}\nonumber \\&= \left\{ e\in {\mathcal {E}}: e=\sum _i r_i e_i, \, r_i\in {\mathbb {R}}\right\} . \end{aligned}$$

(10)

Proof

Let us assume that $\tilde{{\mathcal {E}}}$ is a subset given in (10) by some elements $e_1,\ldots ,e_n\in {\mathcal {E}}$, and $u\in \tilde{{\mathcal {E}}}$. It follows that $u=\sum _i \bar{r}_i e_i$ for some $\bar{r}_i\in {\mathbb {R}}$. Using this fact, we see that (SA2) is valid: if $e\in \tilde{{\mathcal {E}}}$ and hence $e=\sum _i r_i e_i$ for some $r_i\in {\mathbb {R}}$, then $e'=\sum _i (\bar{r}_i -r_i) e_i \in \tilde{{\mathcal {E}}}$. It is clear from (10) that also (SA3) and (SA4) are valid.

Let us then assume that $\tilde{{\mathcal {E}}}$ is a convex subalgebra of $[0,u]_C$. Let $v_1,\dots ,v_m$ be a linear basis in V. Since $[0,u]_C$ generates V, every $v_i$ can be written as $v_i=c_i^+ v_i^+ - c_i^- v_i^-$ for some $v_i^+,v_i^-\in [0,u]_C$ and $c_i^+,c_i^- \ge 0$. We denote $e_i=v_i^+$ and $e_{m+i}=v_i^-$ for $i=1,\ldots ,m$. Since $\{v_i\}_{i=1}^m$ is a basis, (10) holds. $\square $

Using the same premises, we can also rephrase Theorem 3 as follows: A subset $\tilde{{\mathcal {E}}}\subset {\mathcal {E}}$ is a convex subalgebra if and only if $\tilde{{\mathcal {E}}}= U \cap {\mathcal {E}}$ for some linear subspace $U \subset V$ such that $u \in U$. Thus, convex subalgebras are always determined by some linear subspace that contains the unit effect. The smallest nontrivial convex subalgebras are generated by u and some other effect e.

4.4 Subalgebras and Restrictions

We are now ready to explain the connection between convex effect algebras and operational restrictions. In the following ${\mathcal {E}}({\mathcal {S}})$ is the set of all effects on a state space ${\mathcal {S}}$. The following statement follows from Theorem 1. Here we give a short direct proof as a consequence of Theorem 3.

Proposition 1

Let$\tilde{{\mathcal {E}}}$be a convex subalgebra of${\mathcal {E}}({\mathcal {S}})$. Then${\mathcal {M}}_{\tilde{{\mathcal {E}}}}$is simulation closed.

Proof

We need to show that ${\mathfrak {sim}}({\mathcal {M}}_{\tilde{{\mathcal {E}}}})\subseteq {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$. Let ${\mathsf {A}}\in {\mathfrak {sim}}({\mathcal {M}}_{\tilde{{\mathcal {E}}}})$. Then

$$\begin{aligned} {\mathsf {A}}_x = \sum _i p_i \sum _y \nu ^{(i)}_{yx} {\mathsf {B}}^{(i)}_y, \end{aligned}$$

where ${\mathsf {B}}^{(i)}\in {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ and hence ${\mathsf {B}}^{(i)}_y\in \tilde{{\mathcal {E}}}$. Since $\tilde{{\mathcal {E}}}$ is a convex subalgebra, by Theorem 3 it has representation (10) for some $e_1,\ldots ,e_m\in {\mathcal {E}}({\mathcal {S}})$. It follows that ${\mathsf {A}}_x \in \tilde{{\mathcal {E}}}$, and therefore ${\mathsf {A}}\in {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$. $\square $

In the following we demonstrate with two propositions that there are restrictions of the type (R1) where the restricted set of effects does not form a subalgebra.

Let ${\mathsf {A}}$ be a meter and denote $\tilde{{\mathcal {M}}}={\mathfrak {sim}}({\mathsf {A}})$. The set $\tilde{{\mathcal {M}}}$ is simulation closed as ${\mathfrak {sim}}({\mathfrak {sim}}({\mathsf {A}}))={\mathfrak {sim}}({\mathsf {A}})$. The restricted set of effects ${\mathcal {E}}_{\tilde{{\mathcal {M}}}}$ is given as

$$\begin{aligned} {\mathcal {E}}_{\tilde{{\mathcal {M}}}}=\left\{ e \in {\mathcal {E}}: e = \sum _x r_x {\mathsf {A}}_x, r_x \in [0,1] \right\} . \end{aligned}$$

(11)

The set ${\mathcal {E}}_{\tilde{{\mathcal {M}}}}$ satisfies the conditions (SA1), (SA2), and (SA4). However, the condition (SA3) is satisfied only for specific choices of ${\mathsf {A}}$; this is the content of the second part of the following proposition.

Proposition 2

Let${\mathsf {A}}$be a meter such that$\{{\mathsf {A}}_1,\ldots ,{\mathsf {A}}_n\}$is linearly independent and let$\tilde{{\mathcal {M}}}={\mathfrak {sim}}({\mathsf {A}})$. Then

(a)
$\tilde{{\mathcal {M}}}= {\mathcal {M}}_{{\mathcal {E}}_{\tilde{{\mathcal {M}}}}}$, hence$\tilde{{\mathcal {M}}}$is a restriction of type (R1).
(b)
${\mathcal {E}}_{\tilde{{\mathcal {M}}}}$is a convex subalgebra of${\mathcal {E}}({\mathcal {S}})$if and only if$\lambda _{\max }({\mathsf {A}}_x)=1$for everyx.

Proof

(a)
From the definitions of ${\mathcal {E}}_{\tilde{{\mathcal {M}}}}$ and ${\mathcal {M}}_{{\mathcal {E}}_{\tilde{{\mathcal {M}}}}}$ it follows that $\tilde{{\mathcal {M}}}\subseteq {\mathcal {M}}_{{\mathcal {E}}_{\tilde{{\mathcal {M}}}}}$ for any $\tilde{{\mathcal {M}}}$. For the other direction, let us take ${\mathsf {B}}\in \tilde{{\mathcal {M}}}_{{\mathcal {E}}_{\tilde{{\mathcal {M}}}}}$, where ${\mathcal {E}}_{\tilde{{\mathcal {M}}}}$ is given by Eq. (11). Thus, for each $y \in \Omega _{\mathsf {B}}$ there exist $\{r^{(y)}_x\}_{x \in \Omega _{\mathsf {A}}} \subset [0,1]$ such that ${\mathsf {B}}_y = \sum _{x} r^{(y)}_x {\mathsf {A}}_x$. From the normalization of ${\mathsf {A}}$ and ${\mathsf {B}}$ we see that
$$\begin{aligned} \sum _{x \in \Omega _{\mathsf {A}}} {\mathsf {A}}_x = u = \sum _{y \in \Omega _{\mathsf {B}}} {\mathsf {B}}_y =\sum _{x \in \Omega _{\mathsf {A}}} \left( \sum _{y \in \Omega _{\mathsf {B}}} r^{(y)}_x \right) {\mathsf {A}}_x \end{aligned}$$
(12)
so that from the linear independence of the effects of ${\mathsf {A}}$ it follows that $\sum _y r^{(y)}_x = 1$ for all $x \in \Omega _{\mathsf {A}}$. Thus, we can define a post-processing $\nu : \Omega _{\mathsf {A}}\rightarrow \Omega _{\mathsf {B}}$ by setting $\nu _{xy} = r^{(y)}_x$ for all $x \in \Omega _{\mathsf {A}}$ and $y \in \Omega _{\mathsf {B}}$ so that ${\mathsf {B}}= \nu \circ {\mathsf {A}}\in {\mathfrak {sim}}({\mathsf {A}}) = \tilde{{\mathcal {M}}}$. Hence, also ${\mathcal {M}}_{{\mathcal {E}}_{\tilde{{\mathcal {M}}}}} \subseteq \tilde{{\mathcal {M}}}$ holds in this case.
(b)
Let us assume that $\lambda _{\max }({\mathsf {A}}_x)=1$ for every x. Suppose that $e,f\in {\mathcal {E}}_{\tilde{{\mathcal {M}}}}$ and $e+f\le u$. We have $e=\sum _x \alpha _x {\mathsf {A}}_x$ and $f=\sum _x \beta _x {\mathsf {A}}_x$, and thus $e+f=\sum _x (\alpha _x + \beta _x) {\mathsf {A}}_x$. For every x, fix $s_x\in {\mathcal {S}}$ such that ${\mathsf {A}}_x(s_x)=1$. Then
$$\begin{aligned} 1 \ge (e+f)(s_x) = \sum _y (\alpha _y + \beta _y) {\mathsf {A}}_y(s_x) = \alpha _x + \beta _x. \end{aligned}$$
Therefore, $e+f\in {\mathcal {E}}_{\tilde{{\mathcal {M}}}}$.

Let us then assume that $1>\lambda _{\max }({\mathsf {A}}_1)\equiv \lambda $. Then $\tfrac{1}{\lambda } {\mathsf {A}}_1\in {\mathcal {E}}({\mathcal {S}})$ and $\tfrac{1}{\lambda }>1$. Let $0<\mu <1$ and $\mu <\tfrac{1-\lambda }{\lambda }$. Then $\mu {\mathsf {A}}_1\in {\mathcal {E}}_{\tilde{{\mathcal {M}}}}$ and ${\mathsf {A}}_1 + \mu {\mathsf {A}}_1 \le \tfrac{1}{\lambda } {\mathsf {A}}_1$, thus ${\mathsf {A}}_1 + \mu {\mathsf {A}}_1 \in {\mathcal {E}}({\mathcal {S}})$. But ${\mathsf {A}}_1+\mu {\mathsf {A}}_1\notin {\mathcal {E}}_{\tilde{{\mathcal {M}}}}$ because otherwise we would have
$$\begin{aligned} (1+\mu ){\mathsf {A}}_1 = \sum _x r_x {\mathsf {A}}_x \end{aligned}$$
and by linear independence $r_1 = 1+\mu >1$, which is a contradiction to Eq. (11). Thus, if $\lambda <1$, then (SA3) is not satisfied.$\square $

Proposition 3

If$\tilde{{\mathcal {E}}}\subsetneq {\mathcal {E}}({\mathcal {S}})$is an effect restriction that satisfies (E1) and (E2) such that there exists an affine bijection$\Phi : {\mathcal {E}}({\mathcal {S}}) \rightarrow \tilde{{\mathcal {E}}}$, then$\tilde{{\mathcal {E}}}$is not a convex subalgebra of${\mathcal {E}}({\mathcal {S}})$but it nevertheless gives a restriction of the type (R1).

Proof

Since $\tilde{{\mathcal {E}}}= \Phi ({\mathcal {E}}({\mathcal {S}}))$, ${\mathcal {E}}({\mathcal {S}})$ is convex, and $\Phi $ is convexity preserving, we have that $\tilde{{\mathcal {E}}}$ is convex. By Theorem 1, we have that ${\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ is simulation closed and thus gives a restriction of type (R1).

To see that $\tilde{{\mathcal {E}}}$ is not a convex subalgebra of ${\mathcal {E}}({\mathcal {S}})\subset V$ for a finite-dimensional vector space V that is spanned by ${\mathcal {E}}({\mathcal {S}})$, we note that $\dim (\mathrm {span} (\tilde{{\mathcal {E}}})) =\dim (\mathrm {span} ({\mathcal {E}}({\mathcal {S}}))) =\dim (V)$ because of the bijectivity of $\Phi $. However, from Theorem 3 we see that the only generating convex subalgebra must be the effect algebra ${\mathcal {E}}({\mathcal {S}})$ itself: namely, if $\tilde{{\mathcal {E}}}= U \cap {\mathcal {E}}({\mathcal {S}})$ for some subspace U, then from the previous equality of the dimensions it follows that also $\dim (U)=\dim (V)$ and this can only be the case when $U=V$ so that $\tilde{{\mathcal {E}}}= V \cap {\mathcal {E}}({\mathcal {S}}) = {\mathcal {E}}({\mathcal {S}})$. Since we have that $\tilde{{\mathcal {E}}}$ is a proper subset of the effect algebra we have arrived at a contradiction and $\tilde{{\mathcal {E}}}$ cannot be a convex subalgebra. $\square $

5 Restriction Class (R2) and Effectively n-Tomic Meters

For every integer $n\ge 2$, we use the notation ${\mathcal {M}}_{n-\mathrm {eff}} = {\mathfrak {sim}}({\mathcal {M}}_n)$, where ${\mathcal {M}}_n$ is the set of all meters that have n or less outcomes. We call ${\mathcal {M}}_{n-\mathrm {eff}}$ the set of effectively n-tomic meters because they can be reduced to meters with n or less outcomes. The foundational interest to investigate and test these type of restrictions has been discussed in [30,31,32].

It is clear that ${\mathcal {M}}_{n-\mathrm {eff}} \subseteq {\mathcal {M}}_{n+1-\mathrm {eff}}$. The set ${\mathcal {M}}_{2-\mathrm {eff}}$ contains all dichotomic meters, therefore ${\mathcal {E}}_{\tilde{{\mathcal {M}}}}={\mathcal {E}}$ for any choice $\tilde{{\mathcal {M}}}={\mathcal {M}}_{n-\mathrm {eff}}$. We conclude that these restrictions are of the type (R2). The restriction to effectively dichotomic meters ${\mathcal {M}}_{2-\mathrm {eff}}$ is the smallest restriction of the type (R2) in the following sense, and motivates to look this restriction in more details.

Proposition 4

Let$\tilde{{\mathcal {M}}}\subset {\mathcal {M}}$be an operational restriction of the type (R2). Then${\mathcal {M}}_{2-\mathrm {eff}} \subseteq \tilde{{\mathcal {M}}}$.

Proof

Since ${\mathcal {E}}_{\tilde{{\mathcal {M}}}}={\mathcal {E}}$, all dichotomic meters are in $\tilde{{\mathcal {M}}}$. As $\tilde{{\mathcal {M}}}$ is simulation closed, it follows that all effectively dichotomic meters are in $\tilde{{\mathcal {M}}}$. $\square $

Depending on the theory, it can happen that ${\mathcal {M}}_{2-\mathrm {eff}} = {\mathcal {M}}$ [19]. The specific nature of these type of restrictions is hence different in different theories. There are, however, some general properties of ${\mathcal {M}}_{2-\mathrm {eff}}$ and ${\mathcal {M}}_{n-\mathrm {eff}}$ that are theory independent; in the following we demonstrate some of these features.

All of the following results are related to the minimal and maximal values $\lambda _{\min }({\mathsf {A}}_x)$ and $\lambda _{\max }({\mathsf {A}}_x)$ of the effects of a meter ${\mathsf {A}}$. We start by making some simple observations. For a dichotomic meter we have ${\mathsf {A}}_1(s)+{\mathsf {A}}_2(s)=1$ for all states s, and hence

$$\begin{aligned} \lambda _{\max }({\mathsf {A}}_2)=1-\lambda _{\min }({\mathsf {A}}_1). \end{aligned}$$

(13)

It follows that

$$\begin{aligned} \lambda _{\max }({\mathsf {A}}_1)+\lambda _{\max }({\mathsf {A}}_2) \ge 1. \end{aligned}$$

(14)

Further, equality holds in (14) if and only if ${\mathsf {A}}$ is a trivial meter, i.e., if ${\mathsf {A}}$ is of the form ${\mathsf {A}}_1 = p u$ and ${\mathsf {A}}_2 = (1-p ) u$ for some $p\in [0,1]$. Clearly, if ${\mathsf {A}}$ is trivial meter, then $\lambda _{\max }({\mathsf {A}}_1) + \lambda _{\max }({\mathsf {A}}_2) = p +(1-p) = 1$. On the other hand, if $\lambda _{\max }({\mathsf {A}}_1) + \lambda _{\max }({\mathsf {A}}_2)=1$, then if we denote by $s_1$ and $s_2$ the states maximizing ${\mathsf {A}}_1(s)$ and ${\mathsf {A}}_2(s)$ respectively, we see that

$$\begin{aligned} 1 = {\mathsf {A}}_1(s) + {\mathsf {A}}_2(s) \le {\mathsf {A}}_1(s_1) + {\mathsf {A}}_2(s) \le {\mathsf {A}}_1(s_1) + {\mathsf {A}}_2(s_2) =1 \end{aligned}$$

for all $s \in {\mathcal {S}}$ so that all the inequalities must actually be equalities and particularly from the first inequality we get that ${\mathsf {A}}_1(s) = {\mathsf {A}}_1(s_1)=:q$ for all $s \in {\mathcal {S}}$. Similarly then ${\mathsf {A}}_2(s) = {\mathsf {A}}_2(s_2)=1-q$ for all $s \in {\mathcal {S}}$. Hence, ${\mathsf {A}}_1 = q u$ and ${\mathsf {A}}_2 = (1-q)u$ so that ${\mathsf {A}}$ is trivial.

Proposition 5

Any effectively dichotomic meter${\mathsf {A}}$can be simulated from dichotomic meters${\mathsf {B}}$that satisfy$\lambda _{\max }({\mathsf {B}}_1)=\lambda _{\max }({\mathsf {B}}_2)=1$.

Proof

It is enough to show that any dichotomic meter ${\mathsf {A}}$ can be post-processed from a dichotomic meter ${\mathsf {A}}'$ with $\lambda _{\max }({\mathsf {A}}'_1)=\lambda _{\max }({\mathsf {A}}'_2)=1$. A trivial meter can be post-processed from any meter, so we can further assume that ${\mathsf {A}}$ is non-trivial. We denote $\alpha =\lambda _{\max }({\mathsf {A}}_1) + \lambda _{\max }({\mathsf {A}}_2) -1>0$ and define

$$\begin{aligned} {\mathsf {A}}'_1 = \tfrac{1}{\alpha } {\mathsf {A}}_1 + \tfrac{\lambda _{\max }({\mathsf {A}}_2) - 1}{\alpha } u, \quad {\mathsf {A}}'_2 = \tfrac{1}{\alpha } {\mathsf {A}}_2 + \tfrac{\lambda _{\max }({\mathsf {A}}_1) - 1}{\alpha } u. \end{aligned}$$

We have $\lambda _{\min }({\mathsf {A}}'_1)=\lambda _{\min }({\mathsf {A}}'_2)=0$ and ${\mathsf {A}}'_1+{\mathsf {A}}'_2=u$, hence ${\mathsf {A}}'$ is a meter. Further, $\lambda _{\max }({\mathsf {A}}'_1)=\lambda _{\max }({\mathsf {A}}'_2)=1$. Finally, ${\mathsf {A}}$ is a post-processing of ${\mathsf {A}}'$. $\square $

An obvious question is: when is a meter ${\mathsf {A}}\in {\mathcal {M}}$ with $m>n$ outcomes effectively n-tomic? In the following we develop some criteria.

Proposition 6

Let${\mathsf {A}}$be a meter.

(a)
If there exists$y \in \Omega _{\mathsf {A}}$such that$\sum _{x\ne y} \lambda _{\max }({\mathsf {A}}_x) \le 1$, then${\mathsf {A}}$is effectively dichotomic.
(b)
If$\sum _x \lambda _{\max }({\mathsf {A}}_x) > n$, then${\mathsf {A}}$is not effectivelyn-tomic.

Proof

(a)
This is a direct generalization of Lemma 5 in the Supplemental Material for Ref. [17], where it was shown to hold for POVMs.
(b)
Let ${\mathsf {A}}$ be effectively n-tomic, i.e., there exist n-outcome meters $\{ {\mathsf {B}}^{(i)}\}_i$, post-processings $\nu ^{(i)}: \{1, \ldots , n\} \rightarrow \Omega _{\mathsf {A}}$, and a probability distribution $(p_i)_i$ such that ${\mathsf {A}}_x = \sum _i p_i \sum _j \nu ^{(i)}_{jx} {\mathsf {B}}^{(i)}_j$ for all $x \in \Omega _{\mathsf {A}}$. Now we see that
$$\begin{aligned} \sum _x \lambda _{\max }({\mathsf {A}}_x)&= \sum _{x} \max _{s \in {\mathcal {S}}} {\mathsf {A}}_x(s) \\&= \sum _{i,x} p_i \max _{s \in {\mathcal {S}}} \left( \sum _j \nu ^{(i)}_{jx} {\mathsf {B}}^{(i)}_j\right) (s) \\&= \sum _{i,x} p_i \max _{s \in {\mathcal {S}}} \left( \sum _j \nu ^{(i)}_{jx} {\mathsf {B}}^{(i)}_j(s)\right) \\&\le \sum _{i,x} p_i \sum _j \nu ^{(i)}_{jx} = \sum _i p_i \left[ \sum _j \left( \sum _x \nu ^{(i)}_{jx} \right) \right] = n. \end{aligned}$$

$\square $

The previous result already shows some tasks that may be possible in general but not in a theory where the effective number of outcomes is restricted. Namely, perfect discrimination of n states requires that $\sum _x \lambda _{\max }({\mathsf {A}}_x) \ge n$. Hence, by Proposition 6(b) an effectively n-tomic meter can discriminate at most n states.

As a consequence of Proposition 5, there exists a dichotomic meter ${\mathsf {A}}$ with $\sum _x \lambda _{\max }({\mathsf {A}}_x) = 2$. Therefore, the bound for effectively dichotomic meters in Proposition 6(b) cannot be improved without additional assumptions. The following statement has specific assumptions and for that reason gives a tighter bound. In Example 2 below we show that this result has interesting implications.

Proposition 7

Let${\mathsf {A}}$be ann-outcome meter such that${\mathsf {A}}_1, \ldots , {\mathsf {A}}_m$are indecomposable effects for some$m\le n$and for all$i,j \in \{1, \ldots , m \}$such that$i\ne j$we have

i)
${\mathsf {A}}_j \ne t {\mathsf {A}}_i$for all$t>0$,
ii)
$t_i{\mathsf {A}}_i + t_j {\mathsf {A}}_j \ne u$for all$t_i,t_j>0$.

If$\sum _{k=1}^m \lambda _{\max }({\mathsf {A}}_k) >1$, then${\mathsf {A}}$is not effectively dichotomic.

Proof

Suppose ${\mathsf {A}}$ is effectively dichotomic so that there exist dichotomic meters $\{{\mathsf {B}}^{(i)}\}_{i=1}^l$, a probability distribution $(p_i)_{i=1}^l$ and post-processings $\nu ^{(i)}: \{+, -\} \rightarrow \{1, \ldots , n\}$ for all $i=1, \ldots ,l$ such that

$$\begin{aligned} {\mathsf {A}}_j = \sum _{i=1}^l p_i \left( \nu ^{(i)}_{+j} {\mathsf {B}}^{(i)}_+ + \nu ^{(i)}_{-j} {\mathsf {B}}^{(i)}_- \right) \end{aligned}$$

for all $j \in \{1, \ldots , n \}$, where we may assume that $p_i \ne 0$ for all $i=1, \ldots ,l$. By the assumption, ${\mathsf {A}}_j$ is indecomposable for all $j \in \{1, \ldots , m\}$. Thus, for each j, there exists index sets $I^{(j)}_\pm := \{ 1\le i \le l \, | \, \nu ^{(i)}_{\pm j} \ne 0 \}$ such that ${\mathsf {B}}^{(i)}_+ = \alpha ^{(j)}_i {\mathsf {A}}_j$ and ${\mathsf {B}}^{(k)}_- = \beta ^{(j)}_k {\mathsf {A}}_j$ for some $\alpha ^{(j)}_i, \beta ^{(j)}_k \in (0,1]$ for all $i \in I^{(j)}_+$ and $k \in I^{(j)}_-$.

First of all, we note that $I^{(j)}_+ \cap I^{(j)}_- = \emptyset $ for all $j \in \{1, \ldots ,m\}$ because otherwise ${\mathsf {A}}_j$ would be proportional to u due to the normalisation of ${\mathsf {B}}^{(i)}$’s. Secondly, from i) it follows that $I^{(j)}_+ \cap I^{(k)}_+ =I^{(j)}_- \cap I^{(k)}_- = \emptyset $ for all $k,j \in \{1, \ldots ,m\}$ such that $j\ne k$. Thirdly, from ii) it follows that $I^{(j)}_+ \cap I^{(k)}_- = \emptyset $ for all $k,j \in \{1, \ldots ,m\}$ such that $j\ne k$. Thus, the sets $\{I^{(j)}_\pm \}_{j=1}^m$ form a partition of their union $I:= \bigcup _{j=1}^m \left( I^{(j)}_+ \cup I^{(j)}_- \right) \subseteq \{1, \ldots , l\}$.

We can now write

$$\begin{aligned} {\mathsf {A}}_j = \sum _{i \in I^{(j)}_+} p_i \nu ^{(i)}_{+j} {\mathsf {B}}^{(i)}_+ +\sum _{k \in I^{(j)}_+} p_k \nu ^{(k)}_{+j} {\mathsf {B}}^{(k)}_+ \le \left( \sum _{i \in I^{(j)}_+} p_i + \sum _{k \in I^{(j)}_+} p_k \right) u. \end{aligned}$$

(15)

From the above expression and the properties of the index sets it follows that

$$\begin{aligned} \sum _{j=1}^m \lambda _{\max }({\mathsf {A}}_j) \le \sum _{j=1}^m \left( \sum _{i \in I^{(j)}_+} p_i + \sum _{k \in I^{(j)}_-} p_k \right) =\sum _{i \in I} p_i \le \sum _{i=1}^l p_i =1. \end{aligned}$$

$\square $

Example 2

(Unambiguous discrimination of two qubit states) Let $\varrho _1 = |{\psi _1}\rangle \langle {\psi _1}|$ and $\varrho _2 =|{\psi _2}\rangle \langle {\psi _2}|$ be two pure qubit states with a priori probabilities $p_1 = p_2 = \frac{1}{2}$. The unambiguous discrimination of these states involves a 3-outcome POVM with effects ${\mathsf {A}}_1,{\mathsf {A}}_2,{\mathsf {A}}_{?}$ such that observation of the outcome 1 (2) guarantees that the input state was $\varrho _1$ ($\varrho _2$). This implies

$$\begin{aligned} \mathrm{tr}[\varrho _1 A_2] = \mathrm{tr}[\varrho _2 A_1] =0 \end{aligned}$$

and hence

$$\begin{aligned} {\mathsf {A}}_1 = q_1 (I - |{\psi _2}\rangle \langle {\psi _2}|), \quad {\mathsf {A}}_2 = q_2 (I - |{\psi _1}\rangle \langle {\psi _1}|) \end{aligned}$$

for some $q_1,q_2 > 0$ such that ${\mathsf {A}}_{?} = I - {\mathsf {A}}_1 - {\mathsf {A}}_2$ is a valid effect, i.e., ${\mathsf {A}}_{?} \ge O$. Suppose ${\mathsf {A}}$ is effectively dichotomic. Then by Proposition 7 we have $q_1 + q_2 \le 1$ and the success probability is

$$\begin{aligned} p_\mathrm{success}&= \tfrac{1}{2}\mathrm{tr}\left[ {\varrho _1 A_1}\right] + \tfrac{1}{2}\mathrm{tr}\left[ {\varrho _2 A_2}\right] \nonumber \\&= \frac{q_1 + q_2}{2} \left( 1 - | \langle \psi _1 | \psi _2 \rangle |^2 \right) \le \frac{1}{2} \left( 1 - | \langle \psi _1 | \psi _2 \rangle |^2 \right) . \end{aligned}$$

(16)

However, it is known [33] that the optimal success probability without any limitations is $1-\left| {\left\langle \,{\psi _1}\,|\,{\psi _2}\,\right\rangle }\right| $. This is strictly higher than the bound in (16) whenever $\varrho _1$ and $\varrho _2$ are two different states. We conclude that the restriction to dichotomic meters decreases the optimal success probability in unambiguous discrimination.

6 Restriction Class (R3), Noise and Compatibility

In this section we present some examples of restrictions that arise quite naturally and belong to the class (R3).

6.1 Compatibility Restriction

We recall that two meters ${\mathsf {A}}$ and ${\mathsf {B}}$ are compatible if they can be simulated with a single meter ${\mathsf {C}}$, i.e., $\{{\mathsf {A}},{\mathsf {B}}\} \subset {\mathfrak {sim}}({\mathsf {C}})$. Let us fix a meter ${\mathsf {A}}$ and consider all meters that are compatible with ${\mathsf {A}}$; we denote this set by $C({\mathsf {A}})$. The conditions for $C({\mathsf {A}})\ne {\mathcal {M}}$ have been characterized in [5]. In the following we assume that $C({\mathsf {A}})\ne {\mathcal {M}}$ and choose $\tilde{{\mathcal {M}}}=C({\mathsf {A}})$.

Proposition 8

${\mathfrak {sim}}(C({\mathsf {A}}))=C({\mathsf {A}})$.

Proof

To see this, take ${\mathsf {D}}\in {\mathfrak {sim}}(C({\mathsf {A}}))$ so that there exist meters $\{{\mathsf {B}}^{(i)}\}_i \subset C({\mathsf {A}})$ such that ${\mathsf {D}}= \sum _{i=1}^n p_i (\nu ^{(i)} \circ {\mathsf {B}}^{(i)})$ for some probability distribution $(p_i)_{i=1}^n$ and post-processings $\nu ^{(i)}: \Omega _{{\mathsf {B}}^{(i)}} \rightarrow \Omega _{\mathsf {D}}$ for all $i \in \{1, \ldots ,n\}$. If we define a new meter $\tilde{{\mathsf {B}}}$ as $\tilde{{\mathsf {B}}}_{(i,x)} = p_i {\mathsf {B}}^{(i)}_x$ for all $i \in \{1, \ldots , n\}$ and $x \in \Omega _{{\mathsf {B}}^{(i)}}$ (where we can take $\Omega _{{\mathsf {B}}^{(i)}} = \Omega _{{\mathsf {B}}^{(j)}}=: \Omega _{\mathsf {B}}$ for all i, j), we see that

$$\begin{aligned} {\mathsf {D}}_y = \sum _{i,x} p_i \nu ^{(i)}_{xy} {\mathsf {B}}^{(i)}_x = (\nu \circ \tilde{{\mathsf {B}}})_y \end{aligned}$$

for all $y \in \Omega _{\mathsf {D}}$, where we have defined a post-processing $\nu : \{1, \ldots ,n\} \times \Omega _{\mathsf {B}}\rightarrow \Omega _{\mathsf {D}}$ by $\nu _{(i,x)y}= \nu ^{(i)}_{xy}$ for all $i \in \{1, \ldots ,n\}$, $x \in \Omega _{\mathsf {B}}$ and $y \in \Omega _{\mathsf {D}}$. Thus, ${\mathsf {D}}\in {\mathfrak {sim}}(\tilde{{\mathsf {B}}})$.

Since ${\mathsf {B}}^{(i)} \in C({\mathsf {A}})$, for any $i \in \{1, \ldots , n\}$ there exists ${\mathsf {C}}^{(i)} \in {\mathcal {M}}$ such that $\{{\mathsf {A}}, {\mathsf {B}}^{(i)}\} \subset {\mathfrak {sim}}({\mathsf {C}}^{(i)})$. Similarly to $\tilde{{\mathsf {B}}}$, we can define $\tilde{{\mathsf {C}}}$ by setting $\tilde{{\mathsf {C}}}_{(i,z)} = p_i {\mathsf {C}}^{(i)}_z$ for all $i \in \{1, \ldots , n\}$ and $z \in \Omega _{\mathsf {C}}$. Since ${\mathsf {A}}\in {\mathfrak {sim}}({\mathsf {C}}^{(i)})$, there exists a post-processing $\mu ^{(i)}: \Omega _{\mathsf {C}}\rightarrow \Omega _{\mathsf {A}}$ for all $i \in \{1, \ldots , n\}$ such that ${\mathsf {A}}= \mu ^{(i)} \circ {\mathsf {C}}^{(i)}$ so that

$$\begin{aligned} {\mathsf {A}}_k = \sum _i p_i {\mathsf {A}}_k = \sum _{i} p_i (\mu ^{(i)} \circ {\mathsf {C}}^{(i)})_k =\sum _{i,z} p_i \mu ^{(i)}_{zk} {\mathsf {C}}^{(i)}_z = (\mu \circ \tilde{{\mathsf {C}}})_k \end{aligned}$$

for all $k \in \Omega _{\mathsf {A}}$, where we have defined another post-processing $\mu : \{1, \ldots ,n\} \times \Omega _{\mathsf {C}}\rightarrow \Omega _{\mathsf {A}}$ by $\mu _{(i,z)k}= \mu ^{(i)}_{zk}$ for all $i \in \{1, \ldots ,n\}$, $z \in \Omega _{\mathsf {C}}$ and $k \in \Omega _{\mathsf {A}}$. Thus, ${\mathsf {A}}\in {\mathfrak {sim}}(\tilde{{\mathsf {C}}})$.

On the other hand, since ${\mathsf {B}}^{(i)} \in {\mathfrak {sim}}({\mathsf {C}}^{(i)})$ for all $i \in \{1, \ldots , n\}$, there exists post-processings $\kappa ^{(i)}: \Omega _{\mathsf {C}}\rightarrow \Omega _{\mathsf {B}}$ such that ${\mathsf {B}}^{(i)} = \kappa ^{(i)} \circ {\mathsf {C}}^{(i)}$ so that

$$\begin{aligned} \tilde{{\mathsf {B}}}_{(i,x)} = p_i {\mathsf {B}}^{(i)}_x = \sum _z \kappa ^{(i)}_{zx} p_i {\mathsf {C}}^{(i)}_z = ( \kappa \circ \tilde{{\mathsf {C}}})_{(i,x)} \end{aligned}$$

for all $(i,x) \in \{1, \ldots , n\} \times \Omega _{\mathsf {B}}$, where we have defined yet another post-processing $\kappa : \{1, \ldots ,n\} \times \Omega _{\mathsf {C}}\rightarrow \{1, \ldots , n\} \times \Omega _{\mathsf {B}}$ by $\kappa _{(j,z)(i,x)}= \delta _{ij} \kappa ^{(j)}_{zx}$ for all $i,j \in \{1, \ldots ,n\}$, $z \in \Omega _{\mathsf {C}}$ and $x \in \Omega _{\mathsf {B}}$. Hence, $\tilde{{\mathsf {B}}} \in {\mathfrak {sim}}(\tilde{{\mathsf {C}}})$ and ${\mathsf {D}}\in {\mathfrak {sim}}(\tilde{{\mathsf {C}}})$.

To conclude, we have shown that $\{{\mathsf {A}}, {\mathsf {D}}\} \subset {\mathfrak {sim}}(\tilde{{\mathsf {C}}})$, i.e., ${\mathsf {A}}$ and any ${\mathsf {D}}\in {\mathfrak {sim}}(C({\mathsf {A}}))$ are compatible, therefore ${\mathfrak {sim}}(C({\mathsf {A}})) \subset C({\mathsf {A}})$ and $C({\mathsf {A}})$ is simulation closed. $\square $

Interestingly, in quantum theory the restriction $C({\mathsf {A}})$ can be either (R1) or (R3), depending on ${\mathsf {A}}$. Firstly, if ${\mathsf {A}}$ is a sharp quantum meter, i.e., every ${\mathsf {A}}_x$ is a projection operator, then a meter ${\mathsf {B}}$ is compatible with ${\mathsf {A}}$ if and only if $[{\mathsf {A}}_x,{\mathsf {B}}_y]=O$ for all outcomes x, y [34]. The restriction $C({\mathsf {A}})$ is then of the type (R1) as $C({\mathsf {A}})={\mathcal {M}}_{\tilde{{\mathcal {E}}}}$, where

$$\begin{aligned} \tilde{{\mathcal {E}}}= \{E \in {\mathcal {E}}({\mathcal {H}}): [E,{\mathsf {A}}_x]=O \; \forall x \}. \end{aligned}$$

Secondly, to see that the restriction $C({\mathsf {A}})$ can be of the type (R3) we recall the result in [35], which demostrates the existence of quantum meters ${\mathsf {A}}$ and ${\mathsf {B}}$ in ${\mathbb {C}}^3$ such that ${\mathsf {A}}$ is a dichotomic, ${\mathsf {B}}$ is trichotomic, and they are coexistent but not compatible. The coexistence of ${\mathsf {A}}$ and ${\mathsf {B}}$ means that all coarse-grainings of ${\mathsf {B}}$ into dichotomic meters are compatible with ${\mathsf {A}}$. The union of the ranges of all dichotomic coarse-grainings of ${\mathsf {B}}$ is the same as the range of ${\mathsf {B}}$. This result implies that there is no $\tilde{{\mathcal {E}}}$ such that $C({\mathsf {A}})= {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$. Finally, to see that $C({\mathsf {A}})$ cannot be of the type (R2), we observe that ${\mathcal {E}}_{C({\mathsf {A}})}={\mathcal {E}}$ implies that ${\mathsf {A}}$ is compatible with all dichotomic meters. If this is the case, every ${\mathsf {A}}_x$ commutes with all projection operators and hence ${\mathsf {A}}_x$ is a multiple of the identity operator I. But then $C({\mathsf {A}})={\mathcal {M}}$ and we do not have a restriction at all, which is a contradiction in general so ${\mathcal {E}}_{C({\mathsf {A}})} \ne {\mathcal {E}}$.

6.2 Noise Restriction on Meters

Let us denote by ${\mathcal {P}}(\Omega )$ the set of probability distributions on a (finite) set $\Omega $. Let us fix $t \in [0,1]$ and define a restriction $\tilde{{\mathcal {M}}}_t$ on meters as

$$\begin{aligned} \tilde{{\mathcal {M}}}_t = \{ t {\mathsf {B}}+(1-t) p u \, | \, {\mathsf {B}}\in {\mathcal {M}}, \ p \in {\mathcal {P}}(\Omega _{\mathsf {B}}) \} . \end{aligned}$$

(17)

Clearly, if $t =1$, we have $\tilde{{\mathcal {M}}}_1 = {\mathcal {M}}$, and if $t= 0$ we have $\tilde{{\mathcal {M}}}_0 = {\mathcal {T}}$, where ${\mathcal {T}}$ is the set of trivial meters. Thus, we can interpret the parameter t as noise on the meters so that the smaller t gets, the noisier the meters in $\tilde{{\mathcal {M}}}_t$ become.

Let now $t \in (0,1)$. We will show that then $\tilde{{\mathcal {M}}}_t$ is a restriction of type (R3). First of all, we see that $\tilde{{\mathcal {M}}}_t$ is simulation closed: If we take ${\mathsf {A}}\in {\mathfrak {sim}}(\tilde{{\mathcal {M}}}_t)$ so that ${\mathsf {A}}= \sum _i p_i (\nu ^{(i)} \circ {\mathsf {B}}^{(i)} )$ for some meters $\{{\mathsf {B}}^{(i)} = t {\mathsf {C}}^{(i)} +(1-t) q^{(i)} u\}_i \subset \tilde{{\mathcal {M}}}_t$, some post-processings $\nu ^{(i)}: \Omega _{\mathsf {B}}\rightarrow \Omega _{\mathsf {A}}$, and some probability distribution $(p_i)_i$, then

$$\begin{aligned} {\mathsf {A}}_y = \sum _{i,x} p_i \nu ^{(i)}_{xy} {\mathsf {B}}^{(i)}_x =\sum _{i,x} p_i \nu ^{(i)}_{xy} [t {\mathsf {C}}^{(i)}_x +(1-t) q^{(i)}_x u] = t {\mathsf {C}}_y +(1-t) q_y u, \end{aligned}$$

where we have defined a new meter ${\mathsf {C}}= \sum _i p_i (\nu ^{(i)} \circ {\mathsf {C}}^{(i)}) \in {\mathcal {M}}$ and a new probability distribution $q \in {\mathcal {P}}(\Omega _{\mathsf {A}})$ by setting $q_y = \sum _{i,x} p_i \nu ^{(i)}_{xy} q^{(i)}_x$. Thus, ${\mathsf {A}}= t{\mathsf {C}}+(1-t) q u \in \tilde{{\mathcal {M}}}_t$ so that $\tilde{{\mathcal {M}}}_t$ is simulation closed.

Next we see that

$$\begin{aligned} {\mathcal {E}}_{\tilde{{\mathcal {M}}}_t} = \{ t e + (1-t) r u \, | \, e \in {\mathcal {E}}, \ r \in [0,1] \} \end{aligned}$$

(18)

so that ${\mathcal {E}}_{\tilde{{\mathcal {M}}}_t} \subsetneq {\mathcal {E}}$ since $t \ne 1$. Thus, $\tilde{{\mathcal {M}}}_t$ is not of type (R2). What remains to show is that $\tilde{{\mathcal {M}}}_t \ne {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ for all effect restrictions $\tilde{{\mathcal {E}}}\subset {\mathcal {E}}$.

Our first observation is that if a restriction on meters $\tilde{{\mathcal {M}}}$ is induced by some effect restriction $\tilde{{\mathcal {E}}}$, i.e., $\tilde{{\mathcal {M}}}={\mathcal {M}}_{\tilde{{\mathcal {E}}}}$, then the effect restriction $\tilde{{\mathcal {E}}}$ is unique and is given by the induced effects ${\mathcal {E}}_{\tilde{{\mathcal {M}}}} ={\mathcal {E}}_{{\mathcal {M}}_{\tilde{{\mathcal {E}}}}}$.

Proposition 9

For a restriction${\mathcal {M}}_{\tilde{{\mathcal {E}}}}$induced by an effect restriction$\tilde{{\mathcal {E}}}\subset {\mathcal {E}}$satisfying the consistency conditions (E1) and (E2) we have that${\mathcal {E}}_{{\mathcal {M}}_{\tilde{{\mathcal {E}}}}}= \tilde{{\mathcal {E}}}$.

Proof

Let us take $e \in {\mathcal {E}}_{{\mathcal {M}}_{\tilde{{\mathcal {E}}}}}$ so that there exists ${\mathsf {A}}\in {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ such that $e \in \mathrm{ran}\,({\mathsf {A}})$. From the definition of ${\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ it follows that $e \in \mathrm{ran}\,({\mathsf {A}}) \subset \tilde{{\mathcal {E}}}$. Thus, ${\mathcal {E}}_{{\mathcal {M}}_{\tilde{{\mathcal {E}}}}} \subseteq \tilde{{\mathcal {E}}}$.

For the other direction let us take $f \in \tilde{{\mathcal {E}}}$. As it was stated earlier, any dichotomic meter ${\mathsf {F}}$ with effects f and $u-f$ must be in ${\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ so that from the definition of ${\mathcal {E}}_{{\mathcal {M}}_{\tilde{{\mathcal {E}}}}}$ we see that $f \in {\mathcal {E}}_{{\mathcal {M}}_{\tilde{{\mathcal {E}}}}}$. Thus, $\tilde{{\mathcal {E}}}\subseteq {\mathcal {E}}_{{\mathcal {M}}_{\tilde{{\mathcal {E}}}}}$. Combining ${\mathcal {E}}_{{\mathcal {M}}_{\tilde{{\mathcal {E}}}}} \subseteq \tilde{{\mathcal {E}}}$ and $\tilde{{\mathcal {E}}}\subseteq {\mathcal {E}}_{{\mathcal {M}}_{\tilde{{\mathcal {E}}}}}$, we have the claim. $\square $

Thus, for $\tilde{{\mathcal {M}}}_t$, the previous result implies that if $\tilde{{\mathcal {M}}}_t = {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ for some $\tilde{{\mathcal {E}}}\subset {\mathcal {E}}$, then $\tilde{{\mathcal {E}}}= {\mathcal {E}}_{\tilde{{\mathcal {M}}}_t}$. First of all, one can readily see that ${\mathcal {E}}_{\tilde{{\mathcal {M}}}_t}$ is convex and hence, by Theorem 1, ${\mathcal {M}}_{{\mathcal {E}}_{\tilde{{\mathcal {M}}}_t}}$ is simulation closed as it should be. We will proceed by constructing a meter ${\mathsf {A}}\in {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ such that ${\mathsf {A}}\notin \tilde{{\mathcal {M}}}_t$.

To see when a given meter is in $\tilde{{\mathcal {M}}}_t$, we give a convenient characterization for $\tilde{{\mathcal {M}}}_t$ in terms of the noise content of a meter [36]. The noise content $w({\mathsf {B}}; {\mathcal {N}})$ of a meter ${\mathsf {B}}\in {\mathcal {M}}$ with respect to a noise set ${\mathcal {N}}\subset {\mathcal {M}}$ is defined as

$$\begin{aligned} w({\mathsf {B}}; {\mathcal {N}}) = \sup \{ 0 \le \lambda \le 1 \, | \, \exists {\mathsf {C}}\in {\mathcal {M}}, {\mathsf {N}}\in {\mathcal {N}}: \ {\mathsf {B}}= \lambda {\mathsf {N}}+(1-\lambda ) {\mathsf {C}}\}. \end{aligned}$$

The noise content $w({\mathsf {B}}; {\mathcal {N}})$ thus characterizes how much of ${\mathsf {B}}$ is in ${\mathcal {N}}$ with respect to the convex structure of meters. When ${\mathcal {N}}$ is chosen to represent some noise in the meters, the noise content can be interpreted as the amount of the intrinsic noise that is present in the meter (contrary to the external noise that is typically added to a meter). A typical choice for ${\mathcal {N}}$ is to set ${\mathcal {N}}= {\mathcal {T}}$, the set of trivial meters. In this case it can be shown that

$$\begin{aligned} w({\mathsf {B}}; {\mathcal {T}}) = \sum _{x \in \Omega _{\mathsf {B}}} \lambda _{\min }({\mathsf {B}}_x). \end{aligned}$$

(19)

We can now give the following characterization for $\tilde{{\mathcal {M}}}_t$:

Lemma 2

Meter${\mathsf {B}}\in \tilde{{\mathcal {M}}}_t$if and only if$w({\mathsf {B}}; {\mathcal {T}}) \ge 1-t$.

Proof

Let first ${\mathsf {B}}\in \tilde{{\mathcal {M}}}_t$ so that there exists ${\mathsf {C}}\in {\mathcal {M}}$ and $p \in {\mathcal {P}}(\Omega _{\mathsf {B}})$ such that ${\mathsf {B}}= t {\mathsf {C}}+(1-t) p u= t {\mathsf {C}}+(1-t) {\mathsf {T}}$, where we have defined a trivial meter ${\mathsf {T}}\in {\mathcal {T}}$ by ${\mathsf {T}}_x =p_x u$ for all $x \in \Omega _{\mathsf {B}}$. From the definition of the noise content we see that $w({\mathsf {B}}; {\mathcal {T}}) \ge 1-t$.

Let then $w({\mathsf {B}}; {\mathcal {T}}) \ge 1-t$. Since we have the noise set ${\mathcal {N}}= {\mathcal {T}}$, by Eq. (19) the supremum in the definition of the noise content is attained so there exist ${\mathsf {D}}\in {\mathcal {M}}$ and ${\mathsf {T}}\in {\mathcal {T}}$ such that ${\mathsf {B}}= w({\mathsf {B}}; {\mathcal {T}}) {\mathsf {T}}+(1-w({\mathsf {B}};{\mathcal {T}})) {\mathsf {D}}$. We have

$$\begin{aligned} {\mathsf {B}}&= w({\mathsf {B}}; {\mathcal {T}}) {\mathsf {T}}+ (1-w({\mathsf {B}}; {\mathcal {T}}) ) {\mathsf {D}}\\&= (1-t+t+w({\mathsf {B}}; {\mathcal {T}})-1) {\mathsf {T}}+(1- w({\mathsf {B}}; {\mathcal {T}})) {\mathsf {D}}\\&= (1-t) {\mathsf {T}}+ t \left[ \dfrac{t+w({\mathsf {B}};{\mathcal {T}})-1}{t} {\mathsf {T}}+\dfrac{1- w({\mathsf {B}}; {\mathcal {T}})}{t} {\mathsf {D}}\right] \\&= (1-t) {\mathsf {T}}+ t \tilde{{\mathsf {D}}} \in \tilde{{\mathcal {M}}}_t, \end{aligned}$$

where $\tilde{{\mathsf {D}}}= \frac{t+w({\mathsf {B}};{\mathcal {T}})-1}{t} {\mathsf {T}}+ \frac{1- w({\mathsf {B}}; {\mathcal {T}})}{t} {\mathsf {D}}\in {\mathcal {M}}$ is a convex mixture of ${\mathsf {T}}$ and ${\mathsf {D}}$. $\square $

Now we are ready to prove that $\tilde{{\mathcal {M}}}_t$ is a restriction of type (R3) for all $t \in (0,1)$.

Proposition 10

Let$t \in (0,1)$. Then$\tilde{{\mathcal {M}}}_t \ne {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$for any$\tilde{{\mathcal {E}}}\subset {\mathcal {E}}$.

Proof

Let us suppose that $\tilde{{\mathcal {M}}}_t = {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ for some effect restriction $\tilde{{\mathcal {E}}}\subset {\mathcal {E}}$. As it was mentioned earlier, by Proposition 9 we then have that $\tilde{{\mathcal {E}}}= {\mathcal {E}}_{{\mathcal {M}}_{\tilde{{\mathcal {E}}}}} = {\mathcal {E}}_{\tilde{{\mathcal {M}}}_t}$. We will construct a meter ${\mathsf {A}}\in {\mathcal {M}}_{\tilde{{\mathcal {E}}}}$ such that ${\mathsf {A}}\notin \tilde{{\mathcal {M}}}_t$, which will then be a contradiction.

Let us start the construction of ${\mathsf {A}}$, by constructing another meter ${\mathsf {B}}$ with $w({\mathsf {B}};{\mathcal {T}}) =0$ and $\max _{x \in \Omega _{\mathsf {B}}} \lambda _{\max }({\mathsf {B}}_x) \in [t,1)$. We will then use ${\mathsf {B}}$ to construct ${\mathsf {A}}$ and use Lemma 2 together with the previously listed properties of ${\mathsf {B}}$ to show that ${\mathsf {A}}\notin \tilde{{\mathcal {M}}}_t$.

Let us fix an extreme indecomposable effect $e \in {\mathcal {E}}({\mathcal {S}})$. If we set $e_1 = e$ and decompose $u-e$ into indecomposable effects $u-e = \sum _{i=2}^n e_i$ for some $n \in {\mathbb {N}}$, we can define a meter $\tilde{{\mathsf {B}}}$ as $\tilde{{\mathsf {B}}}_i = e_i$ for all $i \in \{1, \ldots ,n\}$ such that all of the effects of $\tilde{{\mathsf {B}}}$ are indecomposable. Since indecomposable effects lie on the boundary of the positive cone of the effect space, we have that $\lambda _{\min }(\tilde{{\mathsf {B}}}_i) =0$ for all $i \in \{1, \ldots ,n\}$ so that by Eq. (19) we have that $w(\tilde{{\mathsf {B}}}; {\mathcal {T}}) =0$.

Let us relabel the outcomes of $\tilde{{\mathsf {B}}}$ in such a way that $\lambda _{\max }(\tilde{{\mathsf {B}}}_i) = 1$ for all $i \in \{1, \ldots , m\}$ for some $m \le n$. We recall that since e is an extreme effect we know that $\lambda _{\max }(e) =1$, and since $e \in \mathrm {ran}(\tilde{{\mathsf {B}}})$ we must have $m\ge 1$. We take $q \in [t,1)$ and define a new meter ${\mathsf {B}}$ with effects

$$\begin{aligned} {\mathsf {B}}_i ={\left\{ \begin{array}{ll} q \tilde{{\mathsf {B}}}_i, &{} i\in \{1, \ldots ,m\} \\ \tilde{{\mathsf {B}}}_i, &{} i \in \{m+1, \ldots , n\} \\ (1-q) \tilde{{\mathsf {B}}}_{i-n}, &{} i \in \{n+1, \ldots , n+m\} \end{array}\right. }. \end{aligned}$$

(20)

By construction we have that $w({\mathsf {B}};{\mathcal {T}}) =0$ and $l_{\mathsf {B}}:=\max _{x \in \Omega _{\mathsf {B}}} \lambda _{\max }({\mathsf {B}}_x) \in [t,1)$.

Let us now take numbers $\{r_i\}_{i=1}^{n+m} \subset [0,1]$ such that $r:= \sum _{i=1}^{n+m} r_i \in \left[ \frac{l_{\mathsf {B}}-t}{(1-t)l_{\mathsf {B}}}, 1 \right) $ and define a new meter ${\mathsf {A}}$ by ${\mathsf {A}}_i = t a_i +(1-t)r_i u$, where we have defined $a_i = \frac{1-(1-t)r}{t} {\mathsf {B}}_i$ for all $i \in \{1, \ldots , n+m\}$. Once we show that ${\mathsf {A}}$ is well-defined and ${\mathsf {A}}\in {\mathcal {M}}_{{\mathcal {E}}_{{\mathcal {M}}_t}}$, we can use Eq. (19) to see that $w({\mathsf {A}}; {\mathcal {T}}) < 1-t$ so that by Lemma 2 we have ${\mathsf {A}}\notin \tilde{{\mathcal {M}}}_t$ which completes the proof.

In order to show that ${\mathsf {A}}$ is well-defined we need to show that ${\mathsf {A}}$ is a meter and that we can choose $\{r_i\}_i$ like we wanted. The problematic parts in the definition of the sequence $\{r_i\}_i$ are that we might have that $\frac{l_{\mathsf {B}}-t}{(1-t)l_{\mathsf {B}}}<0$, which might lead to $r<0$, or $\frac{l_{\mathsf {B}}-t}{(1-t)l_{\mathsf {B}}}\ge 1$, which would leave the interval $\left[ \frac{l_{\mathsf {B}}-t}{(1-t)l_{\mathsf {B}}}, 1 \right) $ empty. However, from the definition of ${\mathsf {B}}$ we see that $l_{\mathsf {B}}=\max _{x \in \Omega _{\mathsf {B}}} \lambda _{\max }({\mathsf {B}}_x) \ge t$ so that $\frac{l_{\mathsf {B}}-t}{(1-t)l_{\mathsf {B}}} \ge 0$, and since $l_{\mathsf {B}}<1$ it is easy to see that $\frac{l_{\mathsf {B}}-t}{(1-t)l_{\mathsf {B}}} < 1$. Thus, we can choose the sequence $\{r_i\}_i$ like we wanted.

In order to show that ${\mathsf {A}}\in {\mathcal {M}}_{{\mathcal {E}}_{{\mathcal {M}}_t}}$ we need to show that $a_i \in {\mathcal {E}}({\mathcal {S}})$ for all $i \in \{1, \ldots ,n+m\}$ and that $\sum _i {\mathsf {A}}_i = u$. Since $r<1<\frac{1}{1-t}$ we see that $\frac{1-(1-t)r}{t} >0$ so that $a_i = \frac{1-(1-t)r}{t} {\mathsf {B}}_i\ge o$ for all $i \in \{1, \ldots , n+m\}$. On the other hand, we have $a_i \le u$ if and only if $\frac{1-(1-t)r}{t} \lambda _{\max }({\mathsf {B}}_i) \le 1$ which is equivalent to $r \ge \frac{\lambda _{\max }({\mathsf {B}}_i) -t}{(1-t) \lambda _{\max }({\mathsf {B}}_i)}$. Since $r \ge \frac{l_{\mathsf {B}}-t}{(1-t) l_{\mathsf {B}}} \ge \frac{\lambda _{\max }({\mathsf {B}}_i) -t}{(1-t) \lambda _{\max }({\mathsf {B}}_i)}$, it follows that $a_i \le u$ for all $i \in \{1, \ldots ,n+m\}$. Thus, $a_i \in {\mathcal {E}}({\mathcal {S}})$ so that ${\mathsf {A}}_i \in {\mathcal {E}}_{\tilde{{\mathcal {M}}}_t}$ for all $i \in \{1, \ldots ,n+m\}$. Furthermore, we see that

$$\begin{aligned} \sum _{i=1}^{n+m} {\mathsf {A}}_i = (1-(1-t)r) \left( \sum _{i=1}^{n+m} {\mathsf {B}}_i \right) +(1-t)r u = (1-(1-t)r)u +(1-t)r u = u. \end{aligned}$$

Hence, ${\mathsf {A}}\in {\mathcal {M}}_{{\mathcal {E}}_{\tilde{{\mathcal {M}}}_t}}$.

For the noise content of ${\mathsf {A}}$, we see that

$$\begin{aligned} w({\mathsf {A}};{\mathcal {T}})&= \sum _{i=1}^{n+m} \lambda _{\min }({\mathsf {A}}_i) =t\left( \sum _{i=1}^{n+m} \lambda _{\min }(a_i)\right) + (1-t)r \\&= (1-(1-t)r) \left( \sum _{i=1}^{n+m} \lambda _{\min }({\mathsf {B}}_i) \right) +(1-t)r \\&= (1-t)r < 1-t, \end{aligned}$$

which by Lemma 2 shows that ${\mathsf {A}}\notin \tilde{{\mathcal {M}}}_t$. $\square $

Thus, we have just demonstrated that if the noise is introduced at the level of meters as in Eq. (17), the induced restriction cannot be reproduced by considering noise on effects alone. However, one can of course start with Eq. (18) and use it as a restriction on its own so that we will naturally arrive at (R1) type of restriction instead. The next example will illustrate this point in quantum theory.

Example 3

(Depolarizing noise in quantum theory) In quantum theory, the standard depolarizing channel $\Phi _t: {\mathcal {L}}({\mathcal {H}})\rightarrow {\mathcal {L}}({\mathcal {H}})$ on a d-dimensional Hilbert space ${\mathcal {H}}$ is defined as

$$\begin{aligned} \Phi _t(\varrho ) = t \varrho +(1-t) \mathrm{tr}\left[ {\varrho }\right] \frac{I}{d} \end{aligned}$$

(21)

for all $\varrho \in {\mathcal {L}}({\mathcal {H}})$ with some noise parameter $t \in [0,1]$. In the Heisenberg picture, the depolarizing noise can be alternatively ascribed to the meters, which results in the restricted set of effects

$$\begin{aligned} \tilde{{\mathcal {E}}}_t = \Phi ^*_t({\mathcal {E}}({\mathcal {S}})) = \left\{ t E +(1-t)\frac{\mathrm{tr}\left[ {E}\right] }{d}I \, : \, E \in {\mathcal {E}}({\mathcal {H}})\right\} , \end{aligned}$$

(22)

where $\Phi ^*_t$ is dual to $\Phi _t$. Clearly $\tilde{{\mathcal {E}}}_t \subseteq {\mathcal {E}}({\mathcal {H}})$ for all $t \in (0,1]$ and the equality holds only if $t=1$. For $t \in (0,1]$ it is straigthforward to verify that $\Phi ^*_t$ is an affine isomorphism between ${\mathcal {E}}({\mathcal {H}})$ and $\tilde{{\mathcal {E}}}_t$ so that by Proposition 3 we can deduce that $\tilde{{\mathcal {E}}}_t$ is a restriction of type (R1) that does not form a convex subalgebra of ${\mathcal {E}}({\mathcal {S}})$. For $t=0$ we have that $\tilde{{\mathcal {E}}}_0 = \mathrm {span}_{[0,1]}\{I\}$, which is a trivial convex subalgebra of every effect algebra.

However, if we consider a class of general (shifted) depolarizing channels $\Psi _{t,\xi }(\varrho ) = t \varrho + (1-t) \mathrm{tr}[\varrho ] \xi $ with a general state $\xi $ instead of the maximally mixed state I/d, then a wider class of effects is achievable. This describes a physically relevant situation when the considered qubit is coupled to a two-level fluctuator [37]. The dual map then reads $\Psi ^*_{t,\xi }(E) = t E+ (1-t) \mathrm{tr}\left[ {E \xi }\right] I$ so that in the case of quantum theory it can be confirmed that $\{\Psi ^*_{t,\xi }({\mathcal {E}}({\mathcal {H}}))\, | \, \xi \in {\mathcal {S}}({\mathcal {H}})\} ={\mathcal {E}}_{\tilde{{\mathcal {M}}}_t}$, i.e., we get all the effects provided by Eq. (18). Clearly, ${\mathcal {E}}_{\tilde{{\mathcal {M}}}_t} \ne \tilde{{\mathcal {E}}}_t$. The effect restrictions ${\mathcal {E}}_{\tilde{{\mathcal {M}}}_t}$ and $\tilde{{\mathcal {E}}}_t$ are depicted in Fig. 1.

7 Discussion and Conclusions

Our primary goal in this paper was to establish a natural criterion that any operational restriction is to satisfy and to classify such restrictions. Given a set of meters one can always randomly switch among the meters and classically postprocess their measurement outcomes. As a result, one readily gets a simulation closure of the original set of meters. Equipped with the natural operational requirement of simulation closedness, we have divided all operational restrictions into three classes.

Class (R1) describes such restrictions that originate from the truncation of the set of effects. We have characterized such sets of effects in Theorem 1. We have demonstrated that a restriction to any convex subalgebra of the set of all effects induces a proper operational restriction of class (R1). Further in Proposition 3 we have proved that there exist operational restrictions of class (R1) that do not reduce to convex effect subalgebras. Proposition 9 clarifies that the effect restriction is unique if the consistency conditions (E1) and (E2) are satisfied.

Surprisingly enough, there exist operational restrictions of class (R2) on meters such that every effect within the no-restriction hypothesis is accessible, however, the set of meters is severely truncated. The most prominent example is effectively dichotomic meters, a simulation closure of dichotomic meters. Moreover, any restriction of class (R2) must contain effectively dichotomic meters as a subset (Proposition 4).

It is worth mentioning that effectively dichotomic meters naturally emerge in conventional experiments with polarized photons and superconducting qubits, and therefore are of great practical interest. Despite the fact that restrictions of class (R2) seem quite innocent as compared to the restrictions of class (R1), they do impose some strong physical limitations. In Example 2 we have demonstrated that the success probability of unambiguous discrimination of nonorthogonal pure qubit states with effectively dichotomic meters is strictly less than that with trichotomic meters. From a wider viewpoint of resource theories [38, 39], Example 2 opens an avenue for the study of the resource theory of n-tomicity. Within such a resource theory, n-outcome meters are free and any simulation scheme for meters is a free operation. A meter that is not n-tomic may represent a resource for some task (as a trichotomic meter in the unambiguous discrimination in Example 2). As a byproduct of this research direction, we have also derived some sufficient and (separately) necessary conditions for effectively n-tomic observables (Propositions 5–7).

Finally, we have demonstrated that there are restrictions that arise rather naturally but belong to neither (R1) nor (R2). We have shown that such restrictions can emerge when one considers meters compatible with a given meter (Proposition 8) or when one tries to account for noise in the meters (Proposition 10 and Example 3). We believe that the operational restrictions of type (R3) can be further analyzed in subsequent works.

Notes

A subset $C \subset V$ of a vector space V is a (convex) cone if $C +C \subseteq C$ and $\alpha C \subseteq C$ for every $\alpha \in {\mathbb {R}}^+$. Furthermore, C is a proper cone if $C \cap (- C) = \{0\}$ and generating if $C -C = V$. A subset $B \subset C$ is a base of C if for every $x \in C \setminus \{0\}$ there exists unique $\beta > 0$ and $b \in B$ such that $x = \beta b$.
Dual cone $C^*\subset V^*$ of a cone $C \subset V$ consists of positive linear functionals on C, i.e., $C^* = \{ f \in V^* \, | \, f(x) \ge 0 \ \forall x \in C\}$.

References

Busch, P., Heinosaari, T., Schultz, J., Stevens, N.: Comparing the degrees of incompatibility inherent in probabilistic physical theories. EPL 103, 10002 (2013)
Article ADS Google Scholar
Stevens, N., Busch, P.: Steering, incompatibility, and Bell inequality violations in a class of probabilistic theories. Phys. Rev. A 89, 022123 (2014)
Article ADS Google Scholar
Banik, M.: Measurement incompatibility and Schrödinger–Einstein–Podolsky–Rosen steering in a class of probabilistic theories. J. Math. Phys. 56, 052101 (2015)
Article ADS MathSciNet Google Scholar
Aubrun, G., Lami, L., Palazuelos, C., Plavala, M.: Entangleability of Cones, (2019). arXiv:1911.09663 [math.FA]
Heinosaari, T., Leppäjärvi, L., Plávala, M.: No-free-information principle in general probabilistic theories. Quantum 3, 157 (2019)
Article Google Scholar
Barnum, H., Barrett, J., Leifer, M., Wilce, A.: Generalized no-broadcasting theorem. Phys. Rev. Lett. 99, 240501 (2007)
Article ADS Google Scholar
Chiribella, G., Spekkens, R.W. (eds.): Quantum Theory: Informational Foundations and Foils. FTPH, vol. 181. Springer, Dordrecht (2016)
Chiribella, G., D’Ariano, G.M., Perinotti, P.: Probabilistic theories with purification. Phys. Rev. A 81, 062348 (2010)
Article ADS Google Scholar
Janotta, P., Lal, R.: Generalized probabilistic theories without the no-restriction hypothesis. Phys. Rev. A 87, 052131 (2013)
Article ADS Google Scholar
Chiribella, G., D’Ariano, G.M., Perinotti, P.: Informational derivation of quantum theory. Phys. Rev. A 84, 012311 (2011)
Article ADS Google Scholar
Barnum, H., Müller, M.P., Ududec, C.: Higher-order interference and single-system postulates characterizing quantum theory. New J. Phys. 16, 123029 (2014)
Article ADS Google Scholar
Wilce, A.: Conjugates, filters and quantum mechanics. Quantum 3, 158 (2019)
Article Google Scholar
Sainz, A.B., Guryanova, Y., Acín, A., Navascués, M.: Almost-quantum correlations violate the no-restriction hypothesis. Phys. Rev. Lett. 120, 200402 (2018)
Article ADS Google Scholar
Clerk, A.A., Devoret, M.H., Girvin, S.M., Marquardt, F., Schoelkopf, R.J.: Introduction to quantum noise, measurement, and amplification. Rev. Mod. Phys. 82, 1155 (2010)
Article ADS MathSciNet Google Scholar
Navascués, M., Popescu, S.: How energy conservation limits our measurements. Phys. Rev. Lett. 112, 140502 (2014)
Article ADS Google Scholar
Amosov, G.G., Filippov, S.N.: Spectral properties of reduced fermionic density operators and parity superselection rule. Quantum Inf. Process. 16, 2 (2017)
Article ADS MathSciNet Google Scholar
Oszmaniec, M., Guerini, L., Wittek, P., Acín, A.: Simulating positive-operator-valued measures with projective measurements. Phys. Rev. Lett. 119, 190501 (2017)
Article ADS MathSciNet Google Scholar
Guerini, L., Bavaresco, J., Cunha, M.T., Acín, A.: Operational framework for quantum measurement simulability. J. Math. Phys. 58, 092102 (2017)
Article ADS MathSciNet Google Scholar
Filippov, S.N., Heinosaari, T., Leppäjärvi, L.: Simulability of observables in general probabilistic theories. Phys. Rev. A 97, 062102 (2018)
Article ADS Google Scholar
Barnum, H., Wilce, A.: Information processing in convex operational theories. Electron. Notes Theor. Comput. Sci. 270, 3 (2011)
Article Google Scholar
Kimura, G., Nuida, K., Imai, H.: Distinguishability measures and entropies for general probabilistic theories. Rep. Math. Phys. 66, 175–206 (2010)
Article ADS MathSciNet Google Scholar
Carmeli, C., Heinosaari, T., Miyadera, T., Toigo, A.: Noise-disturbance relation and the Galois connection of quantum measurements. Found. Phys. 49, 492 (2019)
Article ADS MathSciNet Google Scholar
Gudder, S.: Convex structures and operational quantum mechanics. Commun. Math. Phys. 29, 249 (1973)
Article ADS MathSciNet Google Scholar
Gudder, S.: Finite-Dimensional Convex Effect Algebras (2019). arXiv:1912.05110 [quant-ph]
Foulis, D., Bennett, M.K.: Effect algebras and unsharp quantum logics. Found. Phys. 24, 1331 (1994)
Article ADS MathSciNet Google Scholar
Gudder, S., Pulmannová, S.: Representation theorem for convex effect algebras. Comment. Math. Univ. Carol. 39, 645 (1998)
MathSciNet MATH Google Scholar
Gudder, S., Pulmannová, S., Bugajski, S., Beltrametti, E.: Convex and linear effect algebras. Rep. Math. Phys. 44, 359 (1999)
Article ADS MathSciNet Google Scholar
Beltrametti, E., Bugajski, S.: Effect algebras and statistical physical theories. J. Math. Phys. 38, 3020 (1997)
Article ADS MathSciNet Google Scholar
Bugajski, S., Gudder, S., Pulmannová, S.: Convex effect algebras, state ordered effect algebras, and ordered linear spaces. Rep. Math. Phys. 45, 371 (2000)
Article ADS MathSciNet Google Scholar
Kleinmann, M., Cabello, A.: Quantum correlations are stronger than all nonsignaling correlations produced by $n$-outcome measurements. Phys. Rev. Lett. 117, 150401 (2016)
Article ADS Google Scholar
Kleinmann, M., Vértesi, T., Cabello, A.: Proposed experiment to test fundamentally binary theories. Phys. Rev. A 96, 032104 (2017)
Article ADS Google Scholar
Hu, X.-M., Liu, B.-H., Guo, Y., Xiang, G.-Y., Huang, Y.-F., Li, C.-F., Guo, G.-C., Kleinmann, M., Vértesi, T., Cabello, A.: Observation of stronger-than-binary correlations with entangled photonic qutrits. Phys. Rev. Lett. 120, 180402 (2018)
Article ADS Google Scholar
Helstrom, C.W.: Quantum Detection and Estimation Theory. Academic Press, New York (1976)
MATH Google Scholar
Holevo, A.S.: Probabilistic and Statistical Aspects of Quantum Theory. North-Holland, Amsterdam (1982)
MATH Google Scholar
Reeb, D., Reitzner, D., Wolf, M.M.: Coexistence does not imply joint measurability. J. Phys. A: Math. Theor. 46, 462002 (2013)
Article ADS MathSciNet Google Scholar
Filippov, S.N., Heinosaari, T., Leppäjärvi, L.: Necessary condition for incompatibility of observables in general probabilistic theories. Phys. Rev. A 95, 032127 (2017)
Article ADS Google Scholar
Paladino, E., Galperin, Y.M., Falci, G., Altshuler, B.L.: $1/f$ noise: implications for solid-state quantum information. Rev. Mod. Phys. 86, 361–418 (2014)
Article ADS Google Scholar
Coecke, B., Fritz, T., Spekkens, R.W.: A mathematical theory of resources. Inf. Comput. 250, 59 (2016)
Article MathSciNet Google Scholar
Chitambar, E., Gour, G.: Quantum resource theories. Rev. Mod. Phys. 91, 025001 (2019)
Article ADS MathSciNet Google Scholar

Download references

Acknowledgements

Open access funding provided by University of Turku (UTU) including Turku University Central Hospital. S.N.F. and T.H. acknowledge the support of the Academy of Finland for mobility grants to visit University of Turku and Moscow Institute of Physics and Technology, respectively. T.H. and L.L. acknowledge the support from the Academy of Finland via the Centre of Excellence program (Grant No. 312058) as well as Grant No. 287750. L.L. acknowledges financial support from University of Turku Graduate School (UTUGS).

Author information

Authors and Affiliations

Steklov Mathematical Institute of Russian Academy of Sciences, Moscow, Russia, 119991
Sergey N. Filippov
Valiev Institute of Physics and Technology of Russian Academy of Sciences, Moscow, Russia, 117218
Sergey N. Filippov
Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region, Russia, 141700
Sergey N. Filippov
Department of Mathematics, University of Denver, Denver, CO, 80208, USA
Stan Gudder
Department of Physics and Astronomy, QTF Centre of Excellence, University of Turku, 20014, Turku, Finland
Teiko Heinosaari & Leevi Leppäjärvi

Authors

Sergey N. Filippov
View author publications
You can also search for this author in PubMed Google Scholar
Stan Gudder
View author publications
You can also search for this author in PubMed Google Scholar
Teiko Heinosaari
View author publications
You can also search for this author in PubMed Google Scholar
Leevi Leppäjärvi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Leevi Leppäjärvi.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Filippov, S.N., Gudder, S., Heinosaari, T. et al. Operational Restrictions in General Probabilistic Theories. Found Phys 50, 850–876 (2020). https://doi.org/10.1007/s10701-020-00352-6

Download citation

Received: 08 January 2020
Accepted: 15 June 2020
Published: 12 July 2020
Issue Date: August 2020
DOI: https://doi.org/10.1007/s10701-020-00352-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Operational Restrictions in General Probabilistic Theories

Abstract

Similar content being viewed by others

Post-Classical Probability Theory

Hierarchical axioms for quantum mechanics

Quantum Mechanics as a Theory of Probability

1 Introduction

2 Preliminaries

2.1 States, Effects, Meters

Example 1

2.2 Simulation of Meters

3 Three Types of Operational Restrictions

Remark 1

4 Restriction Class (R1) and Convex Effect Algebras

4.1 Characterization of (R1) Restrictions

Lemma 1

Proof

Theorem 1

Proof

4.2 Convex Effect Algebras

4.3 Characterization of Convex Effect Algebras and Subalgebras

Theorem 2

Theorem 3

Proof

4.4 Subalgebras and Restrictions

Proposition 1

Proof

Proposition 2

Proof

Proposition 3

Proof

5 Restriction Class (R2) and Effectively n-Tomic Meters

Proposition 4

Proof

Proposition 5

Proof

Proposition 6

Proof

Proposition 7

Proof

Example 2

6 Restriction Class (R3), Noise and Compatibility

6.1 Compatibility Restriction

Proposition 8

Proof

6.2 Noise Restriction on Meters

Proposition 9

Proof

Lemma 2

Proof

Proposition 10

Proof

Example 3

7 Discussion and Conclusions

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation