Conjugates, Filters and Quantum Mechanics
Alexander Wilce
Department of Mathematics and Computer Science, Susquehanna University
July 2, 2019
The Jordan structure of ﬁnite-dimensional quantum theory is derived, in a conspic-
uously easy way, from a few simple postulates concerning abstract probabilistic models
(each deﬁned by a set of basic measurements and a convex set of states). The key
assumption is that each system A can be paired with an isomorphic conjugate system,
A, by means of a non-signaling bipartite state η
A
perfectly and uniformly correlating
each basic measurement on A with its counterpart on A. In the case of a quantum-
mechanical system associated with a complex Hilbert space H, the conjugate system
is that associated with the conjugate Hilbert space H, and η
A
corresponds to the stan-
dard maximally entangled EPR state on H H. A second ingredient is the notion
of a reversible ﬁlter, that is, a probabilistically reversible process that independently
attenuates the sensitivity of detectors associated with a measurement. In addition to
oﬀering more ﬂexibility than most existing reconstructions of ﬁnite-dimensional quan-
tum theory, the approach taken here has the advantage of not relying on any form
of the “no restriction” hypothesis. That is, it is not assumed that arbitrary eﬀects
are physically measurable, nor that arbitrary families of physically measurable eﬀects
summing to the unit eﬀect, represent physically accessible observables. (An appendix
shows how a version of Hardy’s “subpace axiom” can replace several assumptions
native to this paper, although at the cost of disallowing superselection rules.)
1 Introduction and Overview
A number of recent papers, notably [11, 14, 20, 25, 28], have succeeded in deriving the mathemat-
ical apparatus of ﬁnite-dimensional quantum mechanics (henceforth: QM) from various packages
of broadly operational, probabilistic, or information-theoretic assumptions. These assumptions
are, however, rather strong, and the derivations themselves are not trivial. This paper aims at a
slightly broader target, and ﬁnds it much easier to hit.
Speciﬁcally, the Jordan structure of ﬁnite-dimensional quantum theory is derived, in a con-
spicuously easy way, from a few simple principles. This still brings us within hailing distance
of standard QM, owing to the classiﬁcation theorem for ﬁnite-dimensional formally real Jordan
algebras as direct sums of real, complex and quaternionic quantum systems, spin factors (“bits”
of arbitrary dimension), and the exceptional Jordan algebra [22]. In contrast, all of the cited
reconstructions make use of strong axioms that rule out real and quaternionic systems, and even
complex quantum systems with superselection rules, more or less by ﬁat. Since there are good
arguments for taking real and quaternionic quantum systems seriously (see [4] for a forceful ar-
gument in this direction), it is of interest to have an axiomatic scheme that accommodates them.
Alexander Wilce: wilce@susqu.edu
Accepted in Quantum 2019-06-24, click title to verify 1
arXiv:1206.2897v8 [quant-ph] 1 Jul 2019
I shall have more to say on this point below.
Correlation in quantum mechanics The approach taken here begins with a simple and well-
known observation about ﬁnite-dimensional quantum systems. Let H be an n-dimensional com-
plex Hilbert space
1
, representing a ﬁnite-dimenional quantum system. Recall that the conjugate
Hilbert space,
H, is the same abelian group, but endowed with the scalar multiplication (c, x) 7→ cx
(where the scalar multiplication on the right is that in H, and c is the complex conjugate of c C),
and with inner product (x, y) 7→ hy, xi. It is customary to write x for the vector x H, regarded
as a vector in H, so that cx = cx, or, equivalently, cx = c x. The inner product on H is then
given by hx, yi = hy, xi = hx, yi.
2
Suppose now that W is any density operator on H, with spectral decomposition W =
P
xE
λ
x
p
x
for some orthonormal basis E, where p
x
is the rank-one projection associated with a
unit vector x E. Then W is the marginal, or reduced state, of the pure bipartite state
Ψ
W
:=
X
xE
λ
1/2
x
x x H H (1)
The fact that mixed quantum-mechanical states arise in this way as marginals of pure states on
larger systems is the starting point for the reconstruction of QM in [11]. Here, we focus instead
on the correlational features of Ψ
W
. A straightforward calculation shows that if a, b are any two
operators on H, then
h(a b
W
, Ψ
W
i = Tr(W
1/2
aW
1/2
b).
In particular, if a and b commute with W , then we have
h(a b
W
, Ψ
W
i = Tr(W ab). (2)
It follows that the state Ψ
W
perfectly correlates any projection-valued observable that commutes
with W , with its counterpart on H: if a and b are mutually orthogonal projections, both com-
muting with W , then the joint probability of observing a and b is hΨ
W
, a bi = Tr(Wab) = 0,
while the joint probability of a and a is hΨ
W
, a ai = Tr(W a). Where a = p
x
is the rank-one
projection associated with a unit vector x, this means that the conditional state of the conjugate
system, given a measurement result x on the system corresponding to H, is the “collapsed” state
corresponding x. In eﬀect, the entangled state Ψ
W
allows the conjugate system to retain a record
of the measurement result on the ﬁrst system even though no signal need have passed between
the two.
A striking special case arises where W =
1
n
1, the maximally mixed state: in this case, Ψ
W
is
the “EPR” state
Ψ =
1
n
X
xE
x x,
the expansion being independent of the choice of the orthonormal basis E. As every observable
commutes with W , Ψ perfectly, and uniformly, correlates every observable on H with its coun-
terpart on H. Thus, if we imagine that the system corresponding to H is controlled by Alice and
that corresponding to H, by Bob, then if Alice and Bob happen to make the same measurement,
1
A word on notation: I follow the mathematicians’ convention that a complex inner product h , i is conjugate-
linear in the second, rather than the ﬁrst argument. Thus, in terms of Dirac notation, hx, yi = hy|xi.
2
One can think of H as the space of bras hx| corresponding to the kets |xi H, but I prefer to avoid this
representation, since I want to stress the idea that H represents a quantum system in its own right. Thus, using
Dirac notation we might write |xi = hx|.
Accepted in Quantum 2019-06-24, click title to verify 2
they are bound to obtain the same result, with uniform probability 1/n. Notice, also, that by (2)
we have
h(a b, Ψi =
1
n
Tr(ab)
for all observables a and b, so the state Ψ in some sense explains the normalized trace inner product.
Correlation in General Probabilistic Theories These correlational features make sense in
a much more general setting. As explained in more detail below, a probabilistic model is char-
acterized by a set of basic measurements or experiments, and a convex set of states, with each
state α assigning a probability α(x) to every outcome x of every basic measurement. Given two
such models A and B, a bipartite state ω on A and B is an assignment of joint probabilities
ω(x, y) to all outcomes x and y of basic A- and B-measurements, respectively, having well deﬁned
conditional and marginal (reduced) probability weights corresponding to states of A and B.
We now impose some restrictions on the probabilistic models under consideration. First, we
require all state spaces to be ﬁnite-dimensional (we are, after all, only attempting to recover
ﬁnite-dimensional QM). Secondly, we require that models be uniform, in the sense that
(i) all basic measurements have a common, ﬁnite number of outcomes, n, called the rank of A;
and
(ii) there exists a maximally mixed state, ρ, deﬁned by ρ(x) = 1/n for all basic measurement
outcomes x
These conditions are satisﬁed by ﬁnite-dimensional quantum-mechanical models, including those
involving superselection sectors, provided that we restrict attention to maximal observables, i.e.,
those consisting of rank-one projections. More generally, condition (i) is reasonable if we think
of basic measurements as maximally informative, so that each has the largest possible number of
outcomes, and cannot be further reﬁned. Given condition (i), the maximally-mixed state is well-
deﬁned mathematically, so in (ii), we are only requiring that it count as a physically accessible
state.
The following is a direct translation of the correlational features discussed above for quantum-
mechanical systems, into the language of probabilistic models.
Deﬁnition 1. A conjugate of a (uniform) probabilistic model A is a model A, together with an
isomorphism γ taking each basic measurement outcome x of A to an outcome x := γ(x) of A,
such that
(a) Every state α of A is the marginal of some state ω on A and
A (in general, depending
on α) correlating some basic measurement E of A with its counterpart on A so that
for all x E,
ω(x, x) = α(x)
so that ω(x, y) = α(x)δ
x,y
.
(b) The maximally mixed state ρ arises as the marginal of a bipartite state η
A
uniformly
correlating every basic measurement with its counterpart, in the sense that
η
A
(x, x) =
1
n
for all basic measurement outcomes x, where n is the rank of A.
Evidently, in the quantum-mechanical case, where α corresponds to a density operator W ,
the state Ψ
W
supplies the correlating state ω, while the bipartite state η corresponds to the EPR
state Ψ
1
n
1
= Ψ.
Accepted in Quantum 2019-06-24, click title to verify 3
Mathematically, the existence of a conjugate system has aﬃnities with the puriﬁcation postu-
late of [11], though we do not require the correlating bipartite state ω above to be pure. Physically,
a conjugate system A allows for the formation of records of the outcomes of measurements on A in
causally separated systems, exactly as in the quantum case. Condition (a) above simply requires
that, for every state α, there be at least one basic measurement on A that can be thus recorded
and later “read oﬀ” by performing the corresponding measurement on
A. Condition (b) requires
that, where A is in the maximally mixed state, it be possible to record every basic measurement
in this way.
From correlation to Jordan algebras Remarkably little is required, beyond the existence of a
conjugate, to secure a representation of A in terms of a formally real Jordan algebra. This depends
on a classic mathematical result, the Koecher-Vinberg Theorem [18]. A ﬁnite-dimensional ordered
vector space E with positive cone E
+
is self-dual if it carries an inner product such that a E
+
iﬀ ha, bi 0 for all b E
+
. If the group of invertible linear mappings E E carrying E
+
onto
itself acts transitively on the interior of the cone E
+
, then E is said to be homogeneous. The
Koecher-Vinberg Theorem asserts that if E is both homogeneous and self-dual, it can be endowed
with a formally real Jordan structure for which E
+
= {a
2
|a E}.
Any probabilistic model A gives rise in a natural way to two ordered vector spaces: a space
V(A), generated by A’s states, and a space E(A) V(A)
generated by evaluation functionals
b
x : α 7→ α(x) associated with basic measurement outcomes x. Since we are assuming that the state
space is ﬁnite dimensional, both of the spaces E(A) and V(A) are also ﬁnite dimensional, and it
is easy to see that they have the same dimension. If we can show that E(A) is homogeneous and
self-dual, then the Koecher-Vinberg Theorem will provide a formally real (equivalently, euclidean)
Jordan structure on E(A) for which the cone of squares coincides with E(A)
+
.
Call a probabilistic model A sharp if, for every basic measurement outcome x, there is a unique
state δ
x
with δ
x
(x) = 1. Physically, this is a way of saying that basic measurements are maximally
informative: if we can predict the outcome with certainty, we know the system’s state exactly.
Theorem 1. Suppose A is sharp and has a conjugate. Then the state η
A
gives rise to a self-
dualizing inner product on E(A), with respect to which E(A) and V(A) are isomorphic as ordered
vector spaces.
It follows that if V(A) is homogeneous, so is E(A), whence, by the Koecher-Vinberg Theorem,
the latter carries a formally real Jordan structure. But the homogeneity of V(A) has a direct
physical interpretation: it says that for every non-singular state that is, every state α with
α(x) > 0 for every basic measurement outcome — there exists a probabilistically reversible process
T deﬁned below, but, roughly, one that can be reversed by another process with non-zero
probability such that T (ρ) = rα where r [0, 1]. In other words, every non-singular state
can be prepared, up to normalization, by applying a probabilistically reversible process to the
maximally mixed state.
In fact, it is enough to assume less. By a ﬁlter for a basic measurement with outcome-set E, I
mean a process Φ that is, a positive linear mapping Φ : V(A) V(A) that independently
attenuates the reliability of each outcome x E, so that for every state α, Φ(α)(x) = t
x
α(x) for
some constant t
x
(independent of α). If we think of basic measurement outcomes as detectors,
the existence of such ﬁlters, with arbitrary coeﬃcients, is plausible; in standard QM, not only do
they exist but, if t
x
> 0 for every x E, then Φ can be chosen to be probabilistically reversible.
Where a probabilistic model A shares this feature, I will say that A has arbitrary reversible ﬁlters.
Corollary 1. Suppose that A is sharp, has a conjugate, and has arbitrary reversible ﬁlters. Then
E(A) is homogeneous and self-dual.
Accepted in Quantum 2019-06-24, click title to verify 4
If we adopt a stronger assumption about ﬁlters, we can weaken the requirement that A have
a conjugate, and eliminate entirely the hypothesis that A is sharp. Let us say that A has a weak
conjugate if the maximally mixed state ρ is the marginal of a uniformly correlating state η
A
on A
and A, as in condition (b) in Deﬁnition 1, but not assuming that every state is the marginal of a
correlating state, i.e., not assuming condition (a). That is, we require only that there exist a joint
state on two copies of A — an analogue of the EPR state — in which, if the same measurement is
performed on each copy, the results are guaranteed to be the same, but are otherwise completely
random.
By applying the ﬁlter Φ to the system A, and then computing the canonical bipartite state
η
A
, we obtain a new bipartite state. Equally, we could begin by applying the counterpart of Φ to
the conjugate system, obtaining another bipartite state. If these two bipartite states are in fact
the same, we say that Φ is symmetric.
Corollary 2. Let A have a weak conjugate. If every non-singular state of A can be prepared, up
to normalization, from the maximally mixed state by a symmetric, reversible ﬁlter, then E(A) is
homogeneous and self-dual.
The proofs of these results are all quite short and straightforward. In summary, we recover
euclidean Jordan algebras from either of two distinct, but related, sets of assumptions about
physical systems represented by uniform, ﬁnite-dimensional probabilistic models:
(a) systems are sharp;
(b) Systems have conjugates; and
(c) Systems have arbitrary reversible ﬁlters.
Alternatively, and more compactly:
(b
0
) Systems have weak conjugates;
(c
0
) All non-singular states can be prepared by reversible symmetric ﬁlters
Any euclidean Jordan algebra gives rise to a probabilistic model in which basic measurements
correspond to Jordan frames, i.e., sets {e
i
} of minimal idempotents satisfying e
i
·
e
j
= 0 for i 6= j,
and
P
i
e
i
= 1, where 1 is the Jordan unit. (In standard QM, these would correspond to maximally
ﬁne-grained projective measurements.) In Appendix A, it is shown that any such model satisﬁes
all of the assumptions above and, conversely, if A satisﬁes either package of assumptions, the
Jordan product on E(A) can be chosen so that the set of basic measurements is precisely the
set of Jordan frames. Thus, these two sets of assumptions are in fact equivalent, and exactly
characterize this class of euclidean Jordan-algebraic probabilistic models.
There is actually a third possibility, in which condition (b) in the deﬁnition of a conjugate is
replaced by a rather weak symmetry assumption and a version of Hardy’s subspace axiom [20].
Again, the resulting package of assumptions is satisﬁed by, and hence, characterizes, Jordan mod-
els. The details are spelled out in Appendix B.
Other reconstructions of QM The approach of this paper oﬀers some signiﬁcant advantages
over the reconstructions of quantum mechanics cited earlier. First, it is simply easier, in the sense
that our results are obtained with less mathematical eﬀort. (This, notwithstanding the length
Accepted in Quantum 2019-06-24, click title to verify 5
of this paper, which owes to the inclusion of many details intended to make the paper easier to
follow.)
3
Secondly, it rests on fewer and (arguably) simpler assumptions. Certainly, the second package
of asumptions (that is, (b
0
) and (c
0
) above) is smaller than anything found in earlier reconstructions
of QM. Other reconstructions tend to impose strong constraints on subsystems, in eﬀect assuming
that every face of the state space corresponds to the state space of a “sub-system”, satisfying the
remaining axioms. Nothing like this is needed here (though, as mentioned above, if one ﬁnds a such
a “subspace axiom” compelling, it can be put to good use in the present approach; see Appendix
B for details). A related assumption, also used in several of the cited papers, is that all systems
having the same “information capacity” — the maximal number of states sharply distinguishable
by single-shot measurement are isomorphic. The present approach entirely avoids such an
assumption. It also does without the assumption, commonly called the no-restriction hypothesis
[21], used in [25] for bits, that all mathematically possible eﬀects that is, aﬃne functionals
assigning probabilities to states — correspond to physically accessible measurement results. More
recently, the interesting paper [7] derives the same Jordan-algebraic structure arrived at here, but
in a diﬀerent way. In addition to a strong symmetry postulate, this paper assumes a weak form of
the no restriction hypothesis, namely, that all ﬁnite sets of “allowed” eﬀects that sum to the unit
eﬀect (the eﬀect identically 1 on all states) correspond to accessible measurements, along with a
kind of spectral decomposition for states. Here, we manage without any form of no-restriction
assumption, and a spectral decomposition for states is derived, rather than postulated.
Finally, all of the earlier reconstructions of QM cited above assume some form of local to-
mography. This is the doctrine that the state of a bipartite system is determined by the joint
probabilities it assigns to outcomes of measurements on the two component systems. This prin-
ciple has a certain intuitive appeal; moreover, it is well known, and easy to see on dimensional
grounds, that among ﬁnite-dimensional real, complex and quaternionic quantum mechanics, only
in the complex version are composites locally tomographic.
More generally [19, 8], the only probabilistic theory in which systems correspond to Jordan
models and composite systems are locally tomographic, and which includes at least one system
having the structure of a qubit, is ﬁnite-dimensional complex quantum mechanics. Thus, if one in-
sists on local tomography, it can be added to the list of assumptions discussed above, and leads to
standard, complex QM (with superselection rules). One should perhaps not rush to embrace local
tomography as a universal principle, however. The very fact that it excludes real and quaternionic
quantum theory suggests that it is too strong. There are natural ways of representing complex
Hilbert spaces in terms of real or quaternionic ones, and vice versa
4
; moreover, these representa-
tions have physical meaning, in that bosonic or fermionic (complex) quantum systems can very
naturally be modelled in terms of the corresponding real or, respectively, quaternionic Hilbert
spaces. Again, see, e.g., [4] a cogent development of this line of thought. In any case, it seems
valuable to be able to delineate clearly what does and what does not depend on this assumption,
particularly if we are interested in the possibilities for a “post-quantum” theory.
3
And also notwithstanding my appeal to the Koecher-Vinberg Theorem, as this is itself a very accessible result.
See [18] for a not terribly taxing proof. A number of the other reconstructions mentioned here also depend on
nontrivial mathematical results, e.g., the classiﬁcation of transitive actions of compact groups on spheres is used in
[25].
4
A real or quaternionic Hilbert space can be regarded as a complex Hilbert space equipped with a designated anti-
unitary operator J satisfying, respectively, J
2
= 1 or J
2
= 1; conversely, a complex Hilbert space is essentially
equivalent to a real or quaternionic one equipped with, respectively, an orthogonal or a simplectic operator J
satisfying J
2
= 1 or J
2
= 1.
Accepted in Quantum 2019-06-24, click title to verify 6
Organization The balance of this paper is arranged as follows. Section 2 provides general
background on probabilistic models, ordered vector spaces, Jordan algebas and so on, making
more precise many of the technical terms used above. This material will be familiar to many, but
probably not to all, readers. Section 3 contains the proof of Theorem 1; Corollaries 1 and 2 are
proved in Section 4. Section 5 collects some ﬁnal thoughts, inlcuding a few further remarks on
how the approach of this paper compares to the reconstructions of QM cited above. Appendix A
contains additional information on probabilistic models associated with euclidean Jordan algebras,
and Appendix B shows how a version of the “subspace axiom”, plus a symmetry assumption, can
replace some of the assumptions native to this paper.
Several of the ideas developed here were earlier explored, and somewhat similar results derived,
in [30] and [31], but the approach taken here is much simpler and more direct, and seems to go a
good deal farther.
2 Background
The mathematical framework for this paper is that of “generalized probabilistic theories” [10], in
the idiom of [30, 9], which I now quickly review.
5
In a few places, set oﬀ in numbered deﬁnitions,
my usage diﬀers slightly from that of these last-cited works. See [18, 2] for more information on
ordered vector spaces and Jordan algebras.
Ordered vector spaces An ordered vector space is a real vector space E equipped with a closed,
convex cone E
+
with E
+
E
+
= {0} and E = E
+
E
+
that is, E is spanned by E
+
.
The cone induces a partial order, invariant under translation and multiplication by non-negative
scalars, given by a b iﬀ b a E
+
. As an illustration, the space R
X
of real-valued functions on
a set X is ordered by the cone R
X
+
of functions taking non-negative values. Another example is
the space L
sa
(H) of hermitian operators on a real, complex or quaternionic Hilbert space, ordered
by the cone of positive semi-deﬁnite operators.
A linear mapping T : E F between ordered vector spaces is positive iﬀ T (E
+
) F
+
. If
T is bijective and T
1
is also positive, then T is an order isomorphism. If E and F are ﬁnite
dimensional with dim(E) = dim(F ), T is an order isomorphism iﬀ T (E
+
) = F
+
. The dual space
E
of a ﬁnite-dimensional ordered linear space carries a natural ordering, deﬁned by the dual
cone, E
+
, consisting of positive linear functionals f E
.
Probabilistic Models As discussed above, a probabilistic model is characterized by a set M(A)
of basic measurements or tests, and a set Ω(A) of states. It is convenient to identify each test
with its outcome-set, so that M(A) is simply a collection of non-empty sets (a test space, in the
language of [9]). Let X(A) stand for the union of this collection; that is, X(A) is the space of
all outcomes of all basic measurements. States are understood as assignments of probabilities to
measurement-outcomes, that is, as functions α : X(A) [0, 1] such that
P
xE
α(x) = 1 for all
tests E M(A) (but not all such functions necessarily correspond to states). As mentioned above,
a state α Ω(A) is non-singular iﬀ α(x) > 0 for all x X(A). To reﬂect the possibility of forming
statistical mixtures, I also assume that Ω(A) is convex, that is, if p
1
, ..., p
n
are non-negative real
numbers summing to 1, and α
1
, ..., α
n
are states in Ω(A), then the function p
1
α
1
+ · · · + p
n
α
n
also
5
This is a variant of the standard “convex-operational” framework developed in the 1960s and 1970s by Ludwig,
Davies and Lewis and others (e.g., [24, 15, 16]), specialized to ﬁnite dimensions, and with additional structure
deriving from work of Foulis and Randall [17].
Accepted in Quantum 2019-06-24, click title to verify 7
belongs to Ω(A). Finally, I assume that Ω(A) is closed under pointwise limits, whence, compact
as a subset of [0, 1]
X(A)
in the product topology.
By way of illustration, in the simplest classical model, M(A) consists of a single, ﬁnite test,
and Ω(A) is the simplex of all probability weights on that test. Of more immediate interest to
us is the quantum model A(H) = (M(H), Ω(H)) associated with a complex Hilbert space H.
The test space M(H) is the set of orthonormal bases of H; thus, the outcome-space X(H) is
the set of unit vectors of H. The state space Ω(H) consists of the quadratic forms associated
with density operators on H, so that a state α Ω(H) has the form α(x) = hW
α
x, xi for some
density operator W
α
, and all unit vectors x X(H). Real and quaternionic quantum models,
corresponding to real or quaternionic Hilbert spaces, are deﬁned in the same way.
Remark: Not every physically accessible observable on a ﬁnite-dimensional quantum system is
represented by an orthonormal basis. Rather, the general observable corresponds to a positive-
operator-valued measure. Similarly, for an arbitrary probabilistic model A, the test space M(A)
may, but need not, represent a complete catalogue of all possible measurements one might make
on the system represented by A: rather, it is some privileged (or perhaps, simply convenient)
catalogue of such measurements, suﬃciently large to determine the system’s states.
The spaces V(A) and E(A). Any probabilistic model A gives rise in a canonical way to an
ordered vector space V(A). This is simply the span of the state space Ω(A) in the space R
X(A)
,
ordered by the cone V(A)
+
of non-negative multiples of states; that is, β V(A)
+
iﬀ β =
for some state α Ω(A) and some real constant t 0. An element of V(A)
+
of the form
with α Ω(A) and t 1 is said to be sub-normalized. One can show that the interior of V(A)
+
consists exactly of multiples of non-singular states. The dimension of a model A is the dimension
of V(A). As mentioned in the introduction, it is assumed in this paper that all probabilistic models
are ﬁnite-dimensional.
There is a canonical positive linear functional u
A
: V(A) R, called the unit eﬀect of A, given
by u
A
(α) =
P
xE
α(x), where E is any test in M(A). Note that if α V(A)
+
, then u
A
(α) = 1
precisely when α Ω(A), and that every non-zero element α V(A)
+
has the form β = for
a unique t = u
A
(β) > 0. Thus, any non-zero β V(A)
+
can be normalized to yield a state
e
β = u
A
(β)
1
β Ω(A). A positive linear functional f V(A)
+
with f u
A
is usually called
an eﬀect on A, and can be thought of as representing an “in-principle” measurement outcome,
with probability 0 f(α) 1 in state α. Every outcome x X(A) corresponds to an eﬀect
b
x : V(A) R, given by
b
x(α) = α(x) for all α V(A), and u
A
=
P
xE
b
x.
If A = A(H) is the quantum mechanical model associated with a ﬁnite-dimensional Hilbert
space H, as discussed above, then, identifying Ω(H) with the convex set of density operators on
H, V(A) can be identiﬁed with the ordered vector space L(H) of self-adjoint operators on H,
with its usual cone of positive operators. We can then also identify V(H)
with L(H), using the
trace inner product. That is, if a L(H), we can deﬁne a positive linear functional a V(A)
by setting a(α) = Tr() for all α V(A) = L(H), and all such functionals arise uniquely from
elements of L(H). In this setting, measurement-outcomes are unit vectors of H, and, for each
x X(H), and for each density operator α on V(A),
b
x(α) = Tr(αp
x
) where p
x
is the rank-one
projection associated with x. More generally, eﬀects correspond to positive operators between 0
and 1.
The spectral theorem for self-adjoint operators tells us that every eﬀect in V(A(H)) = L(H)
is a positive linear combination of functionals
b
x corresponding to measurement outcomes. This
will not be the case for probabilistic models in general. It is therefore useful to deﬁne a smaller
cone, as follows:
Accepted in Quantum 2019-06-24, click title to verify 8
Deﬁnition 2. The space E(A) is the span, in V(A)
, of the set of eﬀects
b
x associated with
outcomes x X(A), ordered by the cone E(A)
+
of ﬁnite linear combinations
P
i
t
i
b
x
i
, x
i
X(A),
with coeﬃcients t
i
0.
Since we are assuming that V(A) and (hence) V(A)
are ﬁnite-dimensional, the set of func-
tionals
b
x in fact spans V(A)
(since they separate points of V(A).) Thus, E(A) and V(A)
are
identical as vector spaces. However, their cones are generally quite diﬀerent. If a E(A)
+
, then
a(α) 0 for all α V(A)
+
, so the cone E(A)
+
is contained in the dual cone V(A)
+
, but the
inclusion is usually proper. Thus, E(A) and V(A) are generally distinct as ordered vector spaces.
Remark: The space E(A) will be a useful technical tool in what follows, but should not necessarily
be regarded as anything more than that. In particular, it is not assumed that all physically mean-
ingful eﬀects reside in E(A)
+
, nor that every eﬀect in E(A)
+
is physically meaningful. In fact, it
will not be necessary to take any position at all on which eﬀects, other than those associated with
measurement outcomes, are physically signiﬁcant. Thus, as mentioned in the introduction, we
avoid the so-called no-restriction hypothesis [21], namely, the asumption that all eﬀects in V(A)
+
are physically accessible. This assumption is often made in the literature, sometimes explicitly
(e.g., [25]), sometimes not.
Processes A physical process on a system represented by a probabilistic model A is naturally
represented by an aﬃne (that is, convex-linear) mapping T : Ω(A) V(A) such that, for every
α Ω(A), T (α) = for some β Ω(A) and some constant 0 p 1 (depending on α), which
we can regard as the probability that the process occurs, given that the initial state is α.
6
Such
a mapping extends uniquely to a positive linear mapping T : V(A) V(A) with T (α)(u
A
) 1
for all α Ω(A).
Deﬁnition 3. A process T : V(A) V(A) is probabilistically reversible hereafter, just p-
reversible
7
iﬀ there is another process, S, such that, for every state α, there exists a constant
p (0, 1] with S(T (α)) = .
In other words, S allows us to recover α from T (α), up to normalization. It is not hard to see
that p must be independent of α, so that S = pT
1
. In particular, T is an order-automorphism
of V(A).
A process T : V(A) V(B) has a dual action on V (A)
, given by T
(f) = f T for all
f V(A)
, with T
(u
A
) u
A
. T is lossless iﬀ T
(u
A
) = u
A
. In our ﬁnite-dimensional setting,
we can identify V(A)
with E(A) as vector spaces, but not, generally, as ordered vector spaces.
While T
will preserve the dual cone V(A)
+
, it is not required, a priori, that T
preserve the
cone E(A)
+
V(A)
. This reﬂects the idea that not every physically accessible measurement
need appear among the tests in M(A), as discussed above.
Self-Duality and Jordan Algebras. For both classical and quantum models, the ordered spaces
E(A) and V(A) are isomorphic. In the former case, where M(A) consists of a single test E and
Ω(A) is the simplex of all probability weights on E, we have V(A) ' R
E
and E(A) ' (R
E
)
, with
the standard inner product on R
E
providing the order-isomorphism. If H is a ﬁnite-dimensional
6
To be clear, we are not suggesting that all such positive mappings on V(A) represent physically allowable
processes. Indeed, in QM, only completely positive mappings are physically allowable.
7
It should be noted that my usage is slightly nonstandard here: ordinarily, the adjective reversible is reserved
for processes that are probabilistically reversible, in the above sense, with probability one.
Accepted in Quantum 2019-06-24, click title to verify 9
real or complex Hilbert space, we have an aﬃne isomorphism between the state space of Ω(H)
and the set of density operators on H, allowing us to identify V(A(H)) with the space L
sa
(H)
of self-adjoint operators on H, ordered by the cone of positive operators. For any x X(H),
the evaluation functional
b
x V(A) is then given by W 7→ hW x, xi = Tr(W P
x
). It follows that
E(A(H)) ' L
sa
(H)
' L
sa
(H), with the latter isomorphism implemented by the trace inner
product.
More generally, call an inner product h , i on an ordered vector space E positive iﬀ ha, bi 0
for all a, b E
+
. We then have a positive linear mapping E E
, namely a 7→ ha, ·i. If this
is an order-isomorphism, one says that E is self-dual with respect to this inner product. This is
equivalent to the condition a E
+
iﬀ ha, bi 0 for all b E
+
. In this language, the standard
inner product on R
E
and the trace inner product on L
sa
(H) are self-dualizing, for any ﬁnite set
E and ﬁnite-dimensional Hilbert space H.
In fact, any euclidean Jordan algebra, ordered by its cone of squares, is self-dual with respect
to its canonical inner product. Recall here that a Jordan algebra is a real commutative (but not
necessarily associative) unital algebra (J ,
·
) satisfying the Jordan identity
a · (a
2
·
b) = a
2
·
(a
·
b)
for all a, b J (with a
2
= a
·
a). A euclidean Jordan algebra (EJA) is a ﬁnite-dimensional Jordan
algebra J equipped with an inner product h , i such that ha
·
b, ci = hb, a
·
ci for all a, b, c J . This
is equivalent to the condition that J be formally real, i.e, that
P
k
i=1
a
2
i
= 0 implies a
i
= 0 for all i.
A EJA J is also an ordered vector space with positive cone J
+
= {a
2
|a E}, and it can be shown
that this cone is self-dual with respect to the given inner product [18]. Examples of euclidean
Jordan algebras include the space L
sa
(H) of self-adjoint operators on a ﬁnite-dimensional real,
complex or quaternionic Hilbert space H, with a · b =
1
2
(ab + ba), and with ha|bi = Tr(ab).
The exceptional Jordan algebra of self-adjoint hermitian matrices over the Octonions is also a
euclidean Jordan algebra. Finally, one obtains a euclidean Jordan algebra, called a spin factor, by
deﬁning on V
n
:= R × R
n
a product (t, x) · (s, y) = (ts + hx, yi, ty + sx). This essentially exhausts
the possibilities: according to the Jordan-von Neumann-Wigner classiﬁcation theorem [22], every
euclidean Jordan algebra is a direct sum of euclidean Jordan algebras of these ﬁve types.
We can associate a probabilistic model to an EJA J in the following way. An idempotent in
J is an element e = e
2
. An idempotent is minimal, or primitive, iﬀ for any idempotent f e,
f = 0 or f = e. Two idempotents e, f are Jordan-orthogonal iﬀ e
·
f = 0. A maximal pairwise
Jordan-orthogonal set {e
1
, ..., e
n
} of primitive idempotents summing to the Jordan unit is called
a Jordan frame.
Deﬁnition 4. The Jordan model A(J ) = (X(J ), M(J ), Ω(J )) corresponding to a euclidean
Jordan algebra J has X(J ) the set of primitive idempotents, M(J ), the set of Jordan frames of
J , and Ω(J ) the set of states of the form α(x) := ha, xi where a E(A)
+
satisﬁes ha, 1i = 1. The
spectral theorem for EJAs ([18], Theorem III.1.2) expresses every a J in the form a =
P
xE
t
x
x
where E is a Jordan frame. Therefore, J = E(A), and the model A(J ) is self-dual.
Besides self-duality, all euclidean Jordan algebras share a property called homogeneity: the
group of order-automorphisms of J acts transitively on the interior of the positive cone J
+
.
The Koecher-Vinberg Theorem [18] states that, conversely, any ﬁnite-dimensional homogeneous,
self-dual, homogenous ordered vector space J can be equipped with the structure of a euclidean
Jordan algebra.
Deﬁnition 5. A probabilistic model A is homogeneous iﬀ V(A) is homogeneous, and self-dual iﬀ
E(A) carries an inner product with respect to which it is self-dual and E(A)
+
' V(A)
+
, in the
Accepted in Quantum 2019-06-24, click title to verify 10
sense that a E(A)
+
iﬀ α(x) := ha,
b
xi deﬁnes an element of V(A)
+
, and every element of V(A)
+
arises in this way.
If A is both homogeneous and self-dual — henceforth, HSD — then E(A) is also homogeneous
and self-dual, and thus, by the Koecher-Vinberg Theorem, can be made into a Jordan algebra.
In Appendix A, it is shown that this can be done in such a way that A is actually isomorphic to
the Jordan model corresponding to E(A).
Bipartite States and Conditioning A joint probability weight on a pair of models A and B is
a mapping ω : X(A) × X(B) R such that, for all E M(A) and F M(B),
X
(x,y)E×F
ω(x, y) = 1.
Such a weight is said to be non-signaling if, in addition, the marginal weights
ω
1
(x) :=
X
yF
ω(x, y) and ω
2
(y) :=
X
xE
ω(x, y)
are well-deﬁned, i.e., independent of the choice of tests F M(B) and E M(A), respectively.
The idea is that such a state precludes the sending of signals between A and B based solely on
the choice of what test to perform.
If ω is non-signalling, then given outcomes y X(B) and x X(A), we can deﬁne conditional
probability weights ω
1|y
and ω
2|x
on A and B, respectively, by setting
ω
1|y
(x) =
ω(x, y)
ω
2
(y)
and ω
2|x
(y) :=
ω(x, y)
ω
1
(x)
,
when ω
2
(y) and ω
1
(x) are non-zero. This gives us the following bipartite law of total probability
[17]
ω
2
=
X
xE
ω
1
(x)ω
2|x
and ω
1
=
X
yF
ω
2
(y)ω
1|y
(3)
which will be exploited below.
Deﬁnition 6. Let ω be a non-signaling joint probability weight on A and B. If all conditional
weights ω
1|x
and ω
2|y
(and hence, the marginals ω
1
and ω
2
) of ω belong to Ω(A) and Ω(B),
respectively, then we say that ω is a bipartite state on the models A and B.
If H and K are real or complex Hilbert spaces, every density operator W on H K gives rise
to a bipartite state on A(H) and A(K), given by ω(x, y) = hW x y, x yi.
The conditioning map If ω is a bipartite state on A and B, deﬁne the associated conditioning
maps
b
ω : X(A) V(B) and
b
ω
: X(B) V(A) by
b
ω(x)(y) = ω(x, y) =
b
ω
(y)(x).
Note that
b
ω(x) = ω
1
(x)ω
2|x
for every x X(A), i.e.,
b
ω(x) can be understood as the un-normalized
conditional state of B given the outcome x on A, and similarly for
b
ω
(y).
The conditioning map
b
ω extends uniquely to a positive linear mapping E(A) V(B), which
I also denote by
b
ω, such that
b
ω(
b
x) =
b
ω(x) for all outcomes x X(A). (To see this, consider
the positive linear mapping T : V(A)
R
X(B)
deﬁned, for f V(A)
, by T (f )(y) = f(
b
ω
(y))
Accepted in Quantum 2019-06-24, click title to verify 11
for all y X(B). If f =
b
x, we have T (
b
x) = ω
1
(x)ω
2|x
V(B)
+
. Thus, the range of T lies in
V(B).) In the same way,
b
ω
deﬁnes a positive linear mapping
b
ω
: E(B) V(A). Notice that
b
ω need not take V(A)
+
into V(B)
+
. This is the principal reason for working with E(A) rather
than V(A)
. If
b
ω : E(A) V(A) is an order isomorphism, ω is said to be an isomorphism state [5].
Composite Systems As the language here suggests, one wants to view (some) bipartite states
as elements of the state space of a composite model. Broadly, a composite of two probabilistic
models A and B, is a model AB equipped with a mapping X(A) × X(B) V(AB)
+
, taking
every pair of outcomes x X(A) and y X(B), to a product eﬀect xy, such that every state
ω Ω(AB) pulls back to a bipartite state ω(x, y) := ω(xy) on A and B. While nothing in the
mathematical development to follow depends on the choice of such a composite model, questions
of interpretation may hinge on such a choice.
3 Conjugate Systems
Let H be an n-dimensional complex Hilbert space and H is its conjugate space. As discussed
in the introduction, the maximally engangled “EPR” state, deﬁned by Ψ =
1
n
P
xE
x x,
where E is any orthonormal basis for H, establishes a perfect, uniform correlation betweeen
every projection-valued observable on the system associated with H, and its counterpart on H.
Moreover, Ψ eﬀectively deﬁnes the normalized trace inner product on E(H) = L
sa
(H). Since it
is precisely this inner product that makes L
sa
(H) self-dual, one might guess that the existence of
a uniformly correlating bipartite state is implicated in self-duality more generally.
As a ﬁrst step, we need to generalize the relationship between the models A(H) and A(H). In
order to do this, as mentioned in the introduction, we need ﬁrst to impose the minor restriction
that, henceforth, all models are uniform, meaning that all tests have a common cardinality n, and
that the maximally mixed state ρ, given by ρ(x) = 1/n for all x X(A), belongs to Ω(A). An iso-
morphism between two models A and B is a bijection φ : X(A) X(B) such that φ(E) M(B)
iﬀ E M(A), and β φ Ω(A) iﬀ β Ω(B). It is straightforward that such a mapping gives
rise to an order isomorphism which I’ll also denote by φ from E(A) to E(B), deﬁned by
φ(
b
x) =
d
φ(x) for all x X(A). The following reprises, and makes more precise, the deﬁnition of a
conjugate system (Deﬁnition 1).
Deﬁnition 1 (bis) Let A be a uniform probabilistic model of rank n. A conjugate for A is a triple
(A, γ
A
, η
A
) consisting of a probabilistic model A, an isomorphism γ
A
: A ' A, and a bipartite
state η
A
on A and A such that
(a) η
A
(x, γ
A
(x)) = 1/n for every x X(A).
(b) Every state α Ω(A) is the marginal of some bipartite state ω on A and A that correlates
some test E M(A) with its counterpart on A, so that ω(x, γ(x)) = α(x) for every x E.
As remarked earlier, the marginals of the perfectly correlating state η in part (a) are the maxi-
mally mixed states ρ on A and ρ on A and A, respectively. Where no ambiguity is likely, I write
x for γ
A
(x). If A satisﬁes (a), but not necessarily (b), I will call it a weak conjugate for A.
Given any bipartite state on A and A satisfying (a) i.e., η(x, γ
A
(x)) = 1/n for all x X(A),
the bipartite state η
t
deﬁned by η
t
(x, γ(y)) = η(y, γ(x)) is also perfectly uniformly correlating,
Accepted in Quantum 2019-06-24, click title to verify 12
whence, so is the symmetic state (η + η
t
)/2. We therefore can, and do, assume in what follows
that the chosen correlating state η
A
is always symmetric. If A is sharp, it is easy to show that
η is uniquely determined by condition (a) of Deﬁnition 1: since η(x, y) = 0 for outcomes y 6= x
belonging to a common test, and η(y, y) = 1/n, we have η
1|y
(y) = 1, i.e,. η
1|y
= δ
y
where δ
y
is the
unique state in which y has probability 1. Thus, η(x, y) = δ
y
(x). In this case, therefore, η = η
t
,
i.e., η is automatically symmetric.
8
If A(H) is the quantum probabilistic model associated with a ﬁnite-dimensional Hilbert space,
then the EPR state Ψ turns A(H) := A(H) into a conjugate in this sense, with η
A
(x, y) =
|hΨ, x yi|
2
. In fact, as pointed out in the introduction, A(H) is a conjugate for A(H), since
every density operator is the marginal of a state Ψ
W
correlating an eigenbasis for W with its
conjugate. All of this works equally well for real quantum systems, taking H = H. With a little
care, it can be shown to work for quaternionic systems as well.
Remark: One might wonder whether one can use the isomorphism γ
A
to identify A with its con-
jugate. Certainly one can deﬁne a bipartite state η
0
A
(x, y) = η
A
(x, γ
A
(y)). However, whether
this corresponds to a legitimate state of any physically reasonable composite AA of A with it-
self, depends on the particular probabilistic theory at hand. For example, if A = A(H) is the
quantum model associated with a complex Hilbert space H, then (a, b) 7→ Tr(ab) corresponds
to no state on A(H H). On the other hand, any choice of an anti-unitary operator J acting
on H yields a unitary isomorphism J : H H, given by J(x) = J(x) for all x H. Deﬁning
η
0
(x, y) := |h(1 J)
1
Ψ, x yi|
2
= |hΨ, x Jyi|
2
gives a state on A(H H), correlating along
the anti-unitary isomorphism γ
0
(x) := J
1
x. Thus, whether we choose to treat A(H) as its own
conjugate, or as distinct from its conjugate, is to an extent a matter of convention.
Conjugates and Self-Duality We are now ready to prove Theorem 1, which, for convenience,
I restate. Recall that a model A is sharp iﬀ, for every outcome x X(A), there is a unique state
δ
x
Ω(A) with δ
x
(x) = 1. Both classical and quantum models are sharp.
Theorem 1 (bis) Let A be sharp and have a conjugate A. Then ha, bi := η
A
(a, γ
A
(b)) is a
self-dualizing inner product on E(A), and induces an order-isomorphism E(A) V(A) given by
a 7→ η
A
(a, · ) for a E(A).
9
The proof is not diﬃcult. It will be convenient to break it up into a sequence of even easier
lemmas. In the interest of readability, below I conﬂate x X(A) with the corresponding eﬀect
b
x E(A), and write x for γ
A
(x). Until further notice, the hypotheses of Theorem 1 are in force.
The ﬁrst step is to obtain a kind of weak “spectral” decomposition for states in Ω(A) in terms of
the states δ
x
.
Lemma 1. For every α V(A)
+
, there exists a test E such that
α =
X
xE
α(x)δ
x
(4)
Proof: We can assume that α is a normalized state, i.e, that α Ω(A). Since A is a conjugate
for A, α = ω
1
where ω correlates some test E M(A) with E M(A) along the bijection
8
It follows that, where δ
x
and δ
y
are the unique states making x and y certain, we have δ
y
(x) = δ
x
(y) for all
x, y X(A). Indeed, this last condition could substitute for condition (a) in the deﬁnition of η, as it implies that
η(x, y) := δ
x
(y) is a valid non-signaling probabilty weight on A and A.
9
Here, I am identifying V(A), as a vector space, with E(A)
.
Accepted in Quantum 2019-06-24, click title to verify 13
x 7→ x. By the law of total probability (1) for non-signaling states, α =
P
xE
ω
2
(x)ω
1|x
. Since ω
is correlating we have ω
1|x
(x) = 1. Thus, by sharpness, ω
1|x
= δ
x
. Hence, α =
P
xE
ω
2
(x)δ
x
. It
follows that ω
2
(x) = α(x) for every x E, giving us (4).
We will refer to the decomposition in equation (4) as a spectral decomposition for α.
Lemma 2. η
A
is an isomorphism state.
Proof: We need to show that
c
η
A
: E(A) V(A) is an order-isomorphism. Since E(A) and V(A)
have the same dimension, it is enough to show that
c
η
A
maps the positive cone of E(A) onto that
of V(A). Since x 7→ x is an isomorphism between A and A, we can apply Lemma 1 to A: if
α V
+
(A), we have α =
P
xE
α(x)δ
x
. Since η
A
(x, x) = 1/n, we have
c
η
A
(x) =
1
n
δ
x
for every
x X(A). Hence,
c
η
A
(
P
xE
(x)x) = α.
Lemma 3. Every a E(A) has a representation a =
P
xE
t
x
x for some test E M(A) and
some coeﬃcients t
x
.
Proof: If a E(A)
+
, then by Lemma 1,
c
η
A
(a) =
P
xE
t
x
δ
x
for some E M(A) and coeﬃcients
t
x
0. By Lemma 2,
b
η
A
is an order-isomorphism. Applying
b
η
1
A
to the expansion above gives
a =
P
xE
t
x
x. For an arbitrary a E(A), we have a = a
1
a
2
with a
1
, a
2
E(A)
+
. Choose
N 0 with a
2
Nu. Thus, b := a + Nu
A
= a
1
+ (Nu
A
a
2
) 0. So b :=
P
xE
t
x
x for some
E M(A) and thus
a = b Nu
A
=
X
xE
t
x
x N(
X
xE
x) =
X
xE
(t
x
N )x.
Lemma 4. The function ha, bi := η
A
(a, γ
A
(b)) is an inner product on E(A).
Proof: η
A
is bilinear and, by assumption, symmetric. We need to show that h , i is positive
deﬁnite. Let a E(A). From Lemma 3, we have a =
P
xE
t
x
x for some test E and coeﬀcients
t
x
. Now
ha, ai =
*
X
xE
t
x
x,
X
yE
t
y
y
+
=
X
x,yE×E
t
x
t
y
hx, yi =
X
x,yE×E
t
x
t
y
η
A
(x, y) =
1
n
X
xE
t
x
2
0.
This is zero only when all coeﬃcients t
x
are zero, i.e., only for a = 0.
Proof of Theorem 1, concluded: Lemma 2 tells us that E(A) ' V(A), so it remains only to show
that the inner product h , i is self-dualizing. Clearly ha, bi = η(a, b) 0 for all a, b E(A)
+
.
Suppose a E(A) is such that ha, bi 0 for all b E(A)
+
. Then ha, yi 0 for all y X. By
Lemma 3, a =
P
xE
t
x
x for some test E. Thus, for all y E we have ha, yi = t
y
0, whence,
a E(A)
+
.
Remarks There are several directions in which we can usefully modify the assumptions of Theorem
1.
(1) In the proof of Theorem 1, the only point at which we needed to assume that A satisﬁes
condition (b) in the deﬁnition of a conjugate was in order to obtain the spectral decomposition
equation (4) of Lemma 1. Thus, if we are willing simply to assume such decompositions
are available, as in [7], then a weak conjugate suﬃces. Alternatively, any postulate or postulates
leading to such decompositions can replace condition (b). For instance, certain versions of the
symmetry and “subspace” axioms used in [20, 14, 25] imply a spectral decomposition. This is
Accepted in Quantum 2019-06-24, click title to verify 14
spelled out in Appendix B. Another approach to obtaining such a decomposition can be found in
a recent paper of G. Chiribella and C. M. Scandolo [12].
(2) In fact, it is even enough if (4) holds for states in the interior of Ω(A). From this we have,
as in the proof of Lemma 2, that the interior, V
+
, of the cone V(A)
+
is contained in
b
η
A
(E
+
),
from which it follows that
b
η is a linear isomorphism, and hence (as the vector spaces involved are
ﬁnite-dimensional) an homeomorphism. Thus,
b
η
A
(E
+
) is closed, and so, contains the closure of
V
+
, i.e., V
+
. In other words,
b
η is an order-isomorphism. The proofs of Lemmas 3 and 4, and the
rest of the proof of Theorem 1, then proceed just as before.
(3) The deﬁnition of a conjugate for a probabilistic model A requires the existence of the
uniformly, universally correlating state η
A
, and that arbitrary states of A arise as marginals of
bipartite states on A and A correlating some test E M(A) with its conjugate twin. One might
wonder whether there is some reasonably simple postulate that will imply both of these condi-
tions. Suppose that G is a group acting transitively on the outcome-space X(A) of the model A,
and leaving the state-space Ω(A) invariant. If G is compact, there will exist an invariant state, ρ,
obtained by group averaging; by the transitivity of G on outcomes, this state must be constant. It
follows that all tests have the same ﬁnite size size, say n, and that ρ is the maximally mixed state
ρ(x) 1/n. That is, the model is uniform. Now let γ
A
: x 7→ x be an isomorphism between A and
a model A. Suppose that every state α Ω(A) is the marginal of a correlating state ω Ω(AA)
such that ω(gx, gy) = ω(x, y) for all g G with α g = α. It is easily checked that this is satisﬁed
by ﬁnite-dimensional quantum models. Applied to the maximally mixed state ρ, this produces a
perfectly, uniformly correlating state η
A
. Thus, A is a conjugate in the sense of Deﬁnition 3.
4 Filters
We have just seen that if A is sharp and has a conjugate, then E(A) is self-dual, and isomorphic
to V(A). Suppose now that every non-singular state of A can be prepared, up to normalization,
from the maximally mixed state ρ(x) 1/n by some reversible process. This guarantees that
V(A), and hence, E(A), is homogeneous, so that, by the Koecher-Vinberg Theorem, E(A) carries
a euclidean Jordan structure making E(A)
+
the cone of squares.
In fact, we can say something more interesting. In many kinds of laboratory experiments, the
distinct outcomes of an experiment correspond to physical detectors, the eﬃciency of which can
independently be attenuated, if desired, by the experimenter. This can always be done through
post-processing, using a classical ﬁlter. In QM, it can also be accomplished by subjecting the
system to a suitable process prior to measurement. To see this, let A be a ﬁnite-dimensional
quantum system, with corresponding Hilbert space H, and identify E(A) with L
sa
(H
A
). If E is
an orthonormal basis representing a basic measurement on this system, deﬁne a positive operator
V : H H by setting V x = t
1/2
x
x for every x E, where 0 t
x
1. This gives us a completely
positive linear mapping Φ : E(A) E(A), namely Φ(a) = V aV . If t
x
> 0 for every x E, Φ
has a completely positive inverse Φ
1
(a) = V
1
aV
1
. For each x E, the corresponding eﬀect
b
x E(A) ' L
sa
(H) is the rank-one projection operator p
x
. It is easy to check that V p
x
V = t
x
p
x
,
i.e., that Φ(
b
x) = t
x
b
x for every x E.
Deﬁnition 7. A ﬁlter for a test E of a probabilistic model A is a positive linear mapping
Φ : V(A) V(A) such that, for every outcome x E, there exists a coeﬃcient 0 t
x
1 with
Φ(α)(x) = t
x
α(x)
for all all states α Ω(A). Equivalently, Φ
(x) = t
x
x for every x E.
Accepted in Quantum 2019-06-24, click title to verify 15
As noted above, in QM, not only do ﬁlters with arbitrary coeﬃcients exist for every test, but
they can be implemented p-reversibly, so long as the coeﬃcients t
x
are all non-zero. I will say
that a general probabilistic model with this feature has arbitrary reversible ﬁlters.
Corollary 1 (bis) Suppose that A is sharp and has a conjugate A. If A has arbitrary reversible
ﬁlters, then E(A) is homogeneous and self-dual.
Proof: A is self-dual by Theorem 1. Let α be a normalized state in the interior of V(A)
+
.
By Lemma 1, α has a spectral decomposition α =
P
xE
α(x)δ
x
. Let Φ be a ﬁlter for E with
coeﬃcients α(x). Since α is non-singular, α(x) > 0 for all x E, so Φ can be chosen to be
reversible. Now expand the maximally mixed state ρ, with ρ(x) 1/n, as ρ =
P
xE
1
n
δ
x
. Then
Φ(ρ) =
1
n
P
xE
α(x)δ
x
=
1
n
α. Thus, any non-singular state can be prepared, up to normalization,
by a reversible ﬁlter, and it follows that V(A) is homogeneous. In view of Theorem 1, E(A) is
self-dual, and E(A) ' V(A), whence, also homogeneous.
State preparation by reversible ﬁlters Suppose now that A has only a weak conjugate A,
and that Φ is a ﬁlter for a test E M(A). By applying Φ to one of the two systems A and A,
we can convert the correlator η
A
into a new sub-normalized bipartite state ω, given by ω(x, y) =
η
A
x, y) for all x X(A), y X(B). Noticing that Φ
(x) = t
x
x for every x E, we see that
ω correlates E with E: if x, y E with x 6= y, we have
ω(x, y) = η
A
(t
x
x, y) = t
x
η
A
(x, y) = 0.
In other words, ω is correlating. It follows that the normalized bipartite state
e
ω :=
1
ω(u
A
, u
A
)
ω
is likewise correlating. Since ω
1
= Φ(ρ), it follows that any state preparable from ρ by a ﬁlter
that is, any state of the form α =
]
Φ(ρ), where Φ is a ﬁlter and
e
α :=
1
u
A
(α)
α is the marginal
of a correlating state, and hence enjoys a spectral decomposition as in Equation (4). Thus, if
every state is so preparable, the weak conjugate A is actually a conjugate. So, in the presence of
sharpness, we can replace the assumption that the conjugate is strong, by the requirement that
every state be preparable by a ﬁlter. In fact, by strengthening this preparability assumption, it
is even possible to omit the hypothesis that A is sharp.
The isomorphism γ
A
: A ' A extends to an order-automorphism V(A) ' V(A), given by
α 7→ α, with α(x) = α(x) for all x X(A). Hence, a positive linear mapping Φ : V(A) V(A)
has a counterpart Φ : V(A) V(A), given by Φ(α) = Φ(α). Let us say that Φ is symmetric
with respect to η
A
iﬀ η
A
(x), y) = η
A
(x, Φ
(y)) for all x, y X(A), i.e., iﬀ η
A
id
A
) =
η
A
(id
A
Φ
).
Lemma 5 Let A have a weak conjugate A. Suppose that every state of A is preparable by a
symmetric ﬁlter. Then ha, bi := η
A
(a, γ
A
(b)) is a self-dualizing inner product on E(A).
Proof: Let α = Φ(ρ), where Φ is a symmetric ﬁlter for some test E. Consider the bipartite state
ω := η
A
id
A
) = η
A
(id
A
Φ
).
Accepted in Quantum 2019-06-24, click title to verify 16
For each outcome x X(A), let δ
x
denote the conditional state (η
A
)
1|x
. Then for all x E, and
all outcomes y X, we have
ω
1|x
(y) =
η
A
(y), x)
η
A
(u
A
), x)
=
η
A
(y, Φ
(x))
η
A
(u
A
, Φ
(x))
=
η
A
(y, t
x
x)
η
A
(u
A
, t
x
x)
=
η
A
(y, x)
η
A
(u
A
, x)
= (η
A
)
1|x
(y) = δ
x
(y).
It follows that ω
1|x
= δ
x
. It is easy to check that ω
1
= Φ((η
A
)
1
) = Φ(ρ) = α; also, by the law of
total probability (3), ω
1
=
P
xE
ω
2
(x)ω
1|x
=
P
xE
t
x
δ
x
, where t
x
= ω
2
(x). Thus, every state in
Ω(A) is a convex combination of the states δ
x
, and the cone generated by these states coincides
with V(A)
+
. It follows that
b
η maps E(A)
+
onto V(A)
+
, as in the proof of Lemma 2. The proof
that ha, bi := η(a, b) deﬁnes an inner product on E(A) now proceeds as in the proof of Lemmas 3
and 4.
In fact, we can do a bit better:
Corollary 2 (bis) Let A have a weak conjugate, and suppose that every interior state is prepara-
ble by a reversible symmetric ﬁlter. Then A is homogeneous and self-dual.
Proof: The preparability assumption clearly makes V(A) homogeneous. The proof of Lemma 5
shows that all states in the interior of can be decomposed as in equation (4) with respect to
the states δ
x
= η
1|x
. As noted in Remark (2) following the proof of Theorem 1, this is enough to
secure the self-duality of E(A), and its isomorphism with V(A).
It follows from the KV theorem that, for any model A satisfying the hypotheses of either
Corollary 1 or Corollary 2, E(A) carries a Jordan product compatible with the inner product
arising from η
A
, i.e, E(A) is a euclidean Jordan algebra. In fact, one can prove more: the unit
eﬀect u coincides with the Jordan unit, and M(A) is precisely the set of Jordan frames. In other
words, E(A) is a Jordan moel. The proof is given in Appendix A, where it is also shown that any
Jordan model satisﬁes the hypotheses of both corollaries. Thus, these two sets of hypotheses are
equivalent, and exactly characterize the class of Jordan models. To summarize:
Theorem 2 For a ﬁnite-dimensional, uniform probabilistic model A, the following statements are
equivalent:
(a) A is sharp, has a conjugate, and has arbitrary reversible ﬁlters
(b) A has a weak conjugate, and all non-singular states can be prepared by reversible
symmetric ﬁlters
(c) A is a Jordan model.
It should be stressed that all of the assumptions going into (a) and (b) are what [7] calls single-
system postulates, at least to the extent that the existence of a conjugate (or weak conjugate) is
a property of a single system. In any event, these assumptions, whether seen as pertaining to a
single system A or to the pair (A, A), are quite diﬀerent in ﬂavor from local tomography or the
subspace axiom, which place constraints on an entire theory’s worth of probabilistic models.
Accepted in Quantum 2019-06-24, click title to verify 17
5 Conclusion
We’ve seen that either of two related packages of assumptions — given in (a) and (b) of Theorem
2 — lead in a very simple way the homogeneity and self-duality of the space E(A) associated with
a probabilistic model A, and hence, by the Koecher-Vinberg Theorem, to A’s having a euclidean
Jordan structure. While this is not the only route one can take to deriving this structure (see, e.g,
[27] and [31] for approaches stressing symmetry principles), it does seem especially straightforward.
As discussed in the introduction, several other recent papers (e.g, [20, 28, 14, 25, 11]) have
derived standard ﬁnite-dimensional quantum mechanics, over C, from operational or information-
theoretic axioms. Besides the fact that the mathematical development here is quicker and easier,
the axiomatic basis is considerably diﬀerent, and arguably leaner, making no appeal to the struc-
ture of subsystems, or to the isomorphism of systems with the same information-carrying capacity,
or to local tomography. The last two points are particularly important: by avoiding local tomog-
raphy, we allow for real and quaternionic quantum systems; by not insisting that physical systems
having the same information capacity be isomorphic, we allow for quantum theory with supers-
election rules, and for physical theories in which real and quaternionic systems can coexist. Of
course, the door has been opened a bit wider than this: our postulates are also compatible with
spin factors and with the exceptional Jordan algebra.
10
Of the reconstructions cited above, the one having the strongest aﬃnity with the approach of
this paper is that of [11], the key postulate of which is that every state dilates to that is, arise
as the marginal of a pure state on a larger, composite system, unique up to symmetries of the
ancillary system. Condition (a) in the deﬁnition of a conjugate, requiring that every state dilate
to a correlating state, has a somewhat similar character, albeit with the emphasis on the dilated
state’s correlational properties, rather than its purity. To make the connection more explicit,
suppose we require that every non-singular state α on A dilate to a correlating isomorphism state
ω (which is the case, in the presence of our other assumptions). If µ is another isomorphism
state with the same marginal state α, then φ :=
b
µ
b
ω
1
is a reversible transformation on V(A)
with
b
µ = φ
b
ω, i.e., µ(a, b) = ω(a, φ(b)) for all a, b E(A). Now, as shown in [5], if V(A)
is irreducible as an ordered vector space, isomorphism states are pure. Thus, in the irreducible
case, we have a version of the puriﬁcation postulate for non-singular states. In view of these
connections, it seems plausible that the approach taken here might be adapted to considerably
simplify the mathematical development in [11].
11
An assumption that is common to nearly all of the cited earlier reconstructions is some version
of Hardy’s subspace postulate, which requires (roughly speaking) that the result of constraining
a physical system to the set of states making a particular measurement-outcome impossible, also
count as a physical system. This very powerful assumption, while not needed in the development
above, can readily be adapted to the framework of this paper, and can to a large extent replace our
assumptions above about the existence of reversible ﬁlters. The details can be found in Appendix
B.
10
It can be shown [6] that the exceptional Jordan algebra can be ruled out on the grounds that one can form no
satisfactory composite of two euclidean Jordan algebras if either has an exceptional direct summand. Whether the
spin factors can also be discarded, or whether they have some physical role to play, remains an open question.
11
Going in the other direction, in [12], the authors derive a version of part (a) of our deﬁnition of a conjugate, that
is, the existence of a dilation perfectly correlating two tests, from axioms similar to those of [11]. More recently, in
the conext of a compact closed category of processes, the authors of [29] introduce a stronger, “symmetric” version
of the puriﬁcation postulate, and show that when combined with suitable versions of sharpness and the existence of
a “classical interface”, this implies that all states can be prepared from the maximally mixed state by a reversible
process, allowing them to prove that their analogue of the cone V(A)
+
is homogeneous and self-dual.
Accepted in Quantum 2019-06-24, click title to verify 18
It is worth remarking that the subspace axiom applies, not to an indidual probabilistic model
but to a class of probabilistic models, that is, to an entire probabilistic theory. (In the language of
[7], it is not a single-system postulate.) As a rule, one wants to think of a physical theory, not as a
loosely structured class, but as a category of systems, with morphisms corresponding to processes.
To allow for composite systems, it is natural to take this to be a symmetric monoidal category
[1]. This brings us to the interesting question of whether one can construct symmetric monoidal
categories of probabilistic models, in which (say) the hypotheses of Corollary 1 are satisﬁed by
all systems. This is indeed possible for special Jordan algebras (those not having the exceptional
Jordan algebra as a direct summand). Restricting attention to Jordan models corresponding to
direct sums of real, complex and quaternionic matrix algebras, one can even arrange for this cat-
egory to be compact closed [5]. This implies that many standard quantum-information theoretic
protocols, notably conclusive teleportation and entanglement-swapping, are still available in this
non-locally tomographic setting.
12
Acknowledgements I wish to thank Giulio Chiribella, Chris Heunen, Matt Leifer and Markus
M¨uller for helpful comments on earlier drafts of this paper. This work was supported in part by
a grant (FQXi-RFP3-1348) from the FQXi foundation.
References
[1] S. Abramsky and B. Coecke, Categorical quantum mechanics, in D. Gabbay, K. Engesser and
D. Lehman, Handbook of Quantum Logic and Quantum Structures vol II, Elsevier, 2008;
DOI:10.1016/B978-0-444-52869-9.5001-4; arXiv:quant-ph/0402130)
[2] E. Alfsen and F. Shultz, Geometry of state spaces of operator algebras, Birkh¨auser, 2003 DOI:
10.1007/978-1-4612-0019-2
[3] C. Aliprantis and D. Toukey, Cones and Duality, Springer, 2007 DOI: 10.1090/gsm/084
[4] J. Baez, Division algebras and quantum theory, Foundations of Physics 42 819-855 (2012)
DOI: 10.1007/s10701-011-9566-z; arXiv:1101.5690
[5] H. Barnum, C. Gaebler and A. Wilce, Ensemble steering, weak self-duality and the structure
of probabilistic theories, Foundations of Physics 43 1411-1437 (2013) doi:10.1007/s10701-013-
9752-2; arXiv:0912.5532
[6] H. Barnum, M. Graydon and A. Wilce, Some Nearly Quantum Theories, in C. Heunen, P.
Selinger and J. Vicary, eds., Proceedings of the 12th International Workshop on Quantum
Physics and Logic, EPTCS 195 (2015), 59-70 10.4204/EPTCS.195.5; arXiv:1507.06278
[7] H. Barnum, M. Mueller and C. Ududec, Higher-order interference and single-system postu-
lates characterizing quantum theory, New Journal of Physics 16 (2014) DOI: 10.1088/1367-
2630/16/12/123029; arXiv:1403.4147
12
More generally, the existence of conjugate systems is very suggestive of the “caps” that deﬁne part of a compact
structure on a symmetric monoidal category (an observation reﬂected in my choice of the notation η). For a further
development of this connection, see [32]. See also [29] for a reconstruction of ﬁnite-dimensional quantum theory from
a “symmetric puriﬁcation” postulate on the structure of a suitable dagger-monoidal category allowing “classical
control”.)
Accepted in Quantum 2019-06-24, click title to verify 19
[8] H. Barnum and A. Wilce, Local tomography and the Jordan structure of quantum theory,
Found. Phys. 44 (2014), 192-212 DOI: 10.1007/s10701-014-9777-1; arXiv:1202.4513
[9] H. Barnum and A. Wilce, Post-classical probability theory, in G. Chiribella and R. Spekkens,
eds., Quantum Theory: Informational Foundations and Foils, Springer, 2017 doi:10.1007/978-
94-017-7303-4 11; arXiv:1205.3833
[10] J. Barrett, Information processing in generalized probabilistic theories, Physical Review A
75 (2005) DOI: 10.1103/PhysRevA.75.032304; arXiv:quant-ph/0508211
[11] G. Chiribella, M. D’Ariano and P. Perinotti, Informational derivation of quantum theory,
Physical Review A 84 (2011), 012311 DOI: 10.1103/PhysRevA.84.012311; arXiv:1011.6451
[12] G. Chiribella and C. M. Scandolo, Operational axioms for diagonalizing states, in C. Heunen,
P. Selinger and J. Vicary, Proceedings of the 12th International Workshop on Quantum Physics
and Logic, EPTCS 195 (2015) 96-115 DOI: 10.4204/EPTCS.195.8; arXiv:1608.04459
[13] B. Coecke and A. Kissinger, Categorical Quantum Mechanics I: causal quantum pro-
cesses, in E. Landry, ed., Categories for the Working Philosopher, Oxford, 2017 DOI:
10.1093/oso/9780198748991.003.0012; arXiv:1510.05468
[14] B. Dakic and C. Brukner, Quantum theory and beyond: is entanglement special?
in H. Halvorson, ed., Deep Beauty, Cambridge, 2011 DOI: 10.1017/CBO9780511976971;
arXiv:0911.0695
[15] E. B. Davies and J. Lewis, An operational approach to quantum probability, Communications
in Mathematical Physics 17 (1970) 239-260 DOI: 10.1007/BF01647093
[16] C. M. Edwards, The operational approach to algebraic quantum theory I, Communications
in Mathematical Physics 16 (1970), 207-230 DOI: 10.1007/bf01646788
[17] D. J. Foulis and C. H. Randall, Empirical logic and tensor products, in H. Neun-
mann (ed.), Foundations of Interpretations and Foundations of Quantum Mechanics, B.I.-
Wissenshaftsverlag, 1981
[18] J. Faraut and A. Kor´anyi, Analysis on Symmetric Cones, Oxford, 1994
[19] H. Hanche-Olsen, JB algebras with tensor products are C
algebras, in H. Araki et al. (eds.),
Operator Algebras and their Connections with Topology and Ergodic Theory, Lecture Notes
in Mathematics 1132 (1985), 223-229 DOI: 10.1007/BFb0074886
[20] L. Hardy, Quantum theory from ﬁve reasonable axioms, arXiv:quant-ph/0101012, 2001
[21] P. Janotta and R. Lal, Generalized probabilistic theories without the no-restriction hypoth-
esis, Physical Review A. 87 (2013) DOI: 10.1103/PhysRevA.87.052131; arXiv:1302.2632
[22] P. Jordan, J. von Neumann and E. Wigner, On an algebraic generalization of the quantum
mechanical formalism, Annals of Mathematics 35 (1934), 29-64 DOI: 10.2307/1968117
Accepted in Quantum 2019-06-24, click title to verify 20
[23] M. Koecher, The Minnesota Notes on Jordan Algebras and their Applications, Ed. A.
Krieg and S. Walcher, Springer Lecture Notes in Mathematics 1710, Springer, 1999 DOI:
10.1007/BFb0096285
[24] G. Ludwig, Foundations of Quantum Mechanics, Springer, 1983 DOI: 10.1007/978-3-642-
86751-4
[25] Ll. Masanes and M. uller, A derivation of quantum theory from physical requirements,
New Journal of Physics 13 (2011) DOI: 10.1088/1367-2630/13/6/063001; arXiv:1004.1483
[26] M. Mueller and Ll. Masanes, Information-theoretic postulates for quantum mechanics, in
G. Chiribella and R. Spekkens, eds. Quantum Theory: Informational Foundations and Foils,
Springer, 2016 doi:10.1007/978-94-017-7303-4 5; arXiv:1203.451
[27] M. M¨uller and C. Ududec, The structure of reversible computation determines the self-
duality of quantum theory, Physical Review Letters 108 (2012), 130401 DOI: 10.1103/Phys-
RevLett.108.130401; arXiv:1110.3516
[28] J. Rau, On quantum vs. classical probability, Annals of Physics 324 (2009) 2622–2637 DOI:
10.1016/j.aop.2009.09.013; arXiv:0710.2119
[29] J. Selby, C. M. Scandolo and B. Coecke, Reconstructing quantum theory from diagrammatic
postulates, arXiv:1802.00367
[30] A. Wilce, Four and a half axioms for ﬁnite-dimensional quantum theory in Y. Ben-Menahem
and M. Hemmo (eds.), Probability in Physics, Springer, 2012 doi:10.1007/978-3-642-21329-
8
17; arXiv:0912.5530
[31] A. Wilce, Symmetry, self-duality and the Jordan structure of ﬁnite-dimensional quantum
theory, DOI: 10.4204/EPTCS.95.19; arXiv:1210.0622
[32] A. Wilce, A shortcut from categorical quantum mechanics to convex operational theories, in
B. Coecke and A. Kissinger (Eds.), 14th International Conference on Quantum Physics and
Logic (QPL), EPTCS 266 (2018) 222-236; DOI: 10.4204/eptcs.266.15; arXiv:1206.2897
A Jordan Models
Let J be a euclidean Jordan algebra. As discussed earlier, this is associated with a probabilistic
model A(J ) = (X(J ), M(J ), Ω(J )), where X(J ) is the set of primitive idempotents (that is,
minimal projections) in J , M(J ) is the set of Jordan frames (maximal pairwise orthogonal sets
of minimal projections), and Ω(J ) is the set of states on (X(J ), M(J )) arising from states on J ,
that is, restrictions