Fermat's Library | q-2020-06-08-280 annotated/explained version.

Quantifying Bell: the Resource Theory of Nonclassicality

of Common-Cause Boxes

Elie Wolfe

, David Schmid

1,2

, Ana Bel

en Sainz

3,1

, Ravi Kunjwal

1,4

, and Robert W. Spekkens

Perimeter Institute for Theoretical Physics, 31 Caroline St. N, Waterloo, Ontario, N2L 2Y5, Canada

Institute for Quantum Computing and Dept. of Physics and Astronomy, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada

International Centre for Theory of Quantum Technologies, University of Gda

nsk, 80-308 Gda

nsk, Poland

Centre for Quantum Information and Communication, Ecole polytechnique de Bruxelles, CP 165, Universit

e libre de Bruxelles, 1050

Brussels, Belgium

June 5, 2020

We take a resource-theoretic approach to

the problem of quantifying nonclassicality in

Bell scenarios. The resources are conceptual-

ized as probabilistic processes from the set-

ting variables to the outcome variables hav-

ing a particular causal structure, namely, one

wherein the wings are only connected by

a common cause. We term them “common-

cause boxes”. We deﬁne the distinction be-

tween classical and nonclassical resources in

terms of whether or not a classical causal

model can explain the correlations. One can

then quantify the relative nonclassicality of

resources by considering their interconvert-

ibility relative to the set of operations that

can be implemented using a classical com-

mon cause (which correspond to local oper-

ations and shared randomness). We prove

that the set of free operations forms a poly-

tope, which in turn allows us to derive an ef-

ﬁcient algorithm for deciding whether one re-

source can be converted to another. We more-

over deﬁne two distinct monotones with sim-

ple closed-form expressions in the two-party

binary-setting binary-outcome scenario, and

use these to reveal various properties of

the pre-order of resources, including a lower

bound on the cardinality of any complete set

of monotones. In particular, we show that the

information contained in the degrees of viola-

tion of facet-deﬁning Bell inequalities is not

suﬃcient for quantifying nonclassicality, even

though it is suﬃcient for witnessing nonclas-

sicality. Finally, we show that the continuous

set of convexly extremal quantumly realiz-

able correlations are all at the top of the pre-

order of quantumly realizable correlations. In

addition to providing new insights on Bell

nonclassicality, our work also sets the stage

for quantifying nonclassicality in more gen-

eral causal networks.

Accepted in Quantum 2020-05-28, click title to verify. Published under CC-BY 4.0.1 1

arXiv:1903.06311v4 [quant-ph] 4 Jun 2020

Contents

1 Introduction 3

1.1 Summary of main results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

1.2 How to read this article . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

2 Motivating our approach and contrasting it with alternatives 4

2.1 Three views on Bell’s theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

2.2 The resource theory suggested by the causal modelling paradigm . . . . . . . . . . . . . . . . 6

2.3 Contrast to the strictly operational paradigm . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

2.4 Contrast to the superluminal causation paradigm . . . . . . . . . . . . . . . . . . . . . . . . . 11

3 Details of the resource theory 12

3.1 Free and nonfree common-cause boxes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

3.2 The free operations on common-cause boxes . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

3.3 Convexity of the set of free operations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

4 Resource theory preliminaries 18

4.1 Global features of a pre-order . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18

4.2 Features of resource monotones . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

4.3 Monotone constructions for any resource theory . . . . . . . . . . . . . . . . . . . . . . . . . . 19

5 A linear program for determining the ordering of any pair of resources 21

6 Two useful monotones 22

6.1 Preliminary facts regarding CHSH inequalities and PR boxes . . . . . . . . . . . . . . . . . . 22

6.2 Deﬁning the two useful monotones . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

6.3 Closed-form expressions for M

CHSH

and M

NPR

for

(

2 2

)

-type resources . . . . . . . . . . . . . 25

7 Properties of the pre-order of common-cause boxes 26

7.1 Inferring global properties of the pre-order . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26

7.2 Incompleteness of the two monotones . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29

7.3 At least eight independent measures of nonclassicality . . . . . . . . . . . . . . . . . . . . . . 30

8 Properties of the pre-order of quantumly realizable common-cause boxes 31

9 Conclusions and outlook 34

Appendices 37

A Comparing our framework with prior work 37

A.1 WPICC versus LOSR as the set of free operations . . . . . . . . . . . . . . . . . . . . . . . . 37

A.2 An oversight in the literature concerning how to formalize LOSR . . . . . . . . . . . . . . . . 37

A.3 Generalizing from Bell scenarios to more general causal structures . . . . . . . . . . . . . . . 41

B Proofs 44

B.1 Proof of Proposition 19: closed-form expression for M

NPR

(R) . . . . . . . . . . . . . . . . . . 44

B.2 Proof of Proposition 21: when the two monotones are complete . . . . . . . . . . . . . . . . . 46

B.3 Proof of Proposition 23: all nonfree resources of type

(

2 2

)

are orbital . . . . . . . . . . . . . . 47

B.4 Proof of Proposition 25: lower bound on the number of monotones in any complete set . . . 49

References 50

1 Introduction

Bell’s theorem [1, 2] highlights a precise sense in

which quantum theory requires a departure from a

classical worldview. Furthermore, violations of Bell

inequalities provide a means for certifying the non-

classicality of nature, independently of the correct-

ness of quantum theory. This is because Bell inequal-

ities can be tested directly on experimental data.

Experimental tests under very weak assumptions

have conﬁrmed this nonclassicality [3–5]. Correla-

tions that violate Bell inequalities have also found

applications in information theory. Speciﬁcally, they

constitute an information-theoretic resource inso-

far as they can be used to perform various cryp-

tographic tasks in a device-independent way [6–14].

Consequently, much previous eﬀort has been made

to quantify resourcefulness of correlations within

Bell scenarios [15–22].

In this paper, we take a resource-theoretic ap-

proach to quantifying the nonclassicality of a given

correlation in a Bell scenario, grounded in a new per-

spective on Bell’s theorem. This is the perspective

of causal modelling, which diﬀers from the tradi-

tional operational approaches both conceptually and

in practice. Nevertheless, the natural choice of the

set of free operations for the Bell scenario in our

framework coincides with the one proposed in some

previous works [16, 17], namely, local operations

and shared randomness (LOSR)

. See also the

subsequent works of Refs. [23–25].

Our causal perspective on quantifying Bell non-

classicality also generalizes naturally to a framework

for quantifying the nonclassicality of correlations in

more general causal scenarios. We discuss this gen-

eralization in Section A.2, but leave its development

to future work.

1.1 Summary of main results

We now summarize the content and main results of

our article.

In Section 2, we articulate the view on Bell’s

theorem that motivates our approach—the causal

There is widespread agreement that the free operations should

somehow consist of local operations supplemented with shared

randomness, however, diﬀerent authors have been led to for-

malize this idea diﬀerently, that is, they have been led to

distinct proposals for the the set of free operations. Indeed,

the formalization provided in Refs. [

] is inconsistent

with the one given in Refs. [

] and therefore also with

the one presented here. A detailed discussion of this issue can

be found in Appendix A.2.

modelling paradigm—and contrast it with two other

views on Bell’s theorem, namely, the strictly oper-

ational and superluminal causation paradigms. In

particular, we explain how the diﬀerences between

these views impacts how one conceptualizes Bell in-

equality violations as a resource, and we highlight

some of the advantages of our approach relative to

the alternatives. We also introduce the notion of par-

titioned process theories [26] as the mathematical

framework for resource theories that we adopt in

this article.

In Section 3, we provide a formal deﬁnition of the

resource theory to be studied. For bipartite Bell sce-

narios, we argue that the set of processes which nat-

urally constitute the resources in our approach is the

set of all bipartite processes with classical inputs and

outputs that can arise within a causal model with

a (possibly nonclassical) common cause between the

wings. We also argue that the natural set of free oper-

ations on such processes are those that are achieved

by embedding the process in a circuit for which the

only connection between the wings is a classical com-

mon cause, and we demonstrate that this is equiv-

alent to the set of local operations and shared ran-

domness, as the latter is formalized in Refs. [16, 17].

In Section 4, we introduce some of the central

concepts of any resource theory, including the no-

tion of a pre-order and its features, the notion of

monotones and complete sets thereof, and the no-

tions of cost and yield monotones, which underlie

the explicit monotone constructions that follow.

In Section 5, we show how one can use two in-

stances of a linear program to determine the or-

dering relation which holds between any pair of re-

sources (see Proposition 15 and the discussion that

follows it).

In Section 6, we deﬁne two monotones of particu-

lar interest. The ﬁrst (deﬁned in Eq. (33)) is based

on a yield construction relative to all resources in the

Clauser-Horne-Shimony-Holt (CHSH) scenario [27]

(a bipartite Bell scenario where the settings and out-

comes all have cardinality two) and where the yield

is measured by the value of the canonical CHSH

functional. The second (deﬁned in Eq. (36)) is based

on a cost construction relative to a one-parameter

family of resources in the CHSH scenario and where

the cost is measured again by the value of the canon-

ical CHSH functional. Although both of these mono-

tones are originally deﬁned in terms of an optimiza-

tion problem, we derive closed-form expressions for

each of them for resources within the CHSH scenario

(see Propositions 17 and 19 respectively). We show

that within the CHSH scenario [27], a variety of

monotones which have been previously studied are

all equivalent (up to a monotonic function) to the

ﬁrst of these monotones (see Corollary 18). Because

our two monotones are provably not equivalent, this

result implies that the second of our monotones pro-

vides information beyond that given by previously

studied monotones.

In Section 7 we leverage our two monotones to

derive various global properties of the pre-order

induced by single-copy deterministic conversions.

Speciﬁcally, we prove that the pre-order:

• is not complete (i.e., there exist incomparable

resources),

• is not weak (the incomparability relation is not

transitive),

• has both inﬁnite width and inﬁnite height,

• is locally inﬁnite.

We also prove that the two monotones just men-

tioned do not completely characterize the pre-order

of resources, by showing that they fail to do so even

for the special case of the CHSH scenario. We further

show (in Theorem 26) that no fewer than eight con-

tinuous monotones can do the job. We also show (in

Proposition 23) that the equivalence classes among

nonfree resources in the CHSH scenario (though not

in general) are given exactly by the orbits of the

symmetry group of deterministic free operations.

Finally, in Section 8, we show that all of the global

features of the pre-order hold even for the strict sub-

set of resources which can be realized in quantum

theory. We also prove (in Lemma 27) that every ex-

tremal quantumly realizable resource is at the top of

the pre-order of quantumly realizable resources, and

(in Proposition 28) that there are a continuous set of

incomparable resources at the top of this pre-order.

1.2 How to read this article

We will demonstrate in Section 3 that in spite of

the diﬀerence in our attitude towards Bell’s theo-

rem, the deﬁnition of the set of resources and the

set of free operations that is natural for the Bell

scenario within the causal modelling paradigm coin-

cides with a deﬁnition that has been proposed within

the strictly operational paradigm, namely, the one

proposed in Refs. [16, 17]. Because Bell scenarios

are the focus of our article, any reader who would

rather take the strictly operationalist attitude to-

wards Bell’s theorem can reinterpret all of our re-

sults through that lens. In particular, readers who

are already sympathetic to the notion that LOSR,

as deﬁned in Refs. [16, 17], is the right choice of free

operations may wish to skip Sections 2 and 3.

To understand our conviction that LOSR consti-

tutes the right choice of free operations for Bell sce-

narios, however, readers are advised to read Sec-

tions 2 and 3.2. In particular, to understand how

our approach diﬀers (advantageously) from other

approaches, readers are encouraged to examine Sec-

tions 2.3 and 2.4 as well as Appendix A.

Because Section 4 reviews basic deﬁnitions and

terminology for concepts related to resource theories,

any reader who has expertise on resource theories

may wish to skip this section. We note, however, that

some of the material presented therein is not found

in standard treatments, such as our discussion of

global properties of a pre-order and our discussion

of a scheme for constructing useful cost and yield

monotones.

The presentation of our novel technical results be-

gins in Section 5.

2 Motivating our approach and con-

trasting it with alternatives

2.1 Three views on Bell’s theorem

The traditional commentary on Bell’s theorem [28,

29] takes a particular view on how to articulate the

assumptions that are necessary to derive Bell in-

equalities. Among these assumptions, two are typ-

ically highlighted as deserving of the most scrutiny,

namely, the assumptions that are usually termed re-

alism and locality

. Abandoning one or the other of

these two assumptions is the starting point of most

commentaries on what to do in the face of violations

of Bell inequalities.

Furthermore, a schism seems to

have developed between the camps that advocate for

each of these two views [30].

Among the researchers who take Bell’s theorem

to demonstrate the need to abandon realism, there

is a contingent which adopts a purely operational at-

titude towards quantum theory, that is, an attitude

wherein the scientist’s job is merely to predict the

statistical distribution of outcomes of measurements

performed on speciﬁc preparations in a speciﬁed ex-

perimental scenario. We shall refer to the members

Note, however, that diﬀerent authors will formalize these

assumptions in diﬀerent ways.

See, however, the discussion of superdeterminism in footnote 7.

of this camp as operationalists [31]. For such re-

searchers, a violation of a Bell inequality is simply a

litmus test for the inadequacy of a classical realist ac-

count of the experiment. One particular type of oper-

ationalist attitude, which we shall term the strictly

operational paradigm, advocates that physical

concepts ought to be deﬁned in terms of operational

concepts, and consequently that any properties of

a Bell-type experiment, such as whether it is sig-

nalling or not and what sorts of causal connections

might hold between the wings, must be expressed in

the language of the classical input-output function-

ality of that experiment. In other words, they advo-

cate that the only concepts that are meaningful for

such an experiment are those that supervene

upon

its input-output functionality.

Most prior work on

quantifying the resource in Bell experiments has

been done within this paradigm, and the characteris-

tic of experimental correlations that is usually taken

to quantify the resource is simply some notion of dis-

tance from the set of correlations that satisfy all the

Bell inequalities.

Consider, on the other hand, the researchers who

take realism as sacrosanct, and in particular those

who take Bell’s theorem to demonstrate the failure

of locality—that is, the existence of superluminal

causal inﬂuences [33, 34].

Researchers in this camp,

whom we shall refer to as advocates of the super-

luminal causation paradigm, would presumably

ﬁnd it natural to quantify the resource of Bell in-

equality violations in terms of the strength of the

superluminal causal inﬂuences required to account

for them (within the framework of a classical causal

model). An approach along these lines is described

-properties are said to supervene on

-properties if every

A-diﬀerence implies a B-diﬀerence.

Some might describe what we have here called the strictly op-

erational paradigm as the “device-independent” paradigm [

however, we avoid using the latter term here because its usage

is not restricted to describing a particular type of empiricist

philosophy of science: it also has a more technical meaning in

the context of quantum information theory, wherein it indi-

cates whether or not a given information-theoretic protocol

depends on a prior characterization of the devices used therein.

Indeed, Bell-inequality-violating correlations have been shown

to be a key resource in cryptography because they allow for

device-independent implementations of cryptographic tasks[

–

14].

Although such inﬂuences do not imply the possibility of su-

perluminal signalling, they do imply a certain tension with

relativity theory if one believes that the latter does not merely

concern anthropocentric concepts such as signalling, but also

physical concepts such as causation.

in Refs. [35, 36]. Earlier work on the communication

cost of simulating Bell-inequality violations [37, 38]

is also naturally understood in this way.

In recent years, a third attitude toward Bell’s

theorem—inspired by the framework of causal infer-

ence [42]—has been gaining in popularity. In this ap-

proach, the assumptions that go into the derivation

of Bell inequalities are [43]: Reichenbach’s princi-

ple (that correlations need to be explained causally),

the framework of classical causal modelling, and the

principle of no ﬁne-tuning (that statistical indepen-

dences should not be explained by ﬁne-tuning of the

values of parameters in the causal model). Here, a

violation of a Bell inequality does not lead to the tra-

ditional dilemma between realism and locality, but

rather attests to the impossibility of providing a non-

ﬁne-tuned explanation of the experiment within the

framework of classical causal models. This attitude

implies the possibility of a new option for what as-

sumption to give up in the face of such a violation.

Speciﬁcally, the new possibility being contemplated

is that one can hold fast to Reichenbach’s principle

and the principle of no ﬁne-tuning—and hence to the

possibility of achieving satisfactory causal explana-

tions of correlations—by replacing the framework of

classical causal models with an intrinsically nonclas-

sical generalization thereof.

As is shown in Ref. [43], because the correlations

in a Bell experiment do not provide a means of send-

ing superluminal signals between the wings, the only

causal structure that is a candidate for explaining

these correlations without ﬁne-tuning is one wherein

there is a purely common-cause relation between the

wings, that is, one which admits no causal inﬂuences

between the wings. Therefore, the new approach to

achieving a causal explanation of Bell inequality vi-

olations is one that posits a common cause mech-

A less common view on how to maintain realism in the face

of Bell inequality violations is to hold fast to locality but give

up on a diﬀerent assumption that goes into the derivation of

Bell inequalities, namely, that the hidden variables are sta-

tistically independent of the setting variables. This is known

as the “superdeterministic” response to Bell’s theorem [

Advocates of this approach would presumably ﬁnd it natural

to quantify the resource of Bell inequality violations in terms

of the deviation from such statistical independence that is

required to explain a given violation. In particular, the results

of Refs. [

] and [

] seeking to quantify the nonindependence

needed to explain a given Bell inequality violation might be

framed within a resource-theoretic framework. However, given

that the setting variables can no longer be considered as freely

speciﬁable inputs within such an approach, it would be inap-

propriate to conceptualize a Bell experiment as a box-type

process as we have done here.

anism but replaces the usual formalism for causal

models with one which allows for more general pos-

sibilities on how to represent its components [44].

We refer to this attitude as the causal modelling

paradigm.

The causal modelling paradigm implies not only

a novel attitude towards Bell’s theorem, but also a

change in how one conceives of the resource that

powers the information-theoretic applications of

Bell-inequality violations. The resource is not taken

to be some abstract notion of distance from the set

of Bell-inequality-satisfying correlations within the

space of all nonsignalling correlations, as advocates

of the strictly operational paradigm seem to favour,

nor to consist of the strength of superluminal causal

inﬂuences, as advocates of the superluminal causa-

tion paradigm would presumably have it. Rather, we

take the resource to be the nonclassicality required

by any generalized causal model which can explain

the Bell inequality violations without ﬁne-tuning.

We shall show that in the resource theory that

emerges by adopting this attitude, the nonclassical-

ity of common-cause processes in Bell experiments

cannot be captured solely by the degree of violation

of facet-deﬁning Bell inequalities. That is, there are

distinctions among such common-cause processes—

diﬀerent ways for these to be nonclassical—which

do not correspond to distinctions in the degree of

violation of any facet-deﬁning Bell inequality.

2.2

The resource theory suggested by the

causal modelling paradigm

2.2.1 Generalized causal models

We will work with the notion of a generalized (i.e.,

not necessarily classical) causal model that has been

developed in Refs. [45, 46] using the framework of

generalized probabilistic theories (GPTs) [47–49]),

and refer to it as a GPT causal model.

Since

we are interested in the distinction between classical

For instance, for the notion of a quantum causal model pro-

posed in Ref. [

], reversible deterministic causal dependencies

are represented by unitaries rather than bijective functions,

and lack of knowledge is represented by density operators

rather than by classical probability distributions.

In the language of operational probabilistic theories [

we are considering

free

and

causal

GPTs. A GPT is said to

be free if, for any mathematically well-formed closed circuit,

it speciﬁes a joint probability distribution over the outcomes

of the instruments. A GPT is said to be causal if there exists

a unique deterministic eﬀect (which is often interpreted as

excluding backwards-in-time signalling in any circuit).

and nonclassical, without speciﬁcally distinguishing

quantum and post-quantum types of nonclassical-

ity, we will not be making use of any of the recent

work [44, 52, 53] on devising an intrinsically quan-

tum notion of a causal model.

Deﬁnition 1.

GPT causal model

consists of

a causal structure, represented by a directed acyclic

graph (DAG), and a set of GPT parameters. The

parameters specify, for each node in the DAG, a GPT

operation from the composite system associated to

the parents of that node to the system associated to

the node.

One can approach the study of nonclassicality in

arbitrary causal structures from within the scope of

these GPT causal models, and pursue the develop-

ment of a resource theory of such nonclassical fea-

tures.

We focus on experimental scenarios that are mul-

tipartite. The diﬀerent wings of the experiment

are commonly conceptualized as the laboratories

of diﬀerent parties, particularly when discussing

information-theoretic tasks that may be undertaken

by the parties. If one restricts attention to scenar-

ios wherein locally, at each wing, systems can be

put into arbitrary causal relations with one another

(consistent with the absence of backwards-in-time

causal inﬂuences), then the only freedom in stipulat-

ing the causal structure is in stipulating the causal

relations that hold among the wings of the exper-

iment. Causal relations among the wings come in

two forms: (i) a relation indicating the potential for

causal inﬂuence from one wing to another, corre-

sponding to having access to a GPT channel from

one to the other, and (ii) a relation indicating the po-

tential for a common cause to act on a set of wings,

corresponding to having a source which distributes a

multipartite GPT state among them. The GPT op-

erations and GPT states representing, respectively,

these cause-eﬀect and common-cause relations, to-

gether with the GPT operations representing causal

inﬂuences between systems at a given wing, consti-

tute the parameters of the GPT causal model.

The possible operational statistics that one can

observe in this scenario hence arise from all the pos-

sible ways one may assign values to the parameters

However, we will consider the question of when certain cor-

relations that arise in a GPT causal model can be quan-

tumly realized. Moreover, the follow-up work described in

Ref. [

] explicitly explores the distinction between resources

that are quantumly realizable and those that necessitate a

post-quantum GPT.

of the GPT causal model – both to those pertain-

ing to the causal structure among the wings, and to

those pertaining to the local actions in each wing.

In this article, we focus on a particular type

of causal structure among the wings, namely, one

wherein there is a common cause that acts on all

of the wings, but no causal inﬂuences between any

of them, which we term a Bell scenario. However,

in Appendix A.3, we do include some discussion re-

garding other possible causal structures among the

wings. Details about how entangled states and oper-

ations are represented in a GPT causal model can

be found in Refs. [45, 47, 48], and explicit descrip-

tions of these for the Bell scenario are provided in

Sec. 3.1 and in Appendix A.3.

2.2.2

The distinction between free and nonfree re-

sources in the causal modelling paradigm

We conceptualize any experimental conﬁguration as

a process from its inputs to its outputs. In the frame-

work of GPT causal models, one has the capacity to

consider processes that have GPT systems as inputs

and outputs at the various wings. However, we will

restrict our attention to processes that have only

classical inputs and outputs. Such processes can be

conceptualized as black-box processes, to which one

inputs classical variables and from which classical

variables are output. They are therefore precisely

the sorts of processes considered in the strictly op-

erational paradigm. We further restrict our atten-

tion to processes with a classical input and classical

output at each wing, where the input temporally

precedes the output.

In the strictly operational

paradigm, the term “box” is generally used as jar-

gon for such processes (for instance, as it is used in

the term “PR box” [57]). We therefore refer to such

processes as box-type processes or simply boxes.

Deﬁnition 2.

box

is a process with a classical

input and a classical output at each wing, represented

formally by a stochastic map from the tuple of inputs

to the tuple of outputs.

We use the term common-cause box to refer

to box-type processes which can be realized using a

causal structure consisting of a common cause act-

ing on all of the wings. These will be the resources

Thus, we do not consider processes which involve a sequence

over time of classical input variables and classical output

variables; that is, in the language of Refs [

], we do not

consider general n-combs.

that we focus on in this article. In GPT causal mod-

els, all common-cause boxes can be decomposed into

the preparation of a GPT state on a multipartite

system, followed by the distribution of the compo-

nent subsystems among the wings, followed by each

subsystem being subjected to a GPT measurement,

chosen from a ﬁxed set according to the classical

input variable at that wing (the local setting vari-

able), and the result of which is the classical output

variable at that wing (the local outcome variable). In

short, such processes can be decomposed in the same

manner in which a multipartite Bell experiment is

decomposed in quantum theory.

Deﬁnition 3.

common-cause box

(or, equiv-

alentely, a

GPT-realizable common-cause box

)

is a box that can arise from a GPT causal model of

a multipartite Bell scenario, so that the inputs and

outputs correspond, respectively, to the setting and

outcome variables associated to a set of local GPT

measurements implemented on a multipartite GPT

state.

The distinction between common-cause boxes

that are classically realizable and those that are

not (illustrated for the bipartite case in Fig. 1) is sim-

ply the distinction between whether there is a classi-

cal causal model underlying the process, or whether

it is only realizable by a causal model which invokes

a nonclassical GPT.

Classical causal models are causal models wherein

all systems mediating causal inﬂuences are repre-

sented by classical variables, so that every common-

cause source is represented by a joint probability dis-

tribution (referred to as shared randomness) and ev-

ery channel is represented by a conditional probabil-

ity distribution. Equivalently, classical causal mod-

els can be understood to arise as a subset of GPT

causal models wherein the systems are presumed

to be nonclassical (for instance, they might be pre-

sumed to be quantum), but every common-cause

source is represented by a GPT-unentangled state

and every channel is taken to be GPT-entanglement-

breaking. Particularizing to the case of common-

cause boxes, we have:

Deﬁnition 4.

classically realizable common-

cause box

is a common-cause box that admits of a

classical causal model, such that the common-cause

source consists of shared randomness. Equivalently,

it is a common-cause box that admits of a GPT causal

model wherein the common-cause source consists of

a GPT-unentangled state.

(a)

(b)

(a)

(b)

Figure 1: In the bipartite scenario, the distinction between

(a) a generic GPT-realizable common-cause box and (b) a

classically realizable common-cause box. Here and through-

out this article, single-line edges denote classical systems,

and single-line boxes denote processes that have only classi-

cal inputs and outputs (depicted in light blue). Double-line

edges denote nonclassical systems and double-line boxes

denote processes that have one or more nonclassical in-

puts or outputs (depicted in pink). Any common-cause

box whose input-output functionality is consistent with an

internal structure of the type indicated in (b) (regardless of

its actual internal structure) is termed classically realizable

and is considered free, while a common-cause box whose

input-output functionality is not consistent with the struc-

ture of (b) but instead is only consistent with an internal

structure of the type indicated in (a) is considered nonfree.

It follows that the free common-cause boxes are

precisely the nonsignalling boxes that satisfy all

the Bell inequalities, while the costly common-cause

boxes are the nonsignalling boxes that violate some

Bell inequality

2.2.3

Quantifying resourcefulness in the causal mod-

elling paradigm

In order to quantify the nonclassicality of common-

cause boxes (that is, the extent to which they fail

to be classically realizable), we will use an approach

to resource theories described in Ref. [26], namely,

the framework of partitioned process theories. An en-

veloping theory of processes must be speciﬁed,

together with a subtheory of processes that can be

implemented at no cost, called the free subtheory

of processes. This partitions the set of all processes

in the enveloping theory into free and costly (i.e.,

nonfree) processes. One can then ask of any pair of

Indeed, since the GPT colloquially known as Boxworld realizes

all and only the nonsignalling boxes in Bell scenarios [

], it

follows that all nonsignalling boxes admit of a GPT causal

model.

processes in the enveloping theory of a given type

whether the ﬁrst can be converted to the second by

composing it with a process (of the appropriate type)

from the free subtheory. If interconversion between

processes of type T require composition with a pro-

cess of type T

, then the set of free operations

on processes of type T are the elements of the free

subtheory of processes that are of type T

. Pairwise

convertibility relations under the set of free opera-

tions deﬁne a pre-order on the set of the resources

of interest, and a partial order over the equivalence

classes of such resources. One can then quantify the

relative worth of diﬀerent resources by their rela-

tive positions in this partial order. Functions over

resources that preserve ordering relations, termed

monotones, provide a particularly simple means of

quantifying the worth of resources.

The resource theory considered in this article will

be described in full detail in Sec. 3. Nonetheless, we

provide a sketch of its deﬁnition here in order to be

able to highlight the ways in which it contrasts with

other approaches.

We take the enveloping theory of processes to

include all GPT-realizable common-cause boxes as

well as the GPT-realizable processes that take ev-

ery such box to another such box while only making

use of a common cause (depicted in Fig. 2(a)).

process that takes a box to a box we will refer to

as a clamp because it is a process that has the form

of a comb with two teeth (relative to the notion of

“comb” introduced in [55, 56]). More precisely, a pro-

cess taking boxes to boxes is a clamp with classical

inputs and outputs. Those that only make use of a

common cause, we refer to as common-cause clamps

with classical inputs and outputs. Such clamps are

the most general type of process required in our en-

veloping theory because common-cause boxes are a

special case of these (for instance, when all the sys-

tems at the inputs and outputs on the bottom teeth

of the clamp are trivial).

We take the free subtheory of processes in our re-

source theory to consist of the subset of common-

cause clamps with classical inputs and outputs that

can be realized in a classical causal model, termed

classically realizable. The distinction between a

generic GPT-realizable common-cause clamp and a

This is a general approach to determining a pre-order over

resources of a given type—deﬁne the enveloping theory to

include processes corresponding to the resource type of interest

as well as the processes that are required to interconvert

between such resources.

classically realizable one is depicted in Fig. 2. By

virtue of boxes being a special type of clamp, this

deﬁnition is consistent with Deﬁnition 4.

(a) (b)

Figure 2: In the bipartite scenario, the distinction between

(a) a generic GPT-realizable common-cause clamp with

classical inputs and outputs and (b) a classically realiz-

able common-cause clamp with classical inputs and outputs.

Any common-cause clamp whose input-output functionality

is consistent with an internal structure of the type indi-

cated in (b) (regardless of its actual internal structure) is

termed classically realizable and is considered free, while a

common-cause clamp whose input-output functionality is

not consistent with the structure of (b) but instead is only

consistent with an internal structure of the type indicated

in (a) is considered nonfree.

To determine the ordering relations that hold

among these common-cause boxes, one must deter-

mine the convertibility relations among them. Given

the deﬁnition of our resource theory, whether one

GPT-realizable common-cause box can be converted

to another is determined by whether this can be

achieved by processing it with a classically realiz-

able common-cause clamp, as depicted in Fig. 3. This

subsumes correlated local processings of the inputs

and outputs of the box, as we describe in Section 3.2.

For both the GPT-realizable and classically realizable varieties

of these processes, one can deﬁne notions of sequential and

parallel composition such that the set of processes, together

with these composition relations, satisfy the formal deﬁnition

of a process theory [

], thereby justifying the claim that

the resource theory we have deﬁned is formally a partitioned

process theory. The proof of this fact, however, is not rele-

vant to any of the results in this article and is postponed to

forthcoming work [58].

Figure 3: In the bipartite scenario, the most general form of

a free operation (in blue) taking a GPT-realizable common-

cause box (in pink) to another.

2.2.4 A note about nomenclature

In this article, we avoid describing the resource be-

hind Bell inequality violations as “nonlocality”. This

is because we believe that it is only for those who

take the lesson of Bell’s theorem to be the existence

of superluminal causal inﬂuences that it is appro-

priate to describe violations of Bell inequalities by

this term. Researchers in the operationalist camp

have not, generally speaking, avoided using the term

“nonlocality”, but seem instead to use it as a syn-

onym for “violation of a Bell inequality” rather than

to imply a commitment to superluminal causal inﬂu-

ences. However, we believe that such a usage invites

confusion and so we opt instead to avoid the term

altogether. Nevertheless, our project is very much in

line with earlier projects that describe themselves as

developing a “resource theory of nonlocality”, such

as Refs. [15–18].

2.3

Contrast to the strictly operational

paradigm

As noted in the introduction and as will be demon-

strated in Section 3.2, in the special case of Bell

scenarios—the focus of this article—the natural

set of free operations within our causal modelling

paradigm is equivalent to one of the proposals for the

set of free operations made in earlier works within

the strictly operational paradigm, namely, local oper-

ations and shared randomness (LOSR), as the latter

is deﬁned in Refs. [16, 17]. Additionally, the nat-

ural enveloping theory adopted in the strictly op-

erational approach, namely, the set of no-signalling

boxes, also coincides with that of our enveloping the-

ory for the case of Bell scenarios, namely, the set

of GPT-realizable common-cause boxes (where the

equivalence of these two sets can be inferred from the

results of Ref. [48]). Therefore, in spite of the diﬀer-

ence in the attitude we take towards Bell’s theorem,

the resource theory that we deﬁne for Bell scenarios

is the same as the one studied in Refs. [16, 17].

Nonetheless, the diﬀerence in our attitude towards

Bell’s theorem is not inconsequential. We presently

outline its signiﬁcance for the project of this article

as well as for potential future generalizations of this

project.

Most importantly, the causal modelling approach

diverges sharply from any strictly operational ap-

proach once one considers causal structures beyond

Bell scenarios. As discussed in Appendix A.3, in

a resource theory of nonclassicality for more gen-

eral causal structures, both the free subtheory and

the enveloping theory proposed by the causal mod-

elling approach are radically diﬀerent from those

suggested by the strictly operational approach. In

particular, the free subtheory need not be LOSR in

a general causal structure and the enveloping the-

ory need not be the set of all nonsignalling opera-

tions. Our approach allows us to deﬁne a resource

theory that is speciﬁc to a scenario in which only

strict subsets of the wings are connected by common

causes [46, 59] (such as the triangle-with-settings sce-

nario described in Appendix A.3) and this provides

a concrete example of a case where the free subthe-

ory is not LOSR and the enveloping theory is not

all nonsignalling operations. In these cases, the free

operations are“local operations and causally admiss-

able shared randomness”,wherein only those subsets

of wings that are connected by a common cause have

shared randomness. This is distinct from the LOSR

operations, which assume that randomness is shared

between all the wings. It seems unlikely that the re-

source theory we propose in these cases can be mo-

tivated (or even fully characterized) in the strictly

operational paradigm.

Even for Bell scenarios, however, the causal mod-

elling approach oﬀers advantages over its competi-

tors. In particular, it singles out a unique set of free

operations, while the strictly operational approach

does not. From our perspective, the resource under-

lying Bell inequality violations is the nonclassical-

ity of the causal model required to explain them

with a common cause, so clearly the free operations

should involve only classical common causes act-

ing between the wings. In the strictly operational

paradigm, by contrast, any operation which pre-

serves no-signalling and takes local boxes to local

boxes might constitute a legitimate candidate for

a “free” operation. This ambiguity is reﬂected in

the existence of distinct proposals for the set of free

operations in strictly operational resource theories.

Aside from LOSR, there is also a proposal called

wirings and prior-to-input classical communication

(WPICC) [18] which allows for classical causal inﬂu-

ences between the wings prior to when the parties

receive their inputs (See Appendix A.1). If one be-

lieves that there is a singular concept which under-

lies the violation of Bell inequalities, then at most

one of these proposals (LOSR or WPICC) can be

taken as the relevant set of free operations.

Al-

though WPICC operations meet all desired opera-

tional criteria, they are immediately ruled out as

candidates for the free operations within the causal

modelling paradigm, on the grounds that they in-

volve nontrivial cause-eﬀect inﬂuences between the

wings.

Another advantage of our approach for the Bell

scenario is that it highlights the fact that LOSR is

by construction a convex set, a fact which is criti-

cal for the algorithmic method that we derive for

determining the ordering relation between any two

resources. In highlighting this fact, our approach led

us to notice an oversight in some previous attempts

to formalize LOSR, as discussed in Appendix A.2.

Finally, we note that prior work of Geller and

Piani [17] departs from the strictly operational

paradigm through their use of the uniﬁed operator

formalism [60, 61], which is analogous to the quan-

tum formalism, but where nonpositive Hermitian op-

erators are allowed to represent states. They do not

characterize boxes primarily by their input-output

functionality, but rather as a composition of a bi-

partite source with local measurements. Indeed in

their Fig. 4, they explicitly depict the internal struc-

ture of the box. It is in this sense that their approach

does not quite ﬁt the mould of a strictly operational

approach but is rather somewhat more in the ﬂavour

of the causal modelling approach we have described

here.

Nonetheless, the uniﬁed operator formalism dif-

fers signiﬁcantly from the GPT formalism of

Competing sets of free operations may be interesting for study-

ing phenomena other than the resource that powers violations

of Bell inequalities, but this is not the issue at stake in this

article.

Refs. [45, 46] with respect to the independence of the

nonclassical common cause from the measurements

employed in realizing nonclassical boxes. In the uni-

ﬁed operator formalism, the Hermitian operator de-

scribing the shared state cannot be chosen freely for

a given set of quantum measurements, because some

choices would yield negative numbers rather than

valid probabilities. By contrast, in the GPT formal-

ism that we adopt here, the set of GPT states is

contained within the dual of the set of GPT product

measurements, and hence any measurement scheme

can be paired with any shared state while yielding

valid probabilities. The causal modelling paradigm

must reject any dependence of the shared state on

the choice of measurements, while such dependence

is unavoidable within the uniﬁed operator formalism.

As deﬁned in Ref. [42], a causal model is a directed

acyclic graph, or equivalently, a circuit of causal pro-

cesses, wherein the distinct processes in the circuit

are required to be autonomous (i.e., independently

variable). We therefore classify Ref. [17] as neither

within the causal modelling paradigm nor within the

strictly operational paradigm, while still exhibiting

some features of each of these approaches.

2.4

Contrast to the superluminal causation

paradigm

To our knowledge, advocates of the superluminal

causation paradigm have not attempted to develop

a resource theory for Bell inequality violations (al-

though Refs. [35, 36] are related in spirit). If it were

attempted (within the framework of Ref. [26]), then

the commitments of the approach suggest that it

would also be done diﬀerently from the way we have

done so here. Those who endorse the superluminal

causation paradigm do not shy away from the notion

of causation, and hence a resource theory developed

within their paradigm could be presented using the

same framework that we use here — that of causal

models. However, such an approach would likely be

framed entirely in terms of classical causal models,

rather than introducing the notion of GPT causal

models.

Advocates of the superluminal causation

paradigm would naturally deﬁne the free boxes

to be those that involve only subluminal causes.

Hence, in scenarios wherein the inputs and the

outputs at one wing are space-like separated from

those at the other wings, so that subluminal causal

inﬂuences cannot act between the wings, a box is

free if and only if it can be realized by a classical

common cause. Thus, the natural choice of the free

subtheory in the superluminal causation paradigm

coincides with the free subtheory in the causal mod-

elling paradigm. On the other hand, the natural

choice of the enveloping theory in the superluminal

causation paradigm consists of the set of boxes that

are classically realizable given superluminal causal

inﬂuences between the wings. This diﬀers from the

enveloping theory in the causal modelling paradigm

because it includes boxes that are signalling. In the

superluminal causation paradigm, therefore, it is

natural to try and quantify the resource in terms

of the strength of the superluminal causal inﬂuence

between the wings that is required to explain it in

a classical causal model.

Because the enveloping theory within this

paradigm includes not only non-signalling boxes

that violate Bell inequalities but signalling boxes

as well, the resource theory is rich enough to de-

scribe communication between the wings. Therefore,

deﬁning the resource theory in this way would not

distinguish common-cause resources that are classi-

cally realizable from those that are not (as we pro-

pose to do here), but would instead draw a line be-

tween common-cause resources that are classically

realizable and everything else — including classical

signalling resources.

If one were to go this route,

then all of classical Shannon theory would be sub-

sumed in the resource theory. A potential response

to this expansion in the scope of the project might

be to try to eliminate such signalling resources by

hand, by demanding that the enveloping theory was

constrained to those boxes that are non-signalling

among the wings. Such a response, however, seems

to compromise the ideals of the superluminal cau-

sation paradigm, because no-signalling is an opera-

tional notion rather than a realist one.

It should be noted that no ﬁnite speed of superluminal causal

inﬂuences can satisfactorily account for the predictions of

quantum theory, per Ref. [

], so such inﬂuences would need

to be assumed to be of inﬁnite speed.

That is, if one seeks to partition resources of a given type

into classical and nonclassical varieties, then deﬁning the

enveloping theory correctly is just as important as deﬁning

the free subtheory correctly.

John Bell famously argued against the idea that no-signalling

could embody an assumption of locality in a fundamental

physical theory on the grounds that it was too anthropocen-

tric [63]:

...the “no signaling” notion rests on concepts which

are desperately vague, or vaguely applicable. The

3 Details of the resource theory

3.1 Free and nonfree common-cause boxes

We begin by formalizing the relevant deﬁnitions

from Sec. 2.2.2 and 2.2.3, and providing more details

about the deﬁnition of the resource theory. For ease

of presentation, we focus throughout on the bipar-

tite Bell scenario, but the multipartite Bell scenario

can be formalized analogously.

Fig. 1(a) depicts the structure of a generic GPT-

realizable common-cause box. The classical variables

that range over the (ﬁxed) choices of local measure-

ments are termed the setting variables, denoted

S (left wing) and T (right wing), while the classi-

cal variables that range over the possible results of

these measurements are termed the outcome vari-

ables, denoted X (left wing) and Y (right wing). In

this article, we will refer to the cardinality of the set

over which a variable X varies as “the cardinality of

X” and we will also sometimes refer to the cardinal-

ity of the setting (outcome) variable as simply “the

cardinality of the setting (outcome)”.

Let us label the system distributed to the left wing

by A and the one to the right wing by B. In the

GPT framework, states and eﬀects on A (B) are

represented by vectors in a real vector space of di-

mension d

), that is, in R

). States and

eﬀects on the composite AB are represented by vec-

tors in the tensor product of these vector spaces

⊗R

. If the GPT representation of the X

outcome of the S

s measurement on system A

is r

x|s

∈ R

and that of the Y

y outcome of

the T

t measurement on system B is r

y|t

∈ R

and if s

∈ R

⊗ R

denotes the GPT state

assertion that “we cannot signal faster than light”

immediately provokes the question: Who do we

think we are? We who can make “measurements”,

we who can manipulate “external ﬁelds”, we who

can “signal” at all, even if not faster than light? Do

we include chemists, or only physicists, plants, or

only animals, pocket calculators, or only mainframe

computers?

Strictly speaking, we do allow for a GPT where states and

eﬀects may correspond to vectors outside of the tensor product

of the local vector spaces. That is, we do not assume local

tomography. However, since the causal structure of a Bell

scenario is such that the measurements are assumed to be local,

we can focus on the eﬀects within

⊗ R

without loss of

generality, and therefore also on the states within

⊗ R

In other words, any so-called holistic degrees of freedom in

the GPT play no role in Bell scenarios, and therefore we can

ignore them without loss of generality.

of the composite AB, then the conditional proba-

bility distribution associated to this GPT-realizable

common-cause box is

XY |ST

(xy|st) = (r

x|s

⊗ r

y|t

) · s

, (1)

where · denotes the Euclidean inner product.

By virtue of their internal causal structure,

all GPT-realizable common-cause boxes satisfy

the no-signalling conditions P

Y |ST

= P

Y |T

and

X|ST

= P

X|S

. It is straightforward to verify that

this follows from Eq. (1) using the fact that

x|s

, where u

is the unique determin-

istic eﬀect on A, which is independent of value s of

the setting variable, and using the analogous fact for

The common-cause boxes that are considered to

be free in our resource theory are those that can

be realized when the GPT governing the internal

workings of the box is classical probability theory,

as depicted in Fig. 1(b).

In such cases, the scope of possibilities for the

overall functionality of the common-cause box can

be characterized as follows. The systems A and B

are described by classical variables,

and

(here assumed to be discrete). Classically, the com-

posite system AB is prepared in a joint distribu-

tion over these, P

. The GPT state in this

case is

[

]

(

, λ

)

, where

[

]

de-

notes the ith component of a vector v living in

vector space R

|Λ

⊗ R

|Λ

. Without loss of gen-

erality, we can take systems A and B to be per-

fectly correlated (by incorporating any noise into

the measurements), corresponding to the case where

(

, λ

) =

,λ

(

)

for some dis-

tribution P

(

)

, and where δ denotes the Kronecker-

delta function. This distribution over

and

can be conceptualized as follows: sample a variable

from some distribution, then let

and

copies of it.

Classically, the X=x outcome of the S=s mea-

surement on system A is modelled by a conditional

probability distribution P

X|SΛ

. The GPT eﬀect as-

sociated to this measurement on A is r

x|s

with com-

ponents

[

x|s

]

X|SΛ

(

x|sλ

)

. Similarly, the

GPT eﬀect associated to the measurement on B is

y|t

and has components

[

y|t

]

Y |T Λ

(

y|tλ

)

Substituting these expressions into Eq. (1), we con-

clude that a classically realizable common-cause box

satisﬁes

XY |ST

(xy|st)

X|SΛ

(x|sλ

Y |T Λ

(y|tλ

(λ

)

X|SΛ

(x|sλ)P

Y |T Λ

(y|tλ)P

(λ). (2)

This is recognized to be the expression for a condi-

tional probability distribution P

XY |ST

that satisﬁes

the Bell inequalities.

3.2

The free operations on common-cause

boxes

The most general free operation taking a bipartite

common-cause box with settings S, T and outcomes

X, Y to a bipartite common-cause box with settings

, T

and outcomes X

, Y

is the clamp depicted

in blue in Fig. 3. It is the most general processing

which makes use of a classical common cause that

can act on the local pre-processings and the local

post-processings at each of the wings. It subsumes as

special cases processings wherein classical common

causes act on any of the subsets of these four local

processings.

Note that the most general free operation allows

arbitrary feed-forward of classical information on

each wing, since this does not require any causal

inﬂuences between the wings.

But any such op-

eration can always also be put into the canonical

form depicted in blue in Fig. 4. It suﬃces to note

that the system that mediates the action of the com-

mon cause on the post-processings on a given wing

can always be passed down the classical side-channel.

Henceforth, we use this canonical form when describ-

ing the most general free operation.

Formally, such an operation transforms the condi-

tional probability distribution P

XY |ST

to P

XY ST

ST |XY S

XY |ST

(3)

Because the only physical restriction we are imagining is that

no cause-eﬀect inﬂuences are present between wings, feed-

forward of nonclassical information (that is, of arbitrary GPT

systems) at each wing is also a free LOSR operation. Without

loss of generality, however, we consider only feed-forward of

classical systems in this work, because this is already suf-

ﬁcient to generate any conditional probability distribution

ST |XY S

consistent with the causal structure, i.e.,

satisfying Eqs. (7-8).

where the conditional probability distribution

ST |XY S

satisﬁes certain constraints, which

we specify below.

Figure 4: The canonical form of a generic bipartite LOSR op-

eration

ST |XY S

(in blue) taking a common-cause

box

XY |ST

(in pink) to a common-cause box

Circuit fragments that map processes to processes

(such as the ones depicted in blue in Figs. 3 and 4)

have been studied extensively in recent years in a

variety of frameworks, most notably the quantum

combs framework of Refs. [55, 56], and the process

matrix framework of Refs. [64, 65]. If the source and

target resources are denoted by R and R

, respec-

tively, and the free operation is denoted by τ , we

represent Eq. (3) as

= τ ◦R, (4)

where ◦ is a particular instance of the link product

of Ref. [55].

On the left wing, the most general local pre-

processing takes as input the setting variable of

the target resource (S

) and the variable originating

from the common cause, and it generates as output

the setting variable of the source resource (S) as well

as an arbitrary variable which propagates down the

side-channel. The most general post-processing on

the left wing takes as input the outcome variable of

the source resource (X) and the side-channel vari-

able, and it generates as output the outcome vari-

able of the target resource (X

). Included as spe-

cial cases among these pre- and post-processings are

maps from S

to S and from X to X

that constitute

relabellings, coarse-grainings, or ﬁne-grainings of the

variable, where the possibilities are constrained by

the cardinalities of these variables. Also included as

special cases are instances where the map from S

to S or the map from X

to X is chosen probabilisti-

cally, and instances where these two maps are corre-

lated (by making use of the side-channel). The anal-

ogous pre- and post-processings at the right wing

are also possible. Finally, the choices of maps on the

left can also be correlated with the choices of maps

on the right, by leveraging the common cause.

The free operations are characterized by those

ST |XY S

which can be achieved via the type

of circuit fragment depicted in Fig. 4, namely, those

such that

ST |XY S

st|xys

) = (5)

S|XS

s|xs

T |Y T

t|yt

)

×P

(λ

)

for some joint distribution P

and for

some P

S|XS

and P

T |Y T

satisfying no-

retrocausation conditions

S|XS

= P

S|S

T |Y T

= P

T |T

(6)

One can directly check that any P

ST |XY S

admitting of a decomposition as in Eq. (5) satisﬁes

the operational no-signalling constraints

S|XY S

= P

S|XS

T |XY S

= P

T |Y T

(7)

and the operational no-retrocausation conditions

S|XS

= P

S|S

T |Y T

= P

T |T

(8)

The parts of the circuit fragment in Fig. 4 that

are associated to P

S|XS

and P

T |Y T

refer to as local operations. The part associated

to P

corresponds to a joint distribution on

the variables distributed to the two wings and can

therefore be conceived of as shared randomness.

Consequently, the free operations we are endorsing

here can indeed be described as local operations and

shared randomness (LOSR), as noted earlier.

Deﬁnition 5.

An operation is in the set

LOSR

(and termed an

LOSR operation

) if and only if it

is associated to a conditional probability distribution

ST |XY S

that admits of the sort of decompo-

sition speciﬁed by Eqs. (5) and (6).

Previous resource-theoretic approaches to Bell-

inequality violations have also endorsed the intu-

itive notion that local operations supplemented with

shared randomness should constitute the free opera-

tions. Diﬀerent works, however, have made diﬀerent

proposals for how this notion ought to be formal-

ized. The correct formalization, in our opinion, is

the one provided in Geller and Piani [17] and inde-

pendently in deVincente [16], which coincides with

the one given above

. Therefore, in this article we

are endorsing the proposal of Refs. [16, 17] to take

LOSR as the free operations. On the other hand,

Refs. [18, 20, 21] have formalized the notion of lo-

cal operations supplemented with shared random-

ness diﬀerently, deﬁning a strict subset of the set

LOSR deﬁned above (a subset that can be shown

to be nonconvex). Nonetheless, we believe that this

discrepancy was an oversight and that it is unlikely

anyone would defend taking this subset rather the

full set to deﬁne the resource theory. We discuss the

issue in depth in Appendix A.2.

As a ﬁnal comment, note that, without loss of

generality, we can take the joint distribution to be

(

) =

,λ

(

)

for some dis-

tribution P

, and hence express Eq. (5) as

ST |XY S

st|xys

) = (9)

S|XS

s|xs

λ)P

T |Y T

t|yt

λ)P

(λ).

As a consequence, the conditional probability distri-

bution P

ST |XY S

can be conceptualized as the

more familiar object P

Y |

for setting variables

T and outcome variables

Y that are deﬁned

as follows. We take the composite of the outputs of

the circuit fragment on the left wing, X

and S, as a

composite outcome variable

X, so that

= (

, S

)

Similarly, we take the composite of the inputs on the

left wing, X and S

, as a composite setting variable

S, so that

= (

X, S

)

. Making the analogous def-

initions for

Y and

T in terms of Y, T, Y

, T

on the

right wing, Eq. (9) can be rewritten as

Y |

(˜x˜y|˜s

t) = (10)

SΛ

(˜x|˜sλ)P

Y |

T Λ

(˜y|

tλ)P

(λ).

Recalling Eq. (2), it is clear that P

Y |

satisﬁes

all of the Bell inequalities. This illustrates the con-

The deﬁnition of

LOSR

given in Geller and Piani [

] is very

similar to the one provided here (see Fig. 4 therein), while

the one provided in de Vicente[

] is much more cumbersome.

sistency of our proposal for the free operations, for

we have just shown that the free operations on a re-

source P

XY |ST

are those that are achieved by taking

a link product [55] with a process P

ST |XY S

Y |

which satisﬁes all of the Bell inequalities.

3.2.1

Cardinality-based types for boxes and for opera-

tions

Deﬁnition 6.

We deﬁne the

type of a common-

cause box

as the collection of cardinalities of the

setting and outcome variables, and we denote the

type of a resource

as [

]. We introduce the fol-

lowing notational convention to specify types: the

cardinalities of the setting variables for all

wings

and the cardinalities of the outcome variables for all

wings are speciﬁed as the bottom and top rows,

respectively, of a 2

×n

matrix. For example, for the

2-wing common-cause box depicted in Fig. 1, the

type is

(

|X| |Y |

|S | |T |

)

, where

|O|

denotes the cardinality of

a variable O.

If we further particularize to the CHSH scenario,

where the cardinalities of both setting and outcome

variables is 2, then the type is

(

2 2

)

Deﬁnition 7.

Consider a source resource

type [

] and a target resource

of type [

]. We

denote the

type of an operation τ

taking any re-

source of type [

] to any resource of type [

] by

[τ]

= [R

] → [R

]

, and we denote the set of all free

operations of type [R

] → [R

] by LOSR

]→[R

]

Note that operations — including free operations

— can change the type of a resource, and hence spec-

ifying the type of an operation requires specifying

both the type of the initial resource as well as the

type of the ﬁnal resource. This reﬂects the fact that

we have not restricted the cardinalities of X

, Y

, S

or T

in Eq. (3) in any way.

3.2.2

Locally deterministic operations and local sym-

metry operations

It is valuable to consider two special ﬁnite-

cardinality subsets of LOSR operations: those that

are deterministic and those that are invertible. Note

that the invertible LOSR operations are included

among the deterministic ones because any indeter-

minism in the operation would be an obstacle to

invertibility.

Deﬁnition 8.

An LOSR operation is in the set

LDO

(i.e., it is a

locally deterministic oper-

ation

) if and only if the conditional probabilities

ST |XY S

which deﬁne the operation take val-

ues in

{

}

for all values of

and

. We denote the complete set of LDO operations

of type [R

] → [R

] by LDO

]→[R

]

Deterministic LOSR operations—i.e., LDO

operations—factorize in the sense that every LDO

operation can be expressed as the product of two

local deterministic operations such that

det

ST |XY S

= P

det

S|XS

det

T |Y T

. (11)

This follows from the fact that the deterministic de-

pendences preclude any dependence on the shared

random variables λ

and λ

in Eq. (5), which then

reduces to Eq. (11). Furthermore, the no retrocau-

sation assumption of Eq. (8) implies that these de-

terministic dependencies are of the following form:

det

S|XS

= δ

S,f

)

(X,S

)

det

T |Y T

= δ

T,f

)

(Y,T

)

(12)

for some functions f

, g

, f

and g

. Speciﬁcally,

on the left wing, S is generated deterministically

as a function of S

(the pre-processing) and X

generated deterministically as a function of X and

(the post-processing, which is setting-dependent),

and similarly for the right wing. A generic bipartite

locally deterministic operation is depicted in Fig. 5.

The cardinality of the set LDO for a given type

can be easily deduced. Let |S|, |X|, . . . denote

the cardinalities of the variables S, X, . . . The to-

tal number of possibilities for the function g

|X|·|S

, and the total number of possibilities for

the function f

is |S|

, so that the total number

of possibilities for a deterministic operation on the

left wing is



|S| · |X

|X|



. An analogous decom-

position holds for the deterministic operations on

the right wing, and the total number of possibilities

for these is



|T | · |Y

|Y |



. Consequently, the car-

dinality of the set LDO in this bipartite case is

|LDO| =



|S| · |X

|X|





|T | · |Y

|Y |



. (13)

The other important subset of LOSR are those

type-preserving operations which are invertible (and

hence also deterministic). We refer to this subset of

LOSR operations as the local symmetry operations

and denote it LSO.

Figure 5: A generic bipartite locally deterministic operation

ST |XY S

∈

LDO consists of a product of deter-

ministic operations at each wing. The black dots in the

ﬁgure represent classical copy operations, and the output

variables for each gate are deterministic functions of the

input variables for that gate.

Deﬁnition 9.

The set

LSO

(i.e., the

local sym-

metry operations

) is the subset of type-preserving

operations in LDO that are invertible.

Every local symmetry operation, P

sym

ST |XY S

has the form of a locally deterministic operation,

det

ST |XY S

, speciﬁed in Eqs. (11)-(12). That is,

sym

ST |XY S

= P

sym

S|XS

sym

T |Y T

. (14)

where

sym

S|XS

= δ

S,f

)

(X,S

)

sym

T |Y T

= δ

T,f

)

(Y,T

)

(15)

but where f

, g

are such that P

sym

S|XS

deﬁnes an

invertible map from

(

X, S

)

(

, S

)

, and where f

and g

are such that P

sym

T |Y T

deﬁnes an invertible

map from

(

Y, T

)

(

, T

)

. Unlike general LDO op-

erations, LSO operations are always type-preserving,

and hence the type

(

| |Y

| |T

)

always matches the type

(

|X| |Y |

|S | |T |

)

Note that an exchange of the parties is a symme-

try operation (i.e., invertible), but it cannot be im-

plemented by local operations, and so it is not part

of LSO.

As a ﬁnal remark, notice that the set of LSO oper-

ations forms a group. This follows from the fact that

the properties of being deterministic and invertible

persist under composition, and that the inverse of

every LSO operation is in LSO. This group is gen-

erated by the permutations of the value of a set-

ting variable, and the permutations of the value of

an outcome variable, where the choice of the latter

permutation might depend also on the value of the

setting variable on the same wing.

In the bipartite case, the LSO group is a ﬁnite

group of order

|LSO| = (|S|!) · (|X|!)

|S|

· (|T |!) · (|Y |!)

|T |

, (16)

corresponding to the

(

|S|

relabelings for the set-

tings of the left wing, multiplied by the

(

|X|

re-

labelings of outcomes for each of the |S| diﬀer-

ent settings, and similarly for the right wing. The

group can be generated by the relabelings of only

adjacent settings or outcomes, and hence the LSO

group admits a natural representation in terms of

(|S|−1) + |S|(|X|−1) + (|T |−1) + |T |(|Y |−1) gener-

ators (see Ref. [66, App. B]).

For a concrete example, consider the operations

transforming type

(

2 2

)

into type

(

2 2

)

. Throughout

this work, we index the values a variable X can

take as x ∈ {

,..., |X| −

}. Accordingly, in the

(

2 2

)

scenario, X, Y, S, T take values in {

}. Using

this notation, the group of LSO can be generated

explicitly by the four operations which interconvert

XY |ST

(

x, y|s, t

)

with either P

XY |ST

(

x, y|s⊕

, t

)

XY |ST

(

x, y|s, t⊕

, P

XY |ST

(

x⊕s, y|s, t

)

, or

XY |ST

(

x, y⊕t|s, t

)

, where ⊕ denotes summation

modulo two.

One can readily verify [67] that the

order of this group is 64.

Suppose that a resource R is represented as a

real-valued vector

R of conditional probabilities

XY |ST

(

xy|st

)

, or any linear transformation thereof

(such as the representation in terms of correlators

used in Section 6). LSO operations act as invertible

linear maps on such a representation. Assuming f is

a linear function over

R, then its action can be repre-

sented as f

(

) =

f ·

R for some

f. Hence, it is equally

as meaningful to speak about

f being transformed

under LSO group elements as it is to speak about

being so transformed. The action of an LSO opera-

tion on

f can be thought of as applying the inverse

transformation to

R, i.e.,

(π

f) ·

R = f · (π

-1

R). (17)

The order of a group is the cardinality of the set of group

elements, i.e., the order of the

LSO

group quantiﬁes the total

number of invertible LDO operations.

A second generating set of operations for this group is given

by τ

,..., τ

deﬁned in Proposition 16.

Note that many type-changing LOSR operations

are equally well-deﬁned as transformations on lin-

ear functions. The critical requirement is that the

operation be left-invertible, i.e., it should act as an

injective function on the set of conditional probabili-

ties. See Refs. [66, 68, 69] for discussions on the topic

of converting linear functions (and Bell inequalities

in particular).

3.3 Convexity of the set of free operations

We now show that the set of free operations is con-

vex, and that the extremal elements are determin-

istic, and enumerable for ﬁxed type of the source

resource and of the target resource. This implies

that the set of free operations mapping from a given

source resource type to a given target resource type

is a polytope.

We begin by proving convexity.

Proposition 10.

The set LOSR is convex,

i.e., if

∈ LOSR

and

∈ LOSR

, then

wτ

+ (1 − w)τ

∈ LOSR for 0 ≤ w ≤ 1.

This follows from the fact that the resources re-

quired to achieve such a mixing are achievable using

LOSR. Suppose β is a binary variable that decides

whether τ

or τ

will be implemented. It suﬃces to

imagine that β is sampled from a distribution P

where P

(0) =

w, that it is copied and distributed to

both wings (with a copy sent down the side-channel

at each wing), and that the local processings that

are implemented on each wing are made to depend

on β (chosen so that if β

b, then τ

is implemented

overall). Because β can be incorporated into the def-

inition of the shared randomness, the procedure just

described is itself achievable using LOSR.

The convexity of the set of LOSR operations is

crucial for the technique we develop in the next

section to answer questions about resource conver-

sion. Recognizing the full potential of this convex-

ity is one of the key contributions of our work. In

Appendix A.2, we discuss convexity further, in par-

ticular noting that previous formulations of LOSR

did not seem to recognize the physical realizability

of convex mixing within LOSR, but rather imposed

convexity mathematically.

Next, we highlight features of the extremal free

operations.

Proposition 11.

The set of convexly extremal oper-

ations in LOSR are precisely the subset of operations

comprising LDO, namely the deterministic LOSR

operations.

This proposition is a minor generalization of

Fine’s argument [70], since the latter states that

locally deterministic models can generate any con-

ditional distribution that arises in a locally inde-

terministic model. As in Fine’s argument, here too

any indeterminism in the local operations can be

absorbed into the shared randomness, and hence al-

lowing indeterministic local operations provides no

more generality than considering only deterministic

local operations.

Proof.

It suﬃces to run Fine’s argument [

] for

the composite variables

and

. To see this

explicitly, note that the constituent factors in the

expression for an LOSR operation in Eq.

(10)

can

be rewritten as

SΛ

(˜x|˜sλ) =

∈Λ

det,λ

(˜x|˜s)P

|Λ

(λ

|λ),

Y |

T Λ

(˜y|

tλ) =

∈Λ

det,λ

Y |

(˜y|

t)P

|Λ

(λ

|λ),

where for each value of

, the conditional

det,λ

describes a deterministic operation on the left wing

specifying the value of

= (

, S

) for every value

= (

X, S

), and similarly for

det,λ

Y |

on the right

wing. Plugging these back into Eq.

(10)

, we have

that

Y |

(˜x˜y|˜s

t) = (18)

,λ

det,λ

(˜x|˜s)P

det,λ

Y |

(˜y|

t)P

(λ

where we have deﬁned

(

)

λ∈Λ

|Λ

(

|λ

)

|Λ

(

|λ

)

(

)

Eq.

(18)

shows that a generic indeterministic LOSR oper-

ation can always be decomposed into a convex

combination of products of deterministic operations

on each wing. Not only is it the case that the

convexly extremal LOSR operations are included

within the LDO operations, but there is actually

precise equality between these two sets: all LDO

operations are convexly extremal because every

LDO operation is a deterministic map.

What we have shown above is that any element

of LOSR

]→[R

]

admits of a convex decomposition into el-

ements of LDO

]→[R

]

. This implies the following useful

geometric fact:

Proposition 12 (Polytope of free operations).

The set of all free operations of a given type is a

polytope whose vertices are the locally deterministic

operations of that type,

LOSR

]→[R

]

= ConvexHull



LDO

]→[R

]



. (19)

The number of vertices of this polytope corre-

sponds to the cardinality of the set of LDO oper-

ations, as given in Eq. (13).

4 Resource theory preliminaries

A central question in any resource theory is whether

one resource can be converted to another via the free

operations. Many notions of conversion are studied:

single-copy deterministic conversion, single-copy in-

deterministic conversion (where the probability of

success need only be nonzero), multi-copy conver-

sion (where one is given more than one copy of the

resource), asymptotic conversion (where one is given

arbitrarily many copies), and catalytic conversion

(where one has access to another resource that must

be returned intact after the conversion). We here

focus on single-copy deterministic conversion.

As noted earlier, we denote the application of an

operation τ to a resource R by τ ◦ R. If R

can

be converted to R

by free operations, one writes

7−→R

, otherwise one writes R

Y7−→R

. Explic-

itly,

7−→R

denotes that ∃ τ ∈ LOSR

] 7−→[R

]

such that R

= τ ◦R

and R

Y7−→R

denotes that @ τ ∈ LOSR

] 7−→[R

]

such that R

= τ ◦R

If one can determine, for any pair of resources

and R

, whether R

can be converted to R

us-

ing a free operation, then one can determine the

pre-order over all resources that is induced by the

conversion relation. A pre-order, by deﬁnition, is a

transitive and reﬂexive binary relation between re-

sources. The conversion relation is reﬂexive because

the identity operation is free and maps a resource to

itself, while it is transitive because if R

7−→R

and

7−→R

then R

7−→R

There are four possible ordering relations that

might hold between a pair of resources.

is strictly above R

if:



7−→R

and R

Y7−→R



is strictly below R

if:



Y7−→R

and R

7−→R



is incomparable to R

if:



Y7−→R

and R

Y7−→R



is equivalent to R

if:



7−→R

and R

7−→R



If R

is either strictly above or strictly below R

, we

say that R

and R

are strictly ordered.

We pause to comment on the notion of equivalence

of resources. By deﬁnition, if R

is equivalent to R

then the conversion from one to the other is free in

both directions,

∃ τ

∈ LOSR

] 7−→[R

]

such that R

= τ

◦ R

and ∃ τ

∈ LOSR

] 7−→[R

]

such that R

= τ

◦ R

It need not be the case, however, that either of the

free operations τ

or τ

is invertible, nor that one is

the inverse of the other. For instance, if R

and R

are both free resources, then τ

can be the operation

which discards R

and prepares R

, while τ

can be

the operation which discards R

and prepares R

The conversion relation between resources implies

a corresponding conversion relation between equiv-

alence classes of resources (relative to the equiv-

alence relation deﬁned above), wherein for any two

equivalence classes, they are either strictly ordered

or incomparable. The conversion relation between

equivalence classes is therefore antisymmetric and

describes a partial order relation rather than a

pre-order relation. One can therefore conceptualize

the project of characterizing the pre-order as a char-

acterization of the equivalence classes and of the par-

tial order that holds among these. In this work, we

do not provide a characterization of the equivalence

classes, and so our focus will be on directly charac-

terizing features of the pre-order of resources.

4.1 Global features of a pre-order

To have a complete understanding of deterministic

single-copy conversion in a resource theory, one must

have an understanding of the pre-order that this

conversion relation deﬁnes. In this section, we de-

scribe some of the basic features that characterize

pre-orders.

Perhaps the most basic question about a pre-order

of resources is whether or not it is totally pre-

ordered, meaning that every pair of elements in

the pre-order is strictly ordered or equivalent (i.e.,

the pre-order has no incomparable elements). Equiv-

alently, we say that a pre-order is totally pre-ordered

if and only if the partial order over equivalence

classes that it deﬁnes is totally ordered (i.e., has no

incomparable elements).

If there do exist incomparable resources, one can

ask if the binary relation of incomparability is tran-

sitive, in which case the pre-order is termed weak.

A chain is a subset of the pre-order in which ev-

ery pair of elements is strictly ordered. The height

of a pre-order is the cardinality of the largest chain

contained therein. An antichain is a subset of the

pre-order in which every pair of elements is incom-

parable. The width of a pre-order is the cardinality

of the largest antichain contained therein.

Other important properties of the pre-order refer

to the interval between a pair of resources, where

R is in the interval of R

and R

if and only if both

7−→R and R 7−→R

. If the number of equivalence

classes which lie in the interval between a pair of

resources is ﬁnite for every pair of inequivalent re-

sources, then the pre-order is said to be locally ﬁ-

nite, otherwise it is said to be locally inﬁnite.

4.2 Features of resource monotones

A resource monotone is a real-valued function

over

resources whose value cannot increase under any free

operation in the resource theory. Formally,

Deﬁnition 13.

A function

from resources to the

reals is called a

resource monotone

if and only if

7−→R

implies M(R

) ≥ M (R

), (20a)

or equivalently,

M(R

) < M (R

) implies R

Y7−→R

. (20b)

In other words, a resource monotone is an order-

preserving map from the pre-order of resources to

the total order of real numbers. Whenever some

monotone M and a pair of resources R

and R

satis-

ﬁes M

(

)

< M

(

)

, we will say that the monotone

M witnesses the fact that R

Y7−→R

If the pre-order is not totally pre-ordered (i.e., if

there exist incomparable resources), then no single

Technically, it is an extended-real-valued function, where the

set of extended real numbers is obtained by adding

−∞

and

+∞ to the set of real numbers.

monotone can completely characterize the pre-order.

A complete characterization may be achieved, how-

ever, by a family of monotones. Speciﬁcally, a fam-

ily of monotones {M

}

is said to be complete if it

completely characterizes the pre-order, that is, if

∀R

, R

: R

7−→R

if and only if ∀i : M

) ≥ M

(21)

A complete set of monotones is therefore an alterna-

tive way of describing the pre-order.

Strictly speaking, monotones should be functions

from resources of any type in the resource theory

to the reals. However, many natural functions are

only deﬁned for particular types of resources. For in-

stance, the function P

XY |ST

(00

00)

XY |ST

(11

01) +

XY |ST

(20

02)

is only deﬁned for common-cause

boxes where the cardinalities of X and T are three.

To accommodate this, we deﬁne the notion of a

monotone relative to a set S: M is a monotone

relative to a set S of resources if and only if for all

, R

} ∈ S, R

7−→R

implies M

(

)

≥ M

(

)

A family of monotones {M

}

is said to be complete

relative to a set S if it holds that

∀R

, R

∈ S : R

7−→R

if and only if ∀i : M

) ≥ M

(22)

If S is any set of resources all of which are of a

particular type, a monotone relative to S is said to

be type-speciﬁc.

4.3

Monotone constructions for any resource

theory

Here we review a variety of approaches to construct-

ing resource monotones. We will make use of these

versatile constructions to deﬁne an especially use-

ful pair of monotones for the resource theory of

common-cause boxes in Section 6.

4.3.1 Cost and yield monotones

It is possible to upgrade a type-speciﬁc monotone

to a type-independent monotone using either a cost

construction or a yield construction. In fact, a

cost or yield construction takes any function (mono-

tone or not) together with a set of resources and

induces a type-independent monotone from it, as fol-

lows.

Given any function f which maps some set S of

resources to the real numbers, one can deﬁne associ-

ated monotones which are applicable to all resources,

as follows:

M[f -yield, S](R)

= (23)

max

∈S

{f(R

) s.t. R 7−→R

M[f -cost, S](R)

= (24)

min

∈S

{f(R

) s.t. R

7−→R}.

If there does not exist any R

∈ S such that

R 7−→R

, then the yield is deﬁned to be −∞. Sim-

ilarly, if there does not exist any R

∈ S such that

7−→R, then the cost is deﬁned as ∞ [71].

In words, M

[

f-yield, S

]

is a monotone which asks

for the most valuable resource in the set S (as mea-

sured by the function f) that one can create from

the given resource R.

Meanwhile, M

[

f-cost, S

](

)

is a monotone which asks for the least valuable re-

source in the set S (as measured by the function

f) that one can use to create the given resource R.

Note that in both cases, many diﬀerent functions

may yield the same monotone, so there is a conven-

tional element to one’s choice of function. Note also

that S may be restricted to resources of a particu-

lar type (in which case f need only be deﬁned on

resources of that type), and yet the type of the re-

source R for which the monotones may be evaluated

is unrestricted.

4.3.2 Weight and robustness monotones

Various functions have been used as measures of

the distance of a resource from the set of clas-

sically realizable common-cause boxes in previous

work [16, 17, 22, 73–76]. In what follows, we high-

light some of these which are monotones in our re-

source theory.

The maximum of a function

over the set of boxes to which

can be converted can also be thought of as the performance

over the so-called ‘nonlocal game’ deﬁned by the ‘payoﬀ

function’

. Since the set of boxes to which

can be con-

verted (of any given type) is a polytope, it follows that all

forbidden conversions (those from

to a resource outside the

polytope) can be witnessed by a suitable set of payoﬀ functions,

namely, whatever linear functions pick out the facets of

’s

polytope (for any given target type). In other words, any

resource outside the polytope will attain a higher value on at

least one of these functions. It follows, then, that the set of

yield monotones induced by all possible linear functions con-

stitutes a complete set of monotones. While this observation

may not be useful in practice, it does pose an interesting con-

trast with the ﬁndings of Ref. [

]: For common-cause boxes,

we ﬁnd that ‘nonlocal games’ constitute a complete set of

monotones; whereas [

] shows that for the resource theory of

quantum states under LOSR it is semiquantum games instead

of nonlocal games that form a complete set of monotones.

The nonlocal fraction, which we denote here by

, is the minimum weight of the nonfree fraction

in any convex decomposition of the resource,

(R)

= (25)

min

0≤λ≤1

∗

∈S

[R]

L∈L

[R]

{λ s.t. R = λ R

∗

+ (1−λ)L}.

The nonlocal fraction was proven to be a resource

monotone relative to (a superset of) the LOSR free

operations in Ref. [16, Sec. 5.2], though it is there

termed the ‘EPR2’ measure.

Next, there is the case of robustness measures

which quantify the minimum weight of a resource

from some particular class that must be added con-

vexly with the original resource for the mixture to be

free. The two robustness measures that we consider

diﬀer by the class of resources that are mixed with

the original resource. The ﬁrst, which we denote by

RBST,L

(

)

, considers mixing the original resource

R with any element in the set L

[R]

of free resources

of the same type:

RBST,L

(R)

= (26)

min

0≤λ≤1

L∈L

[R]



λ s.t. λ L + (1−λ)R ∈ L

[R]



This robustness measure was shown to be a resource

monotone relative to LOSR in Ref. [17, Sec. 3].

The second robustness measure, which we denote

simply by M

RBST

(

)

considers mixing the original

resource R with any element in the set S

[R]

of all

resources of the same type:

RBST

(R)

= (27)

min

0≤λ≤1

∗

∈S

[R]



λ s.t. λ R

∗

+ (1−λ)R ∈ L

[R]



The uniﬁed resource theory formalism of Ref. [71]

implies that all three of these distance measures are

resource monotones in any resource theory wherein

all of the operations in the free set are convex-

linear

operations, including our resource theory

here. Additionally, in Corollary 18, we show that

each of these three distance measures can be ex-

plicitly related to a monotone for which we pro-

vide a closed-form expression relative to

(

2 2

)

-type

Note that in Ref. [76] these were termed ‘visibilities’.

An operation

is convex-linear if the image

τ ◦

(

) is a given

mixture of

τ ◦

(

) and

τ ◦

(

) whenever the preimage

is the same mixture of

and

. All linear operations are

convex-linear.

resources. By extension, we therefore also provide

closed-form expressions for these three distance mea-

sures relative to

(

2 2

)

-type resources.

5 A linear program for determining

the ordering of any pair of resources

Next, we provide a linear program which allows one

to determine the ordering relation that holds be-

tween any two resources in our enveloping theory.

To do so, it is convenient to set up some useful no-

tation.

Deﬁnition 14.

Let the bold symbol

refer to any

set of resources. We use subscripts to specify the type

of the resources in the set, such as

(

|X| |Y |

|S | |T |

)

[R]

We use superscripts to specify further properties of

a set. For example, the set of all GPT-realizable

common-cause boxes is denoted by

, the set of all

nonfree resources is denoted by

nonfree

, and the set

of all free resources is denoted by

free

. Whenever

we wish to emphasize that a speciﬁc set is discrete,

we denote it

, and whenever we wish to emphasize

that a speciﬁc set is a polytope, we denote it P .

Let P

LOSR

]

(

)

denote the continuous set of re-

sources of type

[

]

into which R

can be converted

under LOSR, that is, the image of R

under LOSR

]→[R

]

Similarly, let V

LDO

]

(

)

denote the discrete set of re-

sources of type

[

]

into which R

can be converted

under LDO, that is, the image of R

under LDO

]→[R

]

From Propositions 10 and 12, and the ﬁnite cardi-

nality of V

LDO

]

(

)

, it follows that P

LOSR

]

(

)

a convex set with a ﬁnite number of vertices, and

hence is a polytope:

Proposition 15

(The polytope of resources obtain-

able from a given resource by LOSR).

The set of all resources of type [

] obtainable from

by LOSR forms a polytope,

LOSR

]

) = ConvexHull



LDO

]

)



. (28)

We can express the content of Proposition 15

equivalently as

7−→R

if and only if

∈ ConvexHull



LDO

]

)



(29)

Therefore, to determine whether R

is higher than

in the pre-order of resources, it suﬃces to imple-

ment the following computational test:

1. Enumerate all of the locally deterministic oper-

ations which take resources of type

[

]

to type

]. (They are ﬁnite in number.)

2. Compute the images of R

under all of these

locally deterministic operations.

3. Determine whether or not R

can be expressed

as a convex combination of these images. (This

is a linear program.)

To determine which of the four possible ordering

relations holds for a given pair of resources, R

and

, it suﬃces to determine whether R

7−→R

or not

and whether R

7−→R

or not. This requires just two

instances of the linear program.

According to Proposition 15, the image of a re-

source under the set of all LOSR free operations is

equivalent to the convex closure of the image of the

resource under only the extremal operations. Replac-

ing the set of all operations with only the extremal

ones is a dramatic shortcut.

In principle, the linear program just described al-

lows one to characterize the pre-order completely.

For instance, this linear program deﬁnes a complete

set of monotones for a given set of resources S,

namely, {M

∈ S} where the monotone M

is deﬁned as follows: for all R ∈ S, M

(

) = 1

if R → R

by LOSR and M

(

) = 0

otherwise.

(

)

reports the answer returned by the linear

program for the question of whether R → R

LOSR, and if one has the answer for all R

∈ S, then

one has located R within the pre-order. However,

such a brute-force characterization of the pre-order

requires one to apply the linear program to every

pair of resources, which is not possible in practice.

Rather, the linear program is primarily useful for

answering questions about conversions among pairs

(or ﬁnite sets) of resources.

To characterize the full pre-order more generally,

one would ideally have a ﬁnite set of resource mono-

tones that characterize the pre-order completely.

Furthermore, in order to determine certain global

properties of the pre-order, such as those described

earlier, knowledge of a few carefully chosen resource

monotones will typically suﬃce. This is the strategy

we will adopt hereafter in the article. Speciﬁcally,

over the next few sections, we deﬁne a pair of re-

source monotones and use these to prove that the

pre-order of single-copy deterministic conversion is

not totally pre-ordered (i.e., there exist incompara-

ble resources), that it is not weak (the incompara-

In the language of Ref. [

], these linear programs constitute

a complete witness for conversion.

bility relation is not transitive), that it has both in-

ﬁnite width and inﬁnite height, and that it is locally

inﬁnite.

6 Two useful monotones

We will deﬁne two monotones, one a cost construc-

tion and the other a yield construction, where the

sets of resources relative to which these costs and

yields are evaluated (to be described below) contain

only resources of type

(

2 2

)

. It is useful to ﬁrst review

some facts about the set of all common-cause boxes

of type

(

2 2

)

, that is, about S

(

2 2

)

6.1

Preliminary facts regarding CHSH inequal-

ities and PR boxes

We adopt the convention of Ref. [78] of parametriz-

ing common-cause boxes of type-

(

2 2

)

in terms of out-

come biases and two-point correlators. The outcome

biases are

i :=

x∈{0,1}

(−1)

X|S

(x|s)

= P

X|S

(0|s) − P

X|S

(1|s)

and hB

i :=

y∈{0,1}

(−1)

Y |T

(y|t)

= P

Y |T

(0|t) − P

Y |T

(1|t),

and the two-point correlators are

i :=

x,y∈{0,1}

(−1)

(x⊕y)

XY |ST

(xy|st).

Recalling that the set of common-cause boxes co-

incides with the set of no-signalling boxes in the Bell

scenario, S

(

2 2

)

constitutes what is conventionally re-

ferred to as the “no-signalling” set for this type.

This set is well-known to be a polytope deﬁned by

16 positivity inequalities [74, 79].

The set of classical (free) resources of type

(

2 2

)

is a subset therein, conventionally termed the “local

set”, and is deﬁned by the same 16 positivity inequal-

ities together with eight additional facet-deﬁning

Bell inequalities, namely, the canonical CHSH in-

equality and its seven variants [80]. A resource is

However, as noted in Appendix A.3, for causal structures

diﬀerent from the Bell scenario, the set

of processes that

can be realized by a GPT causal model on the causal structure

is typically distinct from the no-signalling set.

therefore nonclassical (nonfree) if and only if it vio-

lates a facet-deﬁning Bell inequality.

The eight variants of the canonical CHSH function

are

CHSH

(R)

= +hA

i+hA

i−hA

CHSH

(R)

= +hA

i+hA

i−hA

i+hA

CHSH

(R)

= +hA

i−hA

i+hA

CHSH

(R)

= −hA

i+hA

CHSH

(R)

= −hA

i−hA

i+hA

CHSH

(R)

= −hA

i−hA

i+hA

i−hA

CHSH

(R)

= −hA

i+hA

i−hA

CHSH

(R)

= +hA

i−hA

(31)

The canonical CHSH function is CHSH

, which we

will sometimes denote simply as CHSH.

In terms of these, the eight facet-deﬁning Bell in-

equalities are

CHSH

(R) ≤ 2 for k ∈ {0, . . . , 7}. (32)

Note that the regions deﬁned by strict violation of

each of the eight inequalities are nonoverlapping [74].

It follows that one and only one of the eight CHSH

inequalities can be violated by a given resource, i.e.,

for nonfree R there is precisely one value of k such

that CHSH

(R) > 2.

There are eight extremal nonfree vertices of the

full polytope S

(

2 2

)

. One of these is the canonical

PR box [57, 81], denoted R

and deﬁned explicitly

in Table 2; the other seven are variants of this PR

box. For each k, we denote the associated variant

of the PR-box by R

PR,k

(so that the canonical PR

box is associated to k

= 0

, R

PR,0

). R

PR,k

is the unique resource that maximally violates the

kth CHSH inequality, i.e., that achieves its algebraic

maximum, CHSH

PR,k

) = 4.

Unsurprisingly, the variants of the facet-deﬁning

Bell inequalities are interconvertible under LSO op-

erations, as are the variants of the extremal vertices.

To illustrate this, it is convenient to factorize the

(

2 2

)

LSO group into a subgroup which stabilizes CHSH

and a subgroup which does not, as follows.

Proposition 16.

Consider the following invertible

operations, i.e., elements of the LSO group for

(

2 2

)

-type resources:

: P

XY |ST

(x, y|s, t) ↔ P

XY |ST

(x, y⊕1|s, t)

: P

XY |ST

(x, y|s, t) ↔ P

XY |ST

(x, y|s⊕1, t)

: P

XY |ST

(x, y|s, t) ↔ P

XY |ST

(x, y|s, t⊕1)

: P

XY |ST

(x, y|s, t) ↔ P

XY |ST

(x⊕1, y⊕1|s, t)

: P

XY |ST

(x, y|s, t) ↔ P

XY |ST

(x⊕s, y|s, t⊕1)

: P

XY |ST

(x, y|s, t) ↔ P

XY |ST

(x, y⊕t|s⊕1, t)

Then,

(16a)

The order-64 group

123456

generated by

{τ

, τ

}

is the entire LSO group

for

(

2 2

)

resources.

(16b)

The order-8 subgroup

123

generated by

{τ

, τ

}

has no elements in common with

the subgroup

456

generated by

{τ

, τ

}

other than the identity operation.

(16c)

The order-8 subgroup

456

generated by

{τ

, τ

}

stabilizes the canonical PR box and

the CHSH

inequality.

(16d)

For any

k ∈ {

...

}

, the orbit of

CHSH

under

123

{CHSH

, ..., CHSH

}

, and the orbit of

PR,k

under G

123

is {R

PR,0

, ..., R

PR,7

Proof.

The ﬁrst two claims in Proposition 16 are

readily veriﬁed by standard group theory algo-

rithms [

]. The latter two claims become self-

evident by explicitly examining the actions of the

operations on expectation values (and hence, their

action on resources or functions on resources), per

Table 1. In light of Table 1, the third claim is

easily veriﬁed. The fourth claim simply captures the

fact that the eight CHSH functions are related by

LSO

, and similarly the eight PR boxes are also inter-

convertible under

LSO

. We can explicitly show how

the interconversions are accomplished by

123

by de-

scribing the actions of

{τ

, τ

}

as permutations on

the ordered set of

CHSH

functions, or equivalently,

on the ordered set of PR boxes.

• τ

ﬂips the sign of every correlator, so the action

on the ordered set of

CHSH

functions is

the permutation (0, 4)(1, 5)(2, 6)(3, 7).

• τ

exchanges the roles of

and

, so the ac-

tion of

on the ordered set of

CHSH

functions

is the permutation (0, 1)(2, 3)(4, 5)(6, 7).

• τ

exchanges the roles of

and

, so the ac-

tion of

on the ordered set of

CHSH

functions

is the permutation (0, 2)(1, 3)(4, 6)(5, 7).

Therefore the orbit of

CHSH

under

123

is easily

checked to be {CHSH

, ..., CHSH

}, as claimed.

The ordered set of PR boxes transforms under

LSO operations in exactly the same manner as the

ordered set of CHSH functions, since the values of

the marginals and the correlators for resource

PR,k

coincide with the coeﬃcients of the associated terms

in the linear function

CHSH

(compare, e.g., the

expression for CHSH

in Eq.

(31)

with the values of

the marginals and correlators for

in Table

(2)

Hence, the argument just given also establishes that

the orbit of

PR,k

under

123

PR,0

, ..., R

PR,7

}

6.2 Deﬁning the two useful monotones

Monotone 1: The yield of a resource with re-

spect to the set of resources of type

(

2 2

)

, as

measured by the CHSH function.

To deﬁne our ﬁrst monotone, consider the canon-

ical CHSH function

CHSH(R)

= hA

i + hA

i − hA

The CHSH function is type-speciﬁc

and further-

more is not a monotone [16]. However, we can apply

the prescription of Eq. (23) to this function, taking

the set S to be S

(

2 2

)

, i.e., the set of all common-

cause boxes of type

(

2 2

)

. Doing so, we deﬁne the

following (type-independent) yield-based monotone,

which we will denote by M

CHSH

(R)

= M[CHSH-yield, S

(

2 2

)

](R)

= max

∈S

(

2 2

)

{CHSH(R

) s.t. R 7−→R

(33)

Note that one can always ﬁnd some R

∈ S

(

2 2

)

such that R 7−→R

regardless of the type or details

of R, simply because free resources of type

(

2 2

)

may

always be freely generated after discarding R. Hence,

the value of this monotone is never less than 2, which

is the maximum of the CHSH function when applied

to the subset of free resources.

If one applies this procedure to any of the eight

variants of the CHSH functions in Eq. (31), the

monotones one thereby obtains all turn out to be

equivalent to M

CHSH

. This follows from the fact that

all variants of the CHSH function are interconvert-

ible under LSO and therefore the maximum of any

one in an optimiziation over all LOSR operations is

the same as any other, as noted in Proposition 16d.

Monotone 2: The cost of a resource with re-

spect to a set of noisy PR box resources, as

measured by the CHSH function.

Our second monotone also involves optimizing the

CHSH function, but it is a cost-based monotone, and

the set of resources over which one optimizes is re-

stricted to a particular one-parameter family of re-

sources of type

(

2 2

)

(rather than the full set S

(

2 2

)

To deﬁne this family, we need to highlight a par-

ticular resource in the free set, which we denote

The CHSH function is well-deﬁned only for resources of type

(

2 2

)

i hA

i hB

i hA

i −hB

i −hA

i hA

i hB

i hA

i hB

i hA

−hA

i −hA

i −hB

i hA

i −hA

i hB

i hA

i −hA

i hA

i −hA

i hA

i hB

i −hB

i hA

i −hA

Table 1: Action of each of the six speciﬁed symmetry operations in terms of marginal expectation values and correlators.

NPR

can be deﬁned as the uniform mix-

ture of the PR box with the maximally mixed re-

source L

∅

(deﬁned in Table 2), namely L

NPR

∅

, as enumerated in Table 2. The su-

perscript

in the notation L

NPR

denotes the fact

that this resource sits on the boundary of the free

set, namely, that it saturates the canonical CHSH

inequality, CHSH(L

NPR

) = 2.

The one-parameter family of resources deﬁning

our cost construction are the convex mixtures of

and L

NPR

. We denote the set of these by C

NPR

Formally,

NPR

= {C(α) : α ∈ [0, 1]}, (34)

where

C(α)

= α R

+ (1−α)L

NPR

. (35)

We use “C” because the set of resources forms a

chain (deﬁned in Section 4.1) and “NPR” because

each resource in the chain is a noisy version of the

PR box.

Geometrically, the chain C

NPR

describes a line

segment of resources with endpoints R

and L

NPR

and α parametrizes the distance from C

(

)

to L

NPR

(the bottom of the chain). To see that the elements

of C

NPR

do indeed form a chain in the partial order,

it suﬃces to note that one can move downwards (de-

creasing α) starting from any C

(

)

by mixing C

(

)

with L

NPR

, but one cannot move upwards (increas-

ing α) from any C

(

)

, as doing so would require

increasing the value of the monotone M

CHSH

Table 2 provides an explicit characterization of a

generic resource on the chain, as well as its endpoints

and the maximally-mixed free resource.

The use of

instead of

when describing the resource

NPR

is a nod to the conventional terminology wherein the classically

realizable common-cause boxes are often called the local boxes.

See the discussion in the introduction for why we explicitly

avoid the local-nonlocal terminology here.

Using this one-parameter family of resources, we

deﬁne the following cost-based monotone, which we

denote M

NPR

(R)

= M[CHSH -cost, C

NPR

](R) (36)

= min

∈C

NPR

{CHSH(R

) s.t. R

7−→R},

where if for some R there is no R

∈ C

NPR

such

that R

7−→R, then we deﬁne M

NPR

= ∞.

Critically, note that the CHSH function is an in-

jective (one-to-one) mapping from points on the line

segment C

NPR

to the real numbers, with

CHSH



C(α)



= 2α+2. (37)

Thus, the problem of minimizing the CHSH function

over R

∈ C

NPR

such that R

7−→R is exactly the

same as minimizing the function

α+

under the

constraint C(α) 7−→R, that is,

NPR

= min

α∈[0,1]

{2α+2 s.t. C(α) 7−→R}. (38)

For each variant R

PR,k

of the PR box, where

k ∈ {

, . . . ,

}, we can deﬁne the chain of noisy ver-

sions thereof, that is, C

NPR,k

(

) :

α ∈

}

where C

(

)

α R

PR,k

+ (1

−α

)

NPR,k

, with

NPR,k

PR,k

∅

. One can of course deﬁne a

cost-based monotone for each such chain. However,

all eight of these chains deﬁne the same monotone,

because the local symmetry operations allow one

to move among these, as a consequence of Propo-

sition 16d and the fact that L

∅

is stable under all

(

2 2

)

-type local symmetry operations.

As an aside, note that, unlike the cost with respect to the

chain

NPR

, Eq.

(36)

, the cost with respect to the set

(

2 2

)

of all resources of type

(

2 2

)

, as measured by the CHSH func-

tion, is utterly uninformative with regards to distinguishing

the elements of

(

2 2

)

. This is because the resource

PR,4

can be converted to any other

(

2 2

)

-type resource, and yet

i hA

i hB

i hA

i CHSH

= C(1) 0 0 0 0 +1 +1 +1 −1 4

NPR

= C(0) 0 0 0 0

−1

/2 2

C(α) 0 0 0 0

α+1

−α−1

2α+2

∅

0 0 0 0 0 0 0 0 0

Table 2: An explicit description of the resources referenced in our deﬁnitions.

6.3

Closed-form expressions for

CHSH

and

NPR

for

(

2 2

)

-type resources

The deﬁnitions of M

CHSH

and M

NPR

both involve

an optimization over a continuous set of states. In

this section, we derive closed-form expressions for

these monotones for resources of type

(

2 2

)

Consider ﬁrst M

CHSH

Proposition 17.

For any free resource

of type

(

2 2

)

CHSH

(

) = 2. For any nonfree resource

type

(

2 2

)

, there is a unique

k ∈ {

, . . . ,

}

for which

CHSH

(R) > 2 and such that

CHSH

(R) = CHSH

(R). (39)

Equivalently, each function

CHSH

is a monotone

relative to the subset of

(

2 2

)

-type resources for which

CHSH

(R) ≥ 2.

Proof.

We already noted in Section 6 that

CHSH

(

) = 2 for all resources

that are free,

so it suﬃces to consider the case of nonfree resources.

As noted above, the fact that there is precisely one

value of

such that

CHSH

(

)

2 for a nonfree

resource

follows from the results in Ref. [

]. Thus,

we must show that

CHSH

(

) =

CHSH

(

) for this

value of k.

To prove this, we invoke Theorem 2.2 of

Ref. [

], which informs us that every resource

which violates the

CHSH

inequality ad-

mits a convex decomposition in terms of the

th variant of the PR box and some free re-

source that saturates the

CHSH

inequality,

denoted

, such that

R = λ R

PR,k

+ (1−λ)L

for some

λ ∈

1]. Further,

is spec-

iﬁed uniquely by the linearity of the

CHSH

functions and the fact that

CHSH

PR,k

) = 4

and

CHSH

) = 2

, which together imply that

CHSH

(

PR,4

) =

−

4, the algebraic minimum of the canonical

CHSH function. Consequently, the value of this CHSH-cost

with respect to the set of all resources of type

(

2 2

)

−

4. Since

this monotone is constant on all resources in the scenario, it

is completely uninformative.

CHSH

(

) =

CHSH



λ R

PR,k

+ (1−λ)L



4λ + 2(1−λ)

. Again leveraging this unique decompo-

sition together with linearity of the

CHSH

function

and the linearity of LOSR transformations, it follows

that for any LOSR operation

, we have

CHSH

(

τ ◦

) =

λ CHSH

(τ ◦R

PR,k

) + (1−λ) CHSH

(τ ◦L

)

Clearly

CHSH(τ ◦R

PR,k

) ≤ 4

, since four is the al-

gebraic maximum of the

CHSH

function, and

CHSH

(τ ◦L

) ≤ 2

, since every LOSR operation

takes a free resource

to a free resource

for which

CHSH

(

)

≤

2. For

such that

CHSH

(

)

2, then, it follows that free opera-

tions on

cannot increase its

CHSH

value, and

hence the maximum in Eq.

(33)

is achieved by

itself. This proves Eq. (39).

Using the closed-form expression for M

CHSH

, we

can additionally provide closed-form expressions for

the weight and robustness monotones introduced in

Section 4.3.2 for

(

2 2

)

-type resources:

Corollary 18.

For resources of type

(

2 2

)

, the nonlo-

cal fraction and the robustnesses to mixing are related

to M

CHSH

as follows:

(R) =

CHSH

(R) − 2

, (40a)

RBST,L

(R) =

CHSH

(R) − 2

CHSH

(R) + 2

, (40b)

RBST

(R) =

CHSH

(R) − 2

CHSH

(R) + 4

. (40c)

Proof.

The relationship of these distance measures to

the extent by which the

CHSH

inequality is violated

was derived in Appendix E of Ref. [

]. We simply

recast those results in terms of

CHSH

(

) instead

of CHSH(R) by means of Proposition 17.

The values of the four monotones M

CHSH

(

)

(

)

, M

RBST,L

(

)

, and M

RBST

(

)

are therefore

all expressible as strictly-increasing functions of one

another when applied to resources of type

(

2 2

)

. That

is, if any one of these monotones increases (respec-

tively decreases) between a given pair of resources

of type

(

2 2

)

, then all of monotones will similarly in-

crease (respectively decrease) between that pair of

resources. As we will focus on the

(

2 2

)

type below,

and the three distance-function monotones are no

more informative than M

CHSH

in this case, we will

not discuss them further.

We now turn to providing a closed-form expres-

sion for M

NPR

for resources of type

(

2 2

)

. We ﬁrst

recall some more details of the geometry of S

(

2 2

)

Recall that we use the superscript b to denote

that a resource lies on the particular boundary of

the free set that is deﬁned by the CHSH inequal-

ity (and thus that it saturates this inequality). We

further use the superscript bb to denote that a re-

source both saturates the CHSH inequality and ad-

ditionally lies on the boundary of the full polytope of

resources, S

(

2 2

)

. The set L

of CHSH

-inequality-

saturating resources is 7-dimensional, and the set

of CHSH

-inequality-saturating resources on

the boundary of the full polytope S

(

2 2

)

is 6-

dimensional.

It follows that L

⊆ L

Proposition 19.

For any free resource

of type

(

2 2

)

NPR

(

) = 2. For any nonfree resource

type

(

2 2

)

, there is a unique

k ∈ {

, . . . ,

}

for which

CHSH

(

)

2. Within this region, if

R ∈ C

NPR,k

then we have simply

NPR

(

) =

CHSH

(

). If, on

the other hand, R 6∈ C

NPR,k

, we have

NPR

(R) = 2α+2,

where

is the value appearing in the decomposition

γ L

+ (1

−γ

)

(

), where

(

)

∈ C

NPR,k

∈ L

and

γ ∈

1]. This value of

is un-

ambiguous (and computable from simple geometry)

because there exists a unique resource

∈ L

and

a unique choice of

γ ∈

1] and of

α ∈

1] such

that R = γ L

+ (1−γ)C

(α).

is a facet of the 8-dimensional

(

2 2

)

local polytope, and

facets of polytopes are always one dimension lower than the

dimension of the polytope itself. A resource is within

if it is both a member of the facet deﬁned by the CHSH

inequality and also a member of some other facet deﬁned by a

positivity inequality. The regions deﬁned by the intersection

of adjacent facets are generally termed ‘ridges’, and a ridge

always has dimensionality

d −

2, where

is the dimension

on the polytope.

is a collection of all the eight ridges

adjacent to the

facet. Equivalently,

R ∈ L

if and only

can be convexly decomposed as a mixture over seven-or-

fewer (out of eight) deterministic boxes which saturate the

CHSH inequality. Each possible size-seven subset of CHSH-

inequality-saturating deterministic boxes deﬁnes one of the

eight 6-dimensional ridges comprising L

NPR

C(α)

(1 − γ)

(1 − α)

Figure 6: A depiction of a family of resources parametrized

and

, and the unique decomposition of a particular

point

(

α,γ

) in terms of a point

(

) on the chain

NPR

and a (unique) CHSH-saturating resource

that lies in

the boundary of the set of GPT-realizable common-cause

boxes. Note that the parameters

and (1

− α

) indicate

the fraction of the full line segment attributed to each

sub-segment, and similarly with γ and (1 − γ).

The (unique) relevant decomposition is shown in

Fig. 6 (for the case where k

= 0

). The proof of this

proposition is given in Appendix B.1.

7 Properties of the pre-order of

common-cause boxes

We now leverage the two monotones just introduced

to prove multiple interesting features of the pre-

order of common cause boxes.

7.1

Inferring global properties of the pre-order

Important properties of the pre-order over all re-

sources can already be learned by considering just

these two monotones (M

CHSH

and M

NPR

) and just

resources of type

(

2 2

)

, indeed, just a speciﬁc kind

of two-parameter family of resources within this set.

The kind of two-parameter family that we consider,

denoted S

(

2 2

)

⊂ S

(

2 2

)

, is

(

2 2

)

= {R(α,γ) : α ∈ [0, 1], γ ∈ [0, 1]}, (41)

where

R(α,γ)

= γ L

+ (1−γ)C(α), (42)

with C

(

)

∈ C

NPR

. There are many such families,

one for each choice of a resource L

∈ L

. Each

such family S

(

2 2

)

is the convex hull of the chain

NPR

and the associated point L

, i.e.,

(

2 2

)

= ConvexHull



, R

, L

NPR

}



. (43)

Evaluating M

NPR

for resources in this family

is straightforward, thanks to Proposition 19. The

proposition directly implies that for any R

(

α,γ

)

∈

(

2 2

)

NPR



R(α,γ)



= 2α+2. (44)

We now consider the value of M

CHSH

for resources

in this family. Noting that CHSH



R(α,γ)



≥ 2

for all R(α,γ) ∈ S

(

2 2

)

, Proposition 17 states

that M

CHSH



R(α,γ)



= CHSH



R(α,γ)



. Substitut-

ing the deﬁnition of C

(

)

from Eq. (35) into Eq. (42),

we obtain

R(α,γ) = γ L

+ (1−γ)α R

+ (1−γ)(1−α)L

NPR

Recalling that the CHSH function is linear and

that it satisﬁes CHSH(L

) = 2 for all L

∈ L

and

CHSH(R

) = 4, it follows that

CHSH



R(α,γ)



= CHSH



R(α,γ)



= 2γ + 4(1−γ)α + 2(1−γ)(1−α)

= 2α(1−γ) + 2. (45)

In Fig. 7(a), we plot some of the level

curves

for M

NPR

and M

CHSH

over any such

two-parameter family of resources. The level curve

deﬁned by M

NPR

(R) = 2α+2 is a diagonal line

in Fig. 7(a), extending from the (implicit) point

C(α) to the point L

. The level curve deﬁned

by M

CHSH

(R) = 2α(1−γ) + 2 is a horizontal line in

Fig. 7(a), extending between the two implicit points

C(α) and α R

+ (1−α)L

From these level curves, we can immediately de-

duce a number of features of the pre-order of re-

sources. In particular, we consider those features of

the pre-order that were deﬁned in Section 4.1.

First, we see that the pre-order is locally inﬁ-

nite, simply by virtue of the fact that there exist

chains which are represented by continuous sets of

A level curve of a function

is a set of points that yield the

same value of f; e.g., {x | f(x)=c}.

distinct resources, such as the chain C

NPR

. The in-

terval between any two resources in such a continu-

NPR

(a)

P R

NPR

CHSH

(b)

Figure 7: (a) A plot of the 2-parameter family of resources

(

2 2

)

(deﬁned in Eq.

(41)

), with values for

CHSH

depicted

by a set of level curves (light blue, horizontal lines) and values

for

NPR

depicted by another set of level curves (orange,

diagonal lines). (b) A plot of the same 2-parameter family of

resources, but in a Cartesian coordinate system with

CHSH

and

NPR

as the coordinates. Because all resources on the

bottom border in plot (a) are free, these all map to a single

point in (b), namely (

CHSH

, M

NPR

) = (2

2). The fact

that there are no resources with

CHSH

= 2 and

NPR

is represented by the use of a dashed line at the base of the

plot in (b). Similarly, the hatched region in (b) describes

joint values of the two monotones that are not achieved by

any resource in the family, as

CHSH

(

)

≤ M

NPR

(

) for

all

. Pictured in both plots are three illustrative resources.

The points

and

are incomparable, as are

and

while

and

are strictly ordered. This implies that the

incomparability relation in the pre-order is not transitive.

ous chain contains a continuous inﬁnity of inequiva-

lent resources.

Second, one can also see that the pre-order of re-

sources is not totally pre-ordered. For instance, the

two resources R

and R

in Fig. 7(a) are incompa-

rable, as witnessed by the fact that R

has a larger

value of M

CHSH

than R

does, but R

has a larger

value of M

NPR

than R

does. More generally, the

level curves for the two monotones allow one to im-

mediately construct (by inspection) a continuous in-

ﬁnity of such incomparable pairs.

Furthermore, the binary relation of incomparabil-

ity is not transitive, so the partial order is not weak.

This can be seen by the example of the three re-

sources in Fig. 7(a): R

and R

are incomparable

(as just argued) and R

and R

are incomparable

(by the same logic), yet R

and R

are comparable,

as evidenced by the fact that one can obtain R

from R

, by mixing R

with any free resource that

intersects the line deﬁned by the points R

and R

In addition, one can also see that the height of the

pre-order is inﬁnite. It suﬃces to note that the chain

NPR

is totally ordered and contains a continuum of

elements. The width of the pre-order is also inﬁnite.

Consider, for example, the line segment deﬁned by

the points R

and R

in Fig. 7(a). This subset of

resources constitutes an antichain, as every resource

in it is incomparable to every other: each resource

has a higher M

NPR

value and lower M

CHSH

value

than any of its neighbors towards the left, and has

a lower M

NPR

value and higher M

CHSH

value than

any of its neighbors towards the right. Because this

subset also forms a continuum, it follows that the

width of the pre-order is inﬁnite.

Also by inspection, for a given nonfree resource,

there are a continuum of chains and antichains

which contain it. In order to see this, let us ﬁrst intro-

duce some terminology. Within the plane of the two-

parameter family of resources, depicted in Fig. 8(a),

we refer to a direction from a given point R as an

“antichain direction” relative to that point, if this di-

rection lies strictly clockwise from the direction de-

ﬁned by the M

CHSH

level curve that passes through

R and strictly counterclockwise from the direction

deﬁned by the M

NPR

level curve that passes through

R. Otherwise, it is called a “chain direction”. Thus

an antichain direction relative to R is deﬁned by any

vector originating in R and terminating at a point

strictly within either yellow region in Fig. 8(a), while

a chain direction relative to R is deﬁned by any vec-

tor originating in R and terminating in either blue

region.

A one-dimensional curve of resources in this sub-

set deﬁnes a chain (antichain) if and only if at every

point on the curve, the tangent to the curve at that

NPR

(a)

P R

NPR

CHSH

(b)

Figure 8: (a) and (b) provide the same pair of depictions

of the 2-parameter family of resources

(

2 2

)

as were intro-

duced in Fig. 7. We consider a particular resource

. In

(a), we depict the level curves of

CHSH

(horizontal) and

NPR

(angled) which include

. By monotonicity of the

two monotones,

cannot be freely converted into any re-

source in the upper light-blue region or in the pair of yellow

regions. As we prove in Section 7.2.1, the two monotones

are complete for this subset, which is equivalent to the

fact that an arbitrary resource

can be freely converted

to any resource in the lower dark-blue region; namely, the

entire region wherein

CHSH

and

NPR

do not have a

value greater than the one they have on

. Resources in

the upper light-blue region can be converted to

, while

resources in the pair of yellow regions are incomparable to

point is aimed

in a chain direction (antichain di-

rection) relative to that point.

A ﬁnal lesson we learn from these two mono-

tones is that the set of all monotones induced (via

Eq. (33)) by the facet-deﬁning Bell inequalities for a

given type do not yield a complete set of monotones

for the resources of that type. We have shown that

the set of resources is not totally pre-ordered, and as

stated in Section 4.3.1, the eight facet-deﬁning Bell

inequalities for the

(

2 2

)

-scenario induce only a single

monotone: M

CHSH

. Since no single monotone can be

complete for a pre-order of resources that includes

incomparable resources, it follows immediately that

the monotones induced by the facet-deﬁning Bell in-

equalities for the

(

2 2

)

type are not suﬃcient for fully

characterizing the pre-order of resources of that type.

Since such resources trivially can be lifted to any

nontrivial Bell scenario (where the lifted resource

will violate no facet-deﬁning Bell inequalities other

than CHSH), it follows that:

Proposition 20.

The pre-ordering of resources rel-

ative to LOSR operations cannot be resolved solely

using the degree of violations of facet-deﬁning Bell

inequalities.

Proof.

By deﬁnition, any complete set of mono-

tones allows one to compute the values of any other

monotone from them

. However, although the

value of M

CHSH

(R) can be computed (for any type-

(

2 2

)

resource

) from the eight values of the facet-

deﬁning

CHSH

functionals in Eq.

(31)

, the value of

NPR

(

) cannot. This implies that any complete

set of monotones must include at least one mono-

tone (like

NPR

(

)) which depends on information

beyond the values of the eight

CHSH

functionals.

Proposition 20 shows that the nonclassicality of

common-cause processes is not completely charac-

More precisely: a line deﬁnes two opposing directions, and

both of these directions will point in a chain direction, or both

will point in an antichain direction.

If one has a set of monotones

}

which is complete, then

for a given resource

, the set of values

(

)

}

is suﬃcient

for (in principle) computing the value

(

) of any monotone

on resource

. First, one can deduce the equivalence class

from

(

)

}

; this is possible by the completeness of

the set

}

. Then, one can select any resource

from the

equivalence class of

and can evaluate

(

) for the given

monotone

. Because a monotone must assign the same

value to all resources within an equivalence class, it holds

that

(

) =

(

). (Note that our argument here does not

imply that one can in practice compute the value

(

); this

computation might involve solving a hard problem.)

terized by the monotones that are naturally associ-

ated to facet-deﬁning Bell functionals, despite the

fact that such Bell functionals are suﬃcient to wit-

ness whether or not a resource is nonclassical.

7.2 Incompleteness of the two monotones

In this section, we prove that the two-element set of

monotones {M

CHSH

, M

NPR

} is not a complete set.

We do so by showing that it is not complete even

for resources of type

(

2 2

)

A simple proof is as follows. Consider resources of

the form R

C(½) for diﬀerent choices of

the CHSH-saturating resource L

that lies in the

boundary of S

(

2 2

)

. We will show that there are pairs

of resources of this form which are strictly ordered,

and other pairs of resources of this form which are

incomparable. These facts cannot be captured by

the two monotones, which see all resources of this

form as equivalent, with M

NPR

= 3

and M

CHSH

2.5.

Consider for example the resources L

, L

, and

deﬁned in Table 3. Using the pairwise compar-

ison algorithm described in Section 5, one can ver-

ify that the resource

C(½) is strictly higher

in the order than

C(½), while the two re-

sources

C(½) and

C(½) are incom-

parable. Note that L

is a convexly extremal re-

source, while L

and L

are not.

As an aside, it is worth noting that because

the nonlocal fraction and the two standard robust-

ness measures witness exactly the same ordering

relations as M

CHSH

does (as demonstrated in Sec-

tion 4.3.2), one gains nothing by supplementing

CHSH

and M

NPR

with them. Rather, new mono-

tones are needed.

The incompleteness of the two-element set

CHSH

, M

NPR

} is also established directly from

the argument presented in Section 7.3.

7.2.1

Completeness of the two monotones for certain

families of resources

Although M

CHSH

and M

NPR

do not form a complete

set of monotones for the set of all resources of type

(

2 2

)

, it turns out that they do form a complete set

of monotones for certain subsets thereof.

Proposition 21.

The pair of monotones

CHSH

, M

NPR

}

are a complete set relative

to the subset of resources

(

2 2

)

(deﬁned in Eq.

(41)

)

for any L

∈ L

i hA

i hB

i hA

i M

CHSH

NPR

1 1 1 1 1 1 1 1 2 2

0 0 0 0 1 1 0 0 2 2

0 0 0 0 1 0 1 0 2 2

C(½) 0 0 0 0

−3

/4 3 3

C(½)

/2 3

C(½) 0 0 0 0

−3

/2 3

C(½) 0 0 0 0

−3

/2 3

Table 3: An explicit description of the resources which demonstrate the incompleteness of the pair of monotones

CHSH

, M

NPR

}

. The fact that

i =

1 for the free boxes immediately proves that these do indeed lie on the boundary

of the full set of GPT-realizable common-cause boxes of this type,

(

2 2

)

(since it implies that

0) = 0 =

0),

and hence these boxes saturate positivity inequalities).

Proposition 21 is proven in Appendix B.2. The

logic of the proof is quite simple: we prove that there

always exists a free operation τ

erase−γ

which con-

verts an arbitrary resource R

(

, γ

)

in the family

to some resource R

(

lying on the chain C

NPR

without changing the value of M

CHSH

. By convexity,

it follows that R

(

, γ

)

can be converted to any re-

source in the convex hull of R

(

, γ

)

, R

(

, L

and L

NPR

; namely, the dark-blue region in Fig. 8.

This region corresponds to the set of all resources

with a lower value of both M

CHSH

and M

NPR

. It

follows that if a conversion is not forbidden by con-

sideration of this pair of monotones, then it is achiev-

able. By the deﬁnition of completeness for a set of

monotones (see Eq. (21)), this implies that the two

monotones are indeed a complete set for this family

of resources.

7.3

At least eight independent measures of

nonclassicality

In this section, we tackle the question of how many

independent continuous monotones are required to

fully specify the partial order of resources. This is

the content of Theorem 26. Along the way to prov-

ing this result, we also prove a powerful result about

the equivalence classes under LOSR for nonfree re-

sources of type

(

2 2

)

, stated in Proposition 23.

We begin by drawing a distinction among re-

sources.

Deﬁnition 22.

A resource is said to be

orbital

its equivalence class under type-preserving LOSR is

equal to its equivalence class under LSO.

It follows that if all the resources in a set S are

orbital, then the quotient space [82] of S under the

group LSO provides a representation of the partial

order of LOSR-equivalence classes of resources in S

(despite the fact that the LOSR operations do not

themselves form a group).

This property of resources is pertinent to the dis-

cussion here because of the following result:

Proposition 23.

All nonfree resources of type

(

2 2

)

are orbital.

The proof is provided in Appendix B.3.

Note that for free resources, LOSR-equivalence

is distinct from LSO-equivalence because the LSO-

equivalence class of any resource (including a free

resource) is of ﬁnite cardinality, while the LOSR-

equivalence of a free resource is the entire set of

free resources, which is of inﬁnite cardinality. Thus,

free resources are not orbital. Moreover, the coinci-

dence between being nonfree and being orbital does

not generalize beyond the

(

2 2

)

scenario. For instance,

note that a pair of

(

2 2

)

resources, R

and R

, which

are implemented in parallel can be conceptualized

as a

(

4 4

)

resource, R

1⊗2

, by composing the two bi-

nary setting variables on the left wing into a single

4-valued setting variable on the left wing, and simi-

larly for the other setting variable and the outcome

variables. If R

is free and R

is nonfree, then R

1⊗2

is nonfree, and yet because R

’s equivalence class

is not generated by LSO, neither is the equivalence

For practical purposes, Ref. [

, App. B] provides a technical

discussion regarding how to eﬃciently select a representative

Bell inequality under a ﬁnite symmetry group; the procedure

discussed there is equally applicable for the task of eﬃciently

selecting canonical form resources. Note, however, that the

LSO symmetry group diﬀers from the Bell-polytope automor-

phism group considered in Ref. [

], in that LSO does not

include the symmetry of exchange-of-parties.

class of R

1⊗2

. Thus, R

1⊗2

is a nonfree resource that

is not orbital.

To express the next proposition, we require the

following deﬁnition.

Deﬁnition 24.

The

intrinsic dimension

of a set

of resources

, denoted

IntrinsicDim

(

), is the small-

est cardinality of continuous functions from the set

to the real numbers required to uniquely identify a

resource within S.

Proposition 25.

For any compact set

of resources

that are all orbital, the intrinsic dimension of the set

is a lower bound on the cardinality of a complete set

of continuous monotones for

(and for any superset

of S).

The proof is provided in Appendix B.4.

Recognizing that the set of nonfree resource of

type

(

2 2

)

has intrinsic dimension equal to eight,

then Propositions 23 and 25 together imply the fol-

lowing theorem:

Theorem 26.

For resources of type

(

2 2

)

, the cardi-

nality of a complete set of continuous monotones is

no less than 8.

8 Properties of the pre-order of quan-

tumly realizable common-cause boxes

The bulk of this article has considered the resource

theory which is deﬁned by taking the enveloping the-

ory of resources to be the GPT-realizable common-

cause boxes, and the free subtheory of resources

to be the classically realizable common-cause boxes.

In this section, we consider a slightly diﬀerent re-

source theory, wherein the enveloping theory of re-

sources is taken to be the common-cause boxes that

are realizable in a quantum causal model, which

we term quantumly realizable, while the free

subtheory is chosen to be, as before, the common-

cause boxes that are classically realizable. Eﬀec-

tively, the new resource theory concerns the nonclas-

sicality of common-cause boxes within the scope of

That

IntrinsicDim(S

nonfree

(

2 2

)

) = 8

is evidenced by the char-

acterization of such resources in terms of outcome bi-

ases and two-point correlators. If

indicates any type,

then

IntrinsicDim(S

nonfree

) = IntrinsicDim(S

)

whenever

6= S

free

(think of subtracting one polytope from a circum-

scribing polytope of the same dimension). See Refs.[

] for discussions on the intrinsic dimension of no-signalling

polytopes.

nonclassicality that can be achieved quantumly. In

other words, it concerns the intrinsic quantumness

of common-cause boxes.

Formally, the conditional probability distribution

associated to a quantumly realizable common-cause

box is of the same form as Eq. (1), that is,

XY |ST

(xy|st) = (r

x|s

⊗ r

y|t

) · s

, (46)

but where the vector s

is a real vector represen-

tation of a quantum state on the bipartite system

composed of quantum systems A and B, and the

sets of vectors {r

x|s

}

and {r

y|t

}

are real vector

representations of POVMs on A and on B respec-

tively. (See, e.g., Ref. [45].)

Although the conclusions we drew in Section 7.1

concerned the pre-order of GPT-realizable common-

cause boxes, analogous results hold true for the pre-

order of quantumly realizable common-cause boxes.

This is because the kind of two-parameter family

of GPT-realizable common-cause boxes that was

used to establish global features of the pre-order of

such boxes in Section 7.1 contains a two-parameter

family of quantumly realizable common-cause boxes

that can be used for the same purpose. A carica-

ture of one such quantumly realizable family is pro-

vided in Fig. 9. Speciﬁcally, if one reviews the argu-

ments that were used in Section 7.1 to establish the

various global properties of the pre-order of GPT-

realizable common-cause boxes, it becomes appar-

ent that these apply equally well to the quantumly

realizable common cause boxes.

It is also straightforward to show that the lower

bound on the cardinality of a complete set of mono-

tones, obtained in Section 7.3, also applies to the

resource theory of quantumly realizable common-

cause boxes. It suﬃces to consider the case of the

quantumly realizable resources of type

(

2 2

)

, here-

after S

(

2 2

)

, and to note that the set of nonfree re-

sources therein, that is, the set S

nonfree

(

2 2

)

(

2 2

)

, still

has intrinsic dimension equal to eight.

In the rest of this section, we consider properties

of the pre-order of quantumly realizable common-

cause boxes that are particular to the quantum case.

Unlike for the set S

(

2 2

)

, where the partial or-

der of equivalence classes has a unique element

at the top of the order (the equivalence class

of R

), in S

(

2 2

)

there is no unique element at

the top of the order. An easy way to see this is

by considering the example of the Tsirelson box

i hA

i hB

i hA

i M

CHSH

NPR

Tsirelson

0 0 0 0

√

−

√

/2 2

√

2 2

√

≈0.707 ≈0.707 ≈0.707 ≈−0.707 ≈2.828 ≈2.828

Hardy

5−2

√

5−2 5−2

√

5−2 6

√

5−13 3

√

5−6 3

√

5−6 2

√

5−5 10(

√

5−2) 4

≈0.528 ≈0.236 ≈0.528 ≈0.236 ≈0.416 ≈0.708 ≈0.708 ≈−0.528 ≈2.361

Tilt

(θ) cos(θ) 0

cos(θ)

ξ(θ)

cos(θ)

ξ(θ)

sin

(θ)

ξ(θ)

−sin

(θ)

ξ(θ)

2 ξ(θ)

(

see

caption

)

Tilt

(0) 1 0 1 1 1 0 1 0 2 2

Table 4: An explicit description of the Tsirelson resource, the Hardy resource, and a family of extremal quantum resources

(parametrized by

) which are exposed by tilted Bell inequalities [

]. We employ the shorthand

(

)

sin

(θ)+1

to allow all deﬁnitions to ﬁt within the table. We also analytically derived

NPR



Tilt

(

)



ξ(θ)(ξ(θ)−1)

2(1−cos(θ))−ξ(θ)(ξ(θ)−1)

, for

< θ ≤ π/

2. One can readily verify that

NPR



Tilt

(

)



increases with the amount of tilt (i.e.,

cos

(

)), whereas

CHSH



Tilt

(

)



= 2

2 − cos

(θ)

decreases with added tilt. The opposite behavior of the two monotones implies that

every resource in the tilted family

θ ∈

, π/

2] is incomparable to every other.

Tilt

(0) is a free resource, not violating any

Bell inequality; at the other end of the family, R

Tilt

(

) = R

Tsirelson

) and the Hardy box (R

Hardy

), each of

which is deﬁned explicitly in Table 4. Noting that

CHSH

(

Tsirelson

) =

NPR

(

Tsirelson

) = 2

√

2 ≈

828

, and that M

CHSH

(

Hardy

) = 10(

√

5−

≈

361

and M

NPR

(

Hardy

) = 4

, it follows im-

mediately that the two boxes are incomparable

since M

CHSH

(

Tsirelson

)

> M

CHSH

(

Hardy

)

while

NPR

Tsirelson

) < M

NPR

Hardy

We show these two resources in Fig. 9(a), to-

gether with an approximate sketch

of the extremal

quantumly realizable resources which interpolate be-

tween them (the light-blue curve). The values of

CHSH

and M

NPR

on all of these resources is plotted

in Fig. 9(b). From the ﬁgure, one can immediately

infer that R

Tsirelson

and R

Hardy

are incomparable.

Recall that no quantumly realizable resource can

achieve the algebraic maximum of M

CHSH

, while

some GPT-realizable (such as R

) can achieve the

maximum. In contrast to M

CHSH

, M

NPR

is such

that some quantumly realizable resources (such as

Hardy

) violate it maximally. Furthermore, whereas

maximizes both M

CHSH

and M

NPR

, no sin-

gle quantumly realizable resource maximizes both

those monotones. Therefore, a unique feature of the

enveloping theory of quantumly realizable common-

cause boxes is that inequivalent resources can si-

multaneously be maximally nonclassical (according

An analytic characterization of the set of all extremal quan-

tumly realizable resources within

(

2 2

)

is not known. In

Fig. 9(a), the endpoints and the slope of the curve at the

endpoints are exact, and the rest of the curve is merely an

interpolation.

to distinct monotones), even among

(

2 2

)

-type re-

sources.

The interpolated curve in Figs. 9(a) and 9(b)

furthermore suggests that perhaps all extremal

quantum-realizable resources depicted therein are

relatively incomparable. The following lemma gives

a powerful result regarding maximally nonclassical

resources:

Lemma 27.

If a nonfree resource

is convexly

extremal in the set

(

2 2

)

of quantumly realizable re-

sources of type

(

2 2

)

, then

is at the top of the pre-

order among quantumly realizable resources of type

(

2 2

)

Proof.

Let

R ∈ S

(

2 2

)

be nonfree and extremal in

(

2 2

)

. Then, to prove the proposition, we need only

prove that any quantumly realizable

∈ S

(

2 2

)

that

can be freely converted to

cannot be higher in

the order than

(rather, it must be equivalent).

Assume the existence of some quantumly realizable

such that

7−→R

. Since

is extremal in the

image of

under LOSR,

it must be that

converted to

through extremal operations: that

is, through LDO. But as follows from Lemma 35 in

This is justiﬁed as follows: from

7−→ R

it follows that

R ∈ P

LOSR

[R]

(

), and from the fact that quantumly realizable

boxes remain quantumly realizable under LOSR, it follows that

LOSR

[R]

(

)

⊂ S

(

2 2

)

. Finally,

is by assumption extremal

in S

(

2 2

)

; hence, it is extremal in P

LOSR

[R]

) as well.

Appendix B.3, or as can be explicitly checked,

the

NPR

Tsirelson

Hardy

(a)

NPR

CHSH

Tsirelson

Hardy

√

(b)

Figure 9: (a) and (b) provide the same pair of depictions

of the 2-parameter family of resources

(

2 2

)

as were in-

troduced in Fig. 7. Here, we provide a caricature of some

ordering relations among quantumly realizable common-

cause boxes within this 2-parameter family. We depict the

Tsirelson and Hardy boxes (with scaled-up values of the

monotones, but accurate ordering of these values), together

with a guess of what the boundary of the set of quantumly

realizable resources within this 2-parameter family might be

(dotted blue curves). In (b), we also depict the values of the

two monotones for the set of convexly extremal, quantumly

realizable resources which are self-tested by the tilted Bell

inequalities (smooth black curve).

One can explicitly check that all extremal

(

2 2

)

-type resources

are mapped to the free set by any deterministic operation

which is not a symmetry, which implies by convexity that all

(

2 2

)

-type resources are also mapped to the free set by these

operations.

image of any

(

2 2

)

-scenario resource is free under any

deterministic operation which is not a symmetry!

Put another way, there is no preimage of any nonfree

(

2 2

)

-scenario resource among

(

2 2

)

-scenario resources

under deterministic nonsymmetry operations. This

means that the only

τ ∈ LDO

(

2 2

)

→

(

2 2

)

such that con-

ceivably

τ ◦ R

are symmetry operations. As

such, if

is a nonfree extremal quantumly realizable

resource of type

(

2 2

)

, the only quantumly realizable

resources (of the same type) which can be converted

are symmetries of

. Since resources related by

a symmetry operation are in the same equivalence

class, there are no

(

2 2

)

-type quantumly realizable

resources strictly above R in the partial order.

Lemma 27 allows us to conclude the following:

Proposition 28.

There exists a continuous set of

resources that are at the top of the pre-order of quan-

tumly realizable

(

2 2

)

resources, and wherein each re-

source is incomparable to every other resource in the

set.

Proof.

Lemma 27 states that any subset of resources

which are extremal in

(

2 2

)

are at the top of the

pre-order of quantumly realizable

(

2 2

)

resources. The

fact that one can ﬁnd a continuous set of such re-

sources follows from the well-known fact that

(

2 2

)

is not a polytope. By furthermore choosing such

a set of extremal resources for which

CHSH

takes

a distinct value for every resource in the set, one

additionally guarantees that no two of these top-of-

the-order resources are in the same equivalence class,

and hence each must be incomparable to every other

in the set. Refs. [

] provide some explicit

sets of resources satisfying these criteria.

As one concrete example, consider the one-

parameter family of quantumly realizable resources

which are self-tested by the tilted Bell inequalities.

We denote this family by {R

Tilt

(

) :

θ ∈

, π/

The deﬁnition of R

Tilt

(

)

is given in Table 4. These

resources are related to a corresponding family of

tilted Bell functionals [84, 85, 88, 89], parametrized

by β ∈ [0, 2], namely,

TiltedCHSH

(R)

= βhA

i + hA

+ hA

i + hA

i − hA

where max

∈S

free

(

2 2

)

TiltedCHSH

) = 2 + β,

and where max

∈S

(

2 2

)

TiltedCHSH

) =

8 + 2β

Note that the only value of β for which the maxi-

mum value of this function over the quantumly real-

izable set S

(

2 2

)

coincides with the maximum value

over the free set S

free

(

2 2

)

is β

= 2

. Whenever β <

, the

resource R

Tilt

(

)

for θ deﬁned implicitly by the equa-

tion β

√

1+2 tan

(θ)

is the unique maximizer over

(

2 2

)

of the corresponding tilted Bell functional.

Formally,

β =

1 + 2 tan

(θ)

< 2 implies

TiltedCHSH

) < TiltedCHSH



Tilt

(θ)



for any R

∈ S

(

2 2

)

Tilt

(

)

. It follows that every

resource R

Tilt

(

)

is convexly extremal in the set

of quantumly realizable resources, and its extremal-

ity is exposed by the corresponding tilted Bell func-

tional.

In fact, every resource in this family is incompa-

rable to every other in the family, as can be shown

directly by considering the values of M

CHSH

and

NPR

. In Fig. 9(b), we show a plot of the values

of the two monotones evaluated on this family. The

points form a continuous antichain, shown in black.

Note that the family of resources {R

Tilt

(

) :

θ ∈

, π/

} does not lie in any plane in the linear space

of resources, and as such we do not attempt to plot

the family directly (rather we only plot its valua-

tions with respect to the two monotones).

9 Conclusions and outlook

We have conceptualized Bell experiments as

common-cause ‘box-type’ processes: bipartite or

multipartite processes with classical variables as in-

puts and outputs, the internal causal structure of

which is a common-cause acting on all of the wings

of the experiment. We have argued in favour of this

conceptualization by appeal to the fact that Bell’s

theorem can be regarded as implying the need for

nonclassicality in the causal model that underlies

the process. We have begun to quantify the nonclas-

sicality of such common-cause box-type processes by

developing a resource theory thereof. We have ar-

gued in favour of a particular choice of the free oper-

ations for this resource theory, namely, those which

can be achieved by embedding the resource into a cir-

cuit consisting of box-type processes realizable with

a classical common cause, and we have shown that

this set is equivalent to the set of local operations

and shared randomness.

We have focused here on characterizing the pre-

order deﬁned by single-copy deterministic conver-

sion of resources under the free operations. We have

provided a linear program that decides how any two

resources are ordered. By leveraging a pair of func-

tions that we have proven to be monotones, we have

also established a number of properties of this pre-

order, such as the fact that it contains incompara-

ble resources, that it has inﬁnite width and height,

that it is locally inﬁnite, and that the incompara-

bility relation is not transitive. Moreover, despite

the fact that the values of the facet-deﬁning Bell

functionals are necessary and suﬃcient for witness-

ing the nonclassicality of a common-cause box, we

have shown that they are not suﬃcient for quanti-

fying the nonclassicality of a common-cause box. In

other words, there are aspects of the nonclassical-

ity of such boxes relevant to resource conversions

that are not captured by the degree of violation of

the facet-deﬁning Bell inequalities. For the particu-

lar case of resources with two binary inputs and two

binary outputs, we moreover showed that at least

eight continuous monotones are required to fully

specify the pre-order among resources. We have also

derived some interesting facts about the pre-order of

resources when one restricts attention to common-

cause boxes that can be realized in quantum theory.

In particular, we have shown that for quantumly re-

alizable resources of type

(

2 2

)

, all convexly extremal

resources are at the top of the pre-order of such re-

sources, and that there are an inﬁnite number of

incomparable resources at the top of this pre-order.

There is much scope for advancing and generaliz-

ing our work, some examples of which we now de-

scribe.

One of the most fundamental problems that is

yet to be solved is that of characterizing the equiv-

alence classes of resources in the pre-order induced

by single-copy deterministic conversion. That is, one

would like a compressed representation of each re-

source that includes all and only information that

is relevant to determining its equivalence class in

this pre-order. Finding such a representation would

be the analogue within our resource theory of prov-

ing that the equivalence classes of pure bipartite en-

tangled states under LOCC [90] are given by the

Schmidt coeﬃcients of the state. All resource mono-

tones could then be eﬃciently expressed in terms of

this compressed representation, while all other pa-

rameters of a resource could be safely ignored.

Even among resources of type

(

2 2

)

(much less for

resources of arbitrary type), we do not have a com-

plete set of monotones for this pre-order.

Another

interesting open problem is to connect the existing

monotones to ﬁgures of merit for interesting opera-

tional tasks. E.g., does the value of the monotone

CHSH

determine the extent to which a given re-

source can be used for key distribution or random-

ness generation [6–14]? Since the monotone M

NPR

is maximized for high-bias boxes from the R

Tilt

(

)

family (and by the Hardy box) as opposed to by the

Tsirelson box, M

NPR

is likely a ﬁgure of merit for

operational tasks where the advantage is provided

by such correlations [88, 91].

Note that in deriving our results about properties

of this pre-order, we have not needed to consider any

types of resource beyond

(

2 2

)

, that is, it has suﬃced

to consider Bell experiments of the CHSH type. It

may be that more nuanced features of this pre-order

only become apparent for more general types of re-

sources.

An obvious generalization of our work is to con-

sider the pre-order induced by diﬀerent sorts of

conversion relations, such as indeterministic single-

copy conversion

, multi-copy conversion, asymp-

Although considerations of the examples given in Section 7.2

might provide the intuition necessary to ﬁnd such a complete

set for resources of type

(

2 2

)

Indeterministic single-copy conversion is single-copy conver-

sion that makes use of a post-selection. Therefore, to contem-

plate this notion of conversion for our resource theory is to

contemplate expanding the set of free operations from LOSR

to LOSR with post-selection. However, LOSR with postse-

lection can map a correlation

XY |ST

that satisﬁes the Bell

inequalities to one that violates them, and even to one that

violates the no-signalling condition. (This is in contrast to

the situation with LOCC, where allowing postselection does

not change the set of states that one can prepare for free.)

Consequently, what sort of correlation is consistent with a

classical common cause—and hence what should be deemed

free in a resource theory of nonclassicality of common cause

boxes—becomes contingent on what sort of postselection was

implemented. For example, in a Bell experiment wherein de-

totic conversion, and conversion in the presence of

a catalyst (see Refs. [26, 92, 93] for a discussion of

these diﬀerent notions, and Refs. [94–98] for relevant

examples of such generalized conversions).

Other generalizations require changes to the en-

veloping theory of resources one is considering. We

have noted that our deﬁnition of the free opera-

tions can easily be extended to deﬁne a resource

theory of nonclassicality for box-type processes in

more general causal structures, distinct from that

of a Bell experiment. For example, as discussed

in Appendix. A.3, it can be extended to a sce-

nario we term the triangle-with-settings scenario [59,

Fig. 8], of which the much-studied ‘triangle sce-

nario’ [99–102] is a special case. Another example

would be to extend our deﬁnition to the ‘bilocality

scenario’ [59, 103–107]. The analysis of such cases

is complicated by the fact that our proposal im-

plies that the set of free operations is not convex

for them. Another such generalization would be to

causal structures wherein there are cause-eﬀect re-

lations between diﬀerent parts of the experiment,

for instance, experiments involving sequences of non-

destructive measurements on parts of a shared re-

source, such as the causal structure known as the

‘instrumental scenario’ [45, 108–112].

A generalization of our resource theory in a diﬀer-

ent direction is to consider processes whose inputs

and outputs are not classical (i.e., processes that

are not ‘box-type’), but rather describe quantum or

post-quantum systems. For the case of the common-

cause structure which we focused on here, a quan-

tum resource theory of this sort would subsume en-

tanglement theory, but where quantum correlation

is deﬁned relative to the set of local operations and

shared randomness (LOSR) rather than local oper-

ations and classical communication (LOCC).

10 Acknowledgments

The authors acknowledge useful discussions with

Jonathan Barrett, Tobias Fritz, Tom´aˇs Gonda and

tectors are not perfectly eﬃcient, postselecting on detection

can induce Bell inequality violations even in the absence of

a nonclassical common cause. However, for a given value of

the detection eﬃciency, this might only be able to explain a

particular degree of violation, while any higher violation would

still attest to the presence of a nonclassical common cause. In

such a context, the boundary between the correlations that

are consistent with a classical common cause and those that

are not would no longer coincide with the facets of the Bell

polytope. Consequently, even deﬁning the free set of resources

becomes quite complicated when postselection is allowed.

Denis Rosset. D.S. is supported by a Vanier Canada

Graduate Scholarship. R.K. is supported by the

Program of Concerted Research Actions (ARC) of

the Universit´e libre de Bruxelles. This research

was supported by Perimeter Institute for Theo-

retical Physics. Research at Perimeter Institute is

supported in part by the Government of Canada

through the Department of Innovation, Science and

Economic Development Canada and by the Province

of Ontario through the Ministry of Colleges and

Universities. This publication was made possible

through the support of a grant from the John Tem-

pleton Foundation. The opinions expressed in this

publication are those of the authors and do not

necessarily reﬂect the views of the John Temple-

ton Foundation. ABS acknowledges support by the

Foundation for Polish Science (IRAP project, IC-

TQT, contract no. 2018/MAB/5, co-ﬁnanced by EU

within Smart Growth Operational Programme).

Appendices

A Comparing our framework with prior work

Correlations that violate Bell inequalities have become an important object of study, not only for their rele-

vance in foundational aspects of quantum theory, but also for their role as a resource in quantum information-

processing tasks [6–14]. Hence, particular eﬀort has been devoted to the formulation of a resource theory

describing them [15, 16, 18, 19]. Two sets of free operations have previously been proposed to deﬁne such

a resource theory, namely LOSR [16–19] which we have developed in the main text, but also wirings and

prior-to-input classical communication (WPICC) [15].

In this section, we assess WPICC from the lens of our resource theory, and we identify an inconsistency

among previous proposals for the deﬁnition of LOSR. The primary diﬀerences between our approach and

previous approaches become most evident when one considers the question of how to develop such a resource

theory for more general causal structures, as we discuss further on in Appendix A.3.

A.1 WPICC versus LOSR as the set of free operations

The set of WPICC operations allows for classical causal inﬂuences among the wings prior to when the parties

receive their inputs. An example of a free operation in the WPICC approach is depicted in Fig. (10). If one

seeks to understand the resource as nonclassicality of common-cause processes, as we do here, then it is

clear that the free operations should not include any cause-eﬀect inﬂuences between the wings, and therefore

should not include any classical communication between the wings. In other words, in our approach, WPICC

is not a viable choice for the set of free operations, as wirings that connect diﬀerent wings of the experiment

cannot be part of any free operation.

One might think that the choice to take WPICC or LOSR to be the set of free operations is not a

particularly consequential one, since WPICC and LOSR deﬁne the same partial order for boxes [18] (see

the discussion of this point in Sec. A.2.1). This equivalence breaks down, however, when one considers more

general resources, e.g., bipartite quantum states. Since bipartite quantum states have no inputs, allowing

classical communication prior to inputs means allowing arbitrary classical communication. Hence, WPICC

coincides with LOCC in this case, and LOCC deﬁnes a partial order on bipartite quantum states that is

distinct from the partial order deﬁned by LOSR [72].

A.2 An oversight in the literature concerning how to formalize LOSR

As we noted in the Introduction and in Section 2, the intuitive notion that the set of free operations should

constitute local operations supplemented by shared randomness is widely agreed upon in previous work [15–

22]. Nonetheless, some prior work seems to have formalized this intuitive notion incorrectly. Speciﬁcally, the

set of free operations deﬁned in Ref. [18] (and repeated in Refs. [20, 21]) does not coincide with the set

of free operations deﬁned in Refs. [16, 17] and which we endorse here as the correct choice. Rather, it is a

nonconvex subset thereof, as we will show here. (Note that Ref. [18] referred to their set of free operations

as “LOSR” but we will here reserve that term for the set of operations described in Deﬁnition 5.)

We suspect that the discrepancy in the deﬁnitions introduced in these papers was merely an oversight,

and in particular, that none of the authors of these articles would advocate for this nonconvex subset over

the full set. Nonetheless, we think that it is important to highlight this oversight, so that it may be avoided

in future work.

It is easiest to see the diﬀerence between the deﬁnition of the free operations given in Ref. [18] and the

one endorsed here (which coincides with the deﬁnitions of Refs. [16, 17]) by considering the diagrammatic

representation of a generic operation in each case. The most general free operation proposed by Ref. [18]

is depicted in their Fig. 1(a), which we reproduce here as Fig. 11, which should be compared with Figs. 3

and 4 of our article. The diﬀerence is that in Fig. 11, the side-channels on each wing that carry information

forward from the pre-processing to the post-processing are limited to carry information only about the setting

Figure 10: An example of a free operation in the WPICC approach, using the diagrammatic conventions of this article.

(Compare with Fig. 1(b) of Ref. [

].) Here, we see an example in which there is communication from the left wing to the

right wing, which (in contrast to our approach) is allowed for free in the WPICC approach, for all times prior to when the

wings receive the inputs S

and T

Figure 11: A depiction of Fig. 1(a) of Ref. [

] using the diagrammatic conventions of this article. The set of operations

having this form is not as general as those depicted in our Fig. 4 because the post-processing does not have complete access

to the shared randomness available at the pre-processing. One can explicitly show that the set of operations having this

form is not convex. It is only after taking the convex closure of the set of operations depicted here that one recovers LOSR.

variables (S, S

, T and T

), while in Figs. 3 and 4, they can also carry information about the common cause

that acts on the local pre-processings.

This diﬀerence is also reﬂected in the equations. The most general free operation proposed by Ref. [18] is

deﬁned via their Eq. (7). In terms of the notation of this article, their Eq. (7) asserts that

ST |XY S

= P

|XY ST S

ST |S

, (47)

where P

ST |S

(denoted I

(L)

in Ref. [18]) and P

|XY ST S

(denoted O

(L)

in Ref. [18]) represent, respec-

tively, the pre- and post-processings (depicted in Fig. 11). Consistently with their Fig. 1(a), the expression

for the post-processing stipulates that the side-channel between the pre- and post-processings only carries

information about the setting variables S, S

, T and T

. The analogue of this equation for the proposal

endorsed here is

ST |XY S

|XY Z

ST Z

, (48)

where Z

and Z

represent the variables propagated along the side-channels in Fig. 4. This is more general,

given that Z

and Z

can encode information about the common cause in the pre-processing.

To see the diﬀerence more explicitly, consider how these expressions appear if one includes the common

causes. Because the pre- and the post-processings in the proposal of Ref. [18] must depend on independent

sources of shared randomness (by virtue of the restriction on the side-channels), we distinguish the common

causes notationally using primed and unprimed variables. The post-processing is given by

|XY ST S

|XSS

|Y T T

, (49)

and the pre-processing is given by

ST |S

S|S

T |T

. (50)

Putting these together, we have

ST |XY S

ΛΛ

|XSS

|Y T T

S|S

T |T

(51)

By contrast, the proposal endorsed here distributes a single source of shared randomness between the pre-

and post-processings. If we consider the circuit depicted in Fig. 4 and note that the side-channels can now

feed forward not just S, S

, T and T

, but the common cause as well, we see that we can express the most

general free operation as follows (which is equivalent to Eq. (9))

ST |XY S

|XSS

|Y T T

S|S

T |T

. (52)

The operational discrepancy between the two proposals is a consequence of the fact that Eq. (51) is strictly

less general than Eq. (52).

One can intuitively expect a failure of convex closure for the set of operations depicted in Fig. 11 and

described in Eq. (51), since the pre-processing and the post-processing have access to independent sources of

shared randomness, and these two sources cannot generally be subsumed into a single source. To explicitly

demonstrate the failure of convexity, we consider the following operations:

= P

ST |XY S

= δ

S,0

T,0

= P

ST |XY S

= δ

S,1

T,0

(τ

+ τ

While τ

and τ

are each free operations which can be realized using the circuit in Fig. 11, the transformation

deﬁned by their mixture cannot be realized using this circuit. To see this, note that in Fig. 11, any

correlations between S and Y

can only be mediated by T , since the only variable in the causal past of both

S and Y

is the variable acting as the common cause of the pre-processing, which we will denote by

, and

the only means by which the value of

could be communicated via the side channel is through T . But in

this example, T does not vary and so cannot mediate any correlations; the point distribution on T screens oﬀ

any correlation between S and Y

. Hence, τ

, which exhibits perfect correlation between S and Y

, cannot

be realized in a circuit of the form of Fig. 11. It follows that the set of operations depicted in Fig. 11 is not

convexly closed.

Despite the deﬁnition of the free operations given in Ref. [18], in Appendix A of that article, the authors

avail themselves of convex mixtures of operations of the sort described by their Fig. 1(a) and Eq. (7). However,

a mixing operation is only allowed if the shared randomness required to implement it is present, and given

that the shared randomness available for the pre-selection is independent from that which is available for the

post-selection, an arbitrary mixing operation is not allowed under the free operations proposed by Ref. [18].

The use of convex mixtures in Appendix A of Ref. [18] is therefore inconsistent with the deﬁnition of the

free operations provided therein.

The mistake of deﬁning the free operations as this nonconvex subset of LOSR is repeated in Ref. [21]:

Fig. 1 and Eq. (13) therein are reproductions of Fig. 1(a) and Eq. (7) of Ref. [18], and, like the latter,

limits the side-channels to carry information only about the setting variables. It is also repeated in Ref. [20],

where the formalization of a “noncontextual wiring” per Eq. (9) there utilizes a post-processing with random-

ness independent from that of pre-processing, again restricting the side-channels to exclusively information

pertaining to the setting variables.

The above discussion has highlighted the fact that if one wishes the set of free operations to include

arbitrary convex mixtures of some smaller set, it is important that it be stipulated precisely how the shared

randomness is distributed in order to ensure the possibility of such mixing. In this regard, although de

Vicente [16] provided a deﬁnition of the free operations that is equivalent to LOSR, the physical justiﬁcation

for this choice was wanting. Speciﬁcally, the deﬁnition in Ref. [16] proceeds by enumerating a long list

of nominally ‘elementary’ operations and then stating (in Section 4.1 of that article) that any mixture of

these operations is also allowed. No discussion is provided of why the type of shared randomness necessary

for achieving arbitrary mixtures should be considered freely available. The work of Geller and Piani [17], by

contrast, does stipulate the physical structure of the circuit that deﬁnes the free operations, thereby providing

a physical justiﬁcation for taking LOSR as the set of free operations.

A.2.1 Previous results in light of this oversight

Given that some previous work [18, 20, 21] formally deﬁned the set of free operations by Eq. (47), which

yields a nonconvex subset of LOSR, one might wonder to what extent the results reached by those works

still hold for LOSR proper, as deﬁned in Eq. (48). In the following, we will brieﬂy comment on some results

described in Refs. [18] and [21].

Lemma 6 of Ref. [18] purports to demonstrate that if a function is a monotone relative to a set of operations

that the authors term “LOSR”, then it is also a monotone relative to WPICC. If one interprets the set of

operations termed “LOSR” by the authors of Ref. [18] in the manner of the deﬁnition stipulated by their

Fig. 1(a) or their Eq. (7) (which is equivalent to Eq. (47) above), namely, as a nonconvex subset of LOSR

proper, as deﬁned in Eq. (48), then the question would arise as to whether an analogous lemma holds for

LOSR proper rather than simply the nonconvex subset thereof. In fact, however, the proof of Lemma 6 in

Ref. [18] assumes that the set of free operations can map a resource R to a convex combination of R with

any local box. This is not possible if the set of free operations is the one deﬁned by their Fig. 1(a) or Eq. (7)

(or equivalently, by Eq. (47) above). Hence, the proof of Lemma 6 holds only if the set of free operations

termed “LOSR” in the statement of the lemma is taken to be LOSR proper, as deﬁned in Eq. (48), and not

the nonconvex subset of LOSR deﬁned by Eq. (47). This fact provides yet another piece of evidence that the

nonconvexity of the formal deﬁnition of the free operations in Ref. [18] was merely an oversight. The bottom

line is that the proofs provided in Ref. [18] do establish that monotonicity relative to LOSR proper implies

monotonicity relative to WPICC.

Kaur et al. [21] state in their Proposition 6 that their proposed “intrinsic non-locality” measure is mono-

tonically nonincreasing under the set of free operations they term “LOSR”. But given that their deﬁnition

of this term is precisely the same as the deﬁnition provided in Ref. [18], the set of operations in question is

the nonconvex subset of LOSR deﬁned by Eq. (47). This prompts the question of whether this proposition

holds if one considers LOSR proper, as deﬁned in Eq (48), rather than this nonconvex subset thereof.

The answer is that it does. Establishing this is nontrivial, however, as an arbitrary monotone relative to

the nonconvex subset of LOSR deﬁned by Eq. (47) need not be a monotone relative to LOSR. Note, however,

that if

(i) a function f is a monotone relative to LDO, and

(ii) f happens to be a convex function,

then f is also a monotone relative to LOSR, as a consequence of Proposition 15. Since LDO is contained

within the nonconvex subset of LOSR deﬁned by Eq. (47), convex monotones relative to those limited

operations are also valid monotones relative to LOSR proper. Finally, we can use this implication to recover

Ref. [21]’s Proposition 6 by leveraging Proposition 7 there regarding the convexity of “intrinsic non-locality”

over box-type resources.

A.3 Generalizing from Bell scenarios to more general causal structures

In the introduction, we contrasted our approach to deﬁning a resource theory, which we termed the causal

modelling paradigm, with a pre-existing approach, which we termed the strictly operational paradigm. Con-

sidering causal scenarios beyond Bell scenarios helps to clarify the diﬀerences between these two approaches.

Consider, for instance, a tripartite box-type process, with setting variables for the three wings denoted S,

T , and U , and outcome variables for the three wings denoted X, Y , and Z respectively. One can distinguish

two distinct causal structures that could underlie this sort of process: (i) the tripartite Bell scenario, where

there is a common cause acting on all the three wings, depicted in Fig. 12, and (ii) the triangle-with-settings

scenario [59, Fig. 8], where there is a common cause for each pair of wings, depicted in Fig. 13.

(a) (b)

Figure 12: The distinction between (a) a generic box in the tripartite Bell scenario and (b) a classical box in this scenario.

(a) (b)

Figure 13: The distinction between (a) a generic box in the triangle-with-settings scenario and (b) a classical box in this

scenario.

Consider the case of a generic box in the tripartite Bell scenario, depicted in Fig. 12(a), and label the

systems distributed to the three wings by A, B and C respectively. Let us denote by r

x|s

the GPT represen-

tation of the X

x outcome of the S

s measurement on system A, and similarly deﬁne r

y|t

and r

z|u

. If

ABC

denotes the GPT state of the composite ABC, then the conditional probability distribution associated

to this box is

XY Z|ST U

(xyz|stu) = (r

x|s

⊗ r

y|t

⊗ r

z|u

) · s

ABC

. (53)

When the GPT is classical probability theory, we obtain the classically-realizable box shown in Fig. 12(b),

and the conditional probability distribution associated to it is

XY Z|ST U

(xyz|stu)

,λ

X|SΛ

(x|sλ

Y |T Λ

(y|tλ

Z|U Λ

(z|uλ

(λ

, λ

)

X|SΛ

(x|sλ)P

Y |T Λ

(y|tλ)P

Z|U Λ

(z|uλ)P

(λ). (54)

Now consider a generic box in the triangle-with-settings scenario, depicted in Fig. 13(a). Instead of an

arbitrary joint GPT state s

ABC

on the triple of systems associated to the three wings, each system is

composed of two parts—A is composed of A

and A

, and similarly for B and C—and the joint GPT state

has the form s

⊗ s

. The conditional probability distribution associated to this box is

XY Z|ST U

(xyz|stu) = (r

x|s

⊗ r

y|t

⊗ r

z|u

) · (s

⊗ s

). (55)

When the GPT is classical probability theory, we obtain the classically-realizable box shown in Fig. 13(b).

Taking Λ

= (Λ

, Λ

), and similarly for Λ

and Λ

, we have

XY Z|ST U

(xyz|stu) =

,λ

X|SΛ

(x|sλ

Y |T Λ

(y|tλ

Z|U Λ

(z|uλ

)

× P

(λ

, λ

(λ

, λ

(λ

, λ

)

λ,λ

,λ

X|SΛΛ

(x|sλλ

Y |T ΛΛ

(y|tλλ

Z|U Λ

(z|uλ

)

× P

(λ)P

(λ

). (56)

Figure 14: The free operations for a tripartite Bell scenario,

ST U|XY ZS

, taking a tripartite common-cause box

XY Z|ST U

to a new such box P

As we see, the form of the GPT-realizable boxes in the tripartite Bell scenario diﬀers from the form of the

GPT-realizable boxes in the triangle-with-settings scenario. Similarly for the form of the classically realizable

boxes. These diﬀerences have consequences when one compares the strictly operational paradigm with our

causal modelling paradigm, as we argue next.

Figure 15: The free operations for the so-called triangle-with-settings scenario,

ST U|XY ZS

, taking a triangle-

with-settings box P

XY Z|ST U

to a new such box P

We begin by considering what each paradigm implies for the deﬁnitions of the free and enveloping sets of

resources for each scenario.

For the tripartite Bell scenario, the deﬁnitions of both the enveloping process theory and the free subtheory

of processes that are natural from the perspective of the causal modelling paradigm can also be expressed

in a way that is natural within the strictly operational paradigm. Speciﬁcally, the boxes in the enveloping

theory, which we take to be those that are realizable in a GPT causal model of this scenario (formalized in

Eq. (53)), can also be characterized as those that are nonsignalling between the wings. Similarly, the boxes

in the free subtheory, which we take to be those that are realizable in a classical causal model of this scenario

(formalized in Eq. (54)), can also be characterized as those that are mixtures of deterministic boxes which

are nonsignalling between the wings.

For the triangle-with-settings scenario, on the other hand, the set of boxes realizable in a GPT causal model

for that scenario (formalized in Eq. (55)), is a strict subset of the boxes that are nonsignalling between the

wings, and the set of boxes that are realizable in a classical causal model for that scenario (formalized in

Eq. (56)) is a strict subset of the set of boxes that are mixtures of deterministic boxes that are nonsignalling

between the wings. In both the enveloping theory and the free subtheory, the set of boxes is characterized via

nontrivial inequalities in addition to merely the equalities that represent the no-signalling constraints. See

Ref. [100] for a discussion of these inequalities in the special case of trivial setting variables. Consequently,

within the causal modelling paradigm, the resource theory associated to the triangle-with-settings scenario

and the resource theory associated to the tripartite Bell scenario diﬀer in both the choice of enveloping

theory and free subhteory. Within the strictly operational paradigm, however, it is unclear whether there is

any natural way to pick out the enveloping theory and free subtheory that the causal modelling paradigm

dictates for the triangle-with-settings scenario because it is unclear whether there is any natural way of

picking these out by referring merely to the input-output functionality of the boxes.

Now, we shift our attention to what each paradigm implies for the deﬁnitions of the free operations in each

scenario. We will show that the deﬁnitions that are natural within the causal modelling paradigm cannot be

easily motivated within the strictly operational paradigm.

The free operations prescribed by the causal modelling paradigm for the tripartite Bell scenario are depicted

in Fig. 14. They are of the form

ST U |XY ZS

stu|xyzs

)

S|XS

s|xs

λ)P

T |Y T

t|yt

λ)P

U|ZU

u|zu

λ)P

(λ), (57)

which is clearly a convex set. This, we believe, is the appropriate deﬁnition of local operations and shared

randomness for three parties.

This scenario does not show much diﬀerence with what would be natural in the strictly operational

paradigm, because one can motivate taking this set of operations to be free on the grounds that they take

nonsignalling boxes to nonsignalling boxes (even though, as in the case with the bipartite Bell scenario, the

set of WPICC operations between the three wings can also be motivated in this way).

It is the free operations in the triangle-with-settings scenario that really distinguishes the causal modelling

paradigm from the strictly operational paradigm.

The free operations prescribed by the causal modelling paradigm for the triangle-with-settings scenario

are depicted in Fig. 15. They are of the form

ST U |XY ZS

stu|xyzs

)

λ,λ

,λ

S|XS

ΛΛ

s|xs

λλ

T |Y T

t|yt

U|ZU

ΛΛ

u|zu

λλ

)

× P

(λ)P

(λ

). (58)

Note that this is not a convex set. Furthermore, since a triple of pairwise common causes can be simulated by

a triplewise common cause, the free operations deﬁned in Eq. (58) are a strict subset of the tripartite LOSR

operations deﬁned in Eq. (57). It follows that, just as we saw for the free boxes in the triangle-with-settings

scenario, one cannot motivate the free operations deﬁned in Eq. (58) by appeal to the no-signalling principle.

And, again just as we noted for the free boxes, it is unclear how such a choice could ever be motivated by a

principle that appealed only to the input-output functionality of the operation.

The triangle-with-settings scenario also illustrates why one should not mathematically impose convex

closure of the set of free operations, as was done in Refs. [18]. Rather, whether or not the set of free

operations is convexly closed depends on the causal structure, which speciﬁes precisely how randomness is

shared among the parties. For Bell scenarios, the set of free operations is convex by construction, whereas for

other causal structures, such as the triangle-with-settings scenario, it is not. Mathematically imposing convex

closure in the triangle-with-settings scenario would be equivalent to asserting that there was a common cause

for all three wings, which would constitute a change in the causal structure being considered. In other words,

imposing convexity in an ad-hoc manner contradicts the foundations of the causal modelling paradigm,

where it is the causal structure that speciﬁes how randomness is shared among the parties, and consequently

speciﬁes whether or not convexity holds.

Note ﬁnally that the lack of convexity in general causal structures (such as the triangle-with-settings

scenario) implies that the project of quantifying nonclassicality in these cases will be much more complicated

than it was in the Bell scenario.

B Proofs

B.1 Proof of Proposition 19: closed-form expression for M

NPR

(R)

In this section, we present some arguments that aid in justifying Proposition 19, the proof of which is given

at the end of this appendix. Recall Proposition 19:

Proposition 19.

For any free resource

of type

(

2 2

)

NPR

(

) = 2. For any nonfree resource

of type

(

2 2

)

, there is a unique

k ∈ {

, . . . ,

}

for which

CHSH

(

)

2. Within this region, if

R ∈ C

NPR,k

, then we

have simply M

NPR

(R) = CHSH

(R). If, on the other hand, R 6∈ C

NPR,k

, we have

NPR

(R) = 2α+2,

where

is the value appearing in the decomposition

γ L

+ (1

−γ

)

(

), where

(

)

∈ C

NPR,k

∈ L

and

γ ∈

1]. This value of

is unambiguous because there exists a unique resource

∈ L

and a unique choice of γ ∈ [0, 1] and of α ∈ [0, 1] such that R = γ L

+ (1−γ)C

(α).

The (unique) relevant decomposition is shown in Fig. 6 (for the case where k = 0).

We ﬁrst demonstrate the equivalence of three statements which pertain to the value of M

NPR

(

)

for the

subset of resources that satisfy CHSH(R) ≥ 2:

Proposition 29.

For any resource

of type

(

2 2

)

such that

CHSH

(

)

≥

2, the following deﬁnitions are

equivalent to M

NPR

(R):

min

0≤α≤1

{CHSH(C(α)) such that C(α) 7−→R}, (59a)

min

CHSH(C(α)) such that ∃γ ≥ 0 and ∃L

∈ L

with R = γ L

+ (1−γ)C(α)

, (59b)

(

if R ∈ C

NPR

: CHSH(R), else

if R 6∈ C

NPR

: 2α+2, where α, γ ≥ 0, and L

∈ L

are all unique

in the decomposition R = γ L

+ (1−γ)C(α).

(59c)

Proof of Eq. (59a).

Eq.

(59a)

is directly equivalent to the deﬁnition of the

NPR

monotone given in Eq.

(36)

. Hence, we take

that as our starting point, and prove the implications of the subequations in Proposition 29.

Proof that Eq. (59a) ⇔ Eq. (59b).

Section 5 guarantees that

(

)

7−→R

if and only if we can generate R by convex mixtures of

(

) with the

images of

(

) under LDO operations. For any

R /∈ C

NPR

such that

CHSH

(

)

≥

2, we simplify the situation

by proving that if

can be generated by mixing

(

) with its images under LDO, then

can alternatively

be generated by mixing

(

) with a local point which saturates the CHSH inequality; namely, a point in

as stated in Eq. (59b). To prove this, it is useful to deﬁne the notion of a screening-oﬀ inequality.

Deﬁnition 30.

The inequality

(

)

≥ b

is said to

screen-oﬀ

the ﬁxed-type set of resources which satisfy it,

i.e.,

R : f(R) ≥ b , [R] =

(

|X| |Y |

|S | |T |

)

, if the ﬁxed-type set of resources which saturate it is a free set, i.e., if

R : f(R) = b , [R] =

(

|X| |Y |

|S | |T |

)

consists only of classically realizable common-cause boxes.

For example, the inequality

CHSH

(

)

≥

2 screens-oﬀ the set



R : CHSH(R) ≥ 2 , [R] =

(

2 2

)



since



R : CHSH(R) = 2 , [R] =

(

2 2

)



⊂ S

free

(

2 2

)

Screening-oﬀ inequalities are useful when making statements about resource convertibility, as follows.

Consider the case where we ask whether

7−→R

: if

lies inside some screened-oﬀ region, then, given

Proposition 15,

∈ P

LOSR

]

(

) if and only if

is in the convex hull of those images of

under LDO

inside the screened-oﬀ region, together with the boundary (where the inequality is saturated). Formally, if

f(R) ≥ b is a screening-oﬀ inequality for resources of type [R

], then, given Proposition 15,

7−→R

iﬀ ∃L

such that f(L

) = b and R

∈ ConvexHull



, V

LDO

]

)

{R : f(R) > b}

| {z }

the LDO images of R

interior to screened-oﬀ region



Since

CHSH

(

)

≥

2 is a screening-oﬀ inequality whose saturation-boundary is given by

, and since the

only image in V

LDO

(

2 2

)

(C(α)) which violates the CHSH inequality is C(α) itself, it follows that

C(α) 7−→R if and only if ∃γ ≥ 0 and ∃L

∈ L

such that R = γ L

+ (1−γ)C(α). (60)

The equivalence Eq.

(59a) ⇔

Eq.

(59b)

follows. As a ﬁnal comment, notice that this characterization of

convertibility in terms of the existence of a geometric decomposition involves arbitrary points which saturate

the CHSH inequality, and there are typically many such decompositions.

Proof that Eq. (59b) ⇔ Eq. (59c).

Recall that Eq.

(59b)

involves a minimization under the constraint that

is such that

R = γ L

+ (1−γ)C(α)

We can formally recast it as a constrained optimization problem, as follows:

min

0≤α≤1

CHSH



C(α)



such that L

R − (1−γ)C(α)

, under the constraint that

all conditional probabilities in the expression of

are nonnegative,

where γ is an implicit function of α according to

CHSH(R) − (1−γ) CHSH



C(α)



= 2, as implied by the fact that CHSH(L

) = 2.

Essentially, this is a constrained optimization problem with a linear objective subject to one nonlinear

constraint; namely, that the smallest conditional probability in the expression

must be nonnegative.

For such optimization problems, it is always the case that the objective is maximized when the constraint is

not merely satisﬁed but saturated. Put another way, the set of achievable

arise from points

wherein

all conditional probabilities are nonnegative, but the optimal

arises for some unique

where the

smallest conditional probability in

is precisely zero.

Proof of Proposition 19.

Proposition 19 for arbitrary resources follows from Proposition 29 by the symmetry noted in Proposition 16d.

Namely, the argument can be repeated unchanged in each of the eight spaces of resources generated by the

images under LSO of the set of resources satisfying

CHSH

(

)

≥

2. Together with the trivial observation that

free resources (which do not violate any of the eight CHSH inequalities) always have value of

NPR

equal to

2, Proposition 19 follows.

B.2 Proof of Proposition 21: when the two monotones are complete

We now prove Proposition 21, repeated here:

Proposition 21.

Consider a two-parameter family

R(α,γ) = γ L

+ (1−γ)C(α)

, marked by any ﬁxed

—

that is, a point which saturates the CHSH inequality and is on the boundary of the no-signaling set. The pair

of monotones {M

CHSH

, M

NPR

} is complete relative to such a family of resources.

Proof.

A set of monotones is complete relative to a family of resources if and only if every candidate conversion

among resources in the family which is not ruled out by any of the monotones in the set is in fact possible for

free, as per Eq. (21).

In Fig. 16(a), we depict in blue the set of candidate conversions (from a generic resource

(

, γ

) to

another resource in the family) which are not ruled out by

CHSH

, M

NPR

}

; namely, the blue shaded region

contains all resources which have a value for each of the two monotones that is equal to or lower than that

(

, γ

). To prove the proposition, we argue that

(

, γ

) can indeed be converted to any resource

in the blue region. By convexity, it suﬃces to prove that

(

, γ

) can be converted to each of the four

extreme points of the blue region. Since

and

NPR

are free resources,

(

, γ

) can freely be converted

to either of them, and the resource

(

, γ

) can obviously be ‘converted’ to itself, as the identity is free. Our

proof, therefore, focuses on demonstrating that

(

, γ

) can indeed be converted to the fourth extreme point

(

0), shown as a green star. We now give the explicit free operation which takes a generic initial resource

denotes the representation of

as a vector whose components are the conditional probabilities

XY |ST

(

xy|st

) :

x, y, s, t ∈

{0, 1}}.

NPR

R(α

, γ

)

R(α

, 0)

= R



(1−γ

), 0



(a)

P R

NPR

CHSH

R(α

, γ

)

(b)

Figure 16: (a) and (b) provide the same pair of depictions of the two-parameter family of resources

(

2 2

)

as were introduced

in Fig. 7. We consider a two-parameter family of resources. A generic such resource, speciﬁed by

and

, is marked by a

red diamond. Also depicted are some of the level curves of the two monotones

CHSH

and

NPR

. The solid dark blue

region denotes the set of all resources within this family which have values for both monotones less than or equal to their

values for

(

, γ

). To prove Proposition 21, one must show that

(

, γ

) can be converted to any resource in the solid

blue region. The critical step in this proof is the demonstration that it is possible to convert any resource to one lying on

the line connecting

and

NPR

without changing the value of

CHSH

. Graphically, this corresponds to converting the

generic resource R(α

, γ

) to the resource R(α

, 0) marked by a green star.

(

, γ

) and projects it onto the chain leftwards in the two-dimensional coordinate system of Fig. 16(a), i.e.,

to the target resource R(α

, 0), where α

= α

(1 − γ

)

We denote the free operation which enacts this conversion by

erase-γ

; it is the operation which projects

any resource into the subspace of resources that are invariant under the

456

subgroup of

LSO

(

2 2

)

(

456

deﬁned in Proposition 16c on page 23), i.e., onto the chain

NPR

This operation is indeed free, as it can

be constructed by a uniform mixture of all the elements of

456

, each of which is free. Recall that

456

is the

subgroup of

LSO

(

2 2

)

which stabilizes

CHSH

, and therefore clearly does not modify the value of the

CHSH

monotone.

It remains only to show that the

456

-invariant subspace of resources within the set of all

(

2 2

)

-type resources

for which

CHSH

(

)

≥

2 is the chain

NPR

, i.e., the line of points between

and

NPR

. This is evident

by conﬁrming that

erase-γ

leaves

invariant, but maps each of the 8 deterministic

CHSH

-saturating

boxes to

NPR

. Those 1

8 resources are the extreme points of the set of all

(

2 2

)

-type resources such that

CHSH

(

)

≥

2; since the extreme points map to the line under the action of

erase-γ

, by convex linearity it

follows that the chain is the only space invariant under G

456

within the two-parameter family.

B.3 Proof of Proposition 23: all nonfree resources of type

(

2 2

)

are orbital

We now prove Proposition 23, repeated here:

Proposition 23. All nonfree resources of type

(

2 2

)

are orbital.

The particular relation between

and (

, γ

) follows from Eq.

(45)

, by noticing that

(

0) and

(

, γ

) must have the

same value for M

CHSH

Equivalently, τ

erase-γ

is the Reynold’s operator of the subgroup G

456

Before presenting the proof, we introduce some additional concepts and a few lemmas on which our proof

relies. Throughout the following, we are focused on sets of resources of ﬁxed type, and on type-preserving

operations. Hence, we use slightly abbreviated notation; e.g. V

LDO

(

)

is used as shorthand for V

LDO

[R]

(

)

and so on.

Deﬁnition 31.

The set of

local deterministic type-preserving nonsymmetry operations

, denoted

LDTNO

, contains all type-preserving operations in LDO which are not in LSO. The image of a resource

under LDTNO constitutes a discrete set of resources denoted

LDTNO

(

). Moreover, we use

HullLDTNO

(

)

to indicate the set of all resources in the convex hull of the image of a resource

under LDTNO, i.e.,

HullLDTNO(R)

= ConvexHull



LDTNO

(R)



Deﬁnition 32.

A resource

is said to be

sensitive

if every element of LDTNO removes

from its

equivalence class; i.e., if for all

τ∈LDTNO

it holds that

τ ◦R Y7−→R

. Equivalently, a resource

is sensitive

if and only if

is not in the convex hull of its images under LDTNO, i.e., if

R 6∈ HullLDTNO

(

). A set

of resources is called sensitive if every resource in the set is sensitive.

We bring up the property of sensitivity because (i) it is straightforward to test if a given resource is

sensitive or not by means of a linear program, and (ii) eventually we will argue that if a resource is sensitive,

then it is also orbital. Furthermore, we now prove that sensitive resources never appear in isolation. That is,

a single sensitive resource can be used to construct a set of sensitive resources, as follows:

Lemma 33.

For any resource

, every resource

which is below

in the pre-order and which cannot be

generated from

by mixtures of LDTNO operations is Formally: the set of resources

sens

LOSR

(

)

HullLDTNO(R) is always sensitive.

Proof. First, note two related, useful facts:

(1) The composition of any deterministic operation (invertible or not) followed by some deterministic

nonsymmetry operation is precisely some (other) deterministic nonsymmetry operation. Formally, if

LDTNO

∈

LDTNO

and

LDO

∈ LDO

, and deﬁning

LDTNO

◦τ

LDO

, then

∈ LDTNO

. A consequence of this is that

the entire set

LOSR

(

) is mapped to the set

HullLDTNO

(

) under

LDTNO

and convex mixtures thereof.

To see this, recall that the image of any convex set of resources under any convex set of operations is identically

the convex hull of the images of the extremal resources under the extremal operations (in the respective sets).

We use this fact to eﬀectively replace

LOSR

(

) with

LDO

(

) and to replace convex mixtures of LDTNO

with LDTNO itself, without loss of generality. In summary:

HullLDTNO

(

LOSR

(

)) =

HullLDTNO

(

)

by virtue of the fact that LDTNO ◦ LDO = LDTNO.

(2) The composition of any deterministic nonsymmetry operation followed by some deterministic operation

(invertible or not) is some (other) deterministic nonsymmetry operation. Formally, if

1−LDTNO

∈ LDTNO

and

2−LDO

∈ LDO

, and deﬁning

2−LDO

◦τ

1−LDTNO

, then

∈ LDTNO

. A consequence of this is that the

entire set

HullLDTNO

(

) is mapped to itself under

LOSR

. To see this, we reuse the shortcut of considering

only extremal resources and extremal operations. Specializing to our objects of interest, we eﬀectively replace

the operations-set

LOSR

by its extremal operations — namely

LDO

— and the resources-set

HullLDTNO

(

)

LDTNO

(

) without loss of generality. In summary:

LOSR

(

HullLDTNO

(

)) =

HullLDTNO

(

) by

virtue of the fact that LDO ◦ LDTNO = LDTNO.

Now we are in position to prove Lemma 33. The set of resources

below

in the partial order is

identically

LOSR

(

). The set of resources which can be generated from

by mixtures of deterministic

nonsymmetry operations is identically

HullLDTNO

(

). So, a resource

is below

in the partial

order and cannot be generated from

by mixtures of deterministic nonsymmetry operations if and only if

∈ P

LOSR

(R) \ HullLDTNO(R) =: S

sens

Now, consider any

τ ∈ LDTNO

and any

∈ S

sens

, and deﬁne

τ ◦ R

. Since we have estab-

lished that the entirety of

LOSR

(

) is mapped to

HullLDTNO

(

) under LDTNO, it follows that

∈ HullLDTNO

(

). However, since we have also established that the entirety of

HullLDTNO

(

)

is mapped only to itself under LOSR, and since

/∈ HullLDTNO

(

), it further follows that

Y7−→R

Evidently, any

∈ S

sens

is removed from its equivalence class by every deterministic nonsymmetry operation,

i.e., S

sens

is sensitive. This proves the Lemma.

Note that Lemma 33 implies that if R is sensitive, and R

is equivalent to R, then R

is also sensitive.

Lemma 34.

If a resource is sensitive, then it is also orbital. That is, if two sensitive resources are

interconvertible under type-preserving LOSR, then they are interconvertible under LSO.

Proof.

Let

and

be distinct sensitive resources that are interconvertible under type-preserving LOSR,

i.e.,

R 6

but

R ←→R

and [

] = [

]. Any operation which preserves the equivalence class of a sensitive

resource can be expressed as a convex combination of elements of LSO. The assumption of sensitivity thus

dictates that

is in the convex hull of the images of

under LSO, and vice versa. We proceed to show

that this sort of relationship must imply that

R ∈ V

LSO

(

) and

∈ V

LSO

(

), that is, that

and

are

LSO-equivalent.

This can be seen by recognizing that the 2-norm is a convex function invariant under LSO, meaning

∈ ConvexHull



LSO

(R)



implies

≤ k

By symmetry under exchange of

and

, it holds that

≤ k

, and hence

. The 2-norm, moreover, strictly decreases under nontrivial stochastic

mixing;

hence all interconversions between equivalent sensitive resources must be mediated by deterministic

symmetries. Formally: If

i=1

◦ R

and

R =

i=1

◦ R

, where

i=1

= 1

and

, ..., w

, w

, ..., w

} ≥ 0, then k

= k

and w

, w

∈ {0, 1}.

Lemma 35. R

is a sensitive resource, and

LOSR

(

)

\HullLDTNO

(

) is the entire eight dimen-

sional set of all nonfree resources of type

(

2 2

)

Proof by inspection.

One can readily verify that τ ◦ R

∈ S

free

(

2 2

)

for all type-preserving LDTNO operations τ.

Proof of Proposition 23.

Lemma 35 together with Lemma 33 immediately imply that all nonfree resources of type

(

2 2

)

are sensitive.

Lemma 34 then directly implies that all these resources are orbital.

A ﬁnal comment: consider generalizing Proposition 23 in light of the discussion just given. If one desires to

construct an orbital set of resources beyond

(

2 2

)

-type, one needs only to ﬁnd some single sensitive resource

R of the desired type. From Lemmas 33 and 34, it then follows that the set of resources P

LOSR

(

)

HullLDTNO

(

)

constitutes an orbital set. It might be the case, for instance, that for any nontrivial choice

of resource type, there is at least one convexly extremal resource that is sensitive, analogous to how the

PR-box is a sensitive resource for type

(

2 2

)

B.4 Proof of Proposition 25: lower bound on the number of monotones in any complete set

Recall that a resource is termed orbital if and only if its LOSR-equivalence class of resources of the same

type is equal to its LSO-equivalence class. We now prove Proposition 25, recalled below:

Proposition 25.

For any compact set

of resources that are all orbital, the intrinsic dimension of the set

is a lower bound on the cardinality of a complete set of continuous monotones for

(and for any superset

of S).

The fact that

is in the convex hull of the images of

under permutations of

’s probabilities is equivalent to stating that

vector majorizes

. The relationship is reﬂexive, however. Readers familiar with vector majorization may recall that two

vectors are equivalent under the majorization order if and only if they are related by some reordering, i.e., a (not necessarily

physical) symmetry operation.

Recall that

is shorthand for the representation of the resource in terms of a real-valued vector consisting of all possible

conditional probabilities, i.e.,

R =



XY |ST

(xy|st) : x, y, s, t ∈ {0, 1}



Consider the hypersphere consisting of all resources with 2-norm in common with

. All the images of

under LSO lie on

the surface of this hypersphere. Stochastic mixing of symmetry operations (applied to

) is equivalent to convexly combining

diﬀerent points from the surface of the hypersphere. Any convex combination of points from the surface of a hypersphere results

in a ﬁnal point strictly interior to the sphere. Strictly interior points are closer to the center, in precisely the sense of having a

strictly smaller 2-norm.

Proof.

The set of local symmetry operations for a given type has ﬁnite cardinality, and hence there are a

ﬁnite number of resources in the LSO-equivalence class of any resource. For an orbital resource

, this

implies that the LOSR-equivalence class of

(over resources of type [

]) is precisely

LSO

(

), which is a

ﬁnite set. If a compact set

of orbital resources has intrinsic dimension

, and the LOSR-equivalence class of

every resource in the set is ﬁnite and hence zero-dimensional, then it follows that one can ﬁnd

-dimensional

compact subsets of resources in S in which no two resources are equivalent.

Hence, no two resources in such a subset are assigned the same tuple of values by any complete set of

monotones. In other words, a complete set of

continuous monotones maps the subset of resources injectively

. But this map can only be injective if

n ≥ d

, which guarantees that the number of continuous monotones

required to identify a resource in the set is at least as large as the intrinsic dimension

of the set

. Finally,

note that the number of continuous monotones required to identify a resource in any superset of

must be

at least as large as for the set S itself, which completes the proof.

References

[1] J. S. Bell, “On the Einstein-Podolsky-Rosen paradox,” Physics 1, 195 (1964).

[2] J. S. Bell, “On the Problem of Hidden Variables in Quantum Mechanics,” Rev. Mod. Phys. 38, 447 (1966).

[3]

B. Hensen

et al.

, “Loophole-free Bell inequality violation using electron spins separated by 1.3 kilometres,”

Nature 526, 682 EP (2015).

[4]

M. Giustina

et al.

, “Signiﬁcant-Loophole-Free Test of Bell’s Theorem with Entangled Photons,” Phys. Rev.

Lett. 115, 250401 (2015).

[5] L. Shalm et al., “Strong Loophole-Free Test of Local Realism,” Phys. Rev. Lett. 115, 250402 (2015).

[6]

J. Barrett, L. Hardy, and A. Kent, “No Signaling and Quantum Key Distribution,” Phys. Rev. Lett.

, 010503

(2005).

[7]

A. Ac´ın, N. Gisin, and L. Masanes, “From Bell’s Theorem to Secure Quantum Key Distribution,” Phys. Rev.

Lett. 97, 120405 (2006).

[8]

V. Scarani, N. Gisin, N. Brunner, L. Masanes, S. Pino, and A. Ac´ın, “Secrecy extraction from no-signaling

correlations,” Phys. Rev. A 74, 042339 (2006).

[9]

A. Ac´ın, N. Brunner, N. Gisin, S. Massar, S. Pironio, and V. Scarani, “Device-Independent Security of Quantum

Cryptography against Collective Attacks,” Phys. Rev. Lett. 98, 230501 (2007).

[10] R. Colbeck and R. Renner, “Free randomness can be ampliﬁed,” Nat. Phys. 8, 450 EP (2012).

[11]

S. Pironio, A. Ac´ın, S. Massar, A. B. de la Giroday, D. N. Matsukevich, P. Maunz, S. Olmschenk, D. Hayes,

L. Luo, T. A. Manning, and C. Monroe, “Random numbers certiﬁed by Bell’s theorem,” Nature

464

, 1021 EP

(2010).

[12]

C. Dhara, G. Prettico, and A. Ac´ın, “Maximal quantum randomness in Bell tests,” Phys. Rev. A

, 052116

(2013).

[13]

U. Vazirani and T. Vidick, “Fully Device-Independent Quantum Key Distribution,” Phys. Rev. Lett.

113

140501 (2014).

[14]

J. Kaniewski and S. Wehner, “Device-independent two-party cryptography secure against sequential attacks,”

New J. Phys. 18, 055004 (2016).

[15]

R. Gallego, L. E. W¨urﬂinger, A. Ac´ın, and M. Navascu´es, “Operational Framework for Nonlocality,” Phys. Rev.

Lett. 109, 070401 (2012).

[16]

J. I. de Vicente, “On nonlocality as a resource theory and nonlocality measures,” J. Phys. A

, 424017 (2014).

[17]

J. Geller and M. Piani, “Quantifying non-classical and beyond-quantum correlations in the uniﬁed operator

formalism,” J. Phys. A 47, 424030 (2014).

[18] R. Gallego and L. Aolita, “Nonlocality free wirings and the distinguishability between Bell boxes,” Phys. Rev.

A 95 (2017).

Not all compact subsets will necessarily have this property, but some will. For example, consider a nonfree resource

asym

which

is not invariant under any LSO operation. Every LSO operation maps such a resource to a distinct resource not in the original

neighborhood for a suitably small neighborhood. Because LSO operations are linear (and hence continuous), they map compact

subspaces to compact subspaces. Hence, every LSO operation takes the entire neighborhood of nonfree resources around

asym

to some other nonfree neighborhood; if the original neighborhood is chosen to be small enough, these two neighborhoods will not

intersect. Hence, no two resources in the original neighborhood are interconvertible by LSO.

[19]

K. Horodecki, A. Grudka, P. Joshi, W. K lobus, and J.

odyga, “Axiomatic approach to contextuality and

nonlocality,” Phys. Rev. A 92, 032104 (2015).

[20]

B. Amaral, A. Cabello, M. T. Cunha, and L. Aolita, “Noncontextual wirings,” Phys. Rev. Lett.

120

, 130403

(2018).

[21]

E. Kaur, M. M. Wilde, and A. Winter, “Fundamental limits on key rates in device-independent quantum key

distribution,” arXiv:1810.05627 (2018).

[22]

S. G. A. Brito, B. Amaral, and R. Chaves, “Quantifying Bell nonlocality with the trace distance,” Phys. Rev.

A 97, 022111 (2018).

[23]

D. Schmid, D. Rosset, and F. Buscemi, “Type-independent resource theory of local operations and shared

randomness,” arXiv:1909.04065 (2019).

[24]

D. Rosset, D. Schmid, and F. Buscemi, “Characterizing nonclassicality of arbitrary distributed devices,”

arXiv:2004.09194 (2020).

[25]

D. Schmid, T. C. Fraser, R. Kunjwal, A. B. Sainz, E. Wolfe, and R. W. Spekkens, “Why standard entanglement

theory is inappropriate for the study of Bell scenarios,” arXiv:1911.12462 (2019).

[26]

B. Coecke, T. Fritz, and R. W. Spekkens, “A mathematical theory of resources,” Info. & Comp.

250

, 59 (2016).

[27]

J. F. Clauser, M. A. Horne, A. Shimony, and R. A. Holt, “Proposed experiment to test local hidden-variable

theories,” Phys. Rev. Lett. 23, 880 (1969).

[28] A. Shimony, “Bell’s Theorem,” in The Stanford Encyclopedia of Philosophy (2017).

[29] B. d’Espagnat, “The Quantum Theory and Reality,” Scientiﬁc American 241, 158 (1979).

[30] H. M. Wiseman, “The two Bell’s theorems of John Bell,” J. Phys. A 47, 424001 (2014).

[31] R. F. Werner, “Comment on ‘What Bell did’,” J. Phys. A 47, 424011 (2014).

[32] V. Scarani, “The Device-Independent Outlook on Quantum Physics,” Acta Physica Slovaca 62, 347 (2012).

[33]

T. Maudlin, Quantum Non-Locality and Relativity : Metaphysical Intimations of Modern Physics (Blackwell

Publishers, 2002).

[34] T. Norsen, “Bell Locality and the Nonlocal Character of Nature,” Found. Phys. Lett. 19, 633 (2006).

[35]

R. Chaves, R. Kueng, J. B. Brask, and D. Gross, “Unifying Framework for Relaxations of the Causal Assumptions

in Bell’s Theorem,” Phys. Rev. Lett. 114, 140403 (2015).

[36]

R. Chaves, D. Cavalcanti, and L. Aolita, “Causal hierarchy of multipartite Bell nonlocality,” Quantum

, 23

(2017).

[37]

T. Maudlin, “Bell’s Inequality, Information Transmission, and Prism Models,” in Philosophy of Science Associa-

tion, 1 (1992) pp. 404–417.

[38]

B. F. Toner and D. Bacon, “Communication Cost of Simulating Bell Correlations,” Phys. Rev. Lett.

, 187904

(2003).

[39] G. Hooft, “The Fate of the Quantum,” arXiv:1308.1007 (2013), report numbers: ITP-UU-13/22, SPIN-13/15.

[40]

M. J. W. Hall, “Local Deterministic Model of Singlet State Correlations Based on Relaxing Measurement

Independence,” Phys. Rev. Lett. 105, 250404 (2010).

[41]

J. Barrett and N. Gisin, “How Much Measurement Independence Is Needed to Demonstrate Nonlocality?” Phys.

Rev. Lett. 106, 100406 (2011).

[42] J. Pearl, Causality: Models, Reasoning, and Inference (Cambridge University Press, 2009).

[43] C. J. Wood and R. W. Spekkens, “The lesson of causal discovery algorithms for quantum correlations: causal

explanations of Bell-inequality violations require ﬁne-tuning,” New J. Phys. 17, 033002 (2015).

[44]

J.-M. A. Allen, J. Barrett, D. C. Horsman, C. M. Lee, and R. W. Spekkens, “Quantum Common Causes and

Quantum Causal Models,” Phys. Rev. X 7, 031021 (2017).

[45]

J. Henson, R. Lal, and M. F. Pusey, “Theory-independent limits on correlations from generalized Bayesian

networks,” New J. Phys. 16, 113043 (2014).

[46] T. Fritz, “Beyond Bell’s theorem: correlation scenarios,” New J. Phys. 14, 103001 (2012).

[47] L. Hardy, “Quantum Theory From Five Reasonable Axioms,” quant-ph/0101012 (2001).

[48] J. Barrett, “Information processing in generalized probabilistic theories,” Phys. Rev. A 75, 032304 (2007).

[49]

P. Janotta and H. Hinrichsen, “Generalized probability theories: what determines the structure of quantum

theory?” J. Phys. A 47, 323001 (2014).

[50]

G. Chiribella, G. M. D’Ariano, and P. Perinotti, “Probabilistic theories with puriﬁcation,” Phys. Rev. A

062348 (2010).

[51]

G. M. Ariano, Quantum Theory from First Principles: An Informational Approach (Cambridge University

Press, 2019).

[52] F. Costa and S. Shrapnel, “Quantum causal modelling,” New J. Phys 18, 063032 (2016).

[53] J. Barrett, R. Lorenz, and O. Oreshkov, “Quantum Causal Models,” arXiv:1906.10726 (2019).

[54]

D. Schmid, H. Du, M. Mudassar, G. C. de Wit, D. Rosset, and M. J. Hoban, “Postquantum common-cause

channels: the resource theory of local operations and shared entanglement,” arXiv:2004.06133 (2020).

[55]

G. Chiribella, G. M. D’Ariano, and P. Perinotti, “Quantum Circuit Architecture,” Phys. Rev. Lett.

101

060401 (2008).

[56]

G. Chiribella, G. M. D’Ariano, and P. Perinotti, “Theoretical framework for quantum networks,” Phys. Rev. A

80, 022339 (2009).

[57] S. Popescu and D. Rohrlich, “Quantum nonlocality as an axiom,” Found. Phys. 24, 379 (1994).

[58]

J. Selby

et al.

, “Contextuality Quantiﬁed: A Resource Theory Encompassing Prepare-and-Measure Processes,”

Forthcoming.

[59]

C. Branciard, D. Rosset, N. Gisin, and S. Pironio, “Bilocal versus nonbilocal correlations in entanglement-

swapping experiments,” Phys. Rev. A 85, 032119 (2012).

[60]

A. Ac´ın, R. Augusiak, D. Cavalcanti, C. Hadley, J. K. Korbicz, M. Lewenstein, L. Masanes, and M. Piani,

“Uniﬁed Framework for Correlations in Terms of Local Quantum Observables,” Phys. Rev. Lett.

104

, 140404

(2010).

[61]

S. W. Al-Saﬁ and A. J. Short, “Simulating all Nonsignaling Correlations via Classical or Quantum Theory with

Negative Probabilities,” Phys. Rev. Lett. 111, 170403 (2013).

[62]

J.-D. Bancal, S. Pironio, A. Ac´ın, Y.-C. Liang, V. Scarani, and N. Gisin, “Quantum non-locality based on

ﬁnite-speed causal inﬂuences leads to superluminal signalling,” Nat. Phys. 8, 867 (2012).

[63]

J. S. Bell, “La nouvelle cuisine,” in Quantum Mechanics, High Energy Physics And Accelerators: Selected Papers

Of John S Bell (With Commentary) (World Scientiﬁc, 1995) pp. 910–928.

[64]

O. Oreshkov, F. Costa, and

C. Brukner, “Quantum correlations with no causal order,” Nat. Comm.

, 1092 EP

(2012).

[65] O. Oreshkov and C. Giarmatzi, “Causal and causally separable processes,” New J. Phys. 18, 093020 (2016).

[66]

D. Rosset, J.-D. Bancal, and N. Gisin, “Classifying 50 years of Bell inequalities,” J. Phys. A

, 424022 (2014).

[67] A. Seress, Permutation Group Algorithms (Cambridge University Press, 2003).

[68] S. Pironio, “Lifting Bell inequalities,” J. Math. Phys. 46, 062112 (2005).

[69]

D. Rosset,

Amin Baumeler, J.-D. Bancal, N. Gisin, A. Martin, M.-O. Renou, and E. Wolfe, “Algebraic and

geometric properties of local transformations,” arXiv:2004.09405 (2020).

[70] A. Fine, “Hidden Variables, Joint Probability, and the Bell Inequalities,” Phys. Rev. Lett. 48, 291 (1982).

[71] T. Gonda and R. W. Spekkens, “Monotones in General Resource Theories,” arXiv:1912.07085 (2019).

[72] F. Buscemi, “All Entangled Quantum States Are Nonlocal,” Phys. Rev. Lett. 108, 200401 (2012).

[73]

S. Beigi and A. Gohari, “Monotone Measures for Non-Local Correlations,” IEEE T. Inform. Theory

, 5185

(2015).

[74]

P. Bierhorst, “Geometric decompositions of Bell polytopes with practical applications,” J. Phys. A

, 215301

(2016).

[75]

D. Cavalcanti and P. Skrzypczyk, “Quantitative relations between measurement incompatibility, quantum

steering, and nonlocality,” Phys. Rev. A 93, 052112 (2016).

[76]

K. T. Goh, J. Kaniewski, E. Wolfe, T. V´ertesi, X. Wu, Y. Cai, Y.-C. Liang, and V. Scarani, “Geometry of the

set of quantum correlations,” Phys. Rev. A 97, 022104 (2018).

[77]

M. W. Girard and G. Gour, “Computable entanglement conversion witness that is better than the negativity,”

New J. Phys. 17, 093013 (2015).

[78]

N. Brunner, D. Cavalcanti, S. Pironio, V. Scarani, and S. Wehner, “Bell nonlocality,” Rev. Mod. Phys.

, 419

(2014).

[79]

J. Barrett, N. Linden, S. Massar, S. Pironio, S. Popescu, and D. Roberts, “Nonlocal correlations as an

information-theoretic resource,” Phys. Rev. A 71, 022101 (2005).

[80]

N. Brunner, D. Cavalcanti, S. Pironio, V. Scarani, and S. Wehner, “Bell nonlocality,” Rev. Mod. Phys.

, 419

(2014).

[81]

J. Barrett and S. Pironio, “Popescu-Rohrlich Correlations as a Unit of Nonlocality,” Phys. Rev. Lett.

140401 (2005).

[82] V. L. Popov, Algebraic Geometry IV (Springer-Verlag, 1994) Chap. 4: Quotients.

[83]

D. Collins and N. Gisin, “A relevant two qubit Bell inequality inequivalent to the CHSH inequality,” J. Phys. A

37, 1775 (2004).

[84]

T. H. Yang and M. Navascu´es, “Robust self-testing of unknown quantum systems into any entangled two-qubit

states,” Phys. Rev. A 87, 050102(R) (2013).

[85]

C. Bamps and S. Pironio, “Sum-of-squares decompositions for a family of Clauser-Horne-Shimony-Holt-like

inequalities and their application to self-testing,” Phys. Rev. A 91, 052111 (2015).

[86]

L. Masanes, “Necessary and suﬃcient condition for quantum-generated correlations,” quant-ph/0309137 (2003).

[87]

J. Allcock, N. Brunner, M. Pawlowski, and V. Scarani, “Recovering part of the boundary between quantum and

nonquantum correlations from information causality,” Phys. Rev. A 80, 040103(R) (2009).

[88]

A. Ac´ın, S. Massar, and S. Pironio, “Randomness versus Nonlocality and Entanglement,” Phys. Rev. Lett.

108

100402 (2012).

[89]

E. Wolfe and S. F. Yelin, “Quantum bounds for inequalities involving marginal expectation values,” Phys. Rev.

A 86, 012123 (2012).

[90] M. A. Nielsen, “Conditions for a class of entanglement transformations,” Phys. Rev. Lett. 83, 436 (1999).

[91]

C. Bamps, S. Massar, and S. Pironio, “Device-independent randomness generation with sublinear shared quantum

resources,” Quantum 2, 86 (2018).

[92]

G. Gour, M. P. M¨uller, V. Narasimhachar, R. W. Spekkens, and N. Y. Halpern, “The resource theory of

informational nonequilibrium in thermodynamics,” Phys. Rep. 583, 1 (2015).

[93]

T. Fritz, “Resource convertibility and ordered commutative monoids,” Math. Struct. Comp. Sci.

, 850–938

(2017).

[94]

N. Brunner and P. Skrzypczyk, “Nonlocality Distillation and Postquantum Theories with Trivial Communication

Complexity,” Phys. Rev. Lett. 102, 160403 (2009).

[95]

B. Lang, T. V´ertesi, and M. Navascu´es, “Closed sets of correlations: answers from the zoo,” J. Phys. A

424029 (2014).

[96]

Y. R. Sanders and G. Gour, “Necessary conditions for entanglement catalysts,” Phys. Rev. A

, 054302

(2009).

[97]

D. Jonathan and M. B. Plenio, “Entanglement-Assisted Local Manipulation of Pure Quantum States,” Phys.

Rev. Lett. 83, 3566 (1999).

[98]

W. van Dam and P. Hayden, “Universal entanglement transformations without communication,” Phys. Rev. A

67, 060302 (2003).

[99] B. Steudel and N. Ay, “Information-Theoretic Inference of Common Ancestors,” Entropy 17, 2304 (2015).

[100]

E. Wolfe, R. W. Spekkens, and T. Fritz, “The Inﬂation Technique for Causal Inference with Latent Variables,”

J. Causal Inference 7 (2019).

[101]

N. Gisin, “The Elegant Joint Quantum Measurement and some conjectures about N-locality in the Triangle and

other Conﬁgurations,” arXiv:1708.05556 (2017).

[102]

T. C. Fraser and E. Wolfe, “Causal compatibility inequalities admitting quantum violations in the triangle

structure,” Phys. Rev. A 98, 022113 (2018).

[103]

C. Branciard, N. Gisin, and S. Pironio, “Characterizing the Nonlocal Correlations Created via Entanglement

Swapping,” Phys. Rev. Lett. 104, 170401 (2010).

[104]

F. Andreoli, G. Carvacho, L. Santodonato, R. Chaves, and F. Sciarrino, “Maximal violation of

-locality

inequalities in a star-shaped quantum network,” New J. Phys. 19, 113020 (2017).

[105]

A. Tavakoli, P. Skrzypczyk, D. Cavalcanti, and A. Ac´ın, “Nonlocal correlations in the star-network conﬁguration,”

Phys. Rev. A 90, 062109 (2014).

[106]

D. Rosset, C. Branciard, T. J. Barnea, G. P¨utz, N. Brunner, and N. Gisin, “Nonlinear Bell inequalities tailored

for quantum networks,” Phys. Rev. Lett. 116, 010403 (2016).

[107] A. Tavakoli, “Bell-type inequalities for arbitrary noncyclic networks,” Phys. Rev. A 93, 030101(R) (2016).

[108]

J. Pearl, “On the Testability of Causal Models with Latent and Instrumental Variables,” in Proc. 11th Conf.

Uncertainty in Artiﬁcial Intelligence (1995) pp. 435–443.

[109]

B. Bonet, “Instrumentality Tests Revisited,” in Proc. 17th Conf. Uncertainty in Artiﬁcial Intelligence (2001)

pp. 48–55.

[110]

R. J. Evans, “Graphical methods for inequality constraints in marginalized DAGs,” in IEEE International

Workshop on Machine Learning for Signal Processing (2012).

[111]

R. Chaves, G. Carvacho, I. Agresti, V. D. Giulio, L. Aolita, S. Giacomini, and F. Sciarrino, “Quantum violation

of an instrumental test,” Nat. Phy. 14, 291 (2017).

[112]

T. Van Himbeeck, J. Bohr Brask, S. Pironio, R. Ramanathan, A. Bel´en Sainz, and E. Wolfe, “Quantum violations

in the Instrumental scenario and their relations to the Bell scenario,” Quantum 3, 186 (2019).

Comments

Products

Project