Fermat's Library | q-2020-06-25-286 annotated/explained version.

On the Entanglement Cost of One-Shot Compression

Shima Bab Hadiashar and Ashwin Nayak

Department of Combinatorics and Optimization, and Institute for Quantum Computing, University of Waterloo,

200 University Ave. W., Waterloo, ON, N2L 3G1, Canada.

We revisit the task of visible compression of an ensemble of quantum states

with entanglement assistance in the one-shot setting. The protocols achieving

the best compression use many more qubits of shared entanglement than the

number of qubits in the states in the ensemble. Other compression protocols,

with potentially larger communication cost, have entanglement cost bounded

by the number of qubits in the given states. This motivates the question as to

whether entanglement is truly necessary for compression, and if so, how much

of it is needed.

Motivated by questions in communication complexity, we lift certain restric-

tions that are placed on compression protocols in tasks such as state-splitting

and channel simulation. We show that an ensemble of the form designed by

Jain, Radhakrishnan, and Sen (ICALP’03) saturates the known bounds on the

sum of communication and entanglement costs, even with the relaxed compres-

sion protocols we study.

The ensemble and the associated one-way communication protocol have

several remarkable properties. The ensemble is incompressible by more than a

constant number of qubits without shared entanglement, even when constant

error is allowed. Moreover, in the presence of shared entanglement, the commu-

nication cost of compression can be arbitrarily smaller than the entanglement

cost. The quantum information cost of the protocol can thus be arbitrarily

smaller than the cost of compression without shared entanglement. The en-

semble can also be used to show the impossibility of reducing, via compression,

the shared entanglement used in two-party protocols for computing Boolean

functions.

1 Introduction

1.1 Visible compression

Compression of quantum states is a fundamental task in information processing. In the

simplest setting, we have two spatially separated parties, commonly called Alice and Bob,

Shima Bab Hadiashar: sbabhadi@uwaterloo.ca

Ashwin Nayak: ashwin.nayak@uwaterloo.ca

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 1

arXiv:1905.02110v3 [quant-ph] 19 Jun 2020

who can communicate with each other by exchanging quantum states. They have in mind

an ensemble of m-dimensional quantum states

((p

, ρ

) : x ∈ S, ρ

∈ D(C

)) , (1.1)

where S is some non-empty ﬁnite set, and p is a probability distribution over S. Alice

gets an input x ∈ S with probability p

, and would like to send a message, i.e., a quantum

state σ

∈ D(C

) to Bob so that he can recover the state ρ

, or even an approximation to

it. Since the input x completely speciﬁes the corresponding state ρ

, this variant of the

task is called visible compression. The communication cost of the protocol is log d, the

length of the message in qubits. Their goal is to accomplish this with as short a message

as possible, i.e., to minimize the dimension d. A central question in quantum information

theory is whether there is a simple characterization of the optimal communication cost in

terms of the “information content” of the ensemble.

An additional resource that Alice and Bob may use in compression is a shared entangled

state. In other words, the two parties may start with their qubits initialized to a ﬁxed pure

quantum state independent of the input received by Alice. The local quantum operations

performed for compression and decompression then also involve the respective parts of the

shared state. This is depicted in Figure 1, and the protocol (or channel) is said to be with

shared entanglement or entanglement assisted. As we may expect, the communication cost

may decrease due to the availability of this additional resource. The entanglement cost of

a protocol is the minimal dimension of the support of either party’s share of the initial

state (measured in qubits) required to achieve some communication cost. (We discuss the

notion of entanglement cost in detail in Section 4.) We would also like to characterize the

entanglement cost in this setting, in addition to the communication cost.

|x

<latexit sha1_base64="Kjn82NshDbErSur1Q2qfY5QZjsY=">AAAB5nicbVDLSgMxFL1TX219Vd0IboJFcFVm6sIui25cVrAPaUvJpGkbmskMyR2x1oJf4EbEjYLf4Ff4C4IfY/rYtPXAhcM555J74kdSGHTdHyexsrq2vpFMpTe3tnd2M3v7FRPGmvEyC2Woaz41XArFyyhQ8lqkOQ18yat+/3LsV++4NiJUNziIeDOgXSU6glG00u3jfUNT1ZW8lcm6OXcCsky8GckWDx9+U09fF6VW5rvRDlkccIVMUmPqnhthc0g1Cib5KN2IDY8o69MuH07OHJETK7VJJ9R2FJKJOpejgTGDwLfJgGLPLHpj8T+vHmOn0BwKFcXIFZs+1IklwZCMO5O20JyhHFhCmRb2QsJ6VFOG9mfStrq3WHSZVPI57yyXv/ayxQJMkYQjOIZT8OAcinAFJSgDgwBe4B0+nJ7z7Lw6b9NowpntHMAcnM8/GAGPag==</latexit>

|x

<latexit sha1_base64="WpbDVNFLu2kgMGufoa4z4RgwjdA=">AAAB3nicbVBNS0JBFL3Pvsy+rJZBDEnQSt7TRe4S2rRU6KmgYvPG+3Rw3gcz8wIRl7WJaFPQT3HVur/Qb+hPNH5s1A5cOJxzLnPPeLHgStv2j5Xa2Nza3knvZvb2Dw6PsscnNRUlkqHLIhHJhkcVCh6iq7kW2Igl0sATWPcGt1O//ohS8Si818MY2wHthdznjGojVd1ONmfn7RnIOnEWJHfzNan+Pp1PKp3sd6sbsSTAUDNBlWo6dqzbIyo1ZwLHmVaiMKZsQHs4mp03JpdG6hI/kmZCTWbqUo4GSg0DzyQDqvtq1ZuK/3nNRPul9oiHcaIxZPOH/EQQHZFpV9LlEpkWQ0Mok9xcSFifSsq0+ZGMqe6sFl0ntULeKeYLVSdXLsEcaTiDC7gCB66hDHdQARcYILzAO3xYD9az9Wq9zaMpa7FzCkuwPv8AQoKMxw==</latexit>

<latexit sha1_base64="53u3biTACZG1ED1tQtmUQuW5bV0=">AAAB3nicbVC7SkNBFDzrM8ZX1FKQxSBYhXtjYToDNpYJmAckIe7d7E2W7H2we64QQkptRGwU/JRU1v6C3+BPuHk0SRw4MMzMYc+sFytp0HF+yNr6xubWdmonvbu3f3CYOTqumijRXFR4pCJd95gRSoaighKVqMdasMBToub1byd+7VFoI6PwHgexaAWsG0pfcoZWKlfbmayTc6agq8Sdk+zN17j8+3Q2LrUz381OxJNAhMgVM6bhOjG2hkyj5EqM0s3EiJjxPuuK4fS8Eb2wUof6kbYTIp2qCzkWGDMIPJsMGPbMsjcR//MaCfqF1lCGcYIi5LOH/ERRjOikK+1ILTiqgSWMa2kvpLzHNONofyRtq7vLRVdJNZ9zr3L5spstFmCGFJzCOVyCC9dQhDsoQQU4CHiBd/ggD+SZvJK3WXSNzHdOYAHk8w9D/IzI</latexit>

|

<latexit sha1_base64="yZzoUMAFcvdEmaleQRzLli6veEE=">AAAB6XicbVDLSgMxFL1TX219Vd0IboJFcFVm6sIui25cVnDaQqeUTJppQzOZkGSEWgv+ghsRNwr+gV/hLwh+jOlj09YDFw7nnEvuSSg508Z1f5zM2vrG5lY2l9/e2d3bLxwc1nWSKkJ9kvBENUOsKWeC+oYZTptSURyHnDbCwfXEb9xTpVki7sxQ0naMe4JFjGBjpeAxkH0WKCx6nHYKRbfkToFWiTcnxerxw2/u6euq1il8B92EpDEVhnCsdctzpWmPsDKMcDrOB6mmEpMB7tHR9NIxOrNSF0WJsiMMmqoLORxrPYxDm4yx6etlbyL+57VSE1XaIyZkaqggs4eilCOToElt1GWKEsOHlmCimL0QkT5WmBj7OXlb3Vsuukrq5ZJ3USrfesVqBWbIwgmcwjl4cAlVuIEa+EBAwgu8w4czcJ6dV+dtFs04850jWIDz+QdDWJCt</latexit>

Alice

Bob

<latexit sha1_base64="Q31/2tHi4snRqoI3kUyheTcJZcg=">AAAAAHicbVDLSgMxFM34rPXRUcGNm2ARXJWZIuiy1o3LFuwD2mHIpJk2NMkMSUaow3yJGxeKuHDjN/gF7tz4LWbaLrT1QOBwzr3k3BPEjCrtOF/Wyura+sZmYau4vbO7V7L3D9oqSiQmLRyxSHYDpAijgrQ01Yx0Y0kQDxjpBOPr3O/cEaloJG71JCYeR0NBQ4qRNpJvl678tM+RHkmeUpFlvl12Ks4UcJm4c1KuHTW/6Vv9o+Hbn/1BhBNOhMYMKdVznVh7KZKaYkayYj9RJEZ4jIakZ6hAnCgvnQbP4KlRBjCMpHlCw6n6eyNFXKkJD8xknlEtern4n9dLdHjpmYPiRBOBZx+FCYM6gnkLcEAlwZpNDEFYUpMV4hGSCGvTVdGU4C6evEza1Yp7Xqk23XKtDmYogGNwAs6ACy5ADdyABmgBDBLwAJ7As3VvPVov1utsdMWa7xyCP7DefwBDQ5cw</latexit>

out

<latexit sha1_base64="N7Qkz3aHHdGhuuJGOBeyglQghOk=">AAAAAHicbVDLSgMxFM3UV62vUcGNm2ARXJWZIuiy1o3LFuwD2mHIpJk2NJkMSaZQhvkTNy4UEVz5C36BOzd+i5m2C209EDiccy/35AQxo0o7zpdVWFvf2Nwqbpd2dvf2D+zDo7YSicSkhQUTshsgRRiNSEtTzUg3lgTxgJFOML7N/c6ESEVFdK+nMfE4GkY0pBhpI/m2feOnfY70SPJUJDrLfLvsVJwZ4CpxF6RcO2l+07f6R8O3P/sDgRNOIo0ZUqrnOrH2UiQ1xYxkpX6iSIzwGA1Jz9AIcaK8dJY8g+dGGcBQSPMiDWfq740UcaWmPDCTeUi17OXif14v0eG1l9IoTjSJ8PxQmDCoBcxrgAMqCdZsagjCkpqsEI+QRFibskqmBHf5y6ukXa24l5Vq0y3X6mCOIjgFZ+ACuOAK1MAdaIAWwGACHsATeLZS69F6sV7nowVrsXMM/sB6/wEyXJe7</latexit>

<latexit sha1_base64="QRa0BeN8pMF5pYUkDA8FwbemFgI=">AAAAAHicbVDLSgNBEOyNrxhfUY9eBoPgKeyGgB6DXrwICZgHJEuYnfQmY2Znl5lZIYR8gRcPinj1k7z5N06SPWhiQUNR1U13V5AIro3rfju5jc2t7Z38bmFv/+DwqHh80tJxqhg2WSxi1QmoRsElNg03AjuJQhoFAtvB+Hbut59QaR7LBzNJ0I/oUPKQM2qs1LjvF0tu2V2ArBMvIyXIUO8Xv3qDmKURSsME1brruYnxp1QZzgTOCr1UY0LZmA6xa6mkEWp/ujh0Ri6sMiBhrGxJQxbq74kpjbSeRIHtjKgZ6VVvLv7ndVMTXvtTLpPUoGTLRWEqiInJ/Gsy4AqZERNLKFPc3krYiCrKjM2mYEPwVl9eJ61K2auWK41qqXaTxZGHMziHS/DgCmpwB3VoAgOEZ3iFN+fReXHenY9la87JZk7hD5zPH6avjNY=</latexit>

<latexit sha1_base64="dRboNTlmw86S8PRMM9oopeBs1wg=">AAAAAHicbVDLSgMxFL1TX7W+qoIbN4NFcFVmiqDLWhFctmAf0I4lk2ba0CQzJBmlDP0PNy4UcSf+hF/gzo3fYqbtQlsPBA7n3Ms9OX7EqNKO82VllpZXVtey67mNza3tnfzuXkOFscSkjkMWypaPFGFUkLqmmpFWJAniPiNNf3iZ+s07IhUNxY0eRcTjqC9oQDHSRrq96iYdjvRA8uRiPO7mC07RmcBeJO6MFMoHtW/6VvmodvOfnV6IY06Exgwp1XadSHsJkppiRsa5TqxIhPAQ9UnbUIE4UV4yST22j43Ss4NQmie0PVF/bySIKzXivplMI6p5LxX/89qxDs69hIoo1kTg6aEgZrYO7bQCu0clwZqNDEFYUpPVxgMkEdamqJwpwZ3/8iJplIruabFUcwvlCkyRhUM4ghNw4QzKcA1VqAMGCQ/wBM/WvfVovViv09GMNdvZhz+w3n8Aw06WYw==</latexit>

<latexit sha1_base64="Cpyqr1s2i2MeGJQ4QGs8m7cn1e8=">AAAAAHicbVDLSgMxFL3js9ZX1aWbYBFclZlS0GWpCC4r2Ae0Y8mkmTY0yQxJRinD/IcbF4q49V/c+Tem7Sy09UDgcM693JMTxJxp47rfztr6xubWdmGnuLu3f3BYOjpu6yhRhLZIxCPVDbCmnEnaMsxw2o0VxSLgtBNMrmd+55EqzSJ5b6Yx9QUeSRYygo2VHm4GaV9gM1YibWTZoFR2K+4caJV4OSlDjuag9NUfRiQRVBrCsdY9z42Nn2JlGOE0K/YTTWNMJnhEe5ZKLKj203nqDJ1bZYjCSNknDZqrvzdSLLSeisBOziLqZW8m/uf1EhNe+SmTcWKoJItDYcKRidCsAjRkihLDp5ZgopjNisgYK0yMLapoS/CWv7xK2tWKV6tU72rleiOvowCncAYX4MEl1OEWmtACAgqe4RXenCfnxXl3Phaja06+cwJ/4Hz+AOXcksY=</latexit>

out

<latexit sha1_base64="jtZk7+SeBS1CR32lQkpPF26xkkA=">AAAAAHicbVDLSsNAFJ3UV62vqEs3g0VwVZJS0GWpG5cVbCu0IUymk3boPMLMpFBC/sSNC0Xc+ifu/BsnbRbaemDgcM693DMnShjVxvO+ncrW9s7uXnW/dnB4dHzinp71tUwVJj0smVRPEdKEUUF6hhpGnhJFEI8YGUSzu8IfzInSVIpHs0hIwNFE0JhiZKwUum4nzEYcmanimUxNnodu3Wt4S8BN4pekDkp0Q/drNJY45UQYzJDWQ99LTJAhZShmJK+NUk0ShGdoQoaWCsSJDrJl8hxeWWUMY6nsEwYu1d8bGeJaL3hkJ4uQet0rxP+8YWri2yCjIkkNEXh1KE4ZNBIWNcAxVQQbtrAEYUVtVoinSCFsbFk1W4K//uVN0m82/Faj+dCqtztlHVVwAS7BNfDBDWiDe9AFPYDBHDyDV/DmZM6L8+58rEYrTrlzDv7A+fwBVPeUHg==</latexit>

<latexit sha1_base64="oXBl/cNtkQP47D0Zv2NLv7a0V8o=">AAAAAHicbVBNS8NAEJ3Ur1q/qh69LBbBU0lKwR4rXjxWtLXQhrLZTtqlm03Y3Qgl9Cd48aCIV3+RN/+N2zYHbX0w8Hhvhpl5QSK4Nq777RQ2Nre2d4q7pb39g8Oj8vFJR8epYthmsYhVN6AaBZfYNtwI7CYKaRQIfAwmN3P/8QmV5rF8MNME/YiOJA85o8ZK99cDb1CuuFV3AbJOvJxUIEdrUP7qD2OWRigNE1Trnucmxs+oMpwJnJX6qcaEsgkdYc9SSSPUfrY4dUYurDIkYaxsSUMW6u+JjEZaT6PAdkbUjPWqNxf/83qpCRt+xmWSGpRsuShMBTExmf9NhlwhM2JqCWWK21sJG1NFmbHplGwI3urL66RTq3r1au2uXmk28jiKcAbncAkeXEETbqEFbWAwgmd4hTdHOC/Ou/OxbC04+cwp/IHz+QO2d41k</latexit>

<latexit sha1_base64="j/RHpYpwpOcqB4i4pSCy0Pc+Ah8=">AAAAAHicbZDLSgMxFIbPeK2j1apLN8FScFUmRbDLoiAuK9oLtEPJpJk2NJMZkoxQhj6CGxeKuBUfxEdw59uYXhba+kPg4//PIeecIBFcG8/7dtbWNza3tnM77u5efv+gcHjU1HGqKGvQWMSqHRDNBJesYbgRrJ0oRqJAsFYwuprmrQemNI/lvRknzI/IQPKQU2KsdXfZw71C0St7M6FVwAso1vKfaena/aj3Cl/dfkzTiElDBdG6g73E+BlRhlPBJm431SwhdEQGrGNRkohpP5uNOkEl6/RRGCv7pEEz93dHRiKtx1FgKyNihno5m5r/ZZ3UhFU/4zJJDZN0/lGYCmRiNN0b9bli1IixBUIVt7MiOiSKUGOv49oj4OWVV6FZKePzcuUWF2tVmCsHJ3AKZ4DhAmpwA3VoAIUBPMIzvDjCeXJenbd56Zqz6DmGP3LefwA9xZAB</latexit>

Figure 1: A one-message protocol for compression of quantum states, with shared entanglement. The

holds the input given to Alice, and E

contains Alice’s workspace and her part of the

initial shared state (the shared entanglement). The register E

contains Bob’s workspace and his part

of the initial shared state. The compression is implemented by the isometry U, and the register M

contains the compressed state and is sent as the message. The decompression is implemented by the

isometry V . Bob’s output is contained in the register B

out

Compression problems similar to the one above have been studied extensively in quantum

information theory, both in the one-shot setting (the one we described above), and in the

asymptotic setting (where the sender’s input consists of multiple samples picked indepen-

dently from the same distribution). The problem has been studied in early works such

as Ref. [5] in the setting of quantum communication without shared entanglement. It is

known as remote state preparation when allowed one-way communication over a classical

channel with shared entanglement. We refer the reader to Ref. [4, Table I] for a summary

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 2

of the work on remote state preparation; we describe the most relevant results—in the

one-shot setting—below.

Other tasks in the literature that come close to the one above are state splitting (see, e.g.,

Ref. [7]), and that of channel simulation in the context of the Quantum Reverse Shannon

Theorem [6, 7]. State splitting is the time reversal [9] of state merging [15, 16], and was

called the “fully quantum reverse Shannon protocol” in Ref. [9]. We explain the connection

to state splitting in detail in Section 2.3.

In both state splitting and channel simulation, the protocol is required to be “coherent” in

speciﬁc ways. In particular, in compressing an ensemble of states as in Eq. (1.1), at the

end of the protocol, Bob would be required to hold an approximation to the state ρ

and

Alice a puriﬁcation of this state. In contrast to these tasks, we do not require that the

compression protocol maintain such coherence. More precisely, the registers containing

a puriﬁcation of the output state may be shared by Alice and Bob. Such compression

protocols are more relevant in the context of two-party communication protocols studied

in complexity theory, especially in the context of direct sum and direct product results (see

e.g., Refs. [17, 19, 28] and the references therein). In communication complexity, a typical

goal is to compute a bivariate Boolean function when the inputs are distributed between

two parties. The parties communicate with each other, alternating messages with local

computation, and at the end, one party produces the output of the protocol from the part

of the ﬁnal state in her possession. As a result, the output of the protocol does not depend

on the part of the state held by the other party (i.e., on the puriﬁcation of her part of the

ﬁnal joint state). A compression scheme for the ﬁnal state then need only focus on the

part being measured for the output.

1.2 Entanglement cost of compression

Jain, Radhakrishnan, and Sen [18, 19] gave a one-shot protocol for compressing an en-

semble of states as in Eq. (1.1), and bounded its communication cost by O(I(A : B)

/

where I(A : B)

is the mutual information between registers A and B in the state τ

x∈[n]

|xihx|

⊗ ρ

, and  is the average approximation error (cf. Section 2.3 for a pre-

cise deﬁnition of average error). Using a more reﬁned application of their technique, Bab

Hadiashar, Nayak, and Renner [4] tightly characterized the communication cost of the

task in terms of the smooth max-information, a one-shot entropic analogue of mutual in-

formation. Their results are stated for entanglement-assisted classical channels and use

puriﬁed distance to quantify the approximation, but translate immediately to the setting

here through the use of superdense coding [29, Section 6.3.1] and the Fuchs and van de

Graaf Inequalities (Proposition 2.4). The upper bound so obtained is

/

√

max

(A : B)

+ O(log log(1/)) .

This is slightly better than that derived from protocols for state splitting in terms of the

approximation error; it has an additive term of O(log log



) for average error  versus the

additive term of O(log



) in Ref. [1, Corollary 5]. However, both these protocols use shared

entanglement that may be much longer than the message itself, namely O(k(log



) log m)

qubits and O((1+ 1/

) log

(m/)) qubits, respectively, where log

k = I(A : B)

, and m is

the dimension of the states in the ensemble. On the other hand, earlier protocols for state

splitting [7, Lemma 3.5], with potentially larger communication cost, have entanglement

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 3

cost bounded by log m. Since sharing entanglement also entails some communication, in

addition to the preparation and storage of a potentially delicate high dimensional state,

this motivates the question as to whether shared entanglement is truly necessary for com-

pression, and if so, how much of it is needed.

For the more restrictive task of state splitting, it follows from the proof of the converse

bound for one-shot entanglement consumption due to Berta, Christandl, and Touchette [8,

Proposition 10] that the sum of the communication and entanglement costs is at least the

min-entropy S

min

(ρ) of the ensemble average state ρ

. (Although the proof

is written assuming that the shared state consists of EPR pairs and some ancilla and an

auxiliary error parameter, it may be modiﬁed to give a bound when an arbitrary state is

shared and the auxiliary error is 0.) In this article, we show that there are ensembles for

which the min-entropy bound equals the number of qubits in the states, and the bound

holds up to an additive constant even with the more general compression protocols we

allow.

Theorem 1.1. There exist universal constants c

, c

> 0 such that for any  ∈ (0, 1), and

any k, m ∈ N with k ≥ 6/(1 − ) and m ≥ c

(ln k)/(1 − )

such that k divides m, there

exists an ensemble



, ρ



: x ∈ [n], ρ

∈ D(C

)



, where n depends on k, m, and , such

that

(i) I(A : B)

= I

max

(A : B)

= log

k, where τ

x∈[n]

|xihx|

⊗ ρ

;

(ii) there is a one-way protocol with shared entanglement for the visible compression of

the ensemble with average error /2 and with communication cost

log k+O(log log



);

and

(iii) the sum of communication and entanglement costs of any one-way protocol with

shared entanglement for visible compression of the ensemble, with average-error at

most /2, is at least

log m − 3 log

1 − 

− c

In particular, the theorem implies that in the absence of shared entanglement, the ensemble

may only be compressed by a constant number of qubits (independent of m), even if

constant average error



< 1/2 is allowed. Note also that the straightforward protocol that

prepares and sends the state ρ

on input x has sum of entanglement and communication

costs equal to log m. So the lower bound in the theorem is optimal up to an additive

universal constant term for constant  ∈ (0, 1).

Proposition 3.4 and Corollary 3.5 in Section 3 contain more precise statements of the

results stated in the theorem. As we explain in that section, I(A : B)

may be interpreted

as the “information content” of the ensemble; it is the quantum information cost [28] of

the protocol in which Alice simply prepares the state ρ

on input x and sends the state to

Bob.

The compression task we study is a relaxation of oblivious (or blind) compression, in which

the input to Alice is the state ρ

, rather than x. It is also a relaxation of state-splitting

(more generally, of state re-distribution [10, 23, 30]), and channel simulation. So the lower

bound in Theorem 1.1(ii) holds for these tasks as well.

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 4

The ensemble mentioned in Theorem 1.1 is obtained via the probabilistic method, and is of

a form devised by Jain, Radhakrishnan, and Sen [17]. They showed the incompressibility

of such an ensemble when the decompression operation is unitary (i.e., via protocols as in

Figure 1 in which the register B

is trivial). We adapt their proof method to protocols which

allow a general quantum channel for decompression. A key step here is a technical lemma

(Lemma 3.2 in Section 3) which allows us to reason about general quantum channels, and

also yields a tighter lower bound on the sum of communication and entanglement costs.

1.3 Implications and related work

Jain et al. [18, 19], also used the same kind of ensemble as in Theorem 1.1 to design

a two-party one-way communication protocol with shared entanglement for the Equality

function. They showed that the initial shared state in the protocol cannot be replaced

by one with polynomially smaller dimension in a “black-box fashion” (i.e., when the local

operations of the two parties are not modiﬁed). Theorem 1.1 implies a similar impossibility

result for protocols in which the sender and receiver can deviate from the original protocol

arbitrarily, but they try to approximate the receiver’s state in the original protocol after

the message is sent. The impossibility holds even when the dimension of the initial shared

entangled state is reduced only by a constant factor.

A remarkable property of the ensemble posited by Theorem 1.1 is that the communication

cost of compression (with shared entanglement) may be arbitrarily smaller than the en-

tanglement cost. For constant error the communication cost is within an additive constant

of the quantum information cost [28] of the protocol that simply prepares and sends the

state. As a consequence, we infer that the quantum information cost of a protocol may

be arbitrarily smaller than the communication cost of any protocol without shared entan-

glement for compressing its messages. Anshu, Touchette, Yao, and Yu [3] had previously

proven a similar separation when the compression protocol is allowed to use shared entan-

glement. However, their separation is exponential: they exhibited an interactive protocol

for a Boolean function with quantum information cost that is exponentially smaller than

the communication cost of any interactive quantum protocol that computes the function.

(Observe that a protocol for compressing the ﬁnal state of the original protocol may also

be used to compute the function.) In contrast to that protocol, the one we present is

compressible to its quantum information cost, but requires an arbitrarily larger amount of

shared entanglement to do so.

In another related work, Liu, Perry, Zhu, Koh, and Aaronson [22] show that one-way

protocols cannot be compressed to their quantum information cost without using shared

entanglement. They consider a certain one-way protocol in which Alice gets an n-bit

input, Bob gets an m-bit input, with m ∈ o(n). The protocol has quantum informa-

tion cost O(nm

−2

log m). They show that the protocol cannot be compressed by a one-

way protocol without shared entanglement into a message of length o(log n) with error at

most (n + 1)

−m

. Thus the separation is limited, and only holds for exponentially small

error (in the length of the inputs).

It is believed that the communication in any interactive quantum protocol which has a

constant number of rounds and computes a function of classical inputs may be compressed,

with constant error, to an amount proportional to the quantum information cost of the

protocol. For one-way protocols such a result was shown by Jain, Radhakrishnan, and

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 5

Sen [18, 19]. This was later re-proven by Anshu, Jain, Mukhopadhyay, Shayeghi, and

Yao [2] using diﬀerent techniques. A similar result for protocols with a larger constant

number of rounds of communication was claimed by Touchette [28], but the proof has

an error. The compression protocols achieving quantum information cost all rely on the

presence of shared entanglement. Theorem 1.1 shows that even for the simplest protocols,

such compression is not possible in the absence of shared entanglement. Moreover, it shows

that the entanglement cost may be necessarily within an additive constant of the length

of the message to be compressed, even when the quantum information cost is arbitrarily

smaller than the message length.

In a recent independent work, Khanian and Winter [20] analyse the communication and

entanglement costs of a variant of compression in the asymptotic setting. They study pure

state ensembles with quantum side information in the form of pure states. In the case of vis-

ible compression with shared entanglement, they show that the asymptotic (per-instance)

communication cost is at least

S(ρ), i.e., half the entropy of the ensemble average state ρ.

So this cost may be at most a factor of 1/2 smaller than that of compression without shared

entanglement. Moreover, the asymptotic sum of communication and entanglement costs

is at least the entropy S(ρ). Thus the kind of separation we show does not hold for pure

states even in the asymptotic setting.

Organization. The rest of this article is organized as follows. In Section 2, we review

basic concepts and notation from quantum information and communication. In section 3,

we prove the main result and discuss its implications.

Acknowledgements. We thank Milán Mosonyi for extensive, thoughtful feedback on

earlier versions of this article. This research is supported in part by NSERC Canada. SBH

is also supported by an Ontario Graduate Scholarship.

2 Preliminaries

2.1 Mathematical notation and background

We refer the reader to the book Watrous [29] for a thorough introduction to basics of

quantum information. We brieﬂy review the notation and some results that we use in the

article.

For the sake of brevity, we denote the set {1, 2, . . . , k} by [k]. We denote physical quantum

systems (“registers”) with capital letters, like X, Y and Z. The state space corresponding

to a register is a ﬁnite-dimensional Hilbert space. We denote (ﬁnite dimensional) Hilbert

spaces either by capital script letters like H and K, or as C

where m is the dimension. We

denote the the dimension of a Hilbert space corresponding to a register X as |X|. We use

the Dirac notation, i.e., “ket” and “bra”, for unit vectors and their adjoints, respectively.

We denote the set of all unit vectors in a Hilbert space H by Sphere(H). For a Hilbert

space H

= C

for some non-empty ﬁnite set S, we call {|xi : x ∈ S} its canonical basis.

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 6

A subset N of Sphere(H) is called -dense if for every vector |ui ∈ Sphere(H), there exists

a vector in the set N at Euclidean distance at most  from |vi. Such a set is also called

an “-net” in the literature. The following proposition states that every ﬁnite dimensional

Hilbert space has a relatively small -dense set.

Proposition 2.1 ([24], Lemma 13.1.1, Chapter 13). Let  ∈ (0, 1], and m be a positive

integer. The Hilbert space C

has an -dense set N of size |N| ≤







A slightly better bound



1 +





on the size of an -dense set is given in Ref. [26, Lemma

2.6].

We denote the set of all linear operators on Hilbert space H by L(H), the set of all positive

semi-deﬁnite operators by Pos(H), the set of all unitary operators by U(H), and the set of

all quantum states (or “density operators”) over H by D(H). The identity operator on H is

denoted by 1

. We denote quantum states or sub-normalized states (positive semi-deﬁnite

operators with trace at most 1) by lowercase Greek letters like ρ, σ. We use notation such

as ρ

to indicate that register X is in state ρ, and may omit the superscript when the

operator if M ≤ 1. We usually denote quantum channels, i.e., completely positive trace-

preserving linear maps from the space of linear operators on a Hilbert space to another

such space, by capital Greek letters like Ψ. The partial trace over a Hilbert space K is

denoted as Tr

We denote the operator norm (Schatten ∞ norm) of an operator M ∈ L(H) by kMk,

the Frobenius norm (Schatten 2 norm) by kMk

, and the trace norm (Schatten 1 norm)

by kMk

. Recall that kMk

= Tr

√

∗

M is the sum of the singular values of M , kMk

is the largest singular value, and kMk

Tr(M

∗

M) is the `

-norm of the singular

values with multiplicity. All of these norms are invariant under composition with a unitary

operator.

We consider random unitary operators chosen according to the Haar measure η on U(H),

where H is a ﬁnite dimensional Hilbert space. The Haar measure is the unique unitarily

invariant probability measure over U(H).

Let f : U(H) → R be a continuous function. Suppose f is κ-Lipschitz, i.e., for all U, V ∈

U(H), we have

|f(U) − f (V )| ≤ κ kU − V k

for some κ ≥ 0. If κ is small enough as compared to the dimension of H, with high

probability, the random variable f(U ) is close to its expectation, where U ∈ U(H) is a

Haar-random unitary operator. This concentration of measure property is formalized by

the following theorem, which is a special case of Theorem 5.17 in Ref. [25].

Theorem 2.2 ([25], Theorem 5.17, page 159). Let η be the Haar measure on U(H),

where H is a Hilbert space with ﬁnite dimension m, and let U ∈ U(H) be a random unitary

operator chosen according to η. For every function f : U(H) → R that is κ-Lipschitz with

respect to the Frobenius norm (with κ > 0), and every positive real number t, we have



{U ∈ U(H) : f(U) − E [f(U )] ≥ t}



≤ exp

−

(m − 2)t

24κ

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 7

The ﬁdelity between two sub-normalized states ρ and σ is deﬁned as

F(ρ, σ)

= Tr

√

ρ σ

√

ρ +

(1 − Tr(ρ))(1 − Tr(σ)) .

Fidelity can be used to deﬁne a useful metric called the puriﬁed distance [12, 27] between

sub-normalized states:

P(ρ, σ)

1 − F(ρ, σ)

For a quantum state ρ ∈ D(H) and  ∈ [0, 1], we deﬁne



(ρ)

= {

ρ ∈ Pos (H) : P(ρ,

ρ) ≤ , Tr

ρ ≤ 1}

as the ball of sub-normalized states that are within puriﬁed distance  of ρ.

The trace distance between quantum states is induced by the trace norm. The following

property is well known (see, e.g., Ref. [29, Theorem 3.4, page 128]).

Proposition 2.3 (Holevo-Helstrom Theorem [13, 14]). For any pair of quantum states ρ, σ ∈

D(H),

kρ − σk

= 2 max { |Tr(Mρ) − Tr(Mσ)| : M is a measurement operator on H} .

Puriﬁed distance and trace distance are related to each other as follows (see, e.g., Ref. [29,

Theorem 3.33, page 161]):

Proposition 2.4 (Fuchs and van de Graaf Inequalities [11]). For any pair of quantum

states ρ, σ ∈ D(H),

1 −

1 − P(ρ, σ)

≤

kρ − σk

≤ P(ρ, σ) .

Unless speciﬁed, we take the base of the logarithm function to be 2.

Let H, K, and M be the state spaces corresponding to registers X, Y , and M, respectively.

For a register X in quantum state ρ ∈ D(H), the von Neumann entropy of X is deﬁned as

S(ρ)

= −Tr (ρ log ρ) .

This coincides with the Shannon entropy of the spectrum of ρ. The relative entropy of two

quantum states ρ, σ ∈ D(H) is deﬁned as

S(ρkσ)

= Tr (ρ log ρ − ρ log σ) ,

when supp(ρ) ⊆ supp(σ), and is ∞ otherwise. The max-relative entropy of ρ with respect

to σ is deﬁned as

max

(ρkσ)

= min{λ : ρ ≤ 2

σ} ,

when supp(ρ) ⊆ supp(σ), and is ∞ otherwise. The min-entropy of ρ is deﬁned as

min

(ρ)

= −log kρk .

Suppose that the registers X, Y are in joint state ρ

∈ D(H⊗K). The mutual information

of X and Y is deﬁned as

I(X : Y )

= S



kρ

⊗ ρ



Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 8

When the state is clear from the context, the subscript ρ may be omitted from the nota-

tion. When ρ is a classical-quantum state, i.e., ρ

|xihx|

⊗ ρ

with p being a

probability distribution, {|xi} the canonical orthonormal basis for H, and ρ

∈ D(K), we

have

I(X : Y ) =

S(ρ

kρ) ,

where ρ =

. Suppose the registers X, Y, M are in joint (tripartite) state ρ

XYM

∈

D(H ⊗ K ⊗ M). The conditional mutual information of X and M given Y is deﬁned as

I(X : M |Y )

= I(XY : M ) − I(Y : M) .

When ρ

XYM

is a tensor product of the states ρ

and ρ

, we have

I(X : M |Y ) = I(XY : M ) = I(X : M ) .

For any state ρ

∈ D(H ⊗ K), the max-information register Y has about register X [7]

is deﬁned as

max

(X : Y )

= min

σ∈D(K)

max



kρ

⊗ σ



For a parameter  ∈ [0, 1], the smooth max-information register Y has about register X is

deﬁned as



max

(X : Y )

= min

eρ∈B



(ρ)

max

(X : Y )

eρ

2.2 Quantum communication protocols

We ﬁrst describe a two-party quantum communication protocol informally and then give a

formal deﬁnition for the special case of interest to us. We refer the reader to, e.g., Ref. [28]

for a formal deﬁnition of the general case.

In a two-party quantum communication protocol, there are two parties, Alice and Bob,

each of whom may get some input in registers designated for this purpose. Alice and Bob’s

inputs may be entangled with each other, and also with a “reference” system, which puriﬁes

it. Alice and Bob’s goal is to accomplish an information processing task by communicating

with each other.

Each party possesses some “work” (or “private”) qubits (or registers) in addition to the

input registers. The work qubits are initialized to a ﬁxed pure state in tensor product with

the input state. This ﬁxed state may be entangled across the work registers of Alice and

Bob, and may be used as a computational resource. In this case, we say the protocol or

the channel is with shared entanglement or with entanglement assistance. If the ﬁxed state

is a tensor product state across Alice and Bob’s registers, we say it is a protocol or channel

without shared entanglement or simply unassisted.

The protocol proceeds in some number of “rounds”. In each round, the sender applies an

isometry to the qubits in her possession, and sends a sub-register (the message) to the

other party. The length of the message (in qubits) is the base 2 logarithm of the dimension

of the message register. After the last round, the recipient of the last message applies an

isometry to his registers. The output of the protocol is the state of a pair of designated

registers of the two parties at the end.

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 9

We are often interested in minimizing the total length of the messages over all the rounds,

i.e., the communication cost (or complexity) of the protocol. The idea is to accomplish

the task at hand with minimum communication. In protocols with shared entanglement,

we are also interested in the amount of shared entanglement needed in the protocol, i.e.,

the minimum dimension of the support of the initial state of either party’s work space.

This latter quantity, measured in number of qubits, is called the entanglement cost of the

protocol.

In this article, we study only one-way protocols, i.e., protocols with one round, and there-

fore one message, (say) from Alice to Bob. We describe these more formally here. Alice

and Bob initially hold registers A

and B

, respectively. The input registers A

are initialized to some state ρ

whose puriﬁcation is held in register R with a third

party, the referee. Alice and Bob’s work registers E

and E

are initialized to a pure

state |φi

, which may be entangled across the partition E

. The local opera-

tions in the protocol are speciﬁed by two isometries U and V . The isometry U acts on

registers A

and maps them to registers A

out

M. The isometry V acts on regis-

ters B

M and maps them to registers B

out

. First, Alice applies U to the regis-

ters A

and E

and sends the register M to Bob. Then, Bob applies V on his initial

registers B

and the message M. The output of the protocol is the state of Alice

and Bob’s registers A

out

. The communication cost of this protocol is log |M| and the

entanglement cost is the logarithm of the Schmidt rank of the state |φi across the parti-

tion E

. We say it is a protocol with shared entanglement if the Schmidt rank of |φi is

more than 1, and say that it is without shared entanglement otherwise. Such protocols are

also called entanglement-assisted and unassisted, respectively, in the literature.

We say that the input is “classical” when there are non-empty ﬁnite sets S

, S

(the

sets of classical inputs) such that the Hilbert spaces corresponding to the input registers

are C

, C

, respectively, and the initial joint quantum state in the input registers A

is diagonal in the canonical basis {|xi|yi : x ∈ S

, y ∈ S

}. In the case that the inputs to

Alice and Bob are classical, we assume without loss of generality that the input registers A

and B

are “read-only”, i.e., the isometries U and V are of the form

x∈S

|xihx|

⊗U

and

y∈S

|yihy|

⊗V

, where S

, S

are sets as above. A one-way protocol in which

Alice gets a classical input and Bob does not have any input is depicted in Figure 1.

Let Π be a one-way quantum protocol (with or without shared entanglement) with a

single message from Alice to Bob, in which Alice gets a classical input and Bob does not

have any input. The register R with the referee puriﬁes Alice’s input so that |ρi

x∈S

√

|xxi

, where p

is a probability distribution over the input set S

. Let M

be the quantum register corresponding to the message in Π. The quantum information

cost (or quantum information complexity) of the protocol Π is deﬁned as

QIC(Π)

I(R : M |E

) ,

where the registers are in the state immediately after Alice sends the message register M

to Bob. This expression simpliﬁes to I(R : ME

) as the registers R, E

are in a tensor

product state at this point. It is intended to measure the information Bob gains about

Alice’s input from the message. This notion requires a nuanced deﬁnition for protocols

with more general inputs and with multiple rounds of communication. As it is not central

to our work, we refer the reader to Ref. [28] for the deﬁnition for general protocols.

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 10

2.3 Compression of quantum states

We study one-way protocols for non-oblivious or visible compression of quantum states,

which is typical for tasks of this nature (see, e.g., Ref. [1]). The protocol may be with

or without shared entanglement. Suppose we wish to compress states chosen from an

ensemble ((p

, ρ

) : x ∈ S) for some ﬁnite set S, where p is a probability distribution

over S and ρ

∈ D(H). The ensemble is known to both parties. The sender, say Alice, is

given a classical input x ∈ S chosen according to the distribution p. Alice and Bob execute

a one-way protocol with a message from Alice to Bob in order to prepare an approximation

of ρ

on Bob’s side. Following the notation from Section 2.2, we interpret the state of the

message register M of this protocol as a compression of ρ

. Suppose the state of the output

out

. We say that the average error of the compression protocol is  ∈ [0, 2]

if the output state

is -close in trace distance to the ideal state ρ

on average over the

inputs x:

kρ

−

≤  .

It is sometimes desirable to express the error in terms of the puriﬁed distance. For simplic-

ity, we state error bounds in terms of trace distance; we may express the bounds in terms

of puriﬁed distance via Proposition 2.4.

Note that a protocol for visible compression without shared entanglement may be charac-

terized by a sequence of quantum states (σ

: x ∈ S) and a quantum channel Ψ. We let σ

be the state of the message register M sent by Alice to Bob on input x. We deﬁne Ψ as the

channel resulting from the application of the isometry V followed by the tracing out of the

. The average error of the protocol is then

kρ

− Ψ(σ

. Conversely,

any choice of states (σ

: x ∈ S, σ

∈ D(K)) and quantum channel Ψ : L(K) → L(H) for

some Hilbert space K deﬁnes a valid visible compression protocol.

An essentially equivalent formulation of the task of visible compression is the following

(with the notation from Section 2.2). Consider the state τ over the registers RXA

x∈S

√

|xi

|φ

where |φ

is a puriﬁcation of ρ

, register R is held by the referee, and registers XA

together constitute Alice’s input register A

. Alice and Bob both know the full description

of τ. Their goal is to run a one-way quantum communication protocol with a message from

Alice to Bob, with or without shared entanglement, such that at the end, the state

τ of

registers RB

out

is close to τ



out

− τ



≤  .

The diﬀerence from state-splitting is that for a ﬁxed state |xi of register R, the puriﬁcation

of the state in register B

out

may be shared arbitrarily between Alice and Bob (while in

state splitting, it is required to be held by Alice, in register A

). A protocol for state-

splitting can thus be used for this task, and conversely lower bounds on communication or

entanglement costs derived for the above task applies to state-splitting as well.

3 The main result

In this section, we prove the main result of this article.

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 11

3.1 Two useful lemmas

We begin with two lemmas that we need for the result. The ﬁrst allows us to focus

on a ﬁnite number of subspaces of a ﬁnite dimensional Hilbert space, in the context of

measurements. For an operator M ∈ L(H), and a subspace A of H, deﬁne the semi-norm

kMk

= max

|wi ∈ Sphere(A)

|hw|M|wi| .

Lemma 3.1 ([17], Lemma 6). Let d and q be positive integers with q ≥ d, δ > 0 be a real

number, and H be an q-dimensional Hilbert space. There exists a set T of subspaces of H

of dimension at most d such that

1. |T| ≤



√



2qd

, and

2. for every d-dimensional subspace A ⊆ H, there is a subspace B ∈ T such that for

every measurement operator M ∈ Pos(H),



kMk

− kMk



≤ δ .

The set T in the lemma is obtained as follows. We ﬁx an -dense subset S of Sphere(H) for

a suitably small value of , as given by Proposition 2.1. For any d-dimensional subspace A,

we consider an orthonormal basis, and the d vectors in S closest to the respective elements

in the basis. We include in T the subspace B spanned by the d vectors from S so obtained.

By a uniformly random subspace of dimension ` of an m-dimensional Hilbert space H,

with ` ≤ m, we mean the image of a ﬁxed `-dimensional subspace under a Haar-random

unitary operator on H. The next lemma is similar to Lemma 7 from Ref. [17], and is

stronger in several respects. It enables the generalization of the incompressibility result in

Ref. [17] that we prove, and helps us derive tighter bounds for compression. Informally,

the lemma states that every state in a “small enough” subspace of a bi-partite space has,

with high probability, a small projection onto a “small enough” random subspace of one

part.

Lemma 3.2. Let m, d, `, and p be positive integers such that ` ≤ m. Let W be a

ﬁxed d-dimensional subspace of C

⊗ C

. Let Z be a uniformly random subspace of C

of dimension `, and M be the orthogonal projection operator onto Z . Then for any real

number α > 2, there is a real number α

> 0 that depends only on α such that



kM ⊗ 1

≥

α`



≤ exp

−

(m − 2)

provided

(α − 2)

(m − 2) ≥ (4 × 384)dm



α`



We may take α

(α−2)

768

in the above statement.

Proof: The subspace W is isomorphic to C

as it is d-dimensional. By Proposition 2.1,

there is a set N with |N| ≤



α`



that is a

α`

-dense set of Sphere(W).

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 12

Note that for any two vectors |ui, |vi ∈ Sphere(C

⊗ C

), we have

|hu|(M ⊗ 1)|ui − hv|(M ⊗ 1)|vi| = |Tr (M |uihu| − M |vihv|)|

≤

k|uihu| − |vihv|k

(by Proposition 2.3)

≤

k(|ui − |vi)hv|k

k|ui(hu| − hv|)k

= k|uikk|ui − |vik = k|ui − |vik .

This implies that if kM ⊗ 1

≥

α`

, there is a vector |vi ∈ N such that hv|(M ⊗1)|vi ≥

α`

. By the Union Bound, we get



kM ⊗ 1

≥

α`



≤ |N| × max

|vi∈N



hv|(M ⊗ 1)|vi ≥

α`



. (3.1)

Consider any ﬁxed vector |vi ∈ N and let P ∈ Pos(C

) be a ﬁxed orthogonal projection

of rank `. Consider the function f : U(C

) → R deﬁned as

f(U)

= hv|(UP U

∗

⊗ 1

) |vi .

For any U, W ∈ U(C

), we have

|f(U) − f (W )| =





((UP U

∗

− W P W

∗

) ⊗ 1) |vihv|





≤ kUP U

∗

− W P W

∗

≤ kUP U

∗

− W P U

∗

k + kW P U

∗

− W P W

∗

≤ kU − W k + kU

∗

− W

∗

≤ 2 kU − W k

So f is 2-Lipschitz.

Let U ∈ U(C

) be a Haar-random unitary operation. The expectation of f(U ) is:

E[f(U )] = hv|



E[U P U

∗

] ⊗ 1



|vi

= hv|



⊗ 1



|vi

Since UP U

∗

and M have the same distribution, by Theorem 2.2 we get



hv|(M ⊗ 1) |vi ≥

α`



= Pr



hv|(UP U

∗

⊗ 1) |vi ≥

α`



≤ exp

−

(m − 2)(α − 2)

384m

By Eq. (3.1), we get



kM ⊗ 1

≥

α`



≤



α`



exp

−

(m − 2)(α − 2)

384m

≤ exp

−

(m − 2)(α − 2)

768m

provided the m, `, d, α satisfy the stated condition.

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 13

3.2 The ensemble and its compressibility

We study an ensemble of the same form as in Ref. [17]. For positive integers n, m, k such

that k divides m and n, let B

= (|b

i, |b

i, . . . , |b

i) be a suitably chosen orthonormal

basis for C

, for each i ∈





. Let (B

: j ∈ [k]) be a partition of B

into k equal size

sets. Deﬁne ρ

|vi∈B

|vihv|. We show that there is a choice of bases such that the

ensemble





, ρ



: i ∈





, j ∈ [k]



(3.2)

cannot be compressed signiﬁcantly in the absence of shared entanglement. The following

theorem, which we prove along the same lines as Theorem 5 in Ref. [17], contains the crux

of the argument.

Theorem 3.3. Let β ∈ (0, 1),  ∈ (0, 2), and ν ∈ (0, 1 − /2). Let k, m, n, d be positive

integers such that k divides m and n. There exists an ensemble of n quantum states (ρ

) of

the form in Eq. (3.2) such that for any sequence of quantum states



: σ

∈ D(C

), i ∈





, j ∈ [k]



, and for all quantum channels Ψ : L(C

) → L(C

), we have





(i, j) : kρ

− Ψ(σ

> 





> βn ,

when

k ≥

1 − /2 − ν

m > max





1 − β



ln k, 2 +



1 − /2 − ν



, and

n >

6kd

γ(1 − β)

√

where γ

(1−/2−ν)

8×768

Proof: We use the Probabilistic Method to show the existence of an ensemble with the

claimed property. We ﬁrst derive a simpler property that suﬃces.

For i ∈





and j ∈ [k], let τ

∈ D(C

) be m-dimensional quantum states and M

be the

orthogonal projection onto the support of τ

. By Proposition 2.3, the condition



Tr (M

) − Tr (M

Ψ(σ

))





(3.3)

implies that kτ

− Ψ(σ

> . Since Tr(M

) = 1, Eq. (3.3) is equivalent to

Tr (M

Ψ(σ

)) < 1 −



. (3.4)

Consider the following Stinespring representation [29, Corollary 2.27, Sec. 2.2] of the quan-

tum channel Ψ : L(C

) → L(C

) in terms of a unitary operation U ∈ U(A ⊗ B ⊗C) and a

ﬁxed pure state |

0i ∈ B ⊗ C, with A = C

, B = C = C

Ψ(ω) = Tr

A⊗B

U(ω ⊗ |

0ih

0|)U

∗

∀ω ∈ L(C

) .

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 14

So we have

Tr (M

Ψ(σ

)) = Tr



A⊗B



U(σ

⊗ |

0ih

0|)U

∗





= Tr



⊗ 1

A⊗B

) U(σ

⊗ |

0ih

0|)U

∗



and Eq. (3.4) is equivalent to



⊗ 1

A⊗B

) U(σ

⊗ |

0ih

0|)U

∗



< 1 −



. (3.5)

For a ﬁxed unitary operator U, for any i, j, the state U(σ

⊗ |

0ih

0|)U

∗

belongs to D(X)

where X

= U(A⊗|

0i) is a ﬁxed d-dimensional subspace of A⊗B⊗C. Thus, the expression

on the left in Eq. (3.5) is bounded by kM

⊗ 1

A⊗B

for every i, j. So it suﬃces to exhibit

an ensemble such that for all d-dimensional subspaces W ⊆ A ⊗ B ⊗ C,





(i, j) : kM

⊗ 1

A⊗B

< 1 −







> βn .

By Lemma 3.1, for any ν > 0, there is a collection T of subspaces of A⊗B⊗C of dimension

at most d, such that size |T| ≤ (8

√

d/ν)

, and for all subspaces W as above, there is

a subspace Y ∈ T such that for all i, j,



⊗ 1

A⊗B

− kM

⊗ 1

A⊗B



≤ ν .

Taking ν < 1 −



, it suﬃces to produce an ensemble such that for all subspaces Y ∈ T,





(i, j) : kM

⊗ 1

A⊗B

< 1 −



− ν





> βn . (3.6)

We pick bases B

independently and uniformly at random, i.e., for each i, independently

pick a Haar-random unitary operator on C

, and let B

be the basis deﬁned by its columns.

Partition B

into k sets (B

: j ∈ [k]) of equal size. We then deﬁne an ensemble of the

form in Eq. (3.2) with ρ

|vi∈B

|vihv|, and the corresponding projection opera-

tors M

|vi∈B

|vihv|. We show that with non-zero probability, the operators M

satisfy Eq. (3.6) for all Y ∈ T, by bounding the probability of the complementary event.

Suppose the operators M

do not satisfy Eq. (3.6) for some subspace Y ∈ T. Then





(i, j) : kM

⊗ 1

A⊗B

< 1 −



− ν





≤ βn . (3.7)

Equivalently, there are at least (1 − β)n pairs i, j such that kM

⊗ 1k

≥ 1 −/2 −ν. In

particular, there are at least (1 − β)n/k indices i such that there is at least one j ∈ [k]

with kM

⊗ 1k

≥ 1 −/2 −ν. For convenience, by E

(Y) we denote the event that there

is some j ∈ [k] with kM

⊗ 1k

≥ 1 − /2 − ν, and by I(Y), we denote the subset of

indices i ∈





such that E

(Y) occurs.

Let q

= d(1 − β)

e. By the above reasoning, it suﬃces to bound the probability that for

some subspace Y ∈ T, the subset I(Y) has at least q indices.

By Lemma 3.2, for a ﬁxed subspace Y and pair i, j,

⊗ 1k

≥ 1 − /2 − ν

≤ exp

−

((1 − /2 − ν)k − 2)

(m − 2)

768k

≤ exp(−γm) ,

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 15

with γ

(1−/2−ν)

8×768

, when (1 − /2 − ν)k ≥ 4 and

m − 2 ≥

(16 × 384)d

(1 − /2 − ν)



1 − /2 − ν



So by the Union Bound

(Y)

≤ k exp(−γm) ,

and by the Union Bound and the independence of M

for distinct indices i,

|I(Y)| ≥ q

≤

× (k exp(−γm))

Finally, we get

∃Y ∈ T : Eq. (3.7) holds

≤ |T| × max

Y∈T

|I(Y)| ≥ q

≤

√

(k exp(−γm))

< 1 ,

when m > max



1−β



ln k

, and

γ(1 − β)n > 6kd

m ln

√

This proves the theorem.

Note that the above proof considers an arbitrary choice of states σ

and quantum channel Ψ

after the ensemble is chosen randomly. Together, the sequence (σ

) and the channel Ψ

constitute a compression protocol. The proof shows that no matter how (σ

) and Ψ are

chosen, the error due to the corresponding compression protocol is large if the dimension d

is much smaller than m (provided n is chosen properly).

3.3 Application to entanglement cost

Consider a one-way protocol Π in which with probability 1/n, Alice gets input (i, j), pre-

pares state ρ

as in an ensemble given by Theorem 3.3, and sends it to Bob. The ensemble

average ρ is the completely mixed state

over C

. By construction, we have S(ρ

kρ) =

log k, and therefore QIC(Π) =

log k. In fact, we have S

max

(ρ

kρ) = log k. Theo-

rem I.1(1) of Ref. [4] gives us a protocol for the visible compression of any such ensemble

of states using classical communication and shared entanglement, with error . The com-

munication cost of this protocol is

/

√

max

(A : B)

+ O(log log(1/)) ,

where τ

|ijihij|

⊗ ρ

and we have used Proposition 2.4 to translate between

puriﬁed and trace distance. This expression is bounded from above by log k + O(log log



since S

max

(ρ

kρ) (and therefore I

max

(A : B)

) equals log k. Using superdense coding [29,

Section 6.3.1], we get a bound on the quantum communication cost of compressing the

ensemble with entanglement assistance.

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 16

Proposition 3.4. For any positive integers k, m, n such that k divides m and n, and

error parameter  > 0, any ensemble of n equally likely quantum states in D(C

) of the

form in Eq. (3.2) there is a one-shot one-way protocol with shared entanglement for

compressing the states with quantum communication at most

log k + O(log log



) ,

with average error at most  in trace distance.

This bound is an additive term of O(log log



) more than QIC(Π). Theorem I.1(1) in

Ref. [4] also gives a lower bound of (1/2) I

√



max

(A : B)

on the communication cost, which is

at least (1/2) log k−2 for  ≤ 1/81 (see Proposition A.1 in the appendix). So for constant ,

the upper bound in Proposition 3.4 is close to optimal as a function of k. It is slightly better

than those obtained from protocols for state splitting (see, e.g., Ref. [1, Corollary 5]), which

have an additive term of order log



. However, the protocol from Ref. [4] has entanglement

cost of order k(log



) log m, which is exponential in the communication cost, while the

protocol for state splitting with the least known communication cost [1, Corollary 5] has

entanglement cost of order (1 + 1/

) log(m/).

Next we consider how small the entanglement cost of the visible compression of an ensem-

ble (ρ

) given by Theorem 3.3 may be. By choosing the parameters in the statement of

Theorem 3.3 appropriately, we get the following lower bound on the sum of communication

and entanglement costs of any compression protocol.

Corollary 3.5. There exist universal constants c

, c

> 0 such that for any  ∈ (0, 1)

and any positive integers k, m, n with m and n divisible by k, there is an ensemble of n

equally likely quantum states in D(C

) of the form in Eq. (3.2) for which any (one-shot)

one-way protocol for compressing the states with average error at most



, the sum of the

communication and entanglement costs is at least

log m − 2 log

1 − 

− log ln

1 − 

− c

, (3.8)

when k ≥ 6/(1 − ), m ≥ c

(ln k)/(1 − )

, and

n ≥

(1 − )

√



In particular, the entanglement cost of any such protocol with optimal communication

cost is at least

log m −

log k −O



log

1 − 



− O(1) ,

and the communication cost of any such protocol without entanglement is at least the

bound given in Eq. (3.8).

We defer the proof of this corollary to the appendix.

Note that the parameter m may be chosen arbitrarily larger than k, provided the number

of states n in the ensemble is chosen large enough. Thus, we see that there are ensembles

with m-dimensional states for which communication-optimal compression protocols with

shared entanglement and with constant average error, say 1/4, have entanglement cost

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 17

almost as large as log m. In particular, the number of qubits of shared entanglement needed

may be arbitrarily larger than the quantum information cost of the original protocol.

We also see that in the absence of shared entanglement, there are ensembles with m-

dimensional states that cannot be compressed to states with dimension smaller than cm

with average error less than 1/4, where c is a universal positive constant. In particular,

the optimally compressed message may be arbitrarily longer than the quantum information

cost of the protocol Π.

Corollary 3.5 shows that the number of qubits of shared entanglement used by protocol

with the smallest known communication cost, due to Anshu and Jain [1, Corollary 5], is

optimal up to a constant multiplicative factor and an additive log k term (for constant

error in compression). The lower bound on entanglement cost given in the corollary may

be achieved by protocols derived from those for state splitting, up to an additive term

log k + O(1), again for constant error (see, e.g., Ref. [7, Lemma 3.3]). However, the

communication cost of these protocols may not be optimal.

The probabilistic construction in the results above gives us ensembles with a number

of states n that is polynomial in m and k. Note that in the compression protocol Π

Alice may send the input (i, j) as her message, in which case the message register has

dimension n. Similarly, she may send the state ρ

itself, and this has dimension m.

So in order to study how much compression is truly possible (i.e., how much smaller the

dimension of the message register may be as compared with m), we have to study ensembles

with n ≥ m states, and compression protocols with message registers with dimension at

most m. Further, consider any protocol Γ (similar to Π) in which Alice receives a random

input x out of n possibilities according to some distribution, prepares a state ω

and sends

it to Bob. The quantum information cost of such a protocol Γ is at most

log n. So

the polynomial dependence of n on the dimension of the states in the ensemble (m in the

construction above) and the exponential dependence of n on the quantum information cost

of the corresponding protocol (

log k in the construction) is inevitable.

4 Concluding remarks

In this article, we revisited one-shot compression of an ensemble of quantum states. We

proved that there are ensembles which cannot be compressed by more than a few qubits

in the absence of shared entanglement, when allowed constant error. In the presence of

shared entanglement, the ensemble can be compressed to many fewer qubits. However,

the entanglement cost may not be smaller than the number of qubits being compressed by

more than a constant, for constant error. Since we study compression protocols that are

allowed to make some error, the bounds we establish are robust to perturbations to the

shared entangled state that are suﬃciently small relative to the error.

Entanglement and quantum communication are distinct resources in the context of in-

formation processing. Sharing entanglement involves the generation, distribution, and

storage of a state that is independent of the input for the task at hand. Communication

also involves the same steps, but may be dynamic, i.e., may depend on the input and the

prior history of the communication protocol. Consequently, any physical implementation

of these resources is likely to incur diﬀerent costs for these steps. In this work, we focused

on the cost of distributing quantum states, and as a ﬁrst stab, assumed that the cost of

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 18

distribution for shared entanglement or for communication is proportional to the number of

qubits involved. Formally, this corresponds to the notion of smooth 0-Rényi entropy. The

motivation for this focus comes largely from the area of communication complexity [21], in

which the interaction between multiple processors takes centre stage, but shared entangle-

ment is often taken for granted. Our result shows that entanglement plays a crucial role in

important communication tasks and highlights the need for considering entanglement cost

in addition to communication cost.

A question of interest, from a theoretical perspective, is the degree or strength of entan-

glement required for diﬀerent information processing tasks. Several diﬀerent measures of

entanglement have been studied in the literature, depending on the context. Smooth 0-

Rényi entropy is a very coarse measure in this respect, as it may be the same for states

that are regarded as having widely diﬀerent degrees of entanglement. A natural question

is whether results such as the ones we derived also hold for other deﬁnitions of entangle-

ment cost that capture the degree of entanglement more satisfactorily. We conjecture that

analogous results hold also for other measures, and leave this to future work.

Many other questions surrounding compression remain open. For instance, we do not have

tight characterizations for the communication and entanglement costs of one-shot state

re-distribution. Even lesser is known for the one-shot compression of interactive quantum

protocols. Progress on these questions might hold the key to resolving important questions

in communication complexity as well.

References

[1] Anurag Anshu and Rahul Jain. Eﬃcient methods for one-shot quan-

tum communication. Technical Report arXiv:1809.07056 [quant-ph], arXiv.org,

https://arxiv.org/abs/1809.07056, September 2018.

[2] Anurag Anshu, Rahul Jain, Priyanka Mukhopadhyay, Ala Shayeghi, and Penghui

Yao. New one shot quantum protocols with application to communication com-

plexity. IEEE Transactions on Information Theory, 62(12):7566–7577, 2016. DOI:

10.1109/TIT.2016.2616125.

[3] Anurag Anshu, Dave Touchette, Penghui Yao, and Nengkun Yu. Exponential sep-

aration of quantum communication and classical information. In Proceedings of

the 49th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2017,

pages 277–288, New York, NY, USA, 2017. ACM. ISBN 978-1-4503-4528-6. DOI:

10.1145/3055399.3055401.

[4] Shima Bab Hadiashar, Ashwin Nayak, and Renato Renner. Communication complex-

ity of one-shot remote state preparation. IEEE Transactions on Information Theory,

64(7):4709–4728, July 2018. DOI: 10.1109/TIT.2018.2811509.

[5] Howard Barnum, Carlton M. Caves, Christopher A. Fuchs, Richard Jozsa, and Ben-

jamin Schumacher. On quantum coding for ensembles of mixed states. Journal

of Physics A: Mathematical and General, 34(35):6767–6785, August 2001. DOI:

10.1088/0305-4470/34/35/304.

[6] Charles H. Bennett, Igor Devetak, Aram W. Harrow, Peter W. Shor, and Andreas

Winter. The quantum reverse Shannon theorem and resource tradeoﬀs for simulating

quantum channels. IEEE Transactions on Information Theory, 60(5):2926–2959, May

2014. ISSN 0018-9448. DOI: 10.1109/TIT.2014.2309968.

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 19

[7] Mario Berta, Matthias Christandl, and Renato Renner. The quantum reverse Shannon

theorem based on one-shot information theory. Communications in Mathematical

Physics, 306(3):579–615, August 2011. ISSN 1432-0916. DOI: 10.1007/s00220-011-

1309-7.

[8] Mario Berta, Matthias Christandl, and Dave Touchette. Smooth entropy bounds on

one-shot quantum state redistribution. IEEE Transactions on Information Theory,

62(3):1425–1439, March 2016. DOI: 10.1109/TIT.2016.2516006.

[9] Igor Devetak. Triangle of dualities between quantum communication protocols. Phys-

ical Review Letters, 97:140503, Oct 2006. DOI: 10.1103/PhysRevLett.97.140503.

[10] Igor Devetak and Jon Yard. Exact cost of redistributing multipartite quantum

states. Physical Review Letters, 100(230501), June 2008. DOI: 10.1103/Phys-

RevLett.100.230501.

[11] Christopher A. Fuchs and Jeroen van de Graaf. Cryptographic distinguishability

measures for quantum-mechanical states. IEEE Transactions on Information Theory,

45(4):1216–1227, May 1999. ISSN 1557-9654. DOI: 10.1109/18.761271.

[12] Alexei Gilchrist, Nathan K. Langford, and Michael A. Nielsen. Distance measures to

compare real and ideal quantum processes. Physical Review A, 71:062310, Jun 2005.

DOI: 10.1103/PhysRevA.71.062310.

[13] Carl W. Helstrom. Detection theory and quantum mechanics. Information and Con-

trol, 10(3):254–291, 1967. DOI: 10.1016/S0019-9958(67)90302-6.

[14] Alexander S. Holevo. An analogue of statistical decision theory and noncommutative

probability theory. Trudy Moskovskogo Matematicheskogo Obshchestva, 26:133–149,

1972.

[15] Michał Horodecki, Jonathan Oppenheim, and Andreas Winter. Partial quantum in-

formation. Nature, 436(7051):673–676, August 2005. DOI: 10.1038/nature03909.

[16] Michał Horodecki, Jonathan Oppenheim, and Andreas Winter. Quantum state merg-

ing and negative information. Communications in Mathematical Physics, 269(1):107–

136, January 2007. DOI: 10.1007/s00220-006-0118-x.

[17] Rahul Jain, Jaikumar Radhakrishnan, and Pranab Sen. A direct sum theorem in

communication complexity via message compression. In Jos C. M. Baeten, Jan Karel

Lenstra, Joachim Parrow, and Gerhard J. Woeginger, editors, Automata, Languages

and Programming, volume 2719 of Lecture Notes in Computer Science, pages 300–315,

Berlin, Heidelberg, 2003. Springer Berlin Heidelberg. ISBN 978-3-540-45061-0. DOI:

10.1007/3-540-45061-0_26.

[18] Rahul Jain, Jaikumar Radhakrishnan, and Pranab Sen. Prior entanglement, message

compression and privacy in quantum communication. In Proceedings of the 20th An-

nual IEEE Conference on Computational Complexity, pages 285–296. IEEE Computer

Society, 2005. DOI: 10.1109/CCC.2005.24.

[19] Rahul Jain, Pranab Sen, and Jaikumar Radhakrishnan. Optimal direct sum and pri-

vacy trade-oﬀ results for quantum and classical communication complexity. Technical

Report arXiv:0807.1267v1 [cs.DC], arXiv.org, https://arxiv.org/abs/0807.1267,

July 2008.

[20] Zahra B. Khanian and Andreas Winter. Entanglement-assisted quantum data com-

pression. In 2019 IEEE International Symposium on Information Theory (ISIT),

pages 1147–1151, 2019. DOI: 10.1109/ISIT.2019.8849352.

[21] Troy Lee and Adi Shraibman. Lower bounds in communication complexity. Founda-

tions and Trends in Theoretical Computer Science, 3(4):263–399, 2009. ISSN 1551-

305X. DOI: 10.1561/0400000040.

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 20

[22] Zi-Wen Liu, Christopher Perry, Yechao Zhu, Dax Enshan Koh, and Scott Aaronson.

Doubly inﬁnite separation of quantum information and communication. Phys. Rev.

A, 93:012347, Jan 2016. DOI: 10.1103/PhysRevA.93.012347.

[23] Zhicheng Luo and Igor Devetak. Channel simulation with quantum side information.

IEEE Transactions on Information Theory, 55(3):1331–1342, March 2009. ISSN 0018-

9448. DOI: 10.1109/TIT.2008.2011424.

[24] Jiří Matoušek. Lectures on Discrete Geometry, volume 212 of Graduate Texts in

Mathematics. Springer-Verlag New York, 1st edition, 2002. ISBN 978-0-387-95373-1.

DOI: 10.1007/978-1-4613-0039-7.

[25] Elizabeth S. Meckes. The Random Matrix Theory of the Classical Compact Groups,

volume 218 of Cambridge Tracts in Mathematics. Cambridge University Press, July

2019. DOI: 10.1017/9781108303453.

[26] Vitali D. Milman and Gideon Schechtman. Asymptotic Theory of Finite Dimensional

Normed Spaces, volume 1200 of Lecture notes in mathematics. Springer-Verlag Berlin

Heidelberg, 1986. DOI: 10.1007/978-3-540-38822-7.

[27] Marco Tomamichel, Roger Colbeck, and Renato Renner. Duality between smooth

min- and max-entropies. IEEE Transactions on Information Theory, 56(9):4674–4681,

September 2010. ISSN 0018-9448, 1557-9654. DOI: 10.1109/TIT.2010.2054130.

[28] Dave Touchette. Quantum information complexity. In Proceedings of the Forty-

seventh Annual ACM Symposium on Theory of Computing, STOC ’15, pages

317–326, New York, NY, USA, 2015. ACM. ISBN 978-1-4503-3536-2. DOI:

10.1145/2746539.2746613.

[29] John Watrous. The Theory of Quantum Information. Cambridge University Press,

May 2018. DOI: 10.1017/9781316848142.

[30] Jon T. Yard and Igor Devetak. Optimal quantum source coding with quantum side

information at the encoder and decoder. IEEE Transactions on Information Theory,

55(11):5339–5351, November 2009. ISSN 0018-9448. DOI: 10.1109/TIT.2009.2030494.

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 21

A Proofs of some claims

In this section, we include the proofs of some statements from the main body of the article.

Proof of Corollary 3.5: We invoke Theorem 3.3 with  ∈ (0, 1), ν = /2, β = 1/2

and k, m, n satisfying the conditions stated in the corollary. Then γ as in Theorem 3.3

equals (1 − )

/(8 ×768). We take c

= (24 ×768) + 1, so that m > (3/γ) ln k. Since k ≥

6/(1 − ), we have k > 6 > 2e = e/(1 − β), and m > (3/γ) ln(e/(1 − β)). We take c

(6 × 2 × 8 × 768) + 1 so that n > (6km

/γ(1 − β)) ln(8

√

m/ν).

Now we consider an ensemble (ρ

) given by Theorem 3.3. Let Π

be any one-way proto-

col, possibly with shared entanglement, for the visible compression of the ensemble (ρ

)

with average error at most /2. Following the notation from Section 2.2, suppose that

Bob holds registers M E

just after he receives the message M from Alice in Π

. If the

entanglement cost of Π

is e, we may assume that the register E

may be partitioned

into sub-registers E

with |E

| = e, and that the state of register E

is of the

form ω ⊗ |0ih0|, where E

is in state ω and E

in state |0ih0|, and |0i is a pure state.

(We may achieve this by applying a suitable isometry to register E

Let d

= |M E

|, so that the sum of the communication and entanglement costs of Π

is log d, and let σ

be the state of the registers ME

with Bob when Alice is given

input (i, j). If d ≥ m, the bound in Eq. (3.8) holds, so consider the case when d < m.

Then the choice of n above implies that n > (6kd

m/γ(1 − β)) ln(8

√

d/ν).

Since the average error of Π

is at most /2, by the Markov Inequality we have





(i, j) : kρ

− Ψ(σ

> 





= βn ,

where Ψ is the quantum channel corresponding to Bob’s decompression operation in Π

Theorem 3.3 then implies that

2 +



1 − /2 − ν



≥ m .

Since m −2 ≥ m/2, this gives us the bound stated in Eq. (3.8) with c

= log(16 ×768).

Proposition A.1. Let (ρ

) be an ensemble of the form in Eq. (3.2), and let the state τ

be deﬁned as

|ijihij|

⊗ ρ

. For any ζ ∈ [0, 1/8), we have

max

(A : B)

≥ log k − log



3 − 12ζ

1 − 8ζ



Proof: As shown in Ref. [4, Proposition II.5], there is a classical-quantum state τ

within

puriﬁed distance ζ of τ such that I

max

(A : B)

= I

max

(A : B)

. Let τ

|ijihij| ⊗

By Proposition 2.4, we have



τ − τ



≤ 2ζ . (A.1)

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 22

Let ξ

= 2ζ. By monotonicity of trace distance under measurements [29, Proposition 3.5],

we further get

− p

| ≤ ξ .

If q

> 3/2n or q

< 1/2n, we have |q

− p

| > 1/2n. So for at least (1−2ξ)n pairs (i, j),

we have 1/2n ≤ q

≤ 3/2n, and we call such pairs (i, j) typical.

Eq. (A.1) may be written as

− p

≤ ξ ,

so, by monotonicity of trace distance,

|vi∈B



hv|

|vi−



≤ ξ ,

where B

is as in the deﬁnition of the ensemble (ρ

). In particular,

typical ij

|vi∈B



hv|

|vi−



≤ ξ . (A.2)

There are at least (1 − 2ξ)n/k indices i ∈ [n/k] such that there is a typical pair (i, j) for

some j ∈ [k]. Let S be the set of such indices i. Let η ∈ (0, 1). If for all indices i ∈ S,

there are less than (1 − η)m pairs (j, v) with (i, j) typical, |vi ∈ B

, and

2nm

≤ q

hv|

|vi ≤

2nm

, (A.3)

then we would have

typical ij

|vi∈B



hv|

|vi−



> (1 − 2ξ)

× ηm ×

2nm

= (1 − 2ξ)

Taking η

= 2ξ/(1 − 2ξ), we see that this is in contradiction with Eq. (A.2). So there is

an index i ∈ S such that there are at least (1 − η)m pairs (j, v) with j ∈ [k] and |vi ∈ B

such that (i, j) is typical, and (i, j, v) satisfy Eq. (A.3). Denote such an index i by i

, and

let

(j, v) : j ∈ [k], |vi ∈ B

, (i

, j) typical , (i, j, v) satisfy Eq. (A.3)

We have that for all the pairs (j, v) ∈ T ,

2nm

≤ q

hv|

|vi ≤

hv|

|vi ,

so that

≤ hv|

|vi . (A.4)

Let σ ∈ D(C

) be a state that achieves I

max

(A : B)

, and let λ denote this max-

information. For typical pairs (i, j), since q

> 0, we have

≤ 2

σ. By Eq. (A.4), we

also have k/3m ≤ 2

hv|σ|vi for all pairs (j, v) ∈ T . Summing up over all pairs (j, v) ∈ T ,

we get (1 −η)k/3 ≤ 2

, as the sets B

are a partition of an orthonormal basis, and σ has

trace at most 1. So λ ≥ log k −log(3/(1 − η)).

Accepted in Quantum 2020-06-10, click title to verify. Published under CC-BY 4.0. 23

Comments

Products

Project