Fermat's Library | Sandwiched Renyi Convergence for Quantum Evolutions annotated/explained version.

Sandwiched R

´

enyi Convergence

for Quantum Evolutions

Alexander M

¨

uller-Hermes

1

and Daniel Stilck Franc¸a

2

1

Department of Mathematical Sciences, University of Copenhagen, 2100 Copenhagen, Denmark

2

Department of Mathematics, Technische Universit

¨

at M

¨

unchen, 85748 Garching, Germany

February 27, 2018

We study the speed of convergence of a primitive quantum time evolution

towards its ﬁxed point in the distance of sandwiched R´enyi divergences. For

each of these distance measures the convergence is typically exponentially fast

and the best exponent is given by a constant (similar to a logarithmic Sobolev

constant) depending only on the generator of the time evolution. We establish

relations between these constants and the logarithmic Sobolev constants as

well as the spectral gap. An important consequence of these relations is the

derivation of mixing time bounds for time evolutions directly from logarithmic

Sobolev inequalities without relying on notions like l

p

-regularity. We also derive

strong converse bounds for the classical capacity of a quantum time evolution

and apply these to obtain bounds on the classical capacity of some examples,

including stabilizer Hamiltonians under thermal noise.

Alexander M

¨

uller-Hermes: muellerh@posteo.net

Daniel Stilck Franc¸a: dsfranca@mytum.de

Accepted in Quantum 2018-02-01, click title to verify 1

arXiv:1607.00041v3 [quant-ph] 26 Feb 2018

Contents

1 Introduction 3

2 Notation and Preliminaries 4

2.1 Noncommutative l

p

-spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

2.2 Quantum dynamical semigroups . . . . . . . . . . . . . . . . . . . . . . . . 6

2.3 Logarithmic Sobolev inequalities and the spectral gap . . . . . . . . . . . . 7

3 Convergence rates for sandwiched R´enyi divergences 9

3.1 R´enyi-entropy production . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

3.2 Sandwiched R´enyi convergence rates . . . . . . . . . . . . . . . . . . . . . . 11

3.3 Computing β

p

in simple cases . . . . . . . . . . . . . . . . . . . . . . . . . . 12

4 Comparison with similar quantities 13

4.1 Comparison with spectral gap . . . . . . . . . . . . . . . . . . . . . . . . . . 13

4.2 Comparison with logarithmic Sobolev constants . . . . . . . . . . . . . . . . 15

5 Mixing times 16

5.1 Mixing in Continuous Time . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

5.2 Mixing in Discrete Time . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

6 Strong converse bounds for the classical capacity 21

7 Examples of bounds for the classical capacity of Semigroups 23

7.1 Depolarizing Channels . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

7.2 Stabilizer Hamiltonians . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

7.3 2D Toric Code . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26

7.4 Truncated harmonic oscillator . . . . . . . . . . . . . . . . . . . . . . . . . . 26

8 Conclusion and open questions 28

Acknowledgments 29

A Taylor Expansion of the Dirichlet Form 29

B Interpolation Theorems and Proof of Theorem 5.3 31

References 33

Accepted in Quantum 2018-02-01, click title to verify 2

1 Introduction

Consider a quantum system aﬀected by Markovian noise modeled by a quantum dynamical

semigroup T

t

(with time parameter t ∈

+

) driving every initial state towards a unique

full rank state σ. Using the framework of logarithmic Sobolev inequalities as introduced

in [1, 2] the speed of the convergence towards the ﬁxed point can be studied. Speciﬁcally,

the α

1

-logarithmic Sobolev constant (see [1, 2]) is the optimal exponent α ∈

+

such that

the inequality

D(T

t

(ρ)kσ) ≤ e

−2αt

D (ρkσ) (1)

holds for the quantum Kullback-Leibler divergence, given by D (ρkσ) = tr [ρ(ln(ρ) − ln(σ))],

for all t ∈

+

and all states ρ.

The framework of logarithmic Sobolev constants is closely linked to properties of non-

commutative l

p

-norms, and speciﬁcally to hypercontractivity [1, 2]. Noncommutative l

p

-

norms also appeared recently in the deﬁnition of generalized R´enyi divergences (so called

“sandwiched R´enyi divergences” [3, 4]). It is therefore natural to study the relationship

between logarithmic Sobolev inequalities and noncommutative l

p

-norms more closely. The

approach used here is to deﬁne constants (which we call β

p

for a parameter p ∈ [1, ∞)),

which resemble the logarithmic Sobolev constants, but where the distance measure is a

sandwiched R´enyi divergence instead of the quantum Kullback-Leibler divergence. More

speciﬁcally, the constants β

p

will be the optimal exponents such that inequalities of the

form (1) hold for the sandwiched R´enyi divergences D

p

, given by

D

p

(ρkσ) =











1

p−1

ln



tr



σ

1−p

2p

ρσ

1−p

2p



p



if ker (σ) ⊆ ker (ρ) or p ∈ (0, 1)

+∞, otherwise,

(2)

instead of the quantum Kullback-Leibler divergence D.

Our main results are two-fold:

• We derive inequalities between the new β

p

and other quantities such as logarithmic

Sobolev constants and the spectral gap of the generator of the time evolution. These

inequalities not only reveal basic properties of the β

p

, but can also be used as a

technical tool to strengthen results involving logarithmic Sobolev constants.

• We apply our framework to derive bounds on the mixing time of quantum dynam-

ical semigroups. Using the interplay between the β

p

and the logarithmic Sobolev

constants we show how to derive a mixing time bound with the same scaling as that

of the one derived in [2] directly from a logarithmic Sobolev constant. Previously,

this was only known under the additional assumption of l

p

-regularity (see [2]) of the

generator or for the α

1

-logarithmic Sobolev constant. It is still an open question

whether l

p

-regularity holds for all primitive generators.

As an additional application of our methods we derive time-dependent strong converse

bounds on the classical capacity of a quantum dynamical semigroup. We apply these to

some examples of systems under thermal noise. These include stabilizer Hamiltonians,

such as the 2D toric code, and a truncated harmonic oscillator. To the best of our

knowledge, these are the ﬁrst bounds available on the classical capacity of these channels.

We also apply our bound to depolarizing channels, whose classical capacity is known [5],

to benchmark our ﬁndings.

Accepted in Quantum 2018-02-01, click title to verify 3

2 Notation and Preliminaries

Throughout this paper M

d

will denote the space of d × d complex matrices. We will

denote by D

d

the set of d-dimensional quantum states, i.e. positive semi-deﬁnite matrices

ρ ∈ M

d

with trace 1. By M

+

d

we denote the set of positive deﬁnite matrices and by

D

+

d

= M

+

d

∩ D

d

the set of full rank states.

In [3, 4] the following deﬁnition of sandwiched quantum R´enyi divergences was pro-

posed:

Deﬁnition 2.1 (Sandwiched p-R´enyi divergence). Let ρ, σ ∈ D

d

. For p ∈ (0, 1) ∪ (1, ∞),

the sandwiched p-R´enyi divergence is deﬁned as:

D

p

(ρkσ) =











1

p−1

ln



tr



σ

1−p

2p

ρσ

1−p

2p



p



if ker (σ) ⊆ ker (ρ) or p ∈ (0, 1)

+∞, otherwise

(3)

where ker (σ) is the kernel of σ.

Note that we are using a diﬀerent normalization than in [3, 4], which is more convenient

for our purposes. The logarithm in our deﬁnition is in base e, while theirs is in base 2.

When we write log in later sections we will mean the logarithm in base 2.

Taking the limit p → 1 gives the usual quantum Kullback-Leibler divergence [6]

lim

p→1

D

p

(ρkσ) = D (ρkσ) :=

(

tr[ρ (ln(ρ) − ln(σ))] if ker (σ) ⊆ ker (ρ)

+∞, otherwise

.

Similarly by taking the limit p → ∞ we obtain the max-relative entropy [3, Theorem 5]

lim

p→∞

D

p

(ρkσ) = D

∞

(ρkσ) = ln



kσ

−

1

2

ρσ

−

1

2

k

∞



.

The sandwiched R´enyi divergences increase monotonically in the parameter p ≥ 1 (see [7,

Theorem 7]) and we have

D (ρkσ) = D

1

(ρkσ) ≤ D

p

(ρkσ) ≤ D

q

(ρkσ) ≤ D

∞

(ρkσ) . (4)

for any q ≥ p ≥ 1 and all ρ, σ ∈ D

d

. Next we state two simple consequences of this

ordering, which will be useful later.

Lemma 2.1. For σ ∈ D

+

d

and p ∈ [1, +∞)

sup

ρ∈D

d

D

p

(ρkσ) = ln



kσ

−1

k

∞



. (5)

Proof. Using (4) for ρ ∈ D

d

we have

D

p

(ρkσ) ≤ D

∞

(ρkσ) = ln



kσ

−

1

2

ρσ

−

1

2

k

∞



≤ ln



kσ

−1

k

∞



.

Here we used that any quantum state ρ ∈ D

d

fulﬁlls ρ ≤ 1

d

. Clearly, choosing ρ =

|v

min

ihv

min

| for an eigenvector |v

min

i ∈

d

corresponding to the eigenvalue kσ

−1

k

∞

of σ

−1

achieves equality in the previous bound.

Accepted in Quantum 2018-02-01, click title to verify 4

Using (4) together with the well-known Pinsker inequality [8, Theorem 3.1] for the

quantum Kullback-Leibler divergence we have

1

2

kσ − ρk

2

1

≤ D (ρkσ) ≤ D

p

(ρkσ) (6)

for any p ≥ 1 and all ρ, σ ∈ D

d

. The constant

1

2

has been shown to be optimal in the

classical case (see [9]), i.e. restricting to ρ that commute with σ, and is therefore also

optimal here.

2.1 Noncommutative l

p

-spaces

In the following σ ∈ D

+

d

will denote a full rank reference state. For p ≥ 1 we deﬁne the

noncommutative p-norm with respect to σ as

kXk

p,σ

=



tr





σ

1

2p

Xσ

1

2p



p



1

p

(7)

for any X ∈ M

d

. The space (M

d

, k · k

p,σ

) is called a (weighted) noncommutative l

p

-

space. For a linear map Φ : M

d

→ M

d

and p, q ≥ 1 we deﬁne the noncommutative

p → q-norm with respect to σ as

kΦk

p→q,σ

= sup

Y ∈M

d

kΦ(Y )k

q,σ

kY k

p,σ

.

We introduce the weighting operator Γ

σ

: M

d

→ M

d

as

Γ

σ

(X) = σ

1

2

Xσ

1

2

.

For powers of the weighting operator we set

Γ

p

σ

(X) = σ

p

2

Xσ

p

2

for p ∈ and X ∈ M

d

. We deﬁne the so called power operator I

p,q

: M

d

→ M

d

as

I

p,q

(X) = Γ

−

1

p

σ



Γ

1

q

σ

(X)



q

p

!

(8)

for X ∈ M

d

. It can be veriﬁed that

kI

p,q

(X)k

p

p,σ

= kXk

q

q,σ

for any X ∈ M

d

. As in the commutative theory, the noncommutative l

2

-space turns out

to be a Hilbert space, where the weighted scalar product is given by

hX, Y i

σ

= tr

h

Γ

σ



X

†



Y

i

(9)

for X, Y ∈ M

d

. With the above notions we can express the sandwiched p-R´enyi divergence

(3) for p > 1 in terms of a noncommutative l

p

-norm as

D

p

(ρkσ) =

1

p − 1

ln



kΓ

−1

σ

(ρ) k

p

p,σ



. (10)

For a state ρ ∈ D

d

the positive matrix Γ

−1

σ

(ρ) ∈ M

d

is called the relative density of ρ

with respect to σ. Note that any X ≥ 0 with kXk

1,σ

= 1 can be written as X = Γ

−1

σ

(ρ)

for some state ρ ∈ D

d

. We will simply call operators X ≥ 0 that satisfy kXk

1,σ

= 1

relative densities when the reference state is clear.

We refer to [1, 2] and references therein for proofs and more details about the concepts

introduced in this section.

Accepted in Quantum 2018-02-01, click title to verify 5

2.2 Quantum dynamical semigroups

A family of quantum channels, i.e. trace-preserving completely positive maps, {T

t

}

t∈

+

0

,

T

t

: M

d

→ M

d

, parametrized by a non-negative parameter t ∈

+

0

is called a quantum

dynamical semigroup if T

0

= id

d

(the identity map in d dimensions), T

t+s

= T

t

◦T

s

for

any s, t ∈

+

0

and T

t

depends continuously on t. Any quantum dynamical semigroup can

be written as T

t

= e

tL

(see [10, 11]) for a Liouvillian L : M

d

→ M

d

of the form

L(X) = S(X) − κX −Xκ

†

,

where κ ∈ M

d

and S : M

d

→ M

d

is completely positive such that S

∗

(1

d

) = κ+κ

†

, where

S

∗

is the adjoint of S with respect to the Hilbert-Schmidt scalar product. We will also

deal with tensor powers of semigroups. For a quantum dynamical semigroup {T

t

}

t∈

+

with Liouvillian L we denote by L

(n)

the Liouvillian of the quantum dynamical semigroup

{T

⊗n

t

}

t∈

+

.

In the following we will consider quantum dynamical semigroups having a full rank

ﬁxed point σ ∈ D

+

d

, i.e. the Liouvillian generating the semigroup fulﬁlls L(σ) = 0 (implying

that e

tL

(σ) = σ for any time t ∈

+

0

). We call a quantum dynamical semigroup (or the

Liouvillian generator) primitive if it has a unique full rank ﬁxed point σ. In this case for

any initial state ρ ∈ D

d

we have ρ

t

= e

tL

(ρ) → σ as t → ∞ (see [12, Theorem 14]).

The notion of primitivity can also be deﬁned for discrete semigroups of quantum chan-

nels. For a quantum channel T : M

d

→ M

d

we will sometimes consider the discrete

semigroup {T

n

}

n∈

. Similar to the continuous case we will call this semigroup (or the

channel T ) primitive if there is a unique full rank state σ ∈ D

+

d

with lim

n→∞

T

n

(ρ) = σ for

any ρ ∈ D

d

. We refer to [12] for other characterizations of primitive channels and suﬃcient

conditions for primitivity.

To study the convergence of a primitive semigroup to its ﬁxed point σ we introduce the

time evolution of the relative density X

t

= Γ

−1

σ

(ρ

t

). For any Liouvillian L : M

d

→ M

d

with full rank ﬁxed point σ ∈ D

+

d

deﬁne

ˆ

L = Γ

−1

σ

◦ L ◦ Γ

σ

(11)

to be the generator of the time evolution of the relative density. Indeed it can be checked

that

X

t

= Γ

−1

σ



e

tL

(ρ)



= e

t

ˆ

L

(X)

for any state ρ ∈ D

d

and relative density X = Γ

−1

σ

(ρ). Note that kX

t

k

1,σ

= kXk

1,σ

= 1

for all t ∈

+

0

. Clearly the semigroup generated by

ˆ

L is completely positive and unital,

but it is not trace-preserving in general. In the special case where

Γ

−1

σ

◦ L ◦ Γ

σ

= L

∗

, (12)

the map

ˆ

L generates the adjoint of the initial semigroup, i.e. the corresponding time

evolution in Heisenberg picture. A semigroup fulﬁlling (12) is called reversible (or said

to fulﬁll detailed balance), and in this case the Liouvillian

ˆ

L is a Hermitian operator

w.r.t. the σ-weighted scalar product. We again refer to [2, 1] for more details on these

topics. For discrete semigroups we similarly set

ˆ

T = Γ

−1

σ

◦ T ◦ Γ

σ

.

One important class of semigroups are Davies generators, which describe a system

weakly coupled to a thermal bath under an appropriate approximation [13]. Describing

Accepted in Quantum 2018-02-01, click title to verify 6

them in detail goes beyond the scope of this article and here we will only review their

most basic properties. We refer to [14, 15, 16] for more details.

Suppose that we have a system of dimension d weakly coupled to a thermal bath

of dimension d

B

at inverse inverse temperature β > 0. Consider a Hamiltonian H

tot

∈

M

d

⊗ M

d

B

of the system and the bath of the form

H

tot

= H ⊗ 1

B

+ 1

S

⊗ H

B

+ H

I

,

where H ∈ M

d

is the Hamiltonian of the system, H

B

∈ M

d

B

of the bath and

H

I

=

X

α

S

α

⊗ B

α

∈ M

d

⊗ M

d

B

(13)

describes the interaction between the system and the bath. Here the operators S

α

and

B

α

are self-adjoint. Let {λ

k

}

k∈[d]

be the spectrum of the Hamiltonian H. We then

deﬁne the Bohr-frequencies ω

i,j

to be given by the diﬀerences of eigenvalues of H, that

is, ω

i,j

= λ

i

− λ

j

for diﬀerent values of λ. We will drop the indices on ω from now on to

avoid cumbersome notation, as is usually done. Moreover, we introduce operators S

α

(ω)

which are the Fourier components of the coupling operators S

α

and satisfy

e

iHt

S

α

e

−iHt

=

X

ω

S

α

(ω)e

iωt

.

The canonical form of the Davies generator at inverse temperature β > 0 in the Heisenberg

picture, L

∗

β

, is then given by

L

∗

β

(X) = i[H, X] +

X

ω,α

L

∗

ω,α

(X),

where

L

∗

ω,α

(X) = G

α

(ω)



S

α

(ω)

†

XS

α

(ω) −

1

2

{S

α

(ω)

†

S

α

(ω), X}



.

Here {X, Y } = XY + Y X is the anticommutator and G

α

: → are the transition rate

functions. Their form depends on the choice of the bath model [15]. For our purposes it will

be enough to assume that these are functions that satisfy the KMS condition [17], that is,

G

α

(−ω) = G

α

(ω)e

−βω

. Although this presentation of the Davies generators is admittedly

very short, for our purposes it will be enough to note that under some assumptions on

the operators S

α

(ω) [18, 19] and on the transition rate functions, the semigroup generated

by L

β

converges to the thermal state

e

−βH

tr

(

e

−βH

)

and is reversible [17]. In the examples

considered here this will always be the case.

2.3 Logarithmic Sobolev inequalities and the spectral gap

To study hypercontractive properties and convergence times of primitive quantum dy-

namical semigroups the framework of logarithmic Sobolev inequalities has been developed

in [1, 2]. Here we will brieﬂy introduce this theory. For more details and proofs see [1, 2]

and the references therein.

We deﬁne the operator valued relative entropy (for p > 1) of X ∈ M

+

d

as

S

p

(X) = −p

d

ds

I

p+s,p

(X) |

s=0

. (14)

With this we can deﬁne the p-relative entropy:

Accepted in Quantum 2018-02-01, click title to verify 7

Deﬁnition 2.2 (p-relative entropy). For any full rank σ ∈ M

+

d

and p > 1 we deﬁne the

p-relative entropy of X ∈ M

+

d

as

Ent

p,σ

(X) = hI

q,p

(X) , S

p

(X)i

σ

− kXk

p

p,σ

ln (kXk

p,σ

) , (15)

where

1

q

+

1

p

= 1. For p = 1 we can consistently deﬁne

Ent

1,σ

(X) = tr[Γ

σ

(X) (ln (Γ

σ

(X)) − ln(σ))].

by taking the limit p → 1.

The p-relative entropy is not a divergence in the information-theoretic sense (e.g. it

is not contractive under quantum channels). It was originally introduced to study hyper-

contractive properties of semigroups in [1], where they also show it is positive for positive

operators. There is however a connection to the quantum relative entropy as

Ent

p,σ



I

p,1



Γ

−1

σ

(ρ)



=

1

p

D (ρkσ) .

As a special case of the last equation we have

Ent

1,σ



Γ

−1

σ

(ρ)



= D(ρkσ).

We may also use it to obtain an expression for Ent

2,σ

:

Ent

2,σ

(X) = tr







Γ

1

2

σ

(X)



2

ln





Γ

1

2

σ

(X)

kXk

2,σ









−

1

2

tr

"



Γ

1

2

σ

(X)



2

ln(σ)

#

.

We also need Dirichlet forms to deﬁne logarithmic Sobolev inequalities:

Deﬁnition 2.3 (Dirichlet form). Let L : M

d

→ M

d

be a Liouvillian with full rank ﬁxed

point σ ∈ D

+

d

. For p > 1 we deﬁne the p-Dirichlet form of X ∈ M

+

d

as

E

L

p

(X) = −

p

2(p − 1)

D

I

q,p

(X),

ˆ

L(X)

E

σ

where

1

p

+

1

q

= 1 and

ˆ

L = Γ

−1

σ

◦ L ◦ Γ

σ

denotes the generator of the time evolution of the

relative density (cf. (11)). For p = 1 we may take the limit p → 1 and consistently deﬁne

the 1-Dirichlet form by

E

L

1

(X) = −

1

2

tr

h

Γ

σ



ˆ

L(X)



(ln (Γ

σ

(X)) − ln(σ))

i

.

Formally, by making this choice we introduce the logarithmic Sobolev framework for

ˆ

L (i.e. the generator of the time-evolution of the relative density) instead of L

∗

. While

this is a slightly diﬀerent deﬁnition compared to [2], where the Heisenberg picture is used,

they are the same for reversible Liouvillians.

In [1] the Dirichlet forms were introduced to study hypercontractive properties of

semigroups. As we will see in Theorem 3.1, they appear naturally when we compute the

entropy production of the Sandwiched R´enyi divergences. From Corollary 3.1 we will be

able to infer that the Dirichlet form is positive for positive operators, a fact already proved

in [1]. Both the Ent

p,σ

and the Dirichlet form are intimately related to hypercontractive

Accepted in Quantum 2018-02-01, click title to verify 8

properties of semigroups, as we have for a relative density X, some constant α > 0 and

p(t) = 1 + e

2αt

that

d

dt

ln



kX

t

k

p(t),σ



=

αe

αt

(1 + e

αt

) kX

t

k

p(t)

p(t),σ



Ent

p(t),σ

(X

t

) −

1

α

E

p(t)

(X

t

)



,

as shown in [1].

Notice that when working with E

L

2

we may always suppose the Liouvillian is reversible

without loss of generality. This follows from the fact that

E

L

2

(X) = −

D

X,

ˆ

L(X)

E

σ

is invariant under the additive symmetrization

ˆ

L 7→

1

2



L

∗

+ Γ

−1

σ

◦ L ◦ Γ

σ



for X ≥ 0.

We can now introduce the logarithmic Sobolev constants:

Deﬁnition 2.4 (Logarithmic Sobolev constants). For a Liouvillian L : M

d

→ M

d

with

full rank ﬁxed point σ ∈ D

+

d

and p ≥ 1 the p-logarithmic Sobolev constant is deﬁned

as

α

p

(L) = sup{α ∈

+

: αEnt

p,σ

(X) ≤ E

L

p

(X) for all X > 0} (16)

As Ent

2,σ

does not depend on L and, as remarked before, E

L

2

is invariant under an addi-

tive symmetrization, we may always assume without loss of generality that the Liouvillian

is reversible when working with α

2

.

For any X ∈ M

+

d

we can deﬁne its variance with respect to σ ∈ D

+

d

as

Var

σ

(X) = kXk

2

2,σ

− kXk

2

1,σ

. (17)

This deﬁnes a distance measure to study the convergence of the semigroup. Given a

Liouvillian L : M

d

→ M

d

with ﬁxed point σ ∈ D

+

d

we deﬁne its spectral gap as

λ(L) = sup

n

λ ∈

+

: λVar

σ

(X) ≤ E

L

2

(X) for all X > 0

o

(18)

where

ˆ

L : M

d

→ M

d

is given by (11). We can always assume the Liouvillian to be

reversible when dealing with the spectral gap, as it again depends on E

L

2

.

The spectral gap can be used to bound the convergence in the variance (see [20]), as

for any X ∈ M

+

d

we have

d

dt

Var

σ

(X

t

) = 2

D

ˆ

L(X) , X

E

σ

(19)

and so

Var

σ

(X

t

) ≤ e

−2λt

Var

σ

(X) . (20)

3 Convergence rates for sandwiched R

´

enyi divergences

In this section we consider the sandwiched R´enyi divergences of a state evolving under

a primitive quantum dynamical semigroup and the ﬁxed point of this semigroup. It is

clear that these quantities converge to zero as the time-evolved state approaches the ﬁxed

point. To study the speed of this convergence we introduce a diﬀerential inequality, which

can be seen as an analogue of the logarithmic Sobolev inequalities for sandwiched R´enyi

divergences.

Accepted in Quantum 2018-02-01, click title to verify 9

3.1 R

´

enyi-entropy production

In [18] the entropy production for the quantum Kullback-Leibler divergence of a Liouvillian

was computed. We will now derive a similar expression for the entropy production for the

p-R´enyi divergences for p > 1.

Theorem 3.1 (Derivative of the sandwiched p-R´enyi divergence). Let L : M

d

→ M

d

be

a Liouvillian with full rank ﬁxed point σ ∈ D

+

d

. For any ρ ∈ D

d

and p > 1 we have

d

dt

D

p

(e

tL

(ρ)kσ)



t=0

=

p

p − 1

tr

"



σ

1−p

2p

ρσ

1−p

2p



p−1

σ

1−p

2p

L(ρ) σ

1−p

2p

#

tr



σ

1−p

2p

ρσ

1−p

2p



p



. (21)

Using the relative density X = Γ

−1

σ

(ρ) and (11) this expression can be written as:

d

dt

D

p

(e

tL

(ρ)kσ)



t=0

=

p

p − 1

kXk

−p

p,σ

D

I

q,p

(X),

ˆ

L(X)

E

σ

(22)

with

1

p

+

1

q

= 1.

Proof. Rewriting the p-R´enyi divergence in terms of the relative density X = Γ

−1

σ

(ρ) and

the corresponding generator

ˆ

L = Γ

−1

σ

◦ L ◦ Γ

σ

(see (11)) we have

D

p

(e

tL

(ρ)kσ) =

1

p − 1

ln



ke

t

ˆ

L

(X) k

p

p,σ



. (23)

By the chain rule

d

dt

D

p

(e

tL

ρkσ)



t=0

=

1

p − 1

kXk

−p

p,σ



d

dt

ke

t

ˆ

L

(X)k

p

p,σ





t=0

.

Deﬁne the curve γ :

+

0

→ M

d

as γ(t) = σ

1

2p

e

t

ˆ

L

(X) σ

1

2p

and observe that

ke

t

ˆ

L

(X) k

p

p,σ

= tr[γ(t)

p

].

As the diﬀerential of the function X 7→ X

p

at A ∈ M

+

d

is given by pA

p−1

, another

application of the chain rule yields

d

dt

ke

t

ˆ

L

(X)k

p

p,σ



t=0

= p



γ(0)

p−1

,

dγ

dt

(0)



.

It is easy to check that

dγ

dt

(0) = σ

1

2p

ˆ

L(X) σ

1

2p

. Inserting this in the above equations and

writing it in terms of the power operator (8) we ﬁnally obtain

d

dt

D

p

(e

tL

(ρ)kσ)



t=0

=

p

p − 1

kXk

−p

p,σ

D

I

q,p

(X),

ˆ

L(X)

E

σ

with

1

p

+

1

q

= 1. Expanding this formula gives (21).

By recognizing the p-Dirichlet form in the previous theorem we get:

Accepted in Quantum 2018-02-01, click title to verify 10

Corollary 3.1. Let L : M

d

→ M

d

be a Liouvillian with full rank ﬁxed point σ ∈ D

+

d

.

For any ρ ∈ D

d

and p > 1 we have

d

dt

D

p

(e

tL

(ρ)kσ)



t=0

= −2kXk

−p

p,σ

E

L

p

(X) ≤ 0, (24)

where we used the relative density X = Γ

−1

σ

(ρ).

As we remarked before, Corollary 3.1 implies that the Dirichlet form is always positive

for relative densities. To see this, recall that the divergences contract under quantum

channels [7] and therefore we have that

d

dt

D

p

(e

tL

(ρ)kσ)



t=0

≤ 0. As E

L

p

(λX) = λ

p

E

L

p

(X)

for λ > 0, this shows that it is positive for all positive operators by properly normalizing

X.

3.2 Sandwiched R

´

enyi convergence rates

For any p > 1 we introduce the functional κ

p

: M

+

d

→ as

κ

p

(X) =

1

p − 1

kXk

p

p,σ

ln

kXk

p

p,σ

kXk

p

1,σ

!

(25)

for X ∈ M

+

d

. For p = 1 we may again take the limit p → 1 and obtain κ

1

(X) :=

lim

p→1

κ

p

(X) = Ent

1,σ

(X). Note that κ

p

is well-deﬁned and non-negative as kXk

p,σ

≥

kXk

1,σ

for p ≥ 1. Strictly speaking the deﬁnition also depends on a reference state σ ∈ D

+

d

,

which we usually omit as it is always the ﬁxed point of the primitive Liouvillian under

consideration.

Given a Liouvillian L : M

d

→ M

d

with full rank ﬁxed point σ ∈ D

+

d

it is a simple

consequence of Corollary 3.1 that for ρ 6= σ

d

dt

D

p

(e

tL

(ρ)kσ)



t=0

D

p

(ρkσ)

= −2

E

L

p

(X)

κ

p

(X)

, (26)

where we used the relative density X = Γ

−1

σ

(ρ), which fulﬁlls kXk

1,σ

= 1. This motivates

the following deﬁnition.

Deﬁnition 3.1 (Entropic convergence constant for p-R´enyi divergence). For any primitive

Liouvillian L : M

d

→ M

d

and p ≥ 1 we deﬁne

β

p

(L) = sup{β ∈

+

: βκ

p

(X) ≤ E

L

p

(X) for all X > 0}. (27)

Note that as a special case we have α

1

(L) = β

1

(L). It should be also emphasized that

the supremum in the previous deﬁnition goes over any positive deﬁnite X ∈ M

+

d

and not

only over relative densities. However, it is easy to see that we can equivalently write

β

p

(L) = inf

n

E

L

p

(X)

κ

p

(X)

: X > 0

o

= inf

n

E

L

p

(X)

κ

p

(X)

: X > 0, kXk

1,σ

= 1

o

(28)

as replacing X 7→ X/kXk

1,σ

does not change the value of the quotient E

L

p

(X)/κ

p

(X).

Therefore, to compute β

p

it is enough to optimize over relative densities (i.e. X > 0

fulﬁlling kXk

1,σ

= 1). By inserting β

p

into (26) we have

d

dt

D

p

(e

tL

(ρ)kσ) ≤ −2β

p

(L)D

p



e

tL

(ρ)kσ



Accepted in Quantum 2018-02-01, click title to verify 11

for any ρ ∈ D

d

and Liouvillian L : M

d

→ M

d

with full rank ﬁxed point σ ∈ D

+

d

. By

integrating this diﬀerential inequality we get

Theorem 3.2. Let L : M

d

→ M

d

be a Liouvillian with full rank ﬁxed point σ ∈ D

+

d

. For

any p ≥ 1 and ρ ∈ D

d

we have

D

p



e

tL

(ρ)kσ



≤ e

−2β

p

(L)t

D

p

(ρkσ) (29)

where β

p

(L) is the constant deﬁned in (27).

3.3 Computing β

p

in simple cases

In general it is not clear how to compute β

p

and it does not depend on spectral data

of L alone. This is not surprising, as the computation of the usual logarithmic Sobolev

constants α

2

or α

1

is also challenging and the exact values are only known for few Liou-

villians [21, 2, 22]. In the following we compute β

2

for the depolarizing semigroups.

Theorem 3.3 (β

2

for the depolarizing Liouvillian). Let L

σ

: M

d

→ M

d

denote the

depolarizing Liouvillian given by L

σ

(ρ) = tr (ρ) σ − ρ with ﬁxed point σ ∈ D

+

d

. Then

β

2

(L

σ

) =

1 −

1

kσ

−1

k

∞

ln (kσ

−1

k

∞

)

. (30)

Proof. Without loss of generality we can restrict to X > 0 with kXk

1,σ

= 1 in the

minimization (28). Observe that the generator of the time evolution of the relative density

(see (11)) for the depolarizing Liouvillian is

ˆ

L

σ

(X) = tr



σ

1

2

Xσ

1

2



1 − X.

An easy computation yields E

L

σ

2

(X) = kXk

2

2,σ

− 1 and so

E

L

σ

2

(X)

κ

2

(X)

=

1 −

1

kXk

2

2,σ

ln



kXk

2

2,σ



.

As the function x 7→

1−

1

x

ln(x)

is monotone decreasing for x ≥ 1, we have

inf

X>0

E

L

σ

2

(X)

κ

2

(X)

=

1 −

1

kσ

−1

k

∞

ln (kσ

−1

k

∞

)

, (31)

where we used

sup

X≥0,kXk

1,σ

=1

kXk

2

2,σ

= kσ

−1

k

∞

,

which easily follows from Lemma 2.1 by exponentiating both sides of Equation (5) and

using the correspondence between relative densities and states.

The exact value of α

2

(L

σ

) is open to the best of our knowledge, but in the case of σ =

1

d

we have α

2



L

1

d



=

2(1−2/d)

ln(d−1)

[2, Theorem 24], which is of the same order of magnitude as

β

2

for these semigroups.

Computing β

p

for p 6= 2 seems not to be straightforward even for depolarizing channels,

but for the semigroup depolarizing to the maximally mixed state we can at least provide

upper and lower bounds.

Accepted in Quantum 2018-02-01, click title to verify 12

Theorem 3.4 (β

p

for the Liouvillian depolarizing to the maximally mixed state). Let

L(ρ) = tr(ρ)

1

d

− ρ. For p ≥ 2 we have

p

2(p − 1)

1

ln(d)

≥ β

p

(L) ≥

p

2(p − 1)

d

p−1

p

− 1

d

p−1

p

ln(d)

.

Proof. The Dirichlet Form of this Liouvillian for X > 0 with kXk

1,

1

d

= 1 is given by

E

L

p

(X) =

p

2(p − 1)

(kXk

p

p,

1

d

− kXk

p−1

p−1,

1

d

).

Dividing this expression by κ

p

(X) we get

E

L

p

(X)

κ

p

(X)

=

1 −

kXk

p−1

p−1,

1

d

kXk

p

p,

1

d

2 ln



kXk

p,

1

d



. (32)

By the monotonicity of the weighted norms, we have

kXk

p−1

p−1,

1

d

kXk

p

p,

1

d

≤

1

kXk

p,

1

d

and so

E

L

p

(X)

κ

p

(X)

≥

kXk

p,

1

d

− 1

2kXk

p,

1

d

ln



kXk

p,

1

d



(33)

The expression on the right-hand side of (33) is monotone decreasing in kXk

p,

1

d

and so

the inﬁmum is attained at

sup

kXk

1,

1

d

=1

kXk

p,

1

d

= d

p−1

p

,

which again easily follows from Lemma 2.1. The upper bound follows from (32) as

E

L

p

(X)

κ

p

(X)

≤

1

2 ln



kXk

p,

1

d



.

which is again monotone decreasing in kXk

p,

1

d

.

From the relations between LS constants [2, Proposition 13], it follows that for the LS

constants of the depolarizing channels we have α

p



L

1

d



≥ α

2



L

1

d



=

2(1−2/d)

ln(d−1)

for p ≥ 1.

The constants β

p

and α

p

are therefore of the same order in this case for small p ≥ 2.

4 Comparison with similar quantities

4.1 Comparison with spectral gap

Here we show how β

p

, see (27), compares to the spectral gap (18) of a Liouvillian. This is

motivated by similar results for logarithmic Sobolev constants, where it was shown [2, The-

orem 16] that α

1

(L) ≤ λ(L) for reversible semigroups, a result we recover and generalize

here.

Accepted in Quantum 2018-02-01, click title to verify 13

Theorem 4.1 (Upper bound spectral gap). Let L : M

d

→ M

d

be a primitive and

reversible Liouvillian with full rank ﬁxed point σ ∈ D

+

d

and p ≥ 1. Then

β

p

(L) ≤ λ (L) . (34)

Proof. Let (s

i

)

d

i=1

denote the spectrum of σ

1/p

and choose a unitary U such that

σ

1/p

= Udiag (s

1

, s

2

, . . . , s

d

) U

†

.

As L is reversible, there is a self-adjoint eigenvector X ∈ M

d

of

ˆ

L corresponding to the

spectral gap, i.e.

ˆ

L(X) = −λ(L)X. Let 

0

> 0 be small enough such that Y



= 1

d

+ X

is positive for any || ≤ 

0

. For || ≤ 

0

we use Lemma A.1 of the appendix to show

β

p

(L) ≤

E

L

p

(Y



)

κ

p

(Y



)

=

λ(L)

p

2(p−1)



2

2

P

1≤i≤j≤d

f

p

(s

i

, s

j

)b

ij

b

ji

+ O(

3

)





2

p−1



p

P

1≤i≤j≤d

f

p

(s

i

, s

j

)b

ij

b

ji



+ O(

3

)

(35)

where b

ij

= (U

†

σ

1/2p

Xσ

1/2p

U)

ij

and

f

p

(x, y) =

(

(p − 1)x

p−2

if x = y

x

p−1

−y

p−1

x−y

else.

(36)

Observe that f

p

(s

i

, s

j

) > 0 for s

i

, s

j

> 0. Moreover, as U

†

σ

1/2p

Xσ

1/2p

U is non-zero and

self-adjoint we have b

ij

b

ji

≥ 0 for all i, j and this inequality is strict for at least one choice

of i, j. Therefore, the terms of second order in  in the numerator and denominator of

(35) are strictly positive, and we obtain λ(L) as the limit of the quotient as  → 0.

A similar argument as the one given in the previous proof shows that all real, nonzero

elements of the spectrum of

ˆ

L are upper bounds to β

p

without invoking reversibility.

Note that in the case of p = 2 (see the discussion after (16)) we may assume that the

Liouvillian is reversible without loss of generality and drop the requirement of reversibility

in the previous theorem. Alternatively, we can obtain the same statement directly from a

simple functional inequality. In this case we can also give a lower bound on β

2

in terms

of the spectral gap.

Theorem 4.2 (Upper and lower bound for β

2

). Let L : M

d

→ M

d

be a primitive

Liouvillian with full rank ﬁxed point σ ∈ D

+

d

. Then

λ (L)

1 −

1

kσ

−1

k

∞

ln (kσ

−1

k

∞

)

≤ β

2

(L) ≤ λ (L) . (37)

To prove Theorem 4.2 we need the following Lemma.

Lemma 4.1. For any X ∈ M

d

we have

Var

σ

(X) ≤ κ

2

(X).

Accepted in Quantum 2018-02-01, click title to verify 14

Proof. For X > 0 dividing both sides of the inequality by kXk

2

1,σ

yields

kXk

2

2,σ

kXk

2

1,σ

− 1 ≤

kXk

2

2,σ

kXk

2

1,σ

ln

kXk

2

2,σ

kXk

2

1,σ

!

.

This follows from the elementary inequality x − 1 ≤ x ln(x) for x ≥ 1, where we use the

ordering kXk

2,σ

≥ kXk

1,σ

for any X ∈ M

d

.

Proof of Theorem 4.2. Using the deﬁnition of β

2

(see (27)) and Lemma 4.1 yields

β

2

Var

σ

(X) ≤ β

2

κ

2

(X) ≤ E

L

2

(X).

Now the variational deﬁnition of λ(L) (see (18)) implies the second inequality of (37).

To prove the ﬁrst inequality of (37) consider the depolarizing Liouvillian

L

σ

(X) = tr(X)σ − X.

By Theorem 3.3 we have

1 −

1

kσ

−1

k

∞

ln (kσ

−1

k

∞

)

κ

2

(X) ≤ E

L

σ

2

(X)

As E

L

σ

2

(X) = Var

σ

(X), we have E

L

σ

2

(X) ≤

1

λ(L)

E

L

2

(X) by the variational deﬁnition of

λ(L) (see (18)). Inserting this in the above inequality ﬁnishes the proof.

4.2 Comparison with logarithmic Sobolev constants

Here we show how β

p

, see (27), compares to the logarithmic Sobolev constant α

p

.

Theorem 4.3. Let L : M

d

→ M

d

be a primitive Liouvillian with full rank ﬁxed point

σ ∈ D

+

d

. Then for any p ≥ 1 we have

β

p

(L) ≥

α

p

(L)

p

. (38)

We will need the following Lemma.

Lemma 4.2. For any full rank state σ ∈ D

+

d

, any p > 1 and X ∈ M

+

d

with kXk

1,σ

= 1

we have

Ent

p,σ

(X) ≥

κ

p

(X)

p

. (39)

Proof. The function p 7→ D

p

(ρkσ) is monotonically increasing [3, 7] and diﬀerentiable

(as the noncommutative l

p

-norm is diﬀerentiable in p [1, Theorem 2.7]). Thus, with

f :

+

→ given by f(t) = t + p we have

0 ≤ kXk

p

p,σ

d

dt



D

f(t)

(ρkσ)





t=0

= −

1

(p − 1)

2

kXk

p

p,σ

ln



kXk

p

p,σ



+

1

p − 1

d

dt



kXk

f(t)

f(t),σ





t=0

.

Accepted in Quantum 2018-02-01, click title to verify 15

where we used the relative density X = Γ

−1

σ

(ρ). The remaining derivative in the above

equation has been computed in [1, Theorem 2.7] and we have

d

dt



kXk

f(t)

f(t),σ





t=0

= hI

q,p

(X), S

p

(X)i

σ

with the operator valued entropy S

p

deﬁned in (14) and

1

p

+

1

q

= 1. Inserting this expression

in the above equation we obtain

1

p − 1

kXk

p

p,σ

ln



kXk

p

p,σ



≤ hI

q,p

(X), S

p

(X)i

σ

. (40)

for any X ∈ M

+

d

with kXk

1,σ

= 1, i.e. for any X = Γ

−1

σ

(ρ) for some state ρ ∈ D

d

. Now

we get

pEnt

p,σ

(X) = p hI

q,p

(X), S

p

(X)i

σ

− kXk

p

p,σ

ln(kXk

p

p,σ

)

≥

p

p − 1

kXk

p

p,σ

ln(kXk

p

p,σ

) − kXk

p

p,σ

ln(kXk

p

p,σ

)

= κ

p

(X)

where we used (40).

Proof of Theorem 4.3. There is nothing to show for p = 1 as α

1

(L) = β

1

(L) and we can

assume p > 1. For X ∈ M

+

d

with kXk

1,σ

= 1 we can use Lemma 4.2 and the deﬁnition of

α

p

(L) to compute

α

p

(L)

p

κ

p

(X) ≤ α

p

(L) Ent

p,σ

(X) ≤ E

L

p

(X).

By the variational deﬁnition (27) of β

p

the claim follows.

Theorem 4.3 will be applied in Section 5 to obtain bounds on the mixing time of a

Liouvillian with a positive logarithmic Sobolev constant without invoking any form of

l

p

-regularity (see [2]). As usually a logarithmic Sobolev is implied by a hypercontractive

inequality [1], we would like to remark that one can also make a similar statement as that

of Theorem 4.3 from a hypercontractive inequality. One can easily show that

||e

t

ˆ

L

||

p(t)→p,σ

≤ 1 (41)

for p(t) = (p − 1)e

−α

p

t

+ 1 implies that β

p

(L) ≥

α

p

.

5 Mixing times

In this section we will introduce the quantities of interest and prove the building blocks

to prove mixing times from the entropy production inequalities of the last sections, dis-

tinguishing between continuous and discrete time semigroups. We will mostly focus on

β

2

, as this seems to be the most relevant constant for mixing time applications. This is

justiﬁed by the fact that the underlying Dirichlet form is a quadratic form and the entropy

related to it stems from a Hilbert space norm. Moreover, as the same Dirichlet form is

also involved in computations of the spectral gap, it could be easier to adapt existing

techniques, such as the ones developed in [23, 19].

Accepted in Quantum 2018-02-01, click title to verify 16

Deﬁnition 5.1 (Mixing times). For either I =

+

or I = let {T

t

}

t∈I

be a primitive

semigroup of quantum channels with ﬁxed point σ ∈ D

+

d

. We deﬁne the l

1

mixing time

for  > 0 as

t

1

() = inf{t ∈ I : kT

t

(ρ) − σk

1

≤  for all ρ ∈ D

d

}.

Similarly we deﬁne the l

2

mixing time for  > 0 as

t

2

() = inf{t ∈ I : Var

σ



ˆ

T

t

(X)



≤  for all X ∈ M

+

d

with kXk

1,σ

= 1}.

In the continuous case I =

+

we will often speak of the mixing times of the Liouvillian

generator of a quantum dynamical semigroup which we identify with the mixing times of

the semigroup according to the above deﬁnition.

5.1 Mixing in Continuous Time

It is now straightforward to get mixing times from the previous results.

Theorem 5.1 (Mixing time from entropy production). Let L : M

d

→ M

d

be a primitive

Liouvillian with ﬁxed point σ ∈ D

+

d

. Then

t

1

() ≤

1

2β

p

(L)

ln

2 ln



kσ

−1

k

∞





2

!

.

Proof. From (6) and Lemma 2.1 we have

ln



kσ

−1

k

∞



e

−2β

p

(L)t

≥

1

2

ke

tL

(ρ) − σk

2

1

. (42)

for any ρ ∈ D

d

. The claim follows after rearranging the terms.

Using Theorem 4.3 we can lower bound β

p

in terms of the usual logarithmic Sobolev

constant α

p

. Combining this with Theorem 5.1 shows the following Corollary.

Corollary 5.1 (Mixing time bound from logarithmic Sobolev inequalities). Let L : M

d

→

M

d

be a primitive Liouvillian with ﬁxed point σ ∈ D

+

d

. Then

t

1

() ≤

p

2α

p

(L)

ln

2 ln



kσ

−1

k

∞





2

!

. (43)

By Corollary 5.1 a nonzero logarithmic Sobolev constant always implies a nontrivial

mixing time bound. One should say that the same bound was showed in [2] for p = 2,

however under additional assumptions (speciﬁcally l

p

-regularity [2]) on the Liouvillian in

question. While these assumptions have been shown for certain classes of Liouvillians (in-

cluding important examples like Davies generators and doubly stochastic Liouvillians [2])

they have not been shown in general. Moreover, the bound in Theorem 5.1 clearly does not

depend on p and one could in principle optimize over all β

p

. However, as the computations

in subsection 3.3 already indicate, it does not seem to be feasible to compute or bound β

p

for p 6= 2 even in simple cases and one will probably only work with β

2

in applications.

The bound from Corollary 5.1 also has the right scaling properties needed in recent

applications of rapid mixing, such as the results in [24, 25]. In particular, together with

the results in [26], the last Corollary shows that the hypothesis of Theorem 4.2 in [24] is

Accepted in Quantum 2018-02-01, click title to verify 17

always satisﬁed for product evolutions and not only for the special classes considered in

[2].

One may also use these techniques to get mixing times in the l

2

norms which are

stronger than the ones obtained just by considering that β

2

is a lower bound to the

spectral gap.

Theorem 5.2 (l

2

-mixing time bound). Let L : M

d

→ M

d

be a Liouvillian with ﬁxed

point σ ∈ D

+

d

. Then

t

2

() ≤

1

2β

2

(L)

ln



kσ

−1

k

∞



ln(1 + )

!

. (44)

Proof. For X > 0 with kXk

1,σ

= 1 we have Var

σ

(X) = kX −1k

2

2,σ

= kXk

2

2,σ

−1 and thus

κ

2

(X) = (1 + Var

σ

(X)) ln (1 + Var

σ

(X)) .

In the following let X

t

= e

t

ˆ

L

(X) denote the time evolution of the relative density X.

Using (19) and the deﬁnition of β

2

(L) (see (27)) we obtain

d

dt

Var

σ

(X

t

) = −2E

L

2

(X

t

) ≤ −2β

2

(L)(1 + Var

σ

(X

t

)) ln(1 + Var

σ

(X

t

)).

Integrating this diﬀerential inequality we obtain

ln



ln(1 + Var

σ

(X))

ln(1 + )



≤

t

2

()

Z

0

1

(1 + Var

σ

(X

t

)) ln (1 + Var

σ

(X

t

))



d

dt

Var

σ

(X

t

)



dt

≤ −2β

2

t

2

().

As 1 + Var

σ

(X) ≤ kσ

−1

k

∞

, the claim follows after rearranging the terms.

In the remaining part of the section we will discuss a converse to the previous mixing

time bounds, i.e. a lower bound on the logarithmic Sobolev constant in terms of a mixing

time. This excludes the possibility of a reversible semigroup with both small β

2

and short

mixing time with respect to the l

2

distance. For this we generalize [21, Corollary 3.11] to

the noncommutative setting.

Theorem 5.3 (LS inequality from l

2

mixing time). Let L : M

d

→ M

d

be a primitive,

reversible Liouvillian with ﬁxed point σ ∈ D

+

d

. Then

1

2

≤ α

2

(L) t

2



e

−1



≤ 2β

2

(L) t

2



e

−1



. (45)

Moreover, this inequality is tight.

Proof. We refer to Appendix B for a proof.

As remarked in [21], even the classical result does not hold anymore if we drop the

reversibility assumption. Therefore, this assumption is also needed in the noncommutative

setting. By considering a completely depolarizing channel it is also easy to see that no

such bound can hold in discrete time.

Theorem 5.3 implies that for reversible Liouvillians β

2

and α

2

cannot diﬀer by a large

factor. More speciﬁcally we have the following corollary.

Accepted in Quantum 2018-02-01, click title to verify 18

Corollary 5.2. Let L : M

d

→ M

d

be a primitive, reversible Liouvillian with ﬁxed point

σ ∈ D

+

d

. Then

2β

2

(L) ≥ α

2

(L) ≥ β

2

(L) ln

ln



kσ

−1

k

∞



ln(1 + e

−1

)

!

. (46)

Proof. We showed the ﬁrst inequality in Theorem 4.3. The second inequality follows by

combining (45) and (44).

5.2 Mixing in Discrete Time

In this section we will obtain mixing time bounds and also entropic inequalities for discrete-

time quantum channels T : M

d

→ M

d

. We will then use these techniques to derive mixing

times for random local channels, which we will deﬁne next. These include channels that

usually appear in quantum error correction scenarios, such as random Pauli errors on

qubits [27, Chapter 10]. They will be based on the following quantity:

Deﬁnition 5.2. For a primitive quantum channel T : M

d

→ M

d

with full rank ﬁxed

point σ ∈ D

+

d

, we deﬁne

β

D

(T ) = β

2

(T

∗

ˆ

T − id

d

). (47)

Here we used

ˆ

T = Γ

−1

σ

◦ T ◦ Γ

σ

.

The deﬁnition of β

D

(T ) can be motivated by the following improved data-processing

inequality for the 2-sandwiched R´enyi divergence.

Theorem 5.4. Let T : M

d

→ M

d

be a primitive quantum channel with full rank ﬁxed

point σ ∈ D

+

d

. Then for all ρ ∈ D

d

we have

D

2

(T (ρ)kσ) ≤ (1 − β

D

(T ))D

2

(ρkσ) . (48)

Proof. Let X = σ

−

1

2

ρσ

−

1

2

denote the relative density of ρ with respect to σ. Observe that

the 2-Dirichlet form (see Deﬁnition 2.3) of the semigroup L = T

∗

ˆ

T − id

d

can be written

as

E

L

2

(X) = kXk

2

2,σ

− k

ˆ

T (X)k

2

2,σ

.

From the deﬁnition of β

2

(L) (see (27)) it follows that

kXk

2

2,σ

− k

ˆ

T (X)k

2

2,σ

≥ β

2

κ

2

(X),

which is equivalent to

ln(k

ˆ

T (X)k

2

2,σ

) − ln(kXk

2

2,σ

) ≤ ln(1 − β

2

ln(kXk

2

2,σ

).

Using the elementary inequality ln(1−β

2

ln(kXk

2

2,σ

) ≤ −β

2

ln(kXk

2

2,σ

), that ln(k

ˆ

T (X)k

2

2,σ

) =

D

2

(T (ρ)kσ) and ln(kXk

2

2,σ

) = D

2

(ρkσ) hold, the statement of the theorem follows after

rearranging the terms.

One should note that, unlike in Theorem 3.2, the constant β

D

is not optimal in (48). As

an example take T (ρ) = tr[ρ]

1

d

for which β

D

(T ) =

1−d

−1

ln(d)

, but D

2



T (ρ)k

1

d



= 0. Also,

β

D

(T ) > 0 is not a necessary condition for primitivity, as there are primitive quantum

Accepted in Quantum 2018-02-01, click title to verify 19

channels that are not strict contractions with respect to D

2

. To see this, consider the map

T : M

2

→ M

2

which acts as follows on Pauli operators:

T (1) = 1, T (σ

x

) = 0, T(σ

y

) = 0 and T (σ

z

) = σ

x

.

One can check that this is a a primitive quantum channel with T

2

(ρ) =

1

2

for any state

ρ ∈ D

d

. However, T maps the pure state

1

2

(1 + σ

z

) to the pure state

1

2

(1 + σ

x

), which

implies that D

2

does not strictly contract under T . We can now prove the following bound

on the discrete mixing time.

Theorem 5.5 (Discrete mixing time). Let T : M

d

→ M

d

be a primitive quantum channel

with full rank ﬁxed point σ ∈ D

+

d

and β

D

(T ) > 0. Then

t

1

() ≤ −

1

ln(1 − β

D

(T ))

ln

2 ln



kσ

−1

k

∞





2

!

.

Proof. By Theorem 5.4 we have

D

2

(T

n

(ρ)kσ) ≤ (1 − β

D

(T ))

n

D

2

(ρkσ).

for any ρ ∈ D

d

. The claim then follows from (6) and Lemma 2.1.

Convergence results for primitive continuous-time semigroups can often be lifted to

their tensor powers. In discrete time a similar result holds for the following class of

channels:

Deﬁnition 5.3 (Random Local Channels). For a quantum channel T : M

d

→ M

d

and

probabilities p = (p

1

, . . . , p

n

) with p

i

≥ 0 and

P

i

p

i

= 1 we deﬁne a random local

channel T

(n)

p

: M

⊗n

d

→ M

⊗n

d

by

T

(n)

p

=

n

X

i=1

p

i

id

⊗i−1

d

⊗ T ⊗ id

⊗n−i

d

. (49)

The previous deﬁnition can be generalized to the case where not all local channels are

identical, i.e. if we have T

i

: M

d

→ M

d

acting on the ith system in the expression (49).

As long as the local channels are all primitive our results also hold for this more general

class of channels. However, for simplicity we will restrict here to the above deﬁnition.

Theorem 5.6. Let T : M

d

→ M

d

be a primitive quantum channel with full rank ﬁxed

point σ ∈ D

+

d

such that the Liouvillian

ˆ

L = T

∗

ˆ

T −id

d

fulﬁlls β

2



L

(n)



≥ q for some q > 0

and all n ∈ . Then for any n ∈ and probabilities p = (p

1

, . . . , p

n

) with p

i

≥ 0 and

P

i

p

i

= 1 we have

D

2

(T

(n)

p

(ρ)kσ

⊗n

) ≤ (1 − qp

2

min

)D

2

(ρkσ

⊗n

), (50)

for any ρ ∈ D

d

n

and where p

min

= min p

i

.

Proof. By Theorem 5.4 it is enough to show that β

D

(T

p

) ≥ qp

2

min

.

Observe that the Dirichlet form of (T

(n)

p

)

∗

ˆ

T

(n)

p

− id

d

is given by

E

L

2

(X) =

X

i6=j

p

i

p

j

D

X −T

∗

i

ˆ

T

j

(X), X

E

σ

⊗n

+

X

i

p

2

i

D

X −T

∗

i

ˆ

T

i

(X), X

E

σ

⊗n

Accepted in Quantum 2018-02-01, click title to verify 20

where the map T

∗

i

ˆ

T

j

acts as T

∗

on the i-th system,

ˆ

T on the j-th and as the identity

elsewhere. As T

∗

i

ˆ

T

j

≤ id

d

with respect to h·, ·i

σ

⊗n

we have

E

L

2

(X) ≥

X

i

p

2

i

D

X −T

∗

i

ˆ

T

i

(X), X

E

σ

⊗n

≥ p

2

min

E

L

(n)

2

(X).

From the comparison inequality E

L

2

≥ p

2

min

E

L

(n)

2

and the assumption β

2



L

(n)



≥ q it then

follows that β

D

(Φ) ≥ qp

2

min

.

As an application we can bound the entropy production and the mixing time in a

system of n qubits aﬀected (uniformly) by random Pauli errors. The time evolution of

this system is given by the channel T

n

: M

⊗n

2

→ M

⊗n

2

given by

T

n

=

1

n

X

i=1

id

⊗i−1

2

⊗ T ⊗ id

⊗n−i

2

(51)

with T (ρ) = tr(ρ)

1

2

.

Theorem 5.7. For T

n

deﬁned as in equation (51) we have

D

2



T

n

(ρ)k

1

2

n

2

n



≤



1 −

1

2n

2



D

2



ρk

1

2

n

2

n



.

for any ρ ∈ D

2

n

.

Proof. From [2] it is known that

α

2



L

(n)

1

2



= 1.

Now combining Theorem 4.3 and Theorem 5.6 gives

β

D

(T

n

) ≥

1

2n

2

α

2



L

(n)

1

2



.

Corollary 5.3. Let T

n

be deﬁned as in (51). Then we have

t

1

() ≤ −

1

ln



1 −

1

2n

2



ln



n



2



. (52)

Proof. This follows directly from the previous theorem and Theorem 5.5.

6 Strong converse bounds for the classical capacity

When classical information is sent via a quantum channel, the classical capacity is the

supremum of transmission rates such that the probability for a decoding error vanishes in

the limit of inﬁnite channel uses. In general it is not possible to retrieve the information

perfectly when it is sent over a ﬁnite number of uses of the channel, and the probability

for successful decoding will be smaller than 1. Here we want to derive bounds on this

probability for quantum dynamical semigroups. More speciﬁcally we are interested in

strong converse bounds on the classical capacity. An upper bound on the capacity is

Accepted in Quantum 2018-02-01, click title to verify 21

called a strong converse bound if whenever a transmission rate exceeds the bound the

probability of successful decoding goes to zero in the limit of inﬁnite channel uses.

We refer to [27, Chapter 12] for the exact deﬁnition of the classical capacity and to

[28, 29, 30, 4, 31] for more details on strong converses and strong converse bounds.

In [4] the following quantity was used to study strong converses.

Deﬁnition 6.1 (p-information radius). Let T : M

d

→ M

d

be a quantum channel. The

p-information radius T is deﬁned as

1

K

p

(T ) =

1

ln(2)

min

σ∈D

d

max

ρ∈D

d

D

p

(T (ρ)kσ).

We will often refer to a (m, n, p)-coding scheme for classical communication using a

quantum channel T . By this we mean a coding-scheme for the transmission of m classical

bits via n uses of the channel T for which the probability of successful decoding is p (see

again [27, Chapter 12] for an exact deﬁnition). The following theorem shown in [4, Section

6] relates the information radius and the probability of successful decoding.

Theorem 6.1 (Bound on the success probability in terms of information radius). Let

T : M

d

→ M

d

be a quantum channel, n ∈ and R ≥ 0. For any (nR, n, p

succ

)-coding

scheme for classical communication via T we have

p

succ

≤ 2

−n



p−1

p



(

R−

1

n

K

p

(

T

⊗n

))

. (53)

We will now apply the methods developed in the last sections to obtain strong converse

bounds on the capacity of quantum dynamical semigroups.

Theorem 6.2. Let L : M

d

→ M

d

be a primitive Liouvillian with full rank ﬁxed point

σ ∈ D

+

d

such that for some p ∈ (1, ∞) there exists c > 0 fulﬁlling β

p

(L

(n)

) ≥ c for all

n ∈ . Then for any (nR, n, p

such

)-coding scheme for classical communication via the

quantum dynamical semigroup T

t

= e

tL

we have

p

succ

≤ 2

−n



p−1

p



(

R−e

−2ct

log(kσ

−1

k

∞

)

.

Proof. Using Theorem 3.2 and Lemma 2.1 we have

K

p

(T

⊗n

t

) ≤

1

ln(2)

max

ρ∈D

d

n

D

p

(T

⊗n

t

(ρ)kσ) ≤ ne

−2β

p

(

L

(n)

)

t

log(kσ

−1

k

∞

).

Now Theorem 6.1 together with the assumption β

p

(L

(n)

) ≥ c ﬁnishes the proof.

Together with Theorem 4.3 the previous theorem shows that a quantum memory can

only reliably store classical information for small times when it is subject to noise described

by a quantum dynamical semigroup with “large” logarithmic Sobolev constant, as we will

see more explicitly later in Section 7. Moreover, we can use the results from [26] to give

a universal lower bound to the decay of the capacity in terms of the spectral gap and the

ﬁxed point.

1

The ln(2) factor is due to our diﬀerent choice of normalization for the divergences.

Accepted in Quantum 2018-02-01, click title to verify 22

Corollary 6.1. Let L : M

d

→ M

d

be a primitive Liouvillian with full rank ﬁxed point

σ ∈ D

+

d

and spectral gap λ(L). Then for any (nR, n, p

such

)-coding scheme for classical

communication via the quantum dynamical semigroup T

t

= e

tL

we have

p

such

≤ 2

−

n

2

(

R−e

−k(λ,σ)t

log(kσ

−1

k

∞

)

where k(λ, σ) :=

λ

2(ln(d

4

kσ

−1

k

∞

)+11)

.

Proof. It was shown in [26, Theorem 9] that α

2

(L

(n)

) ≥ 2k(λ(L), σ) for all n ∈ . Using

Theorem 4.3 we have β

2

(L

(n)

) ≥ k(λ(L), σ) for all n ∈ . Together with Theorem 6.2

this gives the claim.

For unital semigroups, i.e. for σ =

1

d

, one can improve the bound from the previous

theorem slightly using (see [32, Theorem 3.3])

k



λ,

1

d



=

λ(1 − 2d

−2

)

2(ln(3) ln(d

2

− 1) + 2(1 − 2d

−2

)

(54)

For d = 2 we even have k(λ,

1

d

2

) =

λ

2

(see [2]).

7 Examples of bounds for the classical capacity of Semigroups

We will now apply the estimate on the capacity given by Corollary 6.1 to some examples

of semigroups. Here C(T ) will denote the classical capacity of a quantum channel T .

7.1 Depolarizing Channels

In [5] it is shown that for L

1

d

(X) = tr(X)

1

d

− X we have

C



e

tL

1

d



= log(d) +



e

−t

+ c(t, d)



log



e

−t

+ c(t, d)



+ (d − 1)c(t, d) log (c(t, d)) (55)

with c(t, d) = (1 − e

−t

)d

−1

. In [30] the strong converse property was established. The

semigroup generated by L

1

d

is therefore a natural candidate to evaluate the quality of our

bounds, as determining its classical capacity can be considered a solved problem. As L

1

d

is just the diﬀerence of a projection and the identity, it is easy to see that the spectral gap

of L

1

d

is 1, which gives us the upper bound

C



e

tL

1

d



≤ log(d)e

−

(1−2d

−2

)

2(ln(3) ln(d

2

−1)+2(1−2d

−2

)

t

(56)

for d > 2 and

C



e

tL

1

2



≤ e

−

t

2

(57)

for d = 2.

Accepted in Quantum 2018-02-01, click title to verify 23

Figure 1: Comparison of the capacity of the depolarizing channel, given in Equation (55), and the

bound obtained by our methods, given in Equation (56).

7.2 Stabilizer Hamiltonians

Estimates on the spectral gap of Davies generators of stabilizer Hamiltonians were obtained

in [19]. In the following we will make the same assumptions as in [19] on the coupling of

the system to the bath. That is, we assume that the operators S

α

(see (13)) are given by

single qubit Pauli operators σ

x

, σ

y

or σ

z

. For the transition rates G

α

(ω) we only assume

that they satisfy the KMS condition [17], that is, G

α

(−ω) = G

α

(ω)e

−βω

. This condition

implies that the semigroup is reversible. Recall that for Davies generators at inverse

temperature β > 0, which we will denote by L

β

, the stationary state is always given by

the thermal state

e

−βH

tr

(

e

−βH

)

.

We will not discuss stabilizer Hamiltonians and groups and their connection to error-

correcting codes, but refer to [27, Section 10.5] for more details. Given some stabilizer

group S ⊂ P

n

, where P

n

is the group generated by the tensor product of n Pauli matrices,

with commutative generators S =< P

1

, . . . , P

k

>, we deﬁne the stabilizer Hamiltonian to

be given by

H

S

= −

k

X

i=1

P

i

.

We then have:

Lemma 7.1. Let H

S

be the stabilizer Hamiltonian of the stabilizer group S =< P

1

, . . . , P

k

>

on n qubits. Denote by σ

β

=

e

−βH

S

tr

(

e

−βH

S

)

the corresponding thermal state at inverse inverse

temperature β > 0. Then

kσ

−1

β

k

∞

≤ 2

n

e

2kβ

.

Proof. The eigenvalues of each P

i

are contained in {1, −1}, as they are just tensor products

of Pauli matrices. From this we have

−k1 ≤ H

S

≤ k1, (58)

as H

S

is just the sum of k terms such that −1 ≤ P

i

≤ 1. From (58) it follows that

tr



e

−βH

S



≤ 2

n

e

βk

, (59)

Accepted in Quantum 2018-02-01, click title to verify 24

as we have 2

n

eigenvalues, including multiplicities. Moreover, it also follows that

ke

βH

S

k

∞

≤ e

βk

. (60)

As kσ

−1

β

k

∞

= ke

βH

S

k

∞

tr



e

−βH

S



, the claim follows by putting (59) and (60) together.

In [19, Theorem 15] they show

λ ≥

h

∗

4η

∗

e

−2β¯

for the spectral gap λ of the Davies generators of stabilizer Hamiltonians at inverse tem-

perature β > 0. Here ¯ is the generalized energy barrier, h

∗

is the smallest transition

rate and η

∗

the longest path in Pauli space. We refer to [19] for the exact deﬁnition of

these parameters. It is important to stress that in general η

∗

will scale with the number

of qubits, so our estimate on the capacity will not be very good as the number of qubits

increases.

However, in [19, Theorem 15] they also show the estimate

λ ≥

h

∗

4

e

−2β¯

,

for the special case in which the generalized energy barrier can be evaluated with canonical

paths Γ

1

. We again refer to [19] for the exact deﬁnition. For these cases the gap does not

scale with the dimension and our estimate is much better. Summing up we obtain:

Theorem 7.1. Let H

S

be the stabilizer Hamiltonian of the stabilizer group S =< P

1

, . . . , P

k

>

on n qubits. Moreover, let L

β

be its Davies generator at inverse temperature β > 0. Then

the classical capacity C(e

tL

β

) is bounded by

C(e

tL

β

) ≤ (n + 2βk log(e)) e

−r(β,n,k)t

,

with

r(β, n, k) = e

−2β¯

h

∗

8η

∗

(2kβ + 5n ln (2) + 11)

and

r(β, n, k) = e

−2β¯

h

∗

8 (2kβ + 5n ln (2) + 11)

in case the generalized energy barrier can be evaluated with canonical paths Γ

1

. Moreover,

this is a bound in the strong converse sense.

Proof. The claim follows immediately after inserting the bounds from Lemma 7.1 and [19,

Theorem 15,16] into Corollary 6.1.

In [19] one can ﬁnd more explicit bounds for the parameters ¯, η

∗

and h

∗

for some

stabilizer groups. To the best of our knowledge this is the ﬁrst bound available for the

classical capacity of this class of quantum channels. To make the bound in Theorem 7.1

more concrete, we show what we obtain for the 2D toric code.

Accepted in Quantum 2018-02-01, click title to verify 25

7.3 2D Toric Code

Here we consider the 2D toric code as originally introduced in [33], which is a stabilizer

code. We consider only square lattices: We take an N × N lattice with N

2

vertical and

(N +1)

2

horizontal edges; associating a qubit to each edge gives a total of n = 2N

2

+2N +1

physical qubits. The stabilizer operators are N(N + 1) plaquette operators (including the

“open” plaquettes along the rough boundary) and N(N + 1) vertex operators, all of which

are independent. It goes beyond the scope of this article to explain the 2D toric code in

detail and we refer to [34, Section 19.4] for a discussion. But from the previous observations

we obtain that we have k = 2N(N + 1) generators for the stabilizer group of the 2D toric

code on n = 2N

2

+ 2N + 1 qubits. We will make the same assumptions on the the Davies

generators at inverse temperature β > 0 for the toric code as in [19]. These are discussed

in the beginning of Subsection 7.2.

In [35] it was proved that the spectral gap for the Davies generators for the 2D toric

code at inverse temperature β satisﬁes λ ≥

1

3

e

−8β

, a result which was reproved in [19]

using diﬀerent techniques. We therefore obtain:

Corollary 7.1. Let H be the stabilizer Hamiltonian of the 2D toric code on a N × N

lattice and L

β

be its Davies generator at inverse temperature β > 0. Then the classical

capacity C(e

tL

β

) is bounded by

C(e

tL

β

) ≤



2N

2

+ 2N + 1 + log(2)4βN(N + 1)



e

−r(β,L)t

, (61)

with

r(β, N) =

e

−8β

6 ((10N

2

+ 10N + 5) ln(2) + 4βN(N + 1)) + 66

.

Moreover, this is a bound in the strong converse sense.

Proof. The claim follows immediately from Lemma 7.1 and the spectral gap estimate of

[35] for the toric code.

From Figure 2 it becomes evident that we cannot retain information in the 2D toric

for long times at small inverse temperatures and that we can get nontrivial estimates even

for very high dimensions, as the size of the gap does not scale with the size of the lattice.

It is conjectured that if the spectral gap of the Davies generators of a Hamiltonian with

local, commuting terms satisﬁes a lower bound which is independent of the size of the

lattice, then the logarithmic Sobolev 2 constant also satisﬁes such a bound [36]. As the

Hamiltonian of the 2D toric code is of this form, proving this conjecture would lead to a

bound similar to the one in Corollary 7.1, but with a rate r(β, N) independent of the size

of the lattice. This would of course lead to much better bounds for large lattice sizes.

7.4 Truncated harmonic oscillator

Consider the Hamiltonian of a truncated harmonic oscillator

H =

d

X

n=0

n|nihn| ∈ M

d+1

.

Accepted in Quantum 2018-02-01, click title to verify 26

Figure 2: Plot for a 5 ×5 lattice of the minimum of the bound in Equation (61) and the trivial bound

C(e

tL

β

) ≤ 2N

2

+ 2N + 1 = 61 as a function of the inverse temperature and time for the Davies

generator of the 2D toric code.

Suppose that the systems couples to the bath via S = (a + a

†

), with

a

†

=

d

X

n=1

√

n |nihn − 1| (62)

and the transition rate function G(x) = (1+e

−xβ

)

−1

. Let σ

β

=

e

−βH

tr(e

−βH

)

. As the eigenvalues

of e

−βH

are just a geometric sequence, we have

kσ

−1

β

k

∞

=

1 − e

−β(d+1)

1 − e

−β

e

βd

. (63)

In [23, Section V, Example 1] they show

λ ≥

1

2

min{((1 + e

−β

)d)

−1

,

h

(G(1)(

√

d − 1 −

√

d)

2

+ G(−1)(

√

d − 2 −

√

d − 1)

2

)

i

}, (64)

for the spectral gap λ of the Davies generator L

β

of the truncated harmonic oscillator at

inverse temperature β > 0. We will denote the value of the lower bound in Equation (64)

by µ(d, β). As we can compute kσ

−1

β

k exactly and have a bound on the spectral gap from

we can apply Corollary 7.1 to these semigroups.

Note that in this case the bound scales with the dimension. Putting these inequalities

together with the bound given in Corollary 6.1 for the capacity, we have for the classical

capacity of this semigroup:

C(e

tL

) ≤

log

1 − e

−β(d+1)

(1 − e

−β

)

!

+ βd log(e)

!

e

−r(d,β)t

, (65)

with

r(d, β) =

8 ln (d + 1) + 2 ln

1 − e

−β(d+1)

1 − e

−β

!

+ 2βd + 22

!

−1

µ(d, β). (66)

Accepted in Quantum 2018-02-01, click title to verify 27

Figure 3: Plot of the minimum of the bound in Equation (65) and the trivial bound C(e

tL

β

) ≤ log(10) '

3.32 as a function of the inverse temperature and time for the classical capacity of the Davies generator

of the truncated harmonic oscillator with d + 1 = 10.

In this example we see that, as the estimate available on the gap scales with the

dimension, our estimates are not much better than the trivial log(d+1) for high dimensions

unless we are looking at large times.

8 Conclusion and open questions

We have introduced a framework similar to logarithmic Sobolev inequalities to study

the convergence of a primitive quantum dynamical semigroup towards its ﬁxed point in

the distance measure of sandwiched R´enyi divergences. These techniques can be used

to obtain mixing time bounds and strong converse bounds on the classical capacity of a

quantum dynamical semigroup. Moreover, these results show that a logarithmic Sobolev

inequality or hypercontractive inequality always implies a mixing time bound without the

assumption of l

p

-regularity (which is still not known to hold for general Liouvillians [2]).

Although we have some structural results concerning the constants β

p

, some questions

remain open. For logarithmic Sobolev inequalities it is known that α

2

≤ α

p

for p ≥ 1 under

the assumption of l

p

-regularity (see [2]). It would be interesting to investigate if a result

of similar ﬂavor also holds for the β

p

. In all examples discussed here, β

2

and α

2

are of the

same order and it would be interesting to know if this is always the case. The framework of

logarithmic Sobolev inequalities has recently been extended to the nonprimitive case [37].

It should be possible to develop a similar theory for the sandwiched R´enyi divergences to

get rid of regularity assumptions present in their main results, as we did here for the usual

logarithmic Sobolev constants.

We restricted our analysis to the sandwiched R´enyi divergences, as they can be ex-

pressed in terms of relative densities and noncommutative l

p

-norms. This allowed us to

connect the convergence under the sandwiched divergences to the theory of hypercontrac-

tivity and to use tools from interpolation theory which were vital to prove estimates on

capacities. There are however other noncommutative generalizations of the R´enyi diver-

Accepted in Quantum 2018-02-01, click title to verify 28

gences that are known to contract under quantum channels, such as the one discussed

in [38, p. 113]. It would be interesting to explore the entropy production and convergence

under semigroups for this and other families of divergences in future work.

In a similar vein, it would be interesting to investigate the entropy production or

convergence rate for the range

1

2

< p < 1, as the sandwiched R´enyi divergences are known

to contract under quantum channels for all p >

1

2

[39]. However, looking closely at the

proof of Theorem 3.1, we see that for p < 1 the sandwiched R´enyi divergence is only

diﬀerentiable at t = 0 if the initial state has full rank. The study of the convergence of

these divergences for p < 1 therefore requires a diﬀerent technical approach than that

of this work. Finally, it would of course be relevant to obtain bounds on the β

p

for

more examples without relying on the estimate based on the spectral gap, such as Davies

generators.

Acknowledgments

We thank David Reeb and Oleg Szehr for interesting and engaging discussions, which

initiated this project. D.S.F acknowledges support from the graduate program TopMath of

the Elite Network of Bavaria, the TopMath Graduate Center of TUM Graduate School at

Technische Universit¨at M¨unchen, the Deutscher Akademischer Austauschdienst(DAAD)

and by the Technische Universit¨at M¨unchen Institute for Advanced Study, funded by the

German Excellence Initiative and the European Union Seventh Framework Programme

under grant agreement no. 291763.

A.M-H acknowledges ﬁnancial support from the European Research Council (ERC

Grant Agreement no 337603), the Danish Council for Independent Research (Sapere

Aude), the Swiss National Science Foundation (project no PP00P2 150734), and the VIL-

LUM FONDEN via the QMATH Centre of Excellence (Grant No. 10059).

This work was supported by the German Research Foundation (DFG) and the Tech-

nical University of Munich within the funding programme Open Access Publishing.

This work was initiated at the BIRS workshop 15w5098, Hypercontractivity and Log

Sobolev Inequalities in Quantum Information Theory. We thank BIRS and the Banﬀ

Centre for their hospitality.

A Taylor Expansion of the Dirichlet Form

In order to compute the Taylor expansions of the Dirichlet forms and of the noncommu-

tative l

p

-norms we deﬁne f

p

:

2

→ and g

p

:

2

→ for p > 1 as

f

p

(x, y) =

(

(p − 1)x

p−2

if x = y

x

p−1

−y

p−1

x−y

else

(67)

and

g

p

(x, y) =







p(p−1)

2

x

p−2

if x = y

(p−1)x

p

−px

p−1

y+y

p

(x−y)

2

else.

(68)

Note that the following identity holds

g

p

(x, y) + g

p

(y, x) = pf

p

(x, y) (69)

for any x, y ∈ .

Accepted in Quantum 2018-02-01, click title to verify 29

Lemma A.1 (Taylor expansion). Consider a primitive, reversible Liouvillian L : M

d

→

M

d

with full rank ﬁxed point σ ∈ D

+

d

. Let X ∈ M

d

be an eigenvector of

ˆ

L = Γ

−1

σ

◦L◦Γ

σ

with corresponding eigenvalue λ ∈ (i.e.

ˆ

L(X) = λX) and Y



= 1

d

+ X. Then we have

E

L

p

(Y



) =

p

2(p − 1)





2

2

X

1≤i≤j≤d

f

p

(s

i

, s

j

)b

ij

b

ji

+ O(

3

)





. (70)

and

κ

p

(Y



) =

p

2

p − 1

X

1≤i≤j≤d

f

p

(s

i

, s

j

)b

ij

b

ji

+ O(

3

). (71)

Where σ

1/p

= Udiag (s

1

, s

2

, . . . , s

d

) U

†

and b

ij

= (U

†

σ

1/2p

Xσ

1/2p

U)

ij

.

Proof. Using that X ∈ M

d

is an eigenvector of

ˆ

L a simple computation gives

E

L

p

(Y



) =

pσ

2(p − 1)

tr



(A + B)

p−1

B



for A = σ

1/p

and B = σ

1/2p

Xσ

1/2p

. Note that

d

k

d

k

tr



(A + B)

p−1

B





=0

= tr



D

k

F (A)(B, B, . . . , B)B



for the matrix power F : M

d

→ M

d

given by F (X) = X

p−1

. We apply the Daleckii-Krein

formula (see [40] and [41, Theorem 2.3.1.] for the version used here) and obtain

d

d

tr



(A + B)

p−1

B





=0

=

d

X

i=1

d

X

j=1

f

p

(s

i

, s

j

)b

ij

b

ji

.

Using that

tr



(A + B)

p−1

B





=0

= h1

d

, Xi

σ

= 0

by the orthogonality of eigenvectors, and that f

p

(x, y) = f

p

(y, x) for any x, y ∈ we

obtain (70).

To obtain (71) we write

kY



k

p

p,σ

= tr ((A + B)

p

)

with A = σ

1/p

and B = σ

1/2p

Xσ

1/2p

as above. Again it is easy to see that

d

k

d

k

tr ((A + B)

p

)



=0

= tr



D

k

G(A)(B, B, . . . , B)



for the matrix power G : M

d

→ M

d

given by G(X) = X

p

. Using the Daleckii-Krein

formulas we obtain the derivatives

d

d

tr ((A + B)

p

)



=0

= p h1

d

, Xi

σ

= 0

d

2

d

2

tr ((A + B)

p

)



=0

=

d

X

i=1

d

X

j=1

g

p

(σ

i

, σ

j

)b

ij

b

ji

= p

X

1≤i≤j≤d

f

p

(σ

i

, σ

j

)b

ij

b

ji

Accepted in Quantum 2018-02-01, click title to verify 30

where we used the identity (69) in the last step. The above shows that

kY



k

p

p,σ

= 1 + 

2

p

X

1≤i≤j≤d

f

p

(σ

i

, σ

j

)b

ij

b

ji

+ O(

3

). (72)

With the well-known expansion ln(1 + x) = x −

x

2

+ O(x

3

) we obtain

κ

p

(Y



) = κ

p

(Y



) =

p

2

p − 1

X

1≤i≤j≤d

f

p

(λ

i

, λ

j

)b

ij

b

ji

+ O(

3

).

which is (71).

B Interpolation Theorems and Proof of Theorem 5.3

In order to prove Theorem 5.3 we will need the following special case of the Stein-Weiss

interpolation theorem [42, Theorem 1.1.1]. This classic result from interpolation spaces

has been applied recently to solve problems from quantum information theory, such as in

[7, Section III].

Theorem B.1 (Hadamard Three Line Theorem). Let S = {z ∈ C : 0 ≤ z ≤ 1} and

F : S → B(M

d

) be an operator-valued function holomorphic in the interior of S and

uniformly bounded and continuous on the boundary. Let σ ∈ D

+

d

and assume 1 ≤ p ≤ q ≤

∞. For 0 < θ < 1 deﬁne p

0

≤ p

θ

≤ p

1

by

1

p

θ

=

1 − θ

p

0

+

θ

p

1

Then for 0 ≤ y ≤ x ≤ 1 we have

kF (y) k

2→p

θ

,σ

≤ sup

a,b∈

kF (ia) k

1−θ

2→p

0

,σ

kF (x + ib) k

θ

2→p

1

,σ

(73)

One important consequence of the Stein-Weiss interpolation theorem is the following

interpolation result. We again refer to [42, Theorem 1.1.1] for a proof.

Theorem B.2 (Riesz-Thorin Interpolation Theorem). Let L : M

d

→ M

d

be a linear

map, 1 ≤ p

0

≤ p

1

≤ +∞ and 1 ≤ q

0

≤ q

1

≤ +∞. For θ ∈ [0, 1] deﬁne p

θ

to satisfy

1

p

θ

=

θ

p

0

+

1 − θ

p

1

and q

θ

analogously. Then for σ ∈ D

+

d

we have:

kLk

p

θ

→q

θ

,σ

≤ kLk

θ

p

0

→q

0

,σ

kLk

1−θ

p

1

→q

1

,σ

With these tools at hand we can ﬁnally prove Theorem 5.3:

Proof. Deﬁne E : M

d

→ M

d

by E (X) = tr (σX) 1

d

and set τ = t

2

() for some  > 0. In

the following we use T

z

= e

τzL

for z ∈ S = {z ∈ C : 0 ≤ Rez ≤ 1}. We will show that for

s ∈ [0, 1]:

k(T

s

− E) (X) k

2

1−s

,σ

≤ 

s

kXk

2,σ

. (74)

Accepted in Quantum 2018-02-01, click title to verify 31

The family of operators T

z

− E clearly satisﬁes the assumptions of the Stein-Weiss inter-

polation theorem. We therefore have

kT

s

− Ek

2→

2

1−s

,σ

≤ sup

a,b∈

kT

ia

− Ek

1−s

2→2,σ

kT

1+ib

− Ek

s

2→∞,σ

. (75)

Observe that by reversibility of L the map T

ia

is a unitary operator with respect to h·, ·i

σ

.

We also have T

ia

◦ E = E, as T

ia

(1

d

) = 1

d

. This gives

k(T

ia

− E) (X) k

2,σ

= kT

ia

(X −E (X)) k

2,σ

= kX −E (X) k

2,σ

≤ kXk

2,σ

,

where the last equality follows from kX −tr (σX) 1

d

k

2,σ

= min

c∈

kX −c1

d

k

2,σ

. We therefore

have

||T

ia

− E||

1−s

2→2,σ

≤ 1. (76)

Furthermore, by the unitarity of T

ib

we can compute

k(T

1+ib

− E) (X) k

∞,σ

= kT

ib

◦ (T

1

− E) (X) k

∞,σ

≤ kT

1

− Ek

2→∞,σ

kXk

2,σ

Using duality of the norms and that both T

1

and E are self-adjoint we have

kT

1+ib

− Ek

2→∞,σ

≤ kT

1

− Ek

2→∞,σ

= kT

1

− Ek

1→2,σ

=  (77)

using the deﬁnition of τ in the last equality. Inserting (76) and (77) into (75) we get



−s

k(T

s

− E) (X) k

2

1−s

,σ

≤ kX − E (X) k

2,σ

, (78)

as k(T

s

− E) (X) k

2

1−s

,σ

= k(T

s

− E) (X − E (X)) k

2

1−s

,σ

.

Taking the derivative of (78) with respect to s on both sides at s = 0 we get

1

2kX −E (X) k

2,σ



−2kX −E (X) k

2

2,σ

ln () + Ent

2,σ

(|X −E (X) |) − 2τ E (X)



≤ 0.

(79)

Rearranging the terms in (79) we obtain

Ent

2,σ

(|X −E (X) |) ≤ 2τE (X) + 2Var (X) ln () . (80)

In [1, Theorem 4.2] the following inequality (known as Rothaus’ inequality) was shown

Ent

2,σ

(X) ≤ Ent

2,σ

(|X −E (X) |) + 2Var (X) . (81)

Combining inequalities (81) with (80) and setting  =

1

e

we get t

2



1

e



α

2

(L) ≥

1

2

by the

deﬁnition of the LS constant.

To prove that the inequality is tight, consider the depolarizing Liouvillian L

σ

(X) =

tr (X) σ −X for some full rank σ ∈ D

+

d

. It is easy to see that Var

σ



e

t

ˆ

L

σ

X



= e

−t

Var

σ

(X)

and so t

2



e

−1



= 1 + ln



kσ

−1

k

∞

− 1



. Restricting to operators commuting with σ, it

follows from [21, Theorem A.1] that

α

2

(L

σ

) ≤ (1 − 2kσ

−1

k

−1

∞

)

1

2 ln (kσ

−1

k

∞

− 1)

.

Thus, for a sequence σ

n

∈ D

+

d

converging to a state that is not full rank we have

lim

n→∞

t

2



e

−1



α

2

(L

σ

n

) =

1

2

.

Accepted in Quantum 2018-02-01, click title to verify 32

References

[1] R. Olkiewicz and B. Zegarlinski. Hypercontractivity in noncommutative lp spaces. J.

Funct. Anal., 161(1):246 – 285, 1999. ISSN 0022-1236. DOI: 10.1006/jfan.1998.3342.

[2] M. J. Kastoryano and K. Temme. Quantum logarithmic Sobolev inequalities and

rapid mixing. J. Math. Phys., 54(5):052202, May 2013. DOI: 10.1063/1.4804995.

[3] M. M¨uller-Lennert, F. Dupuis, O. Szehr, S. Fehr, and M. Tomamichel. On quantum

R´enyi entropies: A new generalization and some properties. J. Math. Phys., 54(12):

122203, 2013. DOI: 10.1063/1.4838856.

[4] M. M. Wilde, A. Winter, and D. Yang. Strong Converse for the Classical Capacity

of Entanglement-Breaking and Hadamard Channels via a Sandwiched R´enyi Relative

Entropy. Commun. Math. Phys., 331:593–622, October 2014. DOI: 10.1007/s00220-

014-2122-x.

[5] C. King. The capacity of the quantum depolarizing channel. IEEE Trans. Inf. The-

ory., 49(1):221–229, 2003. DOI: 10.1109/TIT.2002.806153.

[6] H. Umegaki. Conditional expectation in an operator algebra. IV. Entropy and infor-

mation. Kodai Math. Sem. Rep., 14(2):59–85, 1962. DOI: 10.2996/kmj/1138844604.

[7] S. Beigi. Sandwiched R´enyi divergence satisﬁes data processing inequality. J. Math.

Phys., 54(12):122202, December 2013. DOI: 10.1063/1.4838855.

[8] F. Hiai, M. Ohya, and M. Tsukada. Suﬃciency, KMS condition and rela-

tive entropy in von Neumann algebras. Paciﬁc J. Math., 96(1):99–109. DOI:

10.1142/9789812794208˙0030.

[9] G. L. Gilardoni. On Pinsker’s and Vajda’s type inequalities for Csiszar’s f -divergences.

IEEE Trans. Inf. Theory., 56(11):5377–5386, Nov 2010. ISSN 0018-9448. DOI:

10.1109/TIT.2010.2068710.

[10] G. Lindblad. On the generators of quantum dynamical semigroups. Commun. Math.

Phys., 48(2):119–130. DOI: 10.1007/BF01608499.

[11] V. Gorini, A. Kossakowski, and E. C. G. Sudarshan. Completely positive dynam-

ical semigroups of N-level systems. J. Math. Phys., 17:821–825, May 1976. DOI:

10.1063/1.522979.

[12] D. Burgarth, G. Chiribella, V. Giovannetti, P. Perinotti, and K. Yuasa. Ergodic and

mixing quantum channels in ﬁnite dimensions. New J. Phys., 15(7):073045, July 2013.

DOI: 10.1088/1367-2630/15/7/073045.

[13] H. Spohn and J. L. Lebowitz. Irreversible thermodynamics for quantum sys-

tems weakly coupled to thermal reservoirs. Adv. Chem. Phys, 38:109–142. DOI:

10.1002/9780470142578.ch2.

[14] E.B. Davies. Quantum Theory of Open Systems. Academic Press, 1976. ISBN

9780122061509.

[15] H.P. Breuer and F. Petruccione. The Theory of Open Quantum Systems. OUP Oxford,

2007. ISBN 9780199213900. DOI: 10.1093/acprof:oso/9780199213900.001.0001.

[16] E.B. Davies. Generators of dynamical semigroups. J. Funct. Anal., 34(3):421 – 432,

1979. ISSN 0022-1236. DOI: 10.1016/0022-1236(79)90085-5.

[17] A. Kossakowski, A. Frigerio, V. Gorini, and M. Verri. Quantum detailed balance and

KMS condition. Commun. Math. Phys., 57(2):97–110, Jun 1977. ISSN 1432-0916.

DOI: 10.1007/BF01625769.

[18] H. Spohn. Entropy production for quantum dynamical semigroups. J. Math. Phys.,

19(5):1227–1230, 1978. DOI: 10.1063/1.523789.

Accepted in Quantum 2018-02-01, click title to verify 33

[19] K. Temme. Thermalization time bounds for Pauli stabilizer hamiltonians. Commun.

Math. Phys., 350(2):603–637, Mar 2017. ISSN 1432-0916. DOI: 10.1007/s00220-016-

2746-0.

[20] K. Temme, M. J. Kastoryano, M. B. Ruskai, M. M. Wolf, and F. Verstraete. The

χ

2

-divergence and mixing times of quantum Markov processes. J. Math. Phys., 51

(12):122201, December 2010. DOI: 10.1063/1.3511335.

[21] P. Diaconis and L. Saloﬀ-Coste. Logarithmic Sobolev inequalities for ﬁnite Markov

chains. Ann. Appl. Probab., 6(3):695–750, 08 1996. DOI: 10.1214/aoap/1034968224.

[22] A. M¨uller-Hermes, D. Stilck Fran¸ca, and M. M. Wolf. Relative entropy convergence

for depolarizing channels. J. Math. Phys., 57(2), 2016. DOI: 10.1063/1.4939560.

[23] K. Temme. Lower bounds to the spectral gap of Davies generators. J. Math. Phys.,

54(12):122110, December 2013. DOI: 10.1063/1.4850896.

[24] T. S. Cubitt, A. Lucia, S. Michalakis, and D. Perez-Garcia. Stability of local quantum

dissipative systems. Commun. Math. Phys., 337(3):1275–1315, 2015. ISSN 1432-0916.

DOI: 10.1007/s00220-015-2355-3.

[25] F. G. S. L. Brand˜ao, T. S. Cubitt, A. Lucia, S. Michalakis, and D. Perez-Garcia. Area

law for ﬁxed points of rapidly mixing dissipative quantum systems. J. Math. Phys.,

56(10):102202, October 2015. DOI: 10.1063/1.4932612.

[26] K. Temme, F. Pastawski, and M. J. Kastoryano. Hypercontractivity of quasi-

free quantum semigroups. J. Phys. A., 47(40):405303, 2014. DOI: 10.1088/1751-

8113/47/40/405303.

[27] M.A. Nielsen and I.L. Chuang. Quantum Computation and Quantum Information.

Cambridge Series on Information and the Natural Sciences. Cambridge University

Press, 2000. ISBN 9780521635035. DOI: 10.1017/cbo9780511976667.

[28] A. Winter. Coding theorem and strong converse for quantum channels. IEEE Trans.

Inf. Theory., 45(7):2481–2485, Nov 1999. ISSN 0018-9448. DOI: 10.1109/18.796385.

[29] T. Ogawa and H. Nagaoka. Strong converse to the quantum channel coding theo-

rem. IEEE Trans. Inf. Theory., 45(7):2486–2489, Nov 1999. ISSN 0018-9448. DOI:

10.1109/18.796386.

[30] R. K¨onig and S. Wehner. A Strong Converse for Classical Channel Coding Using

Entangled Inputs. Phys. Rev. Lett, 103(7):070504, August 2009. DOI: 10.1103/Phys-

RevLett.103.070504.

[31] M. Tomamichel, M. M. Wilde, and A. Winter. Strong converse rates for quantum com-

munication. In 2015 IEEE International Symposium on Information Theory (ISIT),

pages 2386–2390. IEEE, 2015. DOI: 10.1109/TIT.2016.2615847.

[32] A. M¨uller-Hermes, D. Stilck Fran¸ca, and M. M. Wolf. Entropy production of

doubly stochastic quantum channels. J. Math. Phys., 57(2):022203, 2016. DOI:

10.1063/1.4941136.

[33] S. B. Bravyi and A. Y. Kitaev. Quantum codes on a lattice with boundary. arXiv

preprint quant-ph/9811052, 1998.

[34] D. A. Lidar and T. A. Brun. Quantum error correction. Cambridge University Press,

2013. DOI: 10.1017/CBO9781139034807.

[35] R. Alicki, M. Fannes, and M. Horodecki. On thermalization in Kitaev’s 2D model.

J. Phys. A., 42(6):065303, 2009. DOI: 10.1088/1751-8113/42/6/065303.

[36] M. J. Kastoryano and F. G. S. L. Brand˜ao. Quantum Gibbs samplers: The commuting

case. Commun. Math. Phys., 344(3):915–957, Jun 2016. ISSN 1432-0916. DOI:

10.1007/s00220-016-2641-8.

Accepted in Quantum 2018-02-01, click title to verify 34

[37] I. Bardet. Estimating the decoherence time using non-commutative Functional In-

equalities. ArXiv preprint quant-ph/1710.01039, October 2017.

[38] M. Ohya and D. Petz. Quantum Entropy and Its Use. Theoretical and Mathematical

Physics. Springer, 2004. ISBN 9783540208068. DOI: 10.1007/978-3-642-57997-4.

[39] R. L. Frank and E. H. Lieb. Monotonicity of a relative R´enyi entropy. J. Math. Phys.,

54(12):122201, 2013. DOI: 10.1063/1.4838835.

[40] J. L. Daletskii and S. G. Krein. Integration and diﬀerentiation of functions of her-

mitian operators and applications to the theory of perturbations. AMS Translations

(2), 47:1–30, 1965. DOI: 10.1090/trans2/047/01.

[41] F. Hiai. Matrix analysis: matrix monotone functions, matrix means, and ma-

jorization. Interdisciplinary Information Sciences, 16(2):139–246, 2010. DOI:

10.4036/iis.2010.139.

[42] J. Bergh and J. Lofstrom. Interpolation spaces: an introduction, volume 223. Springer,

2012. DOI: 10.1007/978-3-642-66451-9.

Accepted in Quantum 2018-02-01, click title to verify 35

Comments

Products

Project