Tensors: Geometry and Applications J.M. Landsberg - Texas A&M ...
Tensors: Geometry and Applications J.M. Landsberg - Texas A&M ... Tensors: Geometry and Applications J.M. Landsberg - Texas A&M ...
22 1. Introduction throwing i for the first and s for the second is simply piqs. We may form a 6 ×20 matrix x = (xi,s) = (piqs) recording all the possible throws with their probabilities. Note xi,s ≥ 0 for all i,s and � i,s xi,s = 1. The matrix x has an additional property: x has rank one. Were the events not independent we would not have this additional constraint. Consider the set {T ∈ R6⊗R20 | Ti,s ≥ 0, � i,s Ti,s = 1}. This is the set of all discrete probability distributions on R6⊗R20 , and the set of the previous paragraph is this set intersected with the set of rank one matrices. Now say some gamblers were cheating with κ sets of dice, each with different probabilities. They watch to see how bets are made and then choose one of the sets accordingly. Now we have probabilities pi,u, qs,u, and a 6 × 20 × κ array zi,s,u with rank(z) = 1, in the sense that if we consider z as a bilinear map, it has rank one. Say that we cannot observe the betting. Then, to obtain the probabilities of what we can observe, we must sum over all the κ possibilities. We end up with an element of R 6 ⊗R 20 , with entries ri,s = � u pi,uqi,u. That is, we obtain a 6×20 matrix of probabilities of rank (at most) κ, i.e., an element of ˆσ κ,R 6 ⊗R 20. The set of all such distributions is the set of matrices of R 6 ⊗R 20 of rank at most κ intersected with PD6,20. This is an example of a Bayesian network . In general, one associates a graph to a collection of random variables having various conditional dependencies and then from such a graph, one defines sets (varieties) of distributions. More generally an algebraic statistical model is the intersection of the probability distributions with a closed subset defined by some dependence relations. Algebraic statistical models are discussed in Chapter 14. A situation discussed in detail in Chapter 14 are algebraic statistical models arising in phylogeny: Given a collection of species, say humans, monkeys, gorillas, orangutans, and ...., all of which are assumed to have evolved from some common ancestor, ideally we might like to reconstruct the corresponding evolutionary tree from sampling DNA. Assuming we can only measure the DNA of existing species, this will not be completely possible, but it might be possible to, e.g., determine which pairs are most closely related. One might imagine, given the numerous possibilities for evolutionary trees, that there would be a horrific amount of varieties to find equations for. A major result of E. Allmann and J. Rhodes states that this is not the case: Theorem 1.5.1.1. [10, 9] Equations for the algebraic statistical model associated to any bifurcating evolutionary tree can be determined explicitly from equations for ˆσ 4,C 4 ⊗C 4 ⊗C 4.
1.5. Algebraic Statistics and tensor networks 23 Figure 1.5.1. Evolutionary tree, extinct ancestor gave rise to 4 species See §14.2.3 for a more precise statement. Moreover, motivated by Theorem 1.5.1.1 (and the promise of a hand smoked alaskan salmon), equations for ˆσ 4,C 4 ⊗C 4 ⊗C 4 were recently found by S. Friedland, see §3.9.3 for the equations and salmon discussion. The geometry of this field has been well developed and exposed by geometers [257, 165]. I include a discussion in Chapter 14. 1.5.2. Tensor network states. Tensors describe states of quantum mechanical systems. If a system has n particles, its state is an element of V1⊗ · · · ⊗ Vn where Vj is a Hilbert space associated to the j-th particle. In many-body physics, in particular solid state physics, one wants to simulate quantum states of thousands of particles, often arranged on a regular lattice (e.g., atoms in a crystal). Due to the exponential growth of the dimension of V1⊗ · · · ⊗ Vn with n, any naïve method of representing these tensors is intractable on a computer. Tensor network states were defined to reduce the complexity of the spaces involved by restricting to a subset of tensors that is physically reasonable, in the sense that the corresponding spaces of tensors are only “locally” entangled because interactions (entanglement) in the physical world appear to just happen locally. Such spaces have been studied since the 1980’s. These spaces are associated to graphs, and go under different names: tensor network states, finitely correlated states (FCS), valence-bond solids (VBS), matrix product states (MPS), projected entangled pairs states (PEPS), and multi-scale entanglement renormalization ansatz states (MERA), see, e.g., [276, 121, 170, 120, 318, 87] and the references therein. I will use the term tensor network states. The topology of the tensor network is often modeled to mimic the physical arrangement of the particles and their interactions.
- Page 1: Tensors: Geometry and Applications
- Page 4 and 5: vi Contents §2.6. Symmetric and sk
- Page 6 and 7: viii Contents §8.6. The Chow varie
- Page 8 and 9: x Contents Index 429
- Page 10 and 11: xii Preface (5) Similarly I have us
- Page 12 and 13: xiv Preface This chapter may be dif
- Page 14 and 15: xvi Preface Chapter 12: Tensor deco
- Page 16 and 17: xviii Preface this and similar inst
- Page 18 and 19: xx Preface 0.5.3. Acknowledgments.
- Page 21 and 22: Introduction Chapter 1 The workhors
- Page 23 and 24: 1.1. The complexity of matrix multi
- Page 25 and 26: 1.2. Definitions from multilinear a
- Page 27 and 28: 1.2. Definitions from multilinear a
- Page 29 and 30: 1.3. Tensor decomposition 11 In pri
- Page 31 and 32: 1.3. Tensor decomposition 13 them t
- Page 33 and 34: 1.3. Tensor decomposition 15 source
- Page 35 and 36: 1.4. P v. NP and algebraic variants
- Page 37 and 38: 1.4. P v. NP and algebraic variants
- Page 39: 1.5. Algebraic Statistics and tenso
- Page 43: 1.6. Geometry and representation th
- Page 46 and 47: 28 2. Multilinear algebra I work ov
- Page 48 and 49: 30 2. Multilinear algebra symmetric
- Page 50 and 51: 32 2. Multilinear algebra 2.2.3. Ex
- Page 52 and 53: 34 2. Multilinear algebra The numbe
- Page 54 and 55: 36 2. Multilinear algebra (3) Show
- Page 56 and 57: 38 2. Multilinear algebra point for
- Page 58 and 59: 40 2. Multilinear algebra the actio
- Page 60 and 61: 42 2. Multilinear algebra where Sd
- Page 62 and 63: 44 2. Multilinear algebra There is
- Page 64 and 65: 46 2. Multilinear algebra (7) Show
- Page 66 and 67: 48 2. Multilinear algebra 2.6.13. T
- Page 68 and 69: 50 2. Multilinear algebra which is
- Page 70 and 71: 52 2. Multilinear algebra (2.7.1) (
- Page 72 and 73: 54 2. Multilinear algebra occur wit
- Page 74 and 75: 56 2. Multilinear algebra Definitio
- Page 76 and 77: 58 2. Multilinear algebra where ψ1
- Page 78 and 79: 60 2. Multilinear algebra Input: el
- Page 80 and 81: 62 2. Multilinear algebra V ⊗V
- Page 82 and 83: 64 2. Multilinear algebra V ⊗V
22 1. Introduction<br />
throwing i for the first <strong>and</strong> s for the second is simply piqs. We may form a<br />
6 ×20 matrix x = (xi,s) = (piqs) recording all the possible throws with their<br />
probabilities. Note xi,s ≥ 0 for all i,s <strong>and</strong> �<br />
i,s xi,s = 1. The matrix x has<br />
an additional property: x has rank one.<br />
Were the events not independent we would not have this additional<br />
constraint. Consider the set {T ∈ R6⊗R20 | Ti,s ≥ 0, �<br />
i,s Ti,s = 1}. This is<br />
the set of all discrete probability distributions on R6⊗R20 , <strong>and</strong> the set of the<br />
previous paragraph is this set intersected with the set of rank one matrices.<br />
Now say some gamblers were cheating with κ sets of dice, each with<br />
different probabilities. They watch to see how bets are made <strong>and</strong> then<br />
choose one of the sets accordingly. Now we have probabilities pi,u, qs,u, <strong>and</strong><br />
a 6 × 20 × κ array zi,s,u with rank(z) = 1, in the sense that if we consider z<br />
as a bilinear map, it has rank one.<br />
Say that we cannot observe the betting. Then, to obtain the probabilities<br />
of what we can observe, we must sum over all the κ possibilities. We end<br />
up with an element of R 6 ⊗R 20 , with entries ri,s = �<br />
u pi,uqi,u. That is, we<br />
obtain a 6×20 matrix of probabilities of rank (at most) κ, i.e., an element of<br />
ˆσ κ,R 6 ⊗R 20. The set of all such distributions is the set of matrices of R 6 ⊗R 20<br />
of rank at most κ intersected with PD6,20.<br />
This is an example of a Bayesian network . In general, one associates a<br />
graph to a collection of r<strong>and</strong>om variables having various conditional dependencies<br />
<strong>and</strong> then from such a graph, one defines sets (varieties) of distributions.<br />
More generally an algebraic statistical model is the intersection of the<br />
probability distributions with a closed subset defined by some dependence<br />
relations. Algebraic statistical models are discussed in Chapter 14.<br />
A situation discussed in detail in Chapter 14 are algebraic statistical<br />
models arising in phylogeny: Given a collection of species, say humans,<br />
monkeys, gorillas, orangutans, <strong>and</strong> ...., all of which are assumed to have<br />
evolved from some common ancestor, ideally we might like to reconstruct the<br />
corresponding evolutionary tree from sampling DNA. Assuming we can only<br />
measure the DNA of existing species, this will not be completely possible,<br />
but it might be possible to, e.g., determine which pairs are most closely<br />
related.<br />
One might imagine, given the numerous possibilities for evolutionary<br />
trees, that there would be a horrific amount of varieties to find equations<br />
for. A major result of E. Allmann <strong>and</strong> J. Rhodes states that this is not the<br />
case:<br />
Theorem 1.5.1.1. [10, 9] Equations for the algebraic statistical model<br />
associated to any bifurcating evolutionary tree can be determined explicitly<br />
from equations for ˆσ 4,C 4 ⊗C 4 ⊗C 4.