12.07.2015 Views

Chapter 8 Vector Spaces in Quantum Mechanics

Chapter 8 Vector Spaces in Quantum Mechanics

Chapter 8 Vector Spaces in Quantum Mechanics

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

<strong>Chapter</strong> 8<strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong>We have seen <strong>in</strong> the previous <strong>Chapter</strong> that there is a sense <strong>in</strong> which the state of a quantumsystem can be thought of as be<strong>in</strong>g made up of other possible states. The aim here is to usethe example of the Stern-Gerlach experiment to develop this idea further, and to show thatthe states of a quantum system can be represented by vectors <strong>in</strong> a complex vector space.To beg<strong>in</strong>, we review some of the basic ideas of vectors, us<strong>in</strong>g the example of vectors <strong>in</strong>real two dimensional space.8.1 <strong>Vector</strong>s <strong>in</strong> Two Dimensional SpaceBelow is a summary of the important properties of vectors <strong>in</strong> physical space based on their<strong>in</strong>terpretation as mathematical objects that have both magnitude and direction. Position,velocity, force and so on are examples of such vectors. The <strong>in</strong>tention is not to give acomplete discussion, but to highlight a number of important properties of such vectors thathave analogues <strong>in</strong> the case of quantum states, <strong>in</strong>clud<strong>in</strong>g the property that two vectors canbe comb<strong>in</strong>ed to produce another vector, and that ‘how much’ of one vector is conta<strong>in</strong>ed<strong>in</strong> another can be measured via the <strong>in</strong>ner product of two vectors.8.1.1 L<strong>in</strong>ear Comb<strong>in</strong>ations of <strong>Vector</strong>s – <strong>Vector</strong> AdditionConsider two non-coll<strong>in</strong>ear vectors ˆv 1 and ˆv 2 , as illustrated <strong>in</strong> Fig. (8.1). Multiples ofthese pair of vectors can be added together <strong>in</strong> the fashion illustrated <strong>in</strong> Fig. (8.1) to formanother vector. Conversely, any other vector v can be expressed <strong>in</strong> terms of ˆv 1 and ˆv 2us<strong>in</strong>g appropriate values for the components a and b of v, i.e.v = aˆv 1 + bˆv 2 . (8.1)The right hand side of this equation is known as a l<strong>in</strong>ear comb<strong>in</strong>ation of the vectors ˆv 1 andˆv 2 . The particular po<strong>in</strong>ts to take away from this is that comb<strong>in</strong><strong>in</strong>g vectors produces othervectors, analogous to our observation above that comb<strong>in</strong><strong>in</strong>g states produces other states,and that the components a and b are a measure of how much of ˆv 1 and ˆv 2 respectively gotowards mak<strong>in</strong>g up the vector v.


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 78These vectors are real vectors <strong>in</strong> that the coefficients a and b are real numbers. We canreadily generalize the ideas above by allow<strong>in</strong>g a and b to be complex numbers. Thismakes it difficult to draw figures such as Fig. (8.1), but the idea can be still reta<strong>in</strong>ed thatthe absolute values |a| and |b| will then represent the extent to which v is made up of thetwo vectors v 1 and v 2 .vv 1v 2û 22û 11.5û 2vû 1(a) (b) (c)Figure 8.1: Examples of vector addition. (a) Two arbitrary basis vectors v 1 and v 2 comb<strong>in</strong><strong>in</strong>gto give v; (b) A pair of orthonormal basis vectors; (c) The vector v expressed as a l<strong>in</strong>earcomb<strong>in</strong>ation of the basis vectors û 1 and û 2 : v = 2û 1 + 1.5û 2 .The two vectors ˆv 1 and ˆv 2 <strong>in</strong>troduced above are arbitrary except <strong>in</strong>sofar as they are notcoll<strong>in</strong>ear. They are known as basis vectors <strong>in</strong> the sense that any vector can be writtenas a l<strong>in</strong>ear comb<strong>in</strong>ation of them. There is effectively an <strong>in</strong>f<strong>in</strong>ite number of choices forthe basis vectors, and <strong>in</strong> fact it is possible to choose three or more vectors to be basisvectors. But the m<strong>in</strong>imum number is two, if we wish to be able to describe any vector <strong>in</strong>the plane as a l<strong>in</strong>ear comb<strong>in</strong>ation of basis vectors. The collection of all the vectors thatcan be constructed by tak<strong>in</strong>g l<strong>in</strong>ear comb<strong>in</strong>ations of these basis vectors us<strong>in</strong>g any complexnumbers a and b as components is known as a complex vector space, and s<strong>in</strong>ce two basisvectors are needed, the vector space is said to be of dimension two.This vector space possess more structure than that implied by simply form<strong>in</strong>g variousl<strong>in</strong>ear comb<strong>in</strong>ations. <strong>Vector</strong>s that can be drawn <strong>in</strong> a plane, as <strong>in</strong> Fig. (8.1), i.e. for whichthe coefficients a and b are real, can be clearly seen to have different lengths and relativeorientations. These properties of vectors are encompassed <strong>in</strong> the def<strong>in</strong>ition of the <strong>in</strong>ner,scalar or dot product of pairs of vectors.8.1.2 Inner or Scalar ProductsThe <strong>in</strong>ner or scalar product of two real vectors v 1 and v 2 is def<strong>in</strong>ed byv 1 · v 2 = v 1 v 2 cos θ (8.2)where v 1 and v 2 are the lengths of v 1 and v 2 respectively, and θ is the angle between them.Us<strong>in</strong>g the idea of an <strong>in</strong>ner or scalar product v 1 · v 2 it is possible to <strong>in</strong>troduce a particularlyuseful pair of basis vectors. To this end, consider two vectors û 1 and û 2 that satisfyû 1 · û 1 = û 2 · û 2 = 1 (8.3)


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 79i.e. they have unit length, hence they are known as unit vectors, andû 1 · û 2 = 0 (8.4)i.e. they are orthogonal. This pair of vectors û 1 and û 2 is said to be orthonormal. Asbefore, they form a basis <strong>in</strong> that multiples of each vector can be added together <strong>in</strong> thefashion illustrated <strong>in</strong> Fig. (8.1) to form another vector. Conversely, any vector v can beexpressed us<strong>in</strong>g appropriate values for the components of v, i.e.v = aû 1 + bû 2 . (8.5)The components a and b, which represent ‘how much’ of the vector v is made up of thevectors û 1 and û 2 are given bya = û 1 · v and b = û 2 · v (8.6)A well known example of a vector expressed as a l<strong>in</strong>ear comb<strong>in</strong>ation of a pair of unitvectors is given by the position vector r written with respect to the unit vectors î and ĵ:r = xî + yĵ. (8.7)Once aga<strong>in</strong>, if we allow for the possibility of complex vectors, then the def<strong>in</strong>ition of the<strong>in</strong>ner product is changed to allow for this. Thus the <strong>in</strong>ner product of two vectors v 1 andv 2 is now v ∗ 1 · v 2, and orthonormal vectors satisfyu ∗ 1 · u 1 = u ∗ 2 · u 2 = 1u ∗ 1 · u 2 = u ∗ 2 · u 1 = 0.(8.8)Complex basis vectors are not exotic mathematical entities. A simple example is3i + 4iju 1 =54i − 3iju 2 =5(8.9)though it is true to say that they are not often encountered <strong>in</strong> mechanics, except at an advancedlevel, but are, as we shall see, rout<strong>in</strong>ely found <strong>in</strong> quantum mechanics. Because ofthat, it is convenient to <strong>in</strong>troduce a new notation for <strong>in</strong>ner product (one that is commonlyused <strong>in</strong> pure mathematics) which is to write the <strong>in</strong>ner product of two vectors v 1 and v 2 as(v 1 , v 2 ), so that we would have here(v 1 , v 2 ) = v ∗ 1 · v 2. (8.10)The value of this notation is simply that it is more general and does not tie us to the geometricalnotion of <strong>in</strong>ner product that we are used to us<strong>in</strong>g with position vectors, velocityvectors and the like.8.2 Sp<strong>in</strong> Half <strong>Quantum</strong> States as <strong>Vector</strong>sWe now need to exam<strong>in</strong>e the way <strong>in</strong> which the quantum states for a sp<strong>in</strong> half system can beseen to fit <strong>in</strong> with the idea of be<strong>in</strong>g considered as vectors. To see how this comes about, we


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 80will show that there is a perfect analogy between Eqs. (7.43) and (7.44) and correspond<strong>in</strong>grelationships for ord<strong>in</strong>ary (complex) vectors. Return<strong>in</strong>g to the Stern-Gerlach examplediscussed <strong>in</strong> the preced<strong>in</strong>g <strong>Chapter</strong> we obta<strong>in</strong>ed there an expression Eq. (7.44)|S 〉 = |+〉〈+|S 〉 + |−〉〈−|S 〉.for the state of a sp<strong>in</strong> half atom expressed <strong>in</strong> terms of the states |±〉, which are statesfor which the atom has a z component of sp<strong>in</strong> equal to ± 1 , and 〈±|S 〉 are probability2amplitudes (complex numbers) whose magnitude <strong>in</strong> some sense tells us ‘how much’ ofthe states |±〉 are to be found <strong>in</strong> the state |S 〉. This result was obta<strong>in</strong>ed by ‘cancell<strong>in</strong>g’ thecommon factor ‘〈S ′ |’ from the result〈S ′ |S 〉 = 〈S ′ |+〉〈+|S 〉 + 〈S ′ |−〉〈−|S 〉.What we should bear <strong>in</strong> m<strong>in</strong>d here is that we can recover this relationship between probabilityamplitudes by re<strong>in</strong>troduc<strong>in</strong>g ‘〈S ′ |’ <strong>in</strong>to Eq. (7.44) for any chosen f<strong>in</strong>al state, yield<strong>in</strong>gan expression for the probability amplitudes as needed. Thus, as has been said before,Eq. (7.44) effectively represents a ‘template’ <strong>in</strong>to which we <strong>in</strong>sert the appropriate <strong>in</strong>formation<strong>in</strong> order to recover the required probability amplitudes. We can also note that thereis noth<strong>in</strong>g sacred about choos<strong>in</strong>g to cancel the the common factor |S 〉 – we could equallyas well cancel the factor |S 〉, yield<strong>in</strong>g〈S ′ | = 〈S ′ |+〉〈+| + 〈S ′ |−〉〈−|. (8.11)Hav<strong>in</strong>g carried out this cancellation procedure, what has reappeared is the state of a quantumsystem i.e. |S 〉 which was <strong>in</strong>troduced earlier <strong>in</strong> a different context, specifically asbe<strong>in</strong>g noth<strong>in</strong>g more than a way of writ<strong>in</strong>g down all that we knew about the state of aquantum system. There, the notation had no mathematical significance, but <strong>in</strong> the manner<strong>in</strong> which it appears here, it seems to have acquired a mathematical mean<strong>in</strong>g of some k<strong>in</strong>d.The aim is to see what this mean<strong>in</strong>g might be, and <strong>in</strong> do<strong>in</strong>g so, we will show that theexpression for |S 〉 has many of the properties that we associate with express<strong>in</strong>g a vectoras a sum of its components.We beg<strong>in</strong> by consider<strong>in</strong>g the probability amplitudes 〈S ′ |S 〉 themselves. These are complexnumbers <strong>in</strong> general for arbitrary sp<strong>in</strong> directions, (but they were real <strong>in</strong> the particularStern-Gerlach example used above), such that their modulus squared |〈S ′ |S 〉| 2 is the probabilityP(S ′ |S ) of observ<strong>in</strong>g the sp<strong>in</strong> to be <strong>in</strong> the state |S ′ 〉 given that it was <strong>in</strong> the state |S 〉.In particular, 〈S |S 〉 is the probability amplitude of observ<strong>in</strong>g the sp<strong>in</strong> to be <strong>in</strong> the state |S 〉given that it was <strong>in</strong> the state |S 〉. This will have to be unity, i.e. P(S |S ) = |〈S |S 〉| 2 = 1.Thus we can conclude that〈S |S 〉 = e iη (8.12)where η is an arbitrary phase. It turns out that this phase always cancels out <strong>in</strong> anycalculation of observable quantities, so it is conventionally set to zero, and hence〈S |S 〉 = 1. (8.13)The state |S 〉 is said to be normalized to unity. As a particular case, this last result impliesthat〈+|+〉 = 1. (8.14)


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 81We can now consider the probability amplitude 〈+|S 〉 obta<strong>in</strong>ed by replac<strong>in</strong>g S ′ by + <strong>in</strong>the above expression for 〈S ′ |S 〉:We have seen that we can put 〈+|+〉 = 1, so we have〈+|S 〉 = 〈+|+〉〈+|S 〉 + 〈+|−〉〈−|S 〉. (8.15)〈+|−〉〈−|S 〉 = 0 (8.16)which has to be true no matter what the state |S 〉 happens to be, i.e. no matter what valuethe probability amplitude 〈−|S 〉 is. Thus we conclude thatSimilarly we can show thatThus we can set up a comparison:〈+|−〉 = 0. (8.17)〈−|−〉 = 1 and 〈−|+〉 = 0. (8.18)〈+|+〉 = 1←→û ∗ 1 · û 1 = 1〈+|−〉 = 0〈−|−〉 = 1←→←→û ∗ 2 · û 1 = 0û ∗ 2 · û 2 = 1(8.19)〈−|+〉 = 0←→û ∗ 2 · û 1 = 0where we have chosen to make the comparison between the probability amplitudes and the<strong>in</strong>ner product of complex unit vectors as we are deal<strong>in</strong>g with probability amplitudes thatare, <strong>in</strong> eneral, complex numbers. This comparison implies the follow<strong>in</strong>g correspondences:|+〉 ←→ û 1 |−〉 ←→ û 2〈+| ←→ û ∗ 1 〈−| ←→ û ∗ 2 . (8.20)We know that 〈+|S 〉 and 〈−|S 〉 are both just complex numbers, so call them a and brespectively. If we now write|S 〉 = a|+〉 + b|−〉 (8.21)we establish a perfect correspondence with the expressionv = a û 1 + b û 2 . (8.22)On the basis of this result, we are then tempted to <strong>in</strong>terpret the ket |S 〉 as a vector expressedas a l<strong>in</strong>ear comb<strong>in</strong>ation of two orthonormal basis vectors |±〉. We can push the analogyfurther if we once aga<strong>in</strong> use the fact that 〈S |S 〉 = 1, so that〈S |S 〉 = 1 = 〈S |−〉〈−|S 〉 + 〈S |+〉〈+|S 〉 (8.23)On the other hand, the total probability of observ<strong>in</strong>g the system <strong>in</strong> either of the states |±〉must add up to unity, which means thatP(+|S ) + P(−|S ) = |〈+|S 〉| 2 + |〈−|S 〉| 2 = 1. (8.24)By compar<strong>in</strong>g the last two equations, and not<strong>in</strong>g that|〈±|S 〉| 2 = 〈±|S 〉〈±|S 〉 ∗ (8.25)


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 82we conclude thatIf we now consider〈±|S 〉 = 〈S |±〉 ∗ . (8.26)〈S ′ | = 〈S ′ |+〉〈+| + 〈S ′ |−〉〈−|and use the result, Eq. (8.26), 〈±|S ′ 〉 = 〈S ′ |±〉 ∗ , we can write this as〈S ′ | = 〈+|S ′ 〉 ∗ 〈+| + 〈−|S ′ 〉 ∗ 〈−| (8.27)or, expressed <strong>in</strong> terms of a ′ = 〈+|S ′ 〉 and b ′ = 〈−|S ′ 〉, we have〈S ′ | = a ′∗ 〈+| + b ′∗ 〈−| (8.28)which has a perfect correspondence with an ord<strong>in</strong>ary vector ˆv ′ <strong>in</strong> the formv ′∗ = a ′∗ û ∗ 1 + b′∗ û ∗ 2 . (8.29)So the bra 〈S ′ | is itself a vector, a bra vector, which can be thought of as be<strong>in</strong>g justthe complex conjugate of the correspond<strong>in</strong>g ket vector |S ′ 〉. But while it is occasionallyuseful to th<strong>in</strong>k this way, it is not strictly true mathematically, and this way of view<strong>in</strong>g a bravector will not be employed here. Instead, as will be shown shortly, an <strong>in</strong>terpretation of ketvectors as column vectors leads to the <strong>in</strong>terpretation of bra vectors as row vectors. A moremathematical standpo<strong>in</strong>t also leads to <strong>in</strong>terpretation of bra vectors as ‘l<strong>in</strong>ear functionals’,that is, a bra is a mathematical operator that acts on a ket vector to produce a complexnumber.F<strong>in</strong>ally, to complete the correspondence, we note that the probability amplitude 〈S ′ |S 〉can be written〈S ′ |S 〉 = a ′∗ a + b ′∗ b (8.30)which can be compared with the <strong>in</strong>ner product v ′∗·v, or written <strong>in</strong> the more formal notation:(v ′ , v) = a ′∗ a + b ′∗ b (8.31)which tells us that the probability amplitude can be considered as be<strong>in</strong>g simply the <strong>in</strong>nerproduct of the two vectors |S ′ 〉 and |S 〉, i.e.〈S ′ |S 〉 = (|S ′ 〉, |S 〉). (8.32)In other words, we have a perfect analogy between the two dimensional complex vectorspace formed by l<strong>in</strong>ear comb<strong>in</strong>ations of the unit vectors û 1 and û 2 discussed <strong>in</strong> Section8.1 and a complex vector space consist<strong>in</strong>g of all the l<strong>in</strong>ear comb<strong>in</strong>ations of the states |±〉.The ket vectors |±〉 are referred to as basis states, analogous to û 1 and û 2 be<strong>in</strong>g referredto as basis vectors.Different sp<strong>in</strong> states can be constructed by form<strong>in</strong>g l<strong>in</strong>ear comb<strong>in</strong>ations |S 〉 = a|+〉 + b|−〉of these basis states, with a and b be<strong>in</strong>g allowed to range over all the complex numbers,though we have only been look<strong>in</strong>g at l<strong>in</strong>ear comb<strong>in</strong>ations with real coefficients. By limit<strong>in</strong>ga and b to be real numbers, we are construct<strong>in</strong>g states whose measured componentof sp<strong>in</strong> all lie <strong>in</strong> the same plane, which, with the system of axes we have been us<strong>in</strong>g here,is the XZ plane. If the coefficients a and b are complex, the state |S 〉 represents a state<strong>in</strong> which the measured component S = S · n is along a direction ˆn that po<strong>in</strong>ts out of this


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 83plane, though we will not attempt to prove this here (see Eq. (8.46)). So, any l<strong>in</strong>ear comb<strong>in</strong>ation<strong>in</strong> which a and b are any complex numbers also def<strong>in</strong>es a possible sp<strong>in</strong> state ofthe sp<strong>in</strong> half system. Thus all the possible l<strong>in</strong>ear comb<strong>in</strong>ations of |±〉, i.e. comb<strong>in</strong>ationsof the form a|+〉 + b|−〉 where a and b are complex numbers form a complex vector spaceknown as the state space of the system.The quantum state vectors can also be ‘multiplied’ together – the <strong>in</strong>ner product of the twovectors |S ′ 〉 and |S 〉 is just the probability amplitude 〈S ′ |S 〉. In particular, the basis statesare normalized to unity, i.e. they are unit vectors, and they are orthogonal to each other,i.e. they form a pair of orthonormal basis states.The term<strong>in</strong>ology often adopted is to say that the state vector |S 〉 = a|+〉 + b|−〉 is a ‘l<strong>in</strong>earsuperposition’ of the two states |±〉. The probability amplitudes a = 〈+|S 〉 and b = 〈−|S 〉represent ‘how much’ of the states |±〉 are conta<strong>in</strong>ed with<strong>in</strong> the state |S 〉 to the extent that|〈±|S 〉| 2 is the probability of the z component of sp<strong>in</strong> be<strong>in</strong>g found to have the values ± 1 2 .One difference between ord<strong>in</strong>ary vectors and quantum state vectors is the importance ofthe ‘normalization condition’, i.e. the requirement that 〈S |S 〉 = 1, which must hold truegiven the <strong>in</strong>terpretation of the <strong>in</strong>ner product as a probability amplitude. But how can thisbe reconciled with the statement above that any l<strong>in</strong>ear comb<strong>in</strong>ation of the basis states is apossible state of the system? How can a state vector such as |˜S 〉 = |+〉 + |−〉 which has theproperty〈˜S |˜S 〉 = 2 (8.33)be a physically acceptable state vector as it seems to be say<strong>in</strong>g that the probability off<strong>in</strong>d<strong>in</strong>g the system <strong>in</strong> the state |˜S 〉 given that it is <strong>in</strong> the state |˜S 〉 is 4, which does not makesense. But, if we def<strong>in</strong>e a new vector |S 〉 by|S 〉 =|˜S 〉√〈˜S |˜S 〉= 1 √2|˜S 〉 (8.34)then automatically |S 〉 will have the required normalization property – it is said to benormalized to unity. So, rather than abandon<strong>in</strong>g giv<strong>in</strong>g a physical <strong>in</strong>terpretation of statevectors which are not normalized to unity, the approach adopted is that we can multiplyany state vector by any factor and say that it still represents the same physical state,i.e. |S 〉 and |˜S 〉 = a|S 〉, where a is any complex number, represent the same physical state.However, it is only the normalized state |S 〉 that should be used <strong>in</strong> any calculations <strong>in</strong>order to be certa<strong>in</strong> that probability is properly accounted for.This can be summarized <strong>in</strong> a table:


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 84Classical vector<strong>Quantum</strong> state vector for sp<strong>in</strong>half systemBasis <strong>Vector</strong>s û 1 , û 2 |+〉, |−〉Inner Product (v 1 , v 2 ) = v ∗ 1 · v 2 (|S ′ 〉, |S 〉) = 〈S ′ |S 〉Orthonormality û ∗ 1 · û 1 = û ∗ 2 · û 2 = 1û ∗ 1 · û 2 = û ∗ 2 · û 1 = 0〈+|+〉 = 〈−|−〉 = 1〈+|−〉 = 〈−|+〉 = 0L<strong>in</strong>earcomb<strong>in</strong>ationˆv = aû 1 + bû 2a and b complex numbers|S 〉 = a|+〉 + b|−〉a and b complex numbersNormalisation — 〈S |S 〉 = 1A More Detailed Analysis One of the important properties of vectors is that two ormore of them can be comb<strong>in</strong>ed as a ‘l<strong>in</strong>ear comb<strong>in</strong>ation’ to produce a third. If we are toconsider quantum states as vectors, then this very basic property must also be possessedby quantum states. In the above, we have not really shown that any l<strong>in</strong>ear comb<strong>in</strong>ation ofthe basis states |±〉, does <strong>in</strong>deed represent a possible state of the system, even if we restrictourselves to the case of the measur<strong>in</strong>g sp<strong>in</strong> components <strong>in</strong> the XZ plane. But a moredetailed argument than that just presented can be used to strengthen this conclusion. Tosee how this comes about, we return to the above expression Eq. (7.18) for the probabilityamplitude 〈S f = 1|S 2 i = 1〉:2〈S f = 1 2 |S i = 1 2 〉 =〈S f = 1 2 |S I = 1 2 〉〈S I = 1 2 |S i = 1 2 〉+ 〈S f = 1 2 |S I = − 1 2 〉〈S I = − 1 2 |S i = 1 2 〉. (8.35)We can ‘cancel’ 〈S f = 1 | from this expression and write2|S i = 1〉 = |S 2 I = 1〉〈S 2 I = 1|S 2 i = 1〉 + |S 2 I = − 1〉〈S 2 I = − 1|S 2 i = 1 〉 (8.36)2In Section 7.3, Eq. (7.33), the quantity 〈S f = 1|S 2 i = 1 〉 was shown to be given by2〈S f = 1 2 |S i = 1 2 〉 = cos[(θ f − θ i )/2]with the phase factor exp(iΦ) put equal to unity, and with ˆn and ˆm, the directions of themagnetic fields, conf<strong>in</strong>ed to the XZ plane. The probability amplitudes for the system topass through the <strong>in</strong>termediate states |S I = ± 1〉, that is, 〈S 2 I = ± 1|S 2 i = 1 〉 are likewise2given by〈S I = 1 2 |S i = 1 2 〉 = cos[ 1 2 (θ I − θ i )]〈S I = − 1 2 |S i = 1 2 〉 = cos[ 1 2 (θ I + π − θ i )] = − s<strong>in</strong>[ 1 2 (θ I − θ i )].(8.37)so that Eq. (8.36) becomes|S i = 1〉 = cos[ 1(θ 2 2 I − θ i )]|S I = + 1〉 − s<strong>in</strong>[ 1(θ 2 2 I − θ i )]|S I = − 1 〉. (8.38)2


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 85To further simplify th<strong>in</strong>gs, we will assume that θ I = 0, i.e. the ˆl vector is <strong>in</strong> the z direction.In addition, we will make the notational simplification already used above def<strong>in</strong>ed by|S i = 1 〉 →|S 〉2|S I = ± 1 〉 →|±〉2〈S I = ± 1|S 2 i = 1 〉 →〈±|S 〉2so that |±〉 = |S z = ± 1 〉, and Eq. (8.36) becomes2or, us<strong>in</strong>g Eq. (8.37)|S 〉 = |+〉〈+|S 〉 + |−〉〈−|S 〉. (8.39)|S 〉 = cos( 1 2 θ i)|+〉 + s<strong>in</strong>( 1 2 θ i)|−〉. (8.40)We are now at the po<strong>in</strong>t at which we can beg<strong>in</strong> to supply an <strong>in</strong>terpretation to this equation.What this equation is say<strong>in</strong>g is that the comb<strong>in</strong>ation cos( 1θ 2 i)|+〉+s<strong>in</strong>( 1θ 2 i)|−〉, and |S 〉, bothrepresent the same th<strong>in</strong>g – the atomic sp<strong>in</strong> is <strong>in</strong> a state for which S i = 1 . In other words,2if we were presented with the comb<strong>in</strong>ation:1√2|+〉 + 1 √2|−〉 (8.41)we immediately see that cos( 1 2 θ i) = 1/ √ 2 and s<strong>in</strong>( 1 2 θ i) = 1/ √ 2, and hence θ i = 90 ◦ . Thusthe magnetic field is po<strong>in</strong>t<strong>in</strong>g <strong>in</strong> the direction 90 ◦ to the z direction, i.e. <strong>in</strong> the x direction,and hence the sp<strong>in</strong> state of the atom is the state |S 〉 = |S x = 1 2 〉.But what if we were presented with the comb<strong>in</strong>ation 2|+〉 + 2|−〉? Here, we cannot f<strong>in</strong>dany angle θ i , so it appears that this comb<strong>in</strong>ation is not a possible state of the atomic sp<strong>in</strong>.But we can write this as2 √ 1√22[|+〉 + √ 1 ]|−〉(8.42)2which we can now understand as represent<strong>in</strong>g 2 √ 2|S x = 1〉. Is 2 √ 2|S2 x = 1 〉 a different2physical state of the system to |S x = 1 〉? Well, it is our notation, so we can say what we2like, and what turns out to be preferable is to say that α|S 〉 describes the same physicalstate as |S 〉, for any value of the constant α. Thus, we can say that 2|+〉+2|−〉 is also a stateof the system, namely 2 √ 2|S x = 1 〉, which represents the same physical <strong>in</strong>formation2about the state of the system as |S x = 1〉.2Thus any comb<strong>in</strong>ation C + |+〉 + C − |−〉 where C ± are real numbers will always representsome state of the system, <strong>in</strong> general given by√C+ 2 + C− 2 |S i = 1 〉 (8.43)2whereS i = S · ˆn (8.44)and where ˆn is a unit vector <strong>in</strong> the direction def<strong>in</strong>ed by the angleθ i = 2 tan −1 (C−C +). (8.45)


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 86Conversely, given any state of the system, we can work out how to write it <strong>in</strong> the formC + |+〉 + C − |−〉. Further, we can repeat the whole of the above discussion for any otherchoice of the <strong>in</strong>termediate states |S I = ± 1 2 〉.It is this last fact that a state |S 〉 can be written as the l<strong>in</strong>ear comb<strong>in</strong>ation or l<strong>in</strong>ear superpositionC + |+〉 + C − |−〉 of two other states, analogous to Eq. (8.1) for the arbitrary vector v,and conversely that any l<strong>in</strong>ear superposition of states is itself another state is the essentialproperty that these states need to possess <strong>in</strong> order for them to be <strong>in</strong>terpreted as vectorsbelong<strong>in</strong>g to some vector vector space, specifically here a real vector space.In the more general case of the magnetic fieldsvectors not all be<strong>in</strong>g <strong>in</strong> the XZ plane, the probabilityamplitudes 〈±|S 〉 will be, <strong>in</strong> general,complex numbers. For <strong>in</strong>stance, it can beshown that the state |S 〉 = |S n = 1 〉 where2S n = S · ˆn, and where ˆn = s<strong>in</strong> θ cos φ î +s<strong>in</strong> θ s<strong>in</strong> φ ĵ + cos θ ˆk is a unit vector oriented<strong>in</strong> a direction def<strong>in</strong>ed by the spherical polarangles θ, φ, is given byZφθˆnY|S 〉 = cos( 1θ)|+〉 + 2 eiφ s<strong>in</strong>( 1 θ)|−〉 (8.46)2XNevertheless, the arguments presented aboveFigure 8.2: Polar angles for def<strong>in</strong><strong>in</strong>g directionof unit vector ˆncont<strong>in</strong>ue to apply. In particular any l<strong>in</strong>earcomb<strong>in</strong>ation C − |−〉+C + |+〉, where C ± are complexnumbers, will be a possible sp<strong>in</strong> state of an atom. But s<strong>in</strong>ce the coefficients arecomplex numbers, the vector space, or state space, is a complex vector space. This moregeneral case, is what is usually encountered <strong>in</strong> quantum mechanics and below, we willassume that the probability amplitudes are, <strong>in</strong> general, complex.8.3 The General Case of Many Intermediate StatesThe above results were based on the particular case of a sp<strong>in</strong> half system which couldpass through two possible <strong>in</strong>termediate sp<strong>in</strong> states, and was developed, at least <strong>in</strong> part,by analogy with the two slit experiment where a particle could pass through two possible<strong>in</strong>termediate states def<strong>in</strong>ed by the positions of the slits. Both these examples can bereadily generalized. For <strong>in</strong>stance, we could consider an <strong>in</strong>terference experiment <strong>in</strong> whichthere are multiple slits <strong>in</strong> the first barrier. Clearly the arguments used <strong>in</strong> the two slit casewould apply aga<strong>in</strong> here, with the f<strong>in</strong>al result for the probability amplitude of observ<strong>in</strong>g anelectron strik<strong>in</strong>g the screen at position x after hav<strong>in</strong>g set out from a source S be<strong>in</strong>g givenbyN∑〈x|S 〉 = 〈x|n〉〈n|S 〉 (8.47)n=1where we have supposed that there are N slits <strong>in</strong> the first barrier, so that the electron canpass through N <strong>in</strong>termediate states. Likewise, if we repeat the Stern-Gerlach experimentwith sp<strong>in</strong> 1 atoms, we would f<strong>in</strong>d that the atoms would emerge <strong>in</strong> one or the other of threebeams correspond<strong>in</strong>g to S z = −, 0, . More generally, if the atoms have sp<strong>in</strong> s (where


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 87s = 0 or 1 or 1 or 3 , . . . then they will emerge <strong>in</strong> one of a total of 2s + 1 different possible2 2beams correspond<strong>in</strong>g to S z = −s, or (−s + 1), or . . . , or (s − 1), or s. We can thenwrites∑〈S ′ |S 〉 = 〈S ′ |n〉〈n|S 〉 (8.48)n=−si.e. the atom can pass through 2s + 1 <strong>in</strong>termediate states, where we have written |n〉 forthe state <strong>in</strong> which the atom has a z component of sp<strong>in</strong> S z = n 1 . What we want to do isextract from these two examples a general prescription that we can apply to any system<strong>in</strong> order to build up a quantum mechanical description of that system. In order to set thescene for this development, we need to reconsider aga<strong>in</strong> the general features that apply tothe <strong>in</strong>termediate states appear<strong>in</strong>g <strong>in</strong> the above two expressions. The important propertiesof these <strong>in</strong>termediate states are as follows:1. Each <strong>in</strong>termediate state represents a mutually exclusive possibility, that is, if thesystem is observed to be <strong>in</strong> one of the <strong>in</strong>termediate states, it is def<strong>in</strong>itely the casethat it will not also be observed <strong>in</strong> any of the others: the electron can be observed topass through one slit only, or a sp<strong>in</strong> s atom would be seen to emerge from a Stern-Gerlach apparatus <strong>in</strong> one beam only. It is this mutual exclusiveness that allows usto write such th<strong>in</strong>gs as 〈−|+〉 = 0 or 〈+|+〉 = 1 and so on.2. The list of <strong>in</strong>termediate states covers all possibilities: if there are N slits <strong>in</strong> thebarrier, then there will be N <strong>in</strong>termediate states |n〉 – there are no other slits for theelectron to pass through. Likewise, if an atom with sp<strong>in</strong> s enters a Stern-Gerlachapparatus, it emerges <strong>in</strong> one of 2s + 1 possible beams. Thus the states |n〉, n =−s, −s + 1, . . . , s − 1, s account for all the possible states that the atom can emerge<strong>in</strong>.3. F<strong>in</strong>ally, the claim is made that the probability amplitude for f<strong>in</strong>d<strong>in</strong>g the system <strong>in</strong>the state 〈φ| given that it was <strong>in</strong> the state |ψ〉 can be written aswhich allows us to write the arbitrary state |ψ〉 as∑〈φ|ψ〉 = 〈φ|n〉〈n|ψ〉 (8.49)n∑|ψ〉 = |n〉〈n|ψ〉 (8.50)nwhere 〈n|ψ〉, the probability amplitude of f<strong>in</strong>d<strong>in</strong>g the system <strong>in</strong> the state |n〉, giventhat it was <strong>in</strong> the state |ψ〉, is now viewed as a ‘weight<strong>in</strong>g’ for the state |ψ〉 to beobserved <strong>in</strong> the state |n〉.The above three properties of <strong>in</strong>termediate states shows us how to generalize the resultsthat were obta<strong>in</strong>ed for the two slit and sp<strong>in</strong> half examples used to develop the ideas.However, the ideas developed ought to be applicable to systems other than these twopossibilities. Thus, we need to extract out of what has been said so far ideas that can be1 Note, n is not necessarily an <strong>in</strong>teger here. If s = 1 2for <strong>in</strong>stance, then n will have the two possible valuesn = ± 1 2 , and the states | ± 1 2〉 are just the states we have earlier called |±〉.


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 88applied to any physical system so as to generate a quantum description of such a system.We do this by not<strong>in</strong>g the s<strong>in</strong>gular role played <strong>in</strong> the above discussion of the ’<strong>in</strong>termediatestates’. If we can identify such <strong>in</strong>termediate states for any physical system, then we are <strong>in</strong>a position to formulate a quantum description of such a system.The characteristics of the <strong>in</strong>termediate states that are of greatest importance are the follow<strong>in</strong>g:1. They represent mutually exclusive alternatives, that is, if a measurement is made, asystem is observed <strong>in</strong> only one of these possible <strong>in</strong>termediate states;2. The <strong>in</strong>termediate states cover all possibilities, that is, if a measurement is made, thesystem is always observed to be <strong>in</strong> one of these <strong>in</strong>termediate states;3. If the system is <strong>in</strong>itially prepared <strong>in</strong> some <strong>in</strong>itial state |ψ〉, then the probability amplitudeof observ<strong>in</strong>g it <strong>in</strong> some arbitrary f<strong>in</strong>al state |φ〉 is given by a sum of probabilityamplitudes of pass<strong>in</strong>g through each of these <strong>in</strong>termediate states, <strong>in</strong> a manneranalogous to Eq. (8.48).We will now see how these ideas can be applied to more−a +a−general k<strong>in</strong>ds of physical systems. To beg<strong>in</strong>, we haveOO to set up these <strong>in</strong>termediate states for a given system.So suppose we perform an exhaustive series of measurementsof some observable property of the system| + a〉– call it Q. For example, we could determ<strong>in</strong>e through−a +aO −which slits it is possible for an electron to pass <strong>in</strong> theO two slit experiment above, or the possible values of theZ component of the sp<strong>in</strong> of a particle, or the possible| − a〉values of the position of an electron on an O − 2ion, as <strong>in</strong>the adjacent figure. We will give many more examplesFigure 8.3: O − 2ion with the two later. Whatever the system, what we mean by exhaustiveis that we determ<strong>in</strong>e all the possible values that thepossible positions of the electron, correspond<strong>in</strong>gto the two states | ± a〉. observed quantity Q might have. For <strong>in</strong>stance, we determ<strong>in</strong>ethat, <strong>in</strong> the two slit <strong>in</strong>terference experiment, theelectron can pass through, (that is, be observed to be at) the position of one or the otherof the two slits, or that a sp<strong>in</strong> half particle would be observed to have either of the valuesS z = ± 1, and no other value. Or for the 2 O− 2ion, the electron can be found either on theatom at position x = −a or on the atom at position x = +a. In general, for an arbitraryobservable Q, let us represent these observed values by q 1 , q 2 , q 3 , . . . all of which will bereal numbers. Of course, there might be other observable properties of the system that wemight be able to measure, but for the present we will suppose that we only need concernourselves with just one.In keep<strong>in</strong>g with the way that we developed the idea of state earlier, we then say that as aresult of the measurement produc<strong>in</strong>g, for <strong>in</strong>stance, the outcome q 1 , we then say that thesystem ends up <strong>in</strong> the state |q 1 〉, and similarly for all the others possibilities. Moreover,there is now the possibility, if the system is <strong>in</strong>itially prepared <strong>in</strong> the <strong>in</strong>itial state |ψ〉, ofobserv<strong>in</strong>g it <strong>in</strong> any one of these states |q n 〉, but once aga<strong>in</strong>, we are unable to predict withcerta<strong>in</strong>ty which of these states we will observe the system to be <strong>in</strong> – we can only assign


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 89a probability P(q n |ψ) of f<strong>in</strong>d<strong>in</strong>g the system <strong>in</strong> the state |q n 〉 given it was <strong>in</strong> the state |ψ〉.Likewise, there will be a probability P(φ|q n ) of f<strong>in</strong>d<strong>in</strong>g the system <strong>in</strong> the f<strong>in</strong>al state |φ〉given that it was <strong>in</strong> the state |q n 〉. It is at this po<strong>in</strong>t that we reach the result that, <strong>in</strong> a sense,def<strong>in</strong>es quantum mechanics. If the possibility exists of f<strong>in</strong>d<strong>in</strong>g the system <strong>in</strong> any of thestates |q 1 〉, |q 2 〉, . . . , then accord<strong>in</strong>g to classical ideas of probability, we would be safe <strong>in</strong>assum<strong>in</strong>g the <strong>in</strong>tuitive formula∑P(φ|ψ) = P(φ|q n )P(q n |ψ) — a result of classical probability theoryni.e. the system starts <strong>in</strong> the state |ψ〉, and has the probability P(q n |ψ) that it will end up<strong>in</strong> the state |q n 〉, and once it ‘arrives there’ we then have the probability P(φ|q n ) that itwill then end up <strong>in</strong> the state |φ〉 given that it was <strong>in</strong> the state |q n 〉. Multiply<strong>in</strong>g thesetwo probabilities will then give the probability of the system start<strong>in</strong>g from |ψ〉 and pass<strong>in</strong>gthrough |q n 〉 on its way to the f<strong>in</strong>al state |φ〉. We then sum over all the possible <strong>in</strong>termediatestates |q n 〉 to give the total probability of arriv<strong>in</strong>g <strong>in</strong> the state |φ〉. However, what is foundexperimentally is that if the system is never observed <strong>in</strong> any of the <strong>in</strong>termediate states |q n 〉,this probability is not given by this classical result – the measurements show evidence of<strong>in</strong>terference effects, which can only be understood if this probability is given as the squareof a ‘probability amplitude’.Thus, we f<strong>in</strong>d we must write P(φ|ψ) = |〈φ|ψ〉| 2 , where 〈φ|ψ〉 is a complex number, generallyreferred to as the probability amplitude of f<strong>in</strong>d<strong>in</strong>g the system <strong>in</strong> state |φ〉 given that itwas <strong>in</strong> state |ψ〉, and this probability amplitude is then given by∑〈φ|ψ〉 = 〈φ|q n 〉〈q n |ψ〉 (8.51)nwhere the sum is over the probability amplitudes of the system pass<strong>in</strong>g through all thepossible states associated with all the possible observed values of the quantity Q.If we square this result we f<strong>in</strong>d that∑P(φ|ψ) = |〈φ|ψ〉| 2 = |〈φ|q n 〉| 2 |〈q n |ψ〉| 2 + cross termsn∑= P(φ|q n )P(q n |ψ) + cross terms. (8.52)nWe note that this expression consists, <strong>in</strong> part, of the classical result Eqn. (8.3), but there is,<strong>in</strong> addition, cross terms. It is these terms that are responsible for the <strong>in</strong>terference effectsobserved <strong>in</strong> the two slit experiment, or <strong>in</strong> the Stern-Gerlach experiment. If we observewhich state |q n 〉 the system ‘passes through’ it is always found that these <strong>in</strong>terferenceterms are washed out, reduc<strong>in</strong>g the result to the classical result Eqn. (8.3). This we haveseen <strong>in</strong> the case of two slit <strong>in</strong>terference where<strong>in</strong> if the slit through which the particlepasses is observed, then the <strong>in</strong>terference pattern on the observation screen is washed out.What rema<strong>in</strong>s to be found is ways to calculate these ‘probability amplitudes’. We can goa long way towards do<strong>in</strong>g this by recogniz<strong>in</strong>g, from a purely physical perspective, that theprobability amplitudes 〈q m |q n 〉 must have the follow<strong>in</strong>g properties:• 〈q m |q n 〉 = 0 if m n. This amounts to stat<strong>in</strong>g that if the system is <strong>in</strong> the state|q n 〉, i.e. where<strong>in</strong> the observable Q is known to have the value q n , then there is zeropossibility of f<strong>in</strong>d<strong>in</strong>g it <strong>in</strong> the state |q m 〉.


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 90• 〈q n |q n 〉 = 1. This asserts that if the system is <strong>in</strong> the state for which the quantity Qhas the value q n , then it is certa<strong>in</strong> to be found <strong>in</strong> the state <strong>in</strong> which it has the valueq n . Thus the states {|q n 〉; n = 1, 2, . . .} are mutually exclusive.• The states {|q n 〉; n = 1, 2, . . .} are also exhaustive <strong>in</strong> that they cover all the possiblevalues that could be observed of the observable Q. These states are said to becomplete – simply because they cover all possibilities – and the result Eqn. (8.51)as the ‘closure relation’ for these states.Then we propose the follow<strong>in</strong>g set of ideas and concepts:1. 〈φ|ψ〉 = the probability amplitude of the system be<strong>in</strong>g found <strong>in</strong> the state |φ〉given that it was <strong>in</strong> state |ψ〉.2. |〈φ|ψ〉| 2 = the probability of the system be<strong>in</strong>g found <strong>in</strong> the state |φ〉 given thatit was <strong>in</strong> state |ψ〉.3. The states {|q n 〉; n = 1, 2, . . . } are mutually exclusive i.e.〈q m |q n 〉 = δ mn = 1 n = m= 0 n m (8.53)where δ mn is known as the Kronecker delta. The states {|q n 〉; n = 1, 2, . . . } are saidto be orthonormal.This condition satisfied by these states is a mathematical expression of the physicalfact that if the system is <strong>in</strong> one of the states |q n 〉, then there is no possibility off<strong>in</strong>d<strong>in</strong>g it <strong>in</strong> some other state |q m 〉, m n, hence 〈q m |q n 〉 = 0 while the probabilityof f<strong>in</strong>d<strong>in</strong>g it <strong>in</strong> the state |q n 〉 must be unity.4. The fundamental law of quantum mechanics, otherwise known as the closure relation:∑〈φ|ψ〉 = 〈φ|q n 〉〈q n |ψ〉 (8.54)ntells us that the total probability amplitude of f<strong>in</strong>d<strong>in</strong>g the system <strong>in</strong> the f<strong>in</strong>al state|φ〉 is just the sum of the probability amplitudes of the system ‘pass<strong>in</strong>g through’ anyof the states {|q n 〉; n = 1, 2, . . . }. This is the ‘sum over paths’ idea alluded to above.It is at this po<strong>in</strong>t that we perform the ‘cancellation’ trick which results <strong>in</strong> the states be<strong>in</strong>gexpressed <strong>in</strong> terms of other states as <strong>in</strong> the follow<strong>in</strong>g, which ultimately leads us to<strong>in</strong>terpret<strong>in</strong>g these states as vectors.5. The completeness relations:∑|ψ〉 = |q n 〉〈q n |ψ〉 (8.55)n∑〈φ| = 〈φ|q n 〉〈q n | (8.56)nwhich tells us that either the <strong>in</strong>itial or f<strong>in</strong>al states can be expressed as a l<strong>in</strong>ear comb<strong>in</strong>ationof the <strong>in</strong>termediate states {|q n 〉; n = 1, 2, . . . }. This is a statement that thesestates are complete <strong>in</strong> that there is enough of them such that any state of the systemcan be written <strong>in</strong> the above form <strong>in</strong> terms of these states.


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 916. 〈φ|ψ〉 = 〈ψ|φ〉 ∗ .7. The normalization condition:∑∑〈ψ|ψ〉 = 〈ψ|q n 〉〈n|ψ〉 = |〈q n |ψ〉| 2 = 1. (8.57)nThis last result tells us that the probability of f<strong>in</strong>d<strong>in</strong>g the system <strong>in</strong> any of the states |n〉 addsup to unity. Moreover, we know that 〈q m |q n 〉 = 0 if n m so that a system <strong>in</strong> state |q n 〉 cannever be found <strong>in</strong> some other state |q m 〉. Consequently, the set of states {|q n 〉; n = 1, 2, . . . }represent a complete set of possible alternative f<strong>in</strong>al states, complete <strong>in</strong> the sense that thetotal probability of end<strong>in</strong>g up <strong>in</strong> any of the mutually exclusive possible f<strong>in</strong>al states |q n 〉adds up to unity – there is nowhere else for the system to be found.By a simple extension of the arguments presented <strong>in</strong> Section 8.2 <strong>in</strong> the case of sp<strong>in</strong> halfquantum states it is now possible to show how the above properties 1 – 7 are completelyanalogous to the properties of vectors <strong>in</strong> a complex vector space. This mathematical formalismwill be developed more fully <strong>in</strong> Section 8.5, but for the present we can summarizethe essential ideas based on what we have already put forward earlier. The importnatpo<strong>in</strong>ts then are as follows:1. The collection of all the possible state vectors of a quantum system forms a complexvector space known as the state space of the system.2. The probability amplitudes are identified as the <strong>in</strong>ner product of these state vectors.3. The <strong>in</strong>termediate states {|q n 〉; n = 1, 2, . . . } form a complete orthonormal set ofbasis states of this state space, i.e. any state vector |ψ〉 can be written as a l<strong>in</strong>earcomb<strong>in</strong>ation of these basis states.4. The number of basis states is known as the dimension of the state space.n8.4 Construct<strong>in</strong>g a State SpaceThe ideas developed above can now be applied to construct<strong>in</strong>g a state space for a physicalsystem. The basic idea is as discussed <strong>in</strong> Section 8.3 which enables us to def<strong>in</strong>e a set ofbasis states for the state space of the system. By establish<strong>in</strong>g a set of basis states, <strong>in</strong> asense, we ‘br<strong>in</strong>g the state space <strong>in</strong>to existence’, and once this is done, we are free to useall the mathematical mach<strong>in</strong>ery available for analys<strong>in</strong>g the properties of the state space soconstructed. The question can be asked as to whether or not the ideas presented <strong>in</strong> Section8.3, admittedly extracted from only a handful of examples, can be applied with successto any other system. This is a question that can only be answered by apply<strong>in</strong>g the rulesformulated there and consider<strong>in</strong>g the consequences. In Section 8.7 we will discuss wherethese ideas, if naively applied, fail to work. Otherwise, these ideas, when fully formed,constitute the basis of quantum physics.In accordance with the ideas developed <strong>in</strong> Section 8.3, construct<strong>in</strong>g a state space for aphysical system can be carried out by recogniz<strong>in</strong>g the <strong>in</strong>termediate states through whicha system can pass as it makes its way from some <strong>in</strong>itial state to some observed f<strong>in</strong>al state,


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 92as was done <strong>in</strong> the case of the two slit, or sp<strong>in</strong> half systems. Thus, <strong>in</strong> the two slit example,the two possible <strong>in</strong>termediate states are those for which the particle is to be found at theposition of either of the two slits. In the sp<strong>in</strong> half example, the two <strong>in</strong>termediate states arethose <strong>in</strong> which the sp<strong>in</strong> is observed to have either of the two values S z = ± 1 ; these are2the states we have been call<strong>in</strong>g |±〉. These <strong>in</strong>termediate states are states of the system thatcan be identified through an argument based on the idea that some physical property ofthe system can be exhaustively measured to yield a set of values that we then use to labela complete set of basis states for the state space of the system.Negatively Charged Ions Here the system is a molecule which has acquired an extraelectron, which can be assumed to found only on any one of the atoms mak<strong>in</strong>g up themolecule. This is, of course, an approximation. The electron could be found anywherearound the atoms, or <strong>in</strong> the space between the atoms, <strong>in</strong> a way that depends on the natureof the chemical bond between the atoms. Here we are mak<strong>in</strong>g use of a coarse notion ofposition, i.e. we are assum<strong>in</strong>g that the electron can be observed to reside on one atom orthe other, and we do not really care about exactly where on each atom the electron mightbe found. The idea is best illustrated by the simple example of the O − 2ion <strong>in</strong> which theelectron can be found on one or the other of the oxygen atoms (see Fig. (8.3)). If we letx = ±a be the positions of the two atomic nuclei with respect to a po<strong>in</strong>t midway betweenthem, then we can use these two possible positions of the electron to label the possiblestates of the system. The two states are then | − a〉 and | + a〉. If the electron is observedto be on the oxygen atom at x = +a, then it has unit probability of be<strong>in</strong>g observed on theatom at x = +a, but zero probability of be<strong>in</strong>g observed on the atom at x = −a. Thesetwo states represent mutually exclusive possibilities, and so they are dist<strong>in</strong>ct orthonormalstates:〈+a| + a〉 = 〈−a| − a〉 = 1〈+a| − a〉 = 〈−a| + a〉 = 0 (8.58)These two states are also complete <strong>in</strong> that, with<strong>in</strong> the limits of our model, there is nowhereelse for the electron to be found – it is either on the atom at x = +a or on the atom atx = −a. These two states form a complete set of orthonormal basis states for the statespace of the ion, so that any state of the ion can be expressed <strong>in</strong> the form|ψ〉 = c 1 | + a〉 + c 2 | − a〉 (8.59)where c 1 , c 2 are complex numbers – this is just the completeness relation for this system.Likewise, any l<strong>in</strong>ear comb<strong>in</strong>ation of {| + a〉, | − a〉} will represent a possible state of theion. As there are two basis states, the state space for the system has dimension 2.This k<strong>in</strong>d of model can be generalized to situations <strong>in</strong>volv<strong>in</strong>g different geometries, suchas atoms arranged <strong>in</strong> a r<strong>in</strong>g e.g. an ozone ion O − 3. In this case, the state space will bespanned by three basis states correspond<strong>in</strong>g to the three possible positions at which theelectron can be observed. This model (and its generalizations to an arbitrary number ofatoms arranged <strong>in</strong> a r<strong>in</strong>g) is valuable as it gives rise to results that serve as an approximatetreatment of angular momentum <strong>in</strong> quantum mechanics.


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 93Ammonia molecule Here the system isthe ammonia molecule NH 3 <strong>in</strong> which thenitrogen atom is at the apex of a triangularpyramid with the three hydrogenatoms form<strong>in</strong>g an equilateral triangle asthe base. The nitrogen atom can be positionedeither above or below the planeof the hydrogen atoms, these two possibilitieswe take as two possible statesof the ammonia molecule. (The N atomcan move between these two positions by‘quantum ‘tunnell<strong>in</strong>g’ through the potentialbarrier ly<strong>in</strong>g <strong>in</strong> the plane of the hydrogenatoms.) Once aga<strong>in</strong>, this is a statespace of dimension 2.+lHNHHHHN−lFigure 8.4: Ammonia molecule <strong>in</strong> two statesdist<strong>in</strong>gusished by the position of the nitrogenatom, either above or below the plane of the hydrogenatoms, correspond<strong>in</strong>g to the states | + l〉and | − l〉 respectively.HSp<strong>in</strong> Flipp<strong>in</strong>g In this case, we have a sp<strong>in</strong> half particle (for <strong>in</strong>stance) <strong>in</strong> a constantmagnetic field, so the two possible states are the familiar sp<strong>in</strong> up or sp<strong>in</strong> down states. If,<strong>in</strong> addition, we add a rotat<strong>in</strong>g magnetic field at right angles to the constant field, therearises a time dependent probability of the sp<strong>in</strong> flipp<strong>in</strong>g from one orientation to the other.As the sp<strong>in</strong> up and sp<strong>in</strong> down states are of different energies, this represents a change <strong>in</strong>energy of the particle, a change that can be detected, and is the basis of the electron sp<strong>in</strong>and nuclear magnetic resonance imag<strong>in</strong>g much used <strong>in</strong> medical work. Obviously this is astate space of dimension two.The Qubit Any two state system can be used to represent the quantum version of ab<strong>in</strong>ary numbers: sp<strong>in</strong> up and sp<strong>in</strong> down, an atom excited or not, and so on. Overall thetwo states can be represented by |0〉 and |1〉, correspond<strong>in</strong>g to the two b<strong>in</strong>ary numbers 0and 1. A l<strong>in</strong>ear comb<strong>in</strong>ation|ψ〉 = c 0 |0〉 + c 1 |1〉.can be formed which represents the possibility of a memory register<strong>in</strong>g a bit of <strong>in</strong>formation,not as either a 0 or a 1, which is all that can happen classically, but simultaneously,register<strong>in</strong>g both possibilities of 0 or 1. Such a state is known as a qubit. Obviously, thestate space is of dimension two, and much that we have said above about sp<strong>in</strong> half systemsapplies. <strong>Quantum</strong> computation then <strong>in</strong>volves manipulat<strong>in</strong>g the whole state |ψ〉, which, <strong>in</strong>effect, amounts to perform<strong>in</strong>g two calculations at once, differ<strong>in</strong>g by the <strong>in</strong>itial sett<strong>in</strong>g ofthe memory bit. The idea <strong>in</strong>troduced above can be readily extended. Thus, if we havetwo two level atoms, we have such possibilities as |00〉, |01〉, |10〉, and |11〉 where, for<strong>in</strong>stance, |10〉 is the state <strong>in</strong> which the first atom is <strong>in</strong> its excited state and the second is<strong>in</strong> its ground state. Obviously, the state |00〉 represents the number zero, |01〉 the numberone, |10〉 the number two, and |11〉 the number three. We now have two qubits, and a statespace of dimension four, and we can set up l<strong>in</strong>ear comb<strong>in</strong>ations such as|ψ〉 = c 00 |00〉 + c 01 |01〉 + c 10 |10〉 + c 11 |11〉 (8.60)and we can then perform calculations mak<strong>in</strong>g use, simultaneously, of four different possiblevalues for whatever quantity the states are <strong>in</strong>tended to represent. With three atoms, or


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 94four and so on, the state space becomes much larger: of dimension 2 N <strong>in</strong> fact where N isthe number of qubits, and the basis states represent the numbers rang<strong>in</strong>g from 0 to 2 N − 1<strong>in</strong> b<strong>in</strong>ary notation.Benzene Molecule An example ofquite a different character is that of thebenzene molecule, illustrated <strong>in</strong> Fig.8.5. The two states of the molecule aredist<strong>in</strong>guished by the position<strong>in</strong>g of thedouble bonds between pairs of carbonatoms. The molecule, at least with regardto the arrangements of double bonds canbe found <strong>in</strong> two different states which,for want of a better name, we will call |α〉and |β〉. The state space is therefore ofdimension 2, and an arbitrary state of themolecule would be given by|ψ〉 = a|α〉 + b|β〉. (8.61)|α〉|β〉Figure 8.5: Two arrangements of thedouble bonds <strong>in</strong> a benzene moelcule correspond<strong>in</strong>gto two states |α〉 and |β〉.All the examples given above yield state spaces of f<strong>in</strong>ite dimension. Much the sameargument can be applied to construct state spaces of <strong>in</strong>f<strong>in</strong>ite dimension. A couple ofexamples follow.The Tight-B<strong>in</strong>d<strong>in</strong>g Model of a Crystall<strong>in</strong>e Metal The examples given above of anelectron be<strong>in</strong>g positioned on one of a (f<strong>in</strong>ite) number of atoms can be readily generalizedto a situation <strong>in</strong> which there are an <strong>in</strong>f<strong>in</strong>ite number of such atoms. This is not a contrivedmodel <strong>in</strong> any sense, as it is a good first approximation to modell<strong>in</strong>g the properties of theconduction electrons <strong>in</strong> a crystall<strong>in</strong>e solid. In the free electron model of a conduct<strong>in</strong>gsolid, the conduction electrons are assumed to be able to move freely (and without mutual<strong>in</strong>teraction) through the crystal, i.e. the effects of the background positive potentialsof the positive ions left is ignored. A further development of this model is to take <strong>in</strong>toaccount the fact that the electrons will experience some attraction to the periodically positionedpositive ions, and so there will be a tendency for the electrons to be found <strong>in</strong> theneighbourhood of these ions. The resultant model – with the basis states consist<strong>in</strong>g ofa conduction electron be<strong>in</strong>g found on any one of the ion sites – is obviously similar tothe one above for the molecular ion. Here however, the number of basis states is <strong>in</strong>f<strong>in</strong>ite(for an <strong>in</strong>f<strong>in</strong>ite crystal), so the state space is of <strong>in</strong>f<strong>in</strong>ite dimension. Represent<strong>in</strong>g the setof basis states by {|n〉, n = 0, ±1, ±2, . . . } where na is the position of the n th atom, and ais the separation between neighbour<strong>in</strong>g atoms, then any state of the system can then bewritten as|ψ〉 =+∞∑n=−∞c n |n〉. (8.62)By tak<strong>in</strong>g <strong>in</strong>to account the fact that the electrons can make their way from an ion to oneof its neighbours, much of the band structure of semiconduct<strong>in</strong>g solids can be obta<strong>in</strong>ed.


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 95Free Particle We can generalize the preced<strong>in</strong>g model by suppos<strong>in</strong>g that the spac<strong>in</strong>gbetween the neighbour<strong>in</strong>g atoms is allowed to go to zero, so that the positions at whichthe electron can be found become cont<strong>in</strong>uous. This then acts as a model for the descriptionof a particle free to move anywhere <strong>in</strong> one dimension. In sett<strong>in</strong>g up this model, we f<strong>in</strong>dthat as well as there be<strong>in</strong>g an <strong>in</strong>f<strong>in</strong>ite number of basis states – someth<strong>in</strong>g we have alreadyencountered – we see that these basis states are not discrete, i.e. a particle at position xwill be <strong>in</strong> the basis state |x〉, and as x can vary cont<strong>in</strong>uously over the range −∞ < x < ∞,there will be a non-denumerably <strong>in</strong>f<strong>in</strong>ite, that is, a cont<strong>in</strong>uous range of such basis states.As a consequence, the completeness relation ought to be written as an <strong>in</strong>tegral:|ψ〉 =∫ +∞−∞|x〉〈x|ψ〉 dx. (8.63)The states |x〉 and |x ′ 〉 will be orthonormal if x x ′ , but <strong>in</strong> order to be able to reta<strong>in</strong> thecompleteness relation <strong>in</strong> the form of an <strong>in</strong>tegral, it turns out that these basis states have tohave an <strong>in</strong>f<strong>in</strong>ite norm. However, there is a sense <strong>in</strong> which we can cont<strong>in</strong>ue to work withsuch states, as will be discussed <strong>in</strong> Section 8.6.Particle <strong>in</strong> an Inf<strong>in</strong>itely Deep Potential Well We saw <strong>in</strong> Section 5.3 that a particleof mass m <strong>in</strong> an <strong>in</strong>f<strong>in</strong>itely deep potential well of width L can have the energies E n =n 2 π 2 2 /2mL 2 where n is a positive <strong>in</strong>teger. This suggests that the basis states of theparticle <strong>in</strong> the well be the states |n〉 such that if the particle is <strong>in</strong> state |n〉, then it hasenergy E n . The probability amplitude of f<strong>in</strong>d<strong>in</strong>g the particle at position x when <strong>in</strong> state |n〉is then 〈x|n〉 which, from Section 5.3 we can identify with the wave function ψ n , i.e.ψ n (x) = 〈x|n〉 =√2L s<strong>in</strong>(nπx/L) 0 < x < LThe state space is obviously of <strong>in</strong>f<strong>in</strong>ite dimension.= 0 x < 0, x > L. (8.64)It has been po<strong>in</strong>ted out before that a state space can have any number of sets of basisstates, i.e. the states |n〉 <strong>in</strong>troduced here do not form the sole possible set of basis statesfor the state space of this system. In this particular case, it is worthwhile not<strong>in</strong>g that wecould have used as the base states the states labelled by the position of the particle <strong>in</strong> thewell, i.e. the states |x〉.As we have seen, there are an <strong>in</strong>f<strong>in</strong>ite number of such states which <strong>in</strong> a way is to beexpected as we have already seen that the state space is of <strong>in</strong>f<strong>in</strong>ite dimension. But thedifference between this set of states and the set of states |n〉 is that <strong>in</strong> the latter case, thesestates are discrete, i.e. they can be labelled by the <strong>in</strong>tegers, while the states |x〉 are cont<strong>in</strong>uous,they are labelled by the cont<strong>in</strong>uous variable x. Thus, someth<strong>in</strong>g new emergesfrom this example: for state spaces of <strong>in</strong>f<strong>in</strong>ite dimension, it is possible to have a denumerably<strong>in</strong>f<strong>in</strong>ite number of basis states (i.e. the discrete states |n〉) or non-denumerably<strong>in</strong>f<strong>in</strong>ite number of basis states (i.e. the states |x〉.) This feature of state spaces of <strong>in</strong>f<strong>in</strong>itedimension, plus others, are discussed separately below <strong>in</strong> Section 8.6.A System of Identical Photons Many other features of a quantum system not related tothe position or energy of the system can be used as a means by which a set of basis states


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 96can be set up. An important example is one <strong>in</strong> which the system consists of a possiblyvariable number of identical particles. One example is a ‘gas’ of photons, all of the samefrequency and polarization. Such a situation is rout<strong>in</strong>ely achieved <strong>in</strong> the laboratory us<strong>in</strong>gsuitably constructed hollow superconduct<strong>in</strong>g metallic cavities designed to support justone mode (i.e. a s<strong>in</strong>gle frequency and polarization) of the electromagnetic field. The stateof the electromagnetic field can then be characterized by the number n of photons <strong>in</strong> thefield which can range from zero to positive <strong>in</strong>f<strong>in</strong>ity, so that the states of the field (knownas number states) can be written |n〉 with n = 0, 1, 2, . . .. The state |0〉 is often referred toas the vacuum state. These states will then constitute a complete, orthonormal set of basisstates (called Fock or number states), i.e.〈n|m〉 = δ nm (8.65)and as n can range up to <strong>in</strong>f<strong>in</strong>ity, the state space for the system will be <strong>in</strong>f<strong>in</strong>ite dimensional.An arbitrary state of the cavity field can be then be written∞∑|ψ〉 = c n |n〉 (8.66)n=0so that |c n | 2 will be the probability of f<strong>in</strong>d<strong>in</strong>g n photons <strong>in</strong> the field. In terms of these basisstates, it is possible to describe the processes <strong>in</strong> which particles are created or destroyed.For <strong>in</strong>stance if there is a s<strong>in</strong>gle atom <strong>in</strong> an excited energy state <strong>in</strong> the cavity, and the cavityis <strong>in</strong> the vacuum state |0〉, then the state of the comb<strong>in</strong>ed atom field system can be written|e, 0〉, where the e <strong>in</strong>dicates that the atom is <strong>in</strong> an excited state. The atom can later lose thisenergy by emitt<strong>in</strong>g it as a photon, so that at some later time the state of the system will bea|e, 0〉 + b|g, 1〉, where now there is the possibility, with probability |b| 2 , of the atom be<strong>in</strong>gfound <strong>in</strong> its ground state, and a photon hav<strong>in</strong>g been created.8.5 General Mathematical Description of a <strong>Quantum</strong> SystemIt was shown <strong>in</strong> preced<strong>in</strong>g Sections that the mathematical description of this sum of probabilityamplitudes admits an <strong>in</strong>terpretation of the state of the system as be<strong>in</strong>g a vector <strong>in</strong>a complex vector space, the state space of the system. It is this mathematical picture thatis summarized here <strong>in</strong> the general case <strong>in</strong>troduced <strong>in</strong> the immediately preced<strong>in</strong>g Section.This idea that the state of a quantum system is to be considered a vector belong<strong>in</strong>g to acomplex vector space, which we have developed here <strong>in</strong> the case of a sp<strong>in</strong> half system,and which has its roots <strong>in</strong> the sum over paths po<strong>in</strong>t of view, is the basis of all of modernquantum mechanics and is used to describe any quantum mechanical system. Belowis a summary of the ma<strong>in</strong> po<strong>in</strong>ts as they are used for a general quantum system whosestate spaces are of arbitrary dimension (<strong>in</strong>clud<strong>in</strong>g state spaces of <strong>in</strong>f<strong>in</strong>ite dimension). Theemphasis here is on the mathematical features of the theory.8.5.1 State SpaceWe have <strong>in</strong>dicated a number of times that <strong>in</strong> quantum mechanics, the state of a physicalsystem is represented by a vector belong<strong>in</strong>g to a complex vector space known as the state


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 97space of the system. Here we will give a list of the def<strong>in</strong><strong>in</strong>g conditions of a state space,though we will not be concern<strong>in</strong>g ourselves too much with the formalities. The follow<strong>in</strong>gdef<strong>in</strong>itions and concepts set up the state space of a quantum system.1. Every physical state of a quantum system is specified by a vector called a statevector, ket vector, or sometimes just state and written | . . .〉 where . . . is a labelspecify<strong>in</strong>g the physical <strong>in</strong>formation known about the state. An arbitrary state iswritten |ψ〉, or |φ〉 and so on. The set of all state vectors describ<strong>in</strong>g a given physicalsystem forms a complex vector space (actually a Hilbert space, see Sec. 8.5.2) Halso known as the state space or ket space for the system.2. Every l<strong>in</strong>ear superposition of two or more state vectors |φ 1 〉, |φ 2 〉, |φ 3 〉, . . . , is also astate of the quantum system i.e. the state |ψ〉 given by|ψ〉 = c 1 |φ 1 〉 + c 2 |φ 2 〉 + c 3 |φ 3 〉 + . . .is a state of the system for all complex numbers c 1 , c 2 , c 3 , . . . .These last two po<strong>in</strong>ts amount to say<strong>in</strong>g that every physical state of a system is representedby a vector <strong>in</strong> the state space of the system, and every vector <strong>in</strong> the state space representsa possible physical state of the system. To guarantee this, the follow<strong>in</strong>g condition is alsoimposed:3. If a state of the system is represented by a vector |ψ〉, then the same state is representedby the vector c|ψ〉 where c is any non-zero complex number.F<strong>in</strong>ally, we need the concept of a set of basis states, and of the dimension of the statespace.4. A set of vectors {|ϕ 1 〉, |ϕ 2 〉, |ϕ 3 〉, . . . } is said to form a basis for the state space ifevery state of the quantum system can be represented as a l<strong>in</strong>ear superposition ofthe |ϕ i 〉’s i.e. for any state |ψ〉 we can write∑|ψ〉 = c i |ϕ i 〉.iThe set of vectors {|ϕ i 〉, i = 1, 2, . . . } are said to span the vector space. The vectorsare also termed the base states for the vector space. They are also said to becomplete. What this means, mathematically, is that for every state |φ〉 say, at leastone of the <strong>in</strong>ner products 〈ϕ n |φ〉 will be non-zero, or conversely, there does not exista state |ξ〉 for which 〈ϕ n |ξ〉 = 0 for all the basis states |ϕ n 〉. Completeness clearlymeans that no more basis states are needed to describe any possible physical stateof a system.For example, return<strong>in</strong>g to the sp<strong>in</strong> half system, the two states |±〉 are all that is needed todescribe any state of the system, i.e. there are no sp<strong>in</strong> states that cannot be described <strong>in</strong>terms of these basis states. Thus, these states are said to be complete.


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 985. The m<strong>in</strong>imum number of vectors needed to form a complete set of basis states isknown as the dimension of the state space. [In many, if not most cases of <strong>in</strong>terest<strong>in</strong> quantum mechanics, the dimension of the state space is <strong>in</strong>f<strong>in</strong>ite.]It should be noted that there is an <strong>in</strong>f<strong>in</strong>ite number of possible sets of basis states for anystate space. The arguments presented above by which we arrive at a set of basis states (or<strong>in</strong>termediate states) serves as a physically motivated start<strong>in</strong>g po<strong>in</strong>t to construct the statespace for the system. But once we have def<strong>in</strong>ed the state space <strong>in</strong> this way, there is noreason why we cannot, at least mathematically, construct other sets of basis states. Thesebasis states that we start with are particularly useful as they have an immediate physicalmean<strong>in</strong>g; this might not be the case for an arbitrary basis set. But there are other meansby which other physically mean<strong>in</strong>gful basis states can be determ<strong>in</strong>ed: often the choice ofbasis states is suggested by the physics (such as the set of eigenstates of an observable,see <strong>Chapter</strong> 9).8.5.2 Probability Amplitudes and the Inner Product of State <strong>Vector</strong>sWe obta<strong>in</strong>ed a number of properties of probability amplitudes when look<strong>in</strong>g at the caseof a sp<strong>in</strong> half system. Some of the results obta<strong>in</strong>ed there, and a few more that were not,are summarized <strong>in</strong> the follow<strong>in</strong>g.If |φ〉 and |ψ〉 are any two state vectors belong<strong>in</strong>g to the state space H, then1. 〈φ|ψ〉, a complex number, is the probability amplitude of observ<strong>in</strong>g the system tobe <strong>in</strong> the state |φ〉 given that it is <strong>in</strong> the state |ψ〉.2. The probability of observ<strong>in</strong>g the system to be <strong>in</strong> the state |φ〉 given that it is <strong>in</strong> thestate |ψ〉 is |〈φ|ψ〉| 2 .The probability amplitude 〈φ|ψ〉, can then be shown to have the properties3. 〈φ|ψ〉 = 〈ψ|φ〉 ∗ .4. (〈φ|{c 1 |ψ 1 〉 + c 2 |ψ 2 〉} = c 1 〈φ|ψ 1 〉 + c 2 〈φ|ψ 2 〉 where c 1 and c 2 are complex numbers.5. 〈ψ|ψ〉 ≥ 0. If 〈ψ|ψ〉=0 then |ψ〉 = 0, the zero vector.This last statement is related to the physically reasonable requirement that the probabilityof a system be<strong>in</strong>g found <strong>in</strong> a state |ψ〉 given that it is <strong>in</strong> the state |ψ〉 has to be unity,i.e. |〈ψ|ψ〉| 2 = 1 which means that 〈ψ|ψ〉 = exp(iη). We now choose η = 0 so that〈ψ|ψ〉 = 1, which is bigger than zero. But recall that any multiple of a state vector stillrepresents the same physical state of the system, i.e. |˜ψ〉 = a|ψ〉 still represents the samephysical state as |ψ〉. However, <strong>in</strong> this case, 〈˜ψ|˜ψ〉 = |a| 2 which is not necessarily unity,but is is certa<strong>in</strong>ly bigger than zero.6. The quantity √ 〈ψ|ψ〉 is known as the length or norm of |ψ〉.7. A state |ψ〉 is normalized, or normalized to unity, if 〈ψ|ψ〉 = 1.


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 99Normalized states are states for which probability is properly taken <strong>in</strong>to account. It ismathematically convenient to permit the use of states whose norms are not equal to unity,but it is necessary <strong>in</strong> order to make use of the probability <strong>in</strong>terpretation to deal only withthat state which has norm of unity. Any state that cannot be normalized to unity (i.e. it isof <strong>in</strong>f<strong>in</strong>ite length) cannot represent a physically acceptable state.8. Two states |φ〉 and |ψ〉 are orthogonal if 〈φ|ψ〉 = 0.The physical significance of two states be<strong>in</strong>g orthogonal should be understood: for asystem <strong>in</strong> a certa<strong>in</strong> state, there is zero probability of it be<strong>in</strong>g observed <strong>in</strong> a state withwhich it is orthogonal. In this sense, two orthogonal states are as dist<strong>in</strong>ct as it is possiblefor two states to be.F<strong>in</strong>ally, a set of orthonormal basis vectors {|ϕ n 〉; n = 1, 2, . . . } will have the property9. 〈ϕ m |ϕ n 〉 = δ mn where δ mn is known as the Kronecker delta, and equals unity if m = nand zero if m n.All the above conditions satisfied by probability amplitudes were to a greater or lesserextent physically motivated, but it nevertheless turns out that these conditions are identicalto the conditions that are used to def<strong>in</strong>e the <strong>in</strong>ner product of two vectors <strong>in</strong> a complexvector space, <strong>in</strong> this case, the state space of the system, i.e. we could write, us<strong>in</strong>g theusual mathematical notation for an <strong>in</strong>ner product, 〈φ|ψ〉 = (|φ〉, |ψ〉). The state space ofa physical system is thus more than just a complex vector space, it is a vector spaceon which there is def<strong>in</strong>ed an <strong>in</strong>ner product, and so is more correctly termed a complex‘<strong>in</strong>ner product’ space. However, it is usually required <strong>in</strong> quantum mechanics that certa<strong>in</strong>convergency criteria, def<strong>in</strong>ed <strong>in</strong> terms of the norms of sequences of vectors belong<strong>in</strong>g tothe state space, must be satisfied. This is not of any concern for spaces of f<strong>in</strong>ite dimension,but are important for spaces of <strong>in</strong>f<strong>in</strong>ite dimension. If these criteria are satisfied then thestate space is said to be a Hilbert space. Thus rather than referr<strong>in</strong>g to the state space of asystem, reference is made to the Hilbert space of the system.It is important to recognize that all the vectors belong<strong>in</strong>g to a Hilbert space have f<strong>in</strong>itenorm, or, putt<strong>in</strong>g it another way, all the state vectors can be normalized to unity – thisstate of affairs is physically necessary if we want to be able to apply the probability <strong>in</strong>terpretation<strong>in</strong> a consistent way. However, as we shall see, we will encounter states whichdo not have a f<strong>in</strong>ite norm and hence neither represent physically realizable states, nor dothey belong to the state or Hilbert space of the system. Nevertheless, with proper care regard<strong>in</strong>gtheir use and <strong>in</strong>terpretation, such states turn out to be essential, and play a crucialrole throughout quantum mechanics. Recogniz<strong>in</strong>g that a probability amplitude is noth<strong>in</strong>gbut an <strong>in</strong>ner product on the state space of the system, leads to a more general way ofdef<strong>in</strong><strong>in</strong>g what is meant by a bra vector. The follow<strong>in</strong>g discussion emphasizes the fact thata bra vector, while it shares many characteristices of a ket vector, is actually a differentmathematical entity.Bra <strong>Vector</strong>sWe have consistently used the notation 〈φ|ψ〉 to represent a probability amplitude, but wehave just seen that this quantity is <strong>in</strong> fact noth<strong>in</strong>g more than the <strong>in</strong>ner product of two state


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 100vectors, which can be written <strong>in</strong> a different notation, (|φ〉, |ψ〉), that is more commonlyencountered <strong>in</strong> pure mathematics. But the <strong>in</strong>ner product can be viewed <strong>in</strong> another way,which leads to a new <strong>in</strong>terpretation of the expression 〈φ|ψ〉, and the <strong>in</strong>troduction of a newclass of state vectors. If we consider the equationand ‘cancel’ the |ψ〉, we get the result〈φ|ψ〉 = (|φ〉, |ψ〉) (8.67)〈φ| • = (|φ〉, •) (8.68)where the ‘•’ is <strong>in</strong>serted, temporarily, to rem<strong>in</strong>d us that <strong>in</strong> order to complete the equation,a ket vector has to be <strong>in</strong>serted. By carry<strong>in</strong>g out this procedure, we have <strong>in</strong>troduced anew quantity 〈φ| which is known as a bra or bra vector, essentially because 〈φ|ψ〉 lookslike quantities enclosed between a pair of ‘bra(c)kets’. It is a vector because, as can bereadily shown, the collection of all possible bras form a vector space. For <strong>in</strong>stance, by theproperties of the <strong>in</strong>ner product, ifthen|ψ〉 = a 1 |ϕ 1 〉 + a 2 |ϕ 2 〉 (8.69)(|ψ〉, •) = 〈ψ| • = (a 1 |ϕ 1 〉 + a 2 |ϕ 2 〉, •) (8.70)i.e., dropp<strong>in</strong>g the ‘•’ symbols, we have= a ∗ 1 (|ϕ 1〉, •) + a ∗ 2 (|ϕ 2〉, •) = a ∗ 1 〈ϕ 1| • + a ∗ 2 〈ϕ 2| • (8.71)〈ψ| = a ∗ 1 〈ϕ 1| + a ∗ 2 〈ϕ 2| (8.72)so that a l<strong>in</strong>ear comb<strong>in</strong>ation of two bras is also a bra, from which follows (after a bit morework check<strong>in</strong>g that the other requirements of a vector space are also satisfied) the resultthat the set of all bras is a vector space. Incidentally, this last calculation above shows,once aga<strong>in</strong>, that if |ψ〉 = a 1 |ϕ 1 〉 + a 2 |ϕ 2 〉 then the correspond<strong>in</strong>g bra is 〈ψ| = a ∗ 1 〈ϕ 1| + a ∗ 2 〈ϕ 2|.So, <strong>in</strong> a sense, the bra vectors are the ‘complex conjugates’ of the ket vectors.The vector space of all bra vectors is obviously closely l<strong>in</strong>ked to the vector space of allthe kets H, and is <strong>in</strong> fact usually referred to as the dual space, and represented by H ∗ . Toeach ket vector |ψ〉 belong<strong>in</strong>g to H, there is then an associated bra vector 〈ψ| belong<strong>in</strong>g tothe dual space H ∗ . However, the reverse is not necessarily true: there are bra vectors thatdo not necessarily have a correspond<strong>in</strong>g ket vector, and there<strong>in</strong> lies the difference betweenbras and kets. It turns out that the difference only matters for Hilbert spaces of <strong>in</strong>f<strong>in</strong>itedimension, <strong>in</strong> which case there can arise bra vectors whose correspond<strong>in</strong>g ket vector is of<strong>in</strong>f<strong>in</strong>ite length, i.e. has <strong>in</strong>f<strong>in</strong>ite norm, and hence cannot be normalized to unity. Such ketvectors can therefore never represent a possible physical state of a system. But these issueswill not arise here, so will not be of any concern. The po<strong>in</strong>t to be taken away from all thisis that a bra vector is not the same k<strong>in</strong>d of mathematical object as a ket vector. In fact,it has all the attributes of an operator <strong>in</strong> the sense that it acts on a ket vector to producea complex number, this complex number be<strong>in</strong>g given by the appropriate <strong>in</strong>ner product.This is <strong>in</strong> contrast to the more usual sort of operators encountered <strong>in</strong> quantum mechanicsthat act on ket vectors to produce other ket vectors. In mathematical texts a bra vectoris usually referred to as a ‘l<strong>in</strong>ear functional’. Nevertheless, <strong>in</strong> spite of the mathematical


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 101dist<strong>in</strong>ction that can be made between bra and ket vectors, the correspondence between thetwo k<strong>in</strong>ds of vectors is <strong>in</strong> most circumstances so complete that a bra vector equally wellrepresents the state of a quantum system as a ket vector. Thus, we can talk of a systembe<strong>in</strong>g <strong>in</strong> the state 〈ψ|.We can summarize all this <strong>in</strong> the general case as follows: The <strong>in</strong>ner product (|ψ〉, |φ〉)def<strong>in</strong>es, for all states |ψ〉, the set of functions (or l<strong>in</strong>ear functionals) (|ψ〉, ). The l<strong>in</strong>earfunctional (|ψ〉, ) maps any ket vector |φ〉 <strong>in</strong>to the complex number given by the <strong>in</strong>nerproduct (|ψ〉, |φ〉).1. The set of all l<strong>in</strong>ear functionals (|ψ〉, ) forms a complex vector space H ∗ , the dualspace of H.2. The l<strong>in</strong>ear functional (|ψ〉, ) is written 〈ψ| and is known as a bra vector.3. To each ket vector |ψ〉 there corresponds a bra vector 〈ψ| such that if |φ 1 〉 → 〈φ 1 |and |φ 2 〉 → 〈φ 2 | thenc 1 |φ 1 〉 + c 2 |φ 2 〉 → c ∗ 1 〈φ 1| + c ∗ 2 〈φ 2|.8.6 State <strong>Spaces</strong> of Inf<strong>in</strong>ite DimensionSome examples of physical systems with state spaces of <strong>in</strong>f<strong>in</strong>ite dimension were provided<strong>in</strong> the previous Section. In these examples, we were able to proceed, at least as far asconstruct<strong>in</strong>g the state space was concerned, largely as was done <strong>in</strong> the case of f<strong>in</strong>ite dimensionalstate spaces. However, further <strong>in</strong>vestigation shows that there are features ofthe mathematics, and the correspond<strong>in</strong>g physical <strong>in</strong>terpretation <strong>in</strong> the <strong>in</strong>f<strong>in</strong>ite dimensionalcase that do not arise for systems with f<strong>in</strong>ite dimensional state spaces. Firstly, it is possibleto construct state vectors that cannot represent a state of the system and secondly, thepossibility arises of the basis states be<strong>in</strong>g cont<strong>in</strong>uously <strong>in</strong>f<strong>in</strong>ite. This latter state of affairsis not at all a rare and special case – it is just the situation needed to describe the motionof a particle <strong>in</strong> space, and hence gives rise to the wave function, and wave mechanics.8.6.1 States of Inf<strong>in</strong>ite NormTo illustrate the first of the difficulties mentioned above, consider the example of a systemof identical photons <strong>in</strong> the state |ψ〉 def<strong>in</strong>ed by Eq. (8.66). As the basis states areorthonormal we have for 〈ψ|ψ〉∞∑〈ψ|ψ〉 = |c n | 2 (8.73)If the probabilities |c n | 2 form a convergent <strong>in</strong>f<strong>in</strong>ite series, then the state |ψ〉 has a f<strong>in</strong>itenorm, i.e. it can be normalized to unity. However, if this series does not converge, then itis not possible to supply a probability <strong>in</strong>terpretation to the state vector as it is not normalizableto unity. For <strong>in</strong>stance, if c 0 = 0 and c n = 1/ √ n, n = 1, 2, . . ., then〈ψ|ψ〉 =n=0∞∑n=11n(8.74)


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 102which is a divergent series, i.e. this state cannot be normalized to unity. In contrast, ifc n = 1/n, n = 1, 2, . . ., then∞∑ 1〈ψ|ψ〉 =n = π2(8.75)2 6n=0which means we can normalize this state to unity by def<strong>in</strong><strong>in</strong>g√6| ˜ψ〉 = |ψ〉. (8.76)πThis shows that there are some l<strong>in</strong>ear comb<strong>in</strong>ation of states that do not represent possiblephysical states of the system. Such states do not belong to the Hilbert space H of thesystem, i.e. the Hilbert space consists only of those states for which the coefficients c nsatisfy Eq. (8.73) 2 . This is a new feature: the possibility of construct<strong>in</strong>g vectors that donot represent possible physical states of the system. It turns out that some very usefulbasis states have this apparently undesirable property, as we will now consider.8.6.2 Cont<strong>in</strong>uous Basis StatesIf we now turn to the example <strong>in</strong> the previous section of the <strong>in</strong>f<strong>in</strong>ite crystal, we can considerwhat happens if we suppose that the separation between the neighbour<strong>in</strong>g atoms <strong>in</strong>the crystal goes to zero, so that the electron can be found anywhere over a range extend<strong>in</strong>gfrom −∞ to ∞. This, <strong>in</strong> effect, is the cont<strong>in</strong>uous limit of the <strong>in</strong>f<strong>in</strong>ite crystal model justpresented, and represents the possible positions that a particle free to move anywhere <strong>in</strong>one dimension, the X axis say, can have. In this case, we could label the possible statesof the particle by its X position, i.e. |x〉, where now, <strong>in</strong>stead of hav<strong>in</strong>g the discrete valuesof the crystal model, the position can now assume any of a cont<strong>in</strong>uous range of values,−∞ < x < ∞. It would seem that we could then proceed <strong>in</strong> the same way as we havedone with the discrete states above, but it turns out that such states cannot be normalizedto unity and hence do not represent (except <strong>in</strong> an idealised sense) physically allowablestates of the system.The aim here is to try to develop a description of the possible basis states for a particlethat is not limited to be<strong>in</strong>g found only at discrete positions on the X axis. After all, <strong>in</strong>pr<strong>in</strong>ciple, we would expect that a particle <strong>in</strong> free space could be found at any position x <strong>in</strong>the range −∞ < x < ∞. We will get at this description by a limit<strong>in</strong>g procedure which isnot at all mathematically rigorous, but nevertheless yields results that turn out to be valid.2 Note however, that we can still construct a bra vectorn=∞ ∑〈ψ| = c ∗ n〈n|n=0without plac<strong>in</strong>g any restrictions on the convergence of the c n ’s such as the one <strong>in</strong> Eq. (8.73). The correspond<strong>in</strong>gket cannot then represent a possible state of the system, but such <strong>in</strong>ner products as 〈ψ|φ〉 where |φ〉is a normalized ket can still be evaluated. The po<strong>in</strong>t be<strong>in</strong>g made here is that if H is of <strong>in</strong>f<strong>in</strong>ite dimension,the dual space H ∗ can also <strong>in</strong>clude bra vectors that do not correspond to normalized ket vectors <strong>in</strong> H, whichemphasizes the fact that H ∗ is def<strong>in</strong>ed as a set of l<strong>in</strong>ear functionals, and not simply as a ‘complex conjugate’version of H. The dist<strong>in</strong>ction is important <strong>in</strong> some circumstances, but we will not have to deal with suchcases.


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 103Suppose we return to the completeness relation for the states |na〉 for the one dimensionalcrystal|ψ〉 =+∞∑n=−∞|na〉〈na|ψ〉. (8.77)If we now put a = ∆x and na = x n , BUT we write |na〉 = √ a|x n 〉, this becomes|ψ〉 =+∞∑n=−∞|x n 〉〈x n |ψ〉∆x (8.78)where now〈x n |x m 〉 = δ nm(8.79)ai.e. each of the states |x n 〉 is not normalized to unity, but we can nevertheless identify sucha state as be<strong>in</strong>g that state for which the particle is at position x n – recall if a state vector ismultiplied by a constant, it still represents the same physical state of the system.If we put to one side any concerns about the mathematical legitimacy of what follows, wecan now take the limit ∆x → 0, i.e. a → 0, then Eq. (8.78) can be written as an <strong>in</strong>tegral,i.e.|ψ〉 =∫ +∞−∞|x〉〈x|ψ〉 dx (8.80)We can identify the state |x〉 with the physical state of affairs <strong>in</strong> which the particle is at theposition x, and the expression Eq. (8.80) is consistent with the completeness requirementi.e. that the states {|x〉, −∞ < x < ∞} form a complete set of basis states, so that any stateof the one particle system can be written as a superposition of the states |x〉, though thefact that the label x is cont<strong>in</strong>uous has forced us to write the completeness relation as an<strong>in</strong>tegral. The difficulty with this procedure is that the states |x〉 are no longer normalizedto unity. This we can see from Eq. (8.79) which tells us that 〈x|x ′ 〉 will vanish if x x ′ ,but for x = x ′ we see that1〈x|x〉 = lima→0 a = ∞ (8.81)i.e. the state |x〉 has <strong>in</strong>f<strong>in</strong>ite norm! This means that there is a price to pay for try<strong>in</strong>g to set upthe mathematics <strong>in</strong> such a manner as to produce the completeness expression Eq. (8.80),which is that we are forced to <strong>in</strong>troduce basis states which have <strong>in</strong>f<strong>in</strong>ite norm, and hencecannot represent a possible physical state of the particle! Nevertheless, provided care istaken, it is still possible to work with these states as if they represent the state <strong>in</strong> whichthe particle is at a def<strong>in</strong>ite position. To see this, we need to look at the orthonormalityproperties of these states, and <strong>in</strong> do<strong>in</strong>g so we are lead to <strong>in</strong>troduce a new k<strong>in</strong>d of function,the Dirac delta function.8.6.3 The Dirac Delta FunctionWe have just seen that the <strong>in</strong>ner product 〈x|x ′ 〉 vanishes if x x ′ , but appears to be <strong>in</strong>f<strong>in</strong>iteif x = x ′ . In order to give some mathematical sense to this result, we return to Eq. (8.80)and look more closely at the properties that 〈x|x ′ 〉 must have <strong>in</strong> order for the completenessrelation also to make sense.


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 104The probability amplitude 〈x|ψ〉 appear<strong>in</strong>g <strong>in</strong> Eq. (8.80) are functions of the cont<strong>in</strong>uousvariable x, and is often written 〈x|ψ〉 = ψ(x), which we identify as the wave function ofthe particle. If we now consider the <strong>in</strong>ner productor〈x ′ |ψ〉 =ψ(x ′ ) =∫ +∞−∞∫ +∞−∞〈x ′ |x〉〈x|ψ〉dx (8.82)〈x ′ |x〉ψ(x)dx (8.83)we now see that we have an <strong>in</strong>terest<strong>in</strong>g difficulty. We know that 〈x ′ |x〉 = 0 if x ′ x, soif 〈x|x〉 is assigned a f<strong>in</strong>ite value, the <strong>in</strong>tegral on the right hand side will vanish, so thatψ(x) = 0 for all x!! But if ψ(x) is to be a non-trivial quantity, i.e. if it is not to be zero forall x, then it cannot be the case that 〈x|x〉 is f<strong>in</strong>ite. In other words, 〈x ′ |x〉 must be <strong>in</strong>f<strong>in</strong>itefor x = x ′ <strong>in</strong> some sense <strong>in</strong> order to guarantee a non-zero <strong>in</strong>tegral. The way <strong>in</strong> which thiscan be done <strong>in</strong>volves <strong>in</strong>troduc<strong>in</strong>g a new ‘function’, the Dirac delta function, which hassome rather unusual properties.What we are after is a ‘function’ δ(x − x 0 ) with the property thatf (x 0 ) =for all (reasonable) functions f (x).∫ +∞−∞δ(x − x 0 ) f (x)dx (8.84)So what is δ(x − x 0 )? Perhaps the simplest way to get at what this function looks like isto exam<strong>in</strong>e beforehand a sequence of functions def<strong>in</strong>ed byD(x, ɛ) = ɛ −1 −ɛ/2 < x < ɛ/2= 0 x < −ɛ/2, x > ɛ/2. (8.85)


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 105What we first notice about this function isthat it def<strong>in</strong>es a rectangle whose area is alwaysunity for any (non-zero) value of ɛ, i.e.∫ +∞−∞D(x, ɛ)dx = 1. (8.86)Secondly, we note that as ɛ is made smaller,the rectangle becomes taller and narrower.Thus, if we look at an <strong>in</strong>tegralf (x)∫ +∞−∞∫ ɛ/2D(x, ɛ) f (x)dx = ɛ −1 f (x)dx−ɛ/2(8.87)where f (x) is a reasonably well behaved function(i.e. it is cont<strong>in</strong>uous <strong>in</strong> the neighbourhoodof x = 0), we see that as ɛ → 0, thistends to the limit f (0). We can summarizethis by the equationFigure 8.6: A sequence of rectangles of decreas<strong>in</strong>gwidth but <strong>in</strong>creas<strong>in</strong>g height, ma<strong>in</strong>ta<strong>in</strong><strong>in</strong>ga constant area of unity approaches an<strong>in</strong>f<strong>in</strong>itely high ‘spike’ at x = 0..∫ +∞lim D(x, ɛ) f (x)dx = f (0). (8.88)ɛ→0−∞Tak<strong>in</strong>g the limit <strong>in</strong>side the <strong>in</strong>tegral sign (an illegal mathematical operation, by the way),we can write this as∫ +∞−∞lim D(x, ɛ) f (x)dx =ɛ→0∫ +∞−∞δ(x) f (x) = f (0) (8.89)where we have <strong>in</strong>troduced the ‘Dirac delta function’ δ(x) def<strong>in</strong>ed as the limitδ(x) = limɛ→0D(x, ɛ), (8.90)a function with the unusual property that it is zero everywhere except for x = 0, where itis <strong>in</strong>f<strong>in</strong>ite.The above def<strong>in</strong>ed function D(x, ɛ) is but one ‘representation’ of the Dirac delta function.There are <strong>in</strong> effect an <strong>in</strong>f<strong>in</strong>ite number of different functions that <strong>in</strong> an appropriate limitbehave as the rectangular function here. Some examples are1 s<strong>in</strong> L(x − x 0 )δ(x − x 0 ) = limL→∞ π x − x 01 ɛ= lim(8.91)ɛ→0 π (x − x 0 ) 2 + ɛ 21= limλ→∞ 2 λe−λ|x−x 0| .In all cases, the function on the right hand side becomes narrower and taller as the limit istaken, while the area under the various curves rema<strong>in</strong>s the same, that is, unity.The first representation above is of particular importance. It arises by via the follow<strong>in</strong>g<strong>in</strong>tegral:∫1 +Le ik(x−x0) dk = eiL(x−x0) − e iL(x−x 0)= 1 s<strong>in</strong> L(x − x 0 )(8.92)2π −L2πi(x − x 0 ) π x − x 0x


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 106In the limit of L → ∞, this then becomes12π∫ +∞−∞e ik(x−x 0) dk = δ(x − x 0 ). (8.93)The delta function is not to be thought of as a function as it is usually def<strong>in</strong>ed <strong>in</strong> puremathematics, but rather it is to be understood that a limit of the k<strong>in</strong>d outl<strong>in</strong>ed above isimplied whenever the delta function appears <strong>in</strong> an <strong>in</strong>tegral 3 . However, such mathematicalniceties do not normally need to be a source of concern <strong>in</strong> most <strong>in</strong>stances. It is usuallysufficient to be aware of the basic property Eq. (8.84) and a few other rules that can beproven us<strong>in</strong>g the limit<strong>in</strong>g process, such as∫ +∞−∞∫ +∞−∞δ(x) = δ(−x)δ(ax) = 1|a| δ(x)xδ(x) = 0δ(x − x 0 )δ(x − x 1 ) = δ(x 0 − x 1 )f (x)δ ′ (x − x 0 )dx = − f ′ (x 0 ).The limit<strong>in</strong>g process should be employed if there is some doubt about any result obta<strong>in</strong>ed.For <strong>in</strong>stance, it can be shown that the square of a delta function cannot be given a satisfactorymean<strong>in</strong>g.Delta Function NormalizationReturn<strong>in</strong>g to the resultψ(x ′ ) =∫ +∞−∞〈x ′ |x〉ψ(x)dx (8.94)we see that the <strong>in</strong>ner product 〈x ′ |x〉, must be <strong>in</strong>terpreted as a delta function:〈x ′ |x〉 = δ(x − x ′ ). (8.95)The states |x〉 are said to be delta function normalized, <strong>in</strong> contrast to the orthonormalproperty of discrete basis states. One result of this, as has been po<strong>in</strong>ted out earlier, is thatstates such as |x〉 are of <strong>in</strong>f<strong>in</strong>ite norm and so cannot be normalized to unity. Such statesstates cannot represent possible physical states of a system, though it is often convenient,with caution, to speak of such states as if they were physically realizable. Mathematical(and physical) paradoxes can arise if care is not taken. However, l<strong>in</strong>ear comb<strong>in</strong>ationsof these states can be normalized to unity, as this follow<strong>in</strong>g example illustrates. If weconsider a state |ψ〉 given by|ψ〉 =∫ +∞−∞|x〉〈x|ψ〉dx, (8.96)3 This raises the question as to whether or not it would matter what representation of the delta function isused. Provided the function f (x) is bounded over (−∞, +∞) there should be no problem, but if the functionf (x) is unbounded over this <strong>in</strong>terval, e.g. f (x) = exp(x 2 ), then only the rectangular representation of thedelta function will give a sensible answer.


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 107then〈ψ|ψ〉 =∫ +∞But 〈x|ψ〉 = ψ(x) and 〈ψ|x〉 = ψ(x) ∗ , so that〈ψ|ψ〉 =−∞∫ ∞−∞〈ψ|x〉〈x|ψ〉dx. (8.97)|ψ(x)| 2 dx. (8.98)Provided |ψ(x)| 2 is a well behaved function, <strong>in</strong> particular that it vanish as x → ±∞, this<strong>in</strong>tegral will converge to a f<strong>in</strong>ite result, so that the state |ψ〉 can <strong>in</strong>deed be normalized tounity, and if so, then we can <strong>in</strong>terpret |ψ(x)| 2 dx as the probability of f<strong>in</strong>d<strong>in</strong>g the particle <strong>in</strong>the region (x, x + dx), which is just the standard Born <strong>in</strong>terpretation of the wave function.8.6.4 Separable State <strong>Spaces</strong>We have seen that state spaces of <strong>in</strong>f<strong>in</strong>ite dimension can be set up with either a denumerably<strong>in</strong>f<strong>in</strong>ite number of basis states, i.e. the basis states are discrete but <strong>in</strong>f<strong>in</strong>ite <strong>in</strong> number,or else a non-denumerably <strong>in</strong>f<strong>in</strong>ite number of basis states, i.e. the basis states are labelledby a cont<strong>in</strong>uous parameter. S<strong>in</strong>ce a state space can be spanned by more than one set ofbasis states, it is worthwhile <strong>in</strong>vestigat<strong>in</strong>g whether or not a space of <strong>in</strong>f<strong>in</strong>ite dimensioncan be spanned by a set of denumerable basis states, as well as a set of non-denumerablebasis states. An example of where this is the case was given earlier, that of a particle<strong>in</strong> an <strong>in</strong>f<strong>in</strong>itely deep potential well, see p95. It transpires that not all vector spaces of<strong>in</strong>f<strong>in</strong>ite dimension have this property, i.e. that they can have both a denumerable and anon-denumerable set of basis states. <strong>Vector</strong> spaces which can have both k<strong>in</strong>ds of basisstates are said to be separable, and <strong>in</strong> quantum mechanics it is assumed that state spacesare separable.8.7 States of Macroscopic SystemsIn the examples given above, it was assumed that an exhaustive list of results that couldbe obta<strong>in</strong>ed <strong>in</strong> the measurement of some observable of a quantum system could be usedto set up the basis states for the state space of the system. The value of do<strong>in</strong>g this is,of course, to be determ<strong>in</strong>e by the success or otherwise of these ideas. That quantummechanics is such an overwhelm<strong>in</strong>gly successful theory <strong>in</strong>dicates that there is someth<strong>in</strong>gcorrect <strong>in</strong> this procedure, but the question that arises is this: why does it not appear towork for macroscopic systems, i.e. for systems which we know can be fully adequatelyexpla<strong>in</strong>ed by standard classical physics? The answer appears to lie <strong>in</strong> the fact that <strong>in</strong>all the examples discussed above, whether or not the Hilbert space is of f<strong>in</strong>ite or <strong>in</strong>f<strong>in</strong>itedimension, i.e. whether or not we are talk<strong>in</strong>g about sp<strong>in</strong> up or sp<strong>in</strong> down of a sp<strong>in</strong> halfparticle, or the position of a particle <strong>in</strong> space, the implicit assumption is that the system weare consider<strong>in</strong>g is totally isolated from all other systems, <strong>in</strong> particular from any <strong>in</strong>fluenceof the surround<strong>in</strong>g environment. After all, when we talked about a system, such as anO − 2ion, we are ignor<strong>in</strong>g all the other physical <strong>in</strong>fluences that could act on this system,i.e. we do not need to mention, <strong>in</strong> our specification of the state of the system, anyth<strong>in</strong>gother than properties that directly perta<strong>in</strong> to the system of <strong>in</strong>terest. The assumption is


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 108made, as it is <strong>in</strong> classical physics, that such <strong>in</strong>fluences are sufficiently weak that they canbe ignored to a good approximation. In effect, we are suppos<strong>in</strong>g that the systems underconsideration are isolated systems, that is, systems that are isolated from the effect of anyexternal perturbations.Classically, at the macroscopic level, we can usually cont<strong>in</strong>ue to ignore weak perturb<strong>in</strong>g<strong>in</strong>fluences when specify<strong>in</strong>g the state of a system. In fact, when def<strong>in</strong><strong>in</strong>g a ‘system’ wetypically <strong>in</strong>clude <strong>in</strong> what we refer to as the ‘system’, all the participants <strong>in</strong> the physicalprocess be<strong>in</strong>g described that <strong>in</strong>teract strongly with each other. Anyth<strong>in</strong>g else that weaklyaffects these constitutents is ignored. For <strong>in</strong>stance, when describ<strong>in</strong>g the orbital dynamicsof the Earth as it revolves around the Sun, we might need to take <strong>in</strong>to account the gravitationalpull of the Moon – the system is the Earth, the Sun and the Moon. But we donot really need to take <strong>in</strong>to account the effect of the background microwave radiation leftover after the Big Bang. Or, when describ<strong>in</strong>g the collision between two billiard balls, itis probably necessary to <strong>in</strong>clude the effect of roll<strong>in</strong>g friction, but it not really necessaryto take <strong>in</strong>to account the frictional drag due to air resistance. Of course, sometimes it isnecessary to <strong>in</strong>clude external <strong>in</strong>fluences even when weak: to describe a system com<strong>in</strong>g tothermal equilibrium with its surround<strong>in</strong>gs it is necessary to extend the system by <strong>in</strong>clud<strong>in</strong>gthe environment <strong>in</strong> the dynamical model. In any of these examples, the same classicalphysics methods and philosophy applies.There is a subtle difference when it comes to try<strong>in</strong>g to apply the quantum ideas developedso far to macroscopic systems. The same, weak perturbations that can be put to one side<strong>in</strong> a classical description of a macroscopic system turn out to have a far-reach<strong>in</strong>g effect if<strong>in</strong>cluded <strong>in</strong> a quantum description of the same system. If we were to attempt to describe amacroscopic system accord<strong>in</strong>g to the laws of quantum mechanics, we would f<strong>in</strong>d that anyl<strong>in</strong>ear superposition of different possible states of the system evolves on a fantasticallyshort time scale to a classical mixture of the different possibilities. For <strong>in</strong>stance, if wewere to attempt to describe the state of a set of car keys <strong>in</strong> terms of two possibilities: <strong>in</strong>your pocket |p〉 or <strong>in</strong> your brief case |b〉, then a state of the form|ψ〉 = 1 √2( |p〉 + |b〉)(8.99)could be used to represent a possible ‘quantum state’ of the keys. But this quantum statewould be exceed<strong>in</strong>gly short lived (on a time scale ∼ 10 −40 sec), and would evolve <strong>in</strong>to thetwo alternative possibilities: a 50% chance of the keys be<strong>in</strong>g <strong>in</strong> the state |p〉, i.e. a 50%chance of f<strong>in</strong>d<strong>in</strong>g your keys <strong>in</strong> your pocket, and a 50% chance of be<strong>in</strong>g <strong>in</strong> the state |b〉,i.e. a 50% chance of f<strong>in</strong>d<strong>in</strong>g them <strong>in</strong> your brief case. But this is no longer a superpositionof these two states. Instead, the keys are either <strong>in</strong> the state |p〉 or the state |b〉. Whatthis effectively means is that randomness is still there, i.e. repeat<strong>in</strong>g an experiment underidentical conditions can give randomly vary<strong>in</strong>g results. But the state of the keys is nolonger represented by a state vector, so there are no longer any quantum <strong>in</strong>terferenceeffects present. The randomness can then be looked upon as be<strong>in</strong>g totally classical <strong>in</strong>nature, i.e. as be<strong>in</strong>g due to our ignorance of <strong>in</strong>formation that is <strong>in</strong> pr<strong>in</strong>ciple there, butimpossibly difficult to access. In effect, the quantum system behaves like a noisy classicalsystem.The process that washes out the purely quantum effects is known as decoherence. S<strong>in</strong>ceit is effectively impossible to isolate any macroscopic system from the <strong>in</strong>fluence of its


<strong>Chapter</strong> 8 <strong>Vector</strong> <strong>Spaces</strong> <strong>in</strong> <strong>Quantum</strong> <strong>Mechanics</strong> 109surround<strong>in</strong>g environment 4 , all macroscopic systems (with very few exceptions such assuperfluid SQUID junctions) are subject to decoherence. This process is believed to playa crucial role <strong>in</strong> why, at the macroscopic level, physical systems, which are all <strong>in</strong>tr<strong>in</strong>sicallyquantum mechanical, behave <strong>in</strong> accordance with the classical laws of physics. It is alsoone of the ma<strong>in</strong> corrupt<strong>in</strong>g <strong>in</strong>fluences that prevent a quantum computer from function<strong>in</strong>gas it should. <strong>Quantum</strong> computers rely for their function<strong>in</strong>g on the ‘qubits’ rema<strong>in</strong><strong>in</strong>g <strong>in</strong>l<strong>in</strong>ear superpositions of states, but the ever-present decoher<strong>in</strong>g effects of the environmentwill tend to destroy these delicate quantum states before a computation is completed, orelse at the very least <strong>in</strong>troduce errors as the computation proceeds. Controll<strong>in</strong>g decoherenceis therefore one of the major challenges <strong>in</strong> the development of viable quantumcomputers.So, the bottom l<strong>in</strong>e is that it is only for protected isolated systems that quantum effects aremost readily observed, and it is for microscopic systems that this state of affairs is to befound. But that is not to say that quantum effects are not present at the macroscopic level.Peculiar quantum effects associated with the superposition of states are not to be seen, butthe properties of matter <strong>in</strong> general, and <strong>in</strong>deed the properties of the forces of nature, areall <strong>in</strong>tr<strong>in</strong>sically quantum <strong>in</strong> orig<strong>in</strong>.(1693)4 ‘No man is an Iland, <strong>in</strong>tire of itselfe’ – J. Donne, Devotions upon Emergent Occasions Meditation XVII

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!