Skip to article frontmatterSkip to article content

In this Chapter we will learn about one of the basic aspects of quantum mechanics — incompatible observables . As we will see in more detail below, quantum mechanics predicts that it is impossible for a particle to have multiple definite properties at the same time. For example, a particle cannot have a definite position and a definite momentum simultaneously, or a definite energy and a definite position (as just two examples). In fact, we can say much more than this, and this is really the main lesson: there is a strong trade-off between how well-defined properties can be. That is, if a particle has a fairly well defined position, this necessitates it having a broad spread of momenta. This is the content of the famous Heisenberg uncertainty principle (HUP), which we will introduce here. This is a fundamental fact about nature, and shows that certain observable properties are incompatible with each other.

3.1Gaussian example

We can illustrate the main idea behind the incompatibility of position and momentum by considering a simple example — that of a gaussian wavefunction. Consider the following wavefunction

ψ(x)=1a2πex2/4a2,\psi(x) = \sqrt{\frac{1}{a\sqrt{2\pi}}} e^{-x^2/4a^2},

with a parameter a>0a > 0, which determines the width of the wavefunction, and where 1a2π\sqrt{\frac{1}{a\sqrt{2\pi}}} is a normalisation factor, which ensures that this is a normalised wavefunction (this is left as an exercise). This wavefunction is depicted below:

Picture of a gaussian function

Figure 3.1:A Gaussian wavefunction, as given by (3.1). The parameter a>0a>0 determines the ‘width’ of the wavefunction.

As we can see, it has the Bell-curve shape, and is centred at the origin. Direct calculations show that the expected position and standard deviation are

x=0,Δx=x2x2=a.\begin{align*} \av{x} &= 0,\\ \Delta x &= \sqrt{\av{x^2} - \av{x}^2} = a. \end{align*}

The fact that the standard deviation Δx\Delta x is equal to aa is the reason we say that aa determines the width (since it uniquely specifies the standard deviation, which is a very useful and common way of quantifying the width of a probability density function). What does it mean for the particle to have a small width? It means that if we were to measure the position, we would be very likely to find the particle within a small region of space (in this case, close to the origin, since this is the only region where the probability density (x)=ψ(x)2\pd(x) = |\psi(x)|^2 is large). This is where we should think of the quantum particle as being located, roughly speaking. If on the other hand aa becomes large, then the particle will be found in a larger region of space when we measure its position.

The question we would now like to ask is what momentum does the particle have, as we consider varying the width of the spatial wavefunction aa? We can answer this by finding the momentum wavefunction ψ~(p)\tilde{\psi}(p), which determines the probability density (p)=ψ~(p)2\pd(p) = |\tilde{\psi}(p)|^2 for the particle to have momentum pp.

In an exercise, you will show that

ψ~(p)=2aπea2p2/2.\tilde{\psi}(p) = \sqrt{\frac{\sqrt{2}a}{\sqrt{\pi}\hbar}}e^{-a^2p^2/\hbar^2}.

(Remarkably) this is also a Gaussian (now in the variable pp instead of xx). However, the expected momentum and standard deviation of momentum are

p=0,Δp=p2p2=2a.\begin{align*} \av{p} &= 0,\\ \Delta p &= \sqrt{\av{p^2} - \av{p}^2} = \frac{\hbar}{2a}. \end{align*}

What we see is that the spread of momentum is now inversely proportional to aa. This means that if aa becomes smaller — so that the particle has a smaller width in space (more precisely, so that a measurement of the position of the particle is likely to find it closer to the origin), it gains a bigger spread of momentum, meaning that a momentum measurement would find that the particle has large momentum with higher probability. Conversely, if the wavefunction has a wider width (a is larger), the spread of momentum in fact goes down!

3.2The Heisenberg Uncertainty Principle

We often refer to these widths as uncertainties, since they tell us how uncertain our measurements of position and momentum will be. That is, how close or far we are from obtaining a very deterministic outcome when performing a measurement of that property of the particle.

What is remarkable is that what we observed above isn’t just a special property of Gaussian wavefunctions, but is a general fact about quantum mechanics. In fact,

It is impossible in quantum mechancis for a particle to simultaneously have a small uncertainty Δx\Delta x in position and a small uncertainty Δp\Delta p in momentum.

The famous Heisenberg Uncertainty Principle makes this statement quantitative: it says that the product of the uncertainties can never be smaller than a fundamental constant:

ΔxΔp2.\Delta x \Delta p \geq \frac{\hbar}{2}.

This shows that if we have a particle in a particular quantum state ψ\ket{\psi} with corresponding spatial wavefunction ψ(x)\psi(x), and we calculate the uncertainty in the position of the particle Δx\Delta x, then even before calculating the momentum wavefunction, we already know that Δp/2Δx\Delta p \geq \hbar/2\Delta x. Hence if Δx\Delta x is small, then this implies that Δp\Delta p is necessarily going to be large.

Returning to our Gaussian wavefunction example from above, we saw in (3.2) and (3.3) that Δx=a\Delta x = a and Δp=/2a\Delta p = \hbar/2a, and so in this case

ΔxΔp=a×2a=2,\Delta x \Delta p = a \times \frac{\hbar}{2a} = \frac{\hbar}{2},

that is, independent of aa, a Gaussian wavefunction saturates the HUP: its product of uncertainties in position and momentum are in fact as small as allowed by quantum mechanics! We won’t prove this, but it can be shown that Gaussian wavefunctions are the only wavefunctions which saturate the HUP. As we will see in numerous examples, ΔxΔp>/2\Delta x \Delta p > \hbar/2 for any other non-Gaussian wavefunction. For this reason, you will see the Gaussian wavefunction referred to as the minimum uncertainty wavefunction in some textbooks. We will also see later on in this unit, that the Gaussian wavefunction is very important, as it corresponds to an ground state (the energy eigenstate of lowest energy) of a harmonic oscillator — i.e. the quantum description of a mass on a spring, or a pendulum, or, in fact the electromagnetic field!

3.3Incompatible Observables

Because of the fact that position and momentum satisfy the HUP, we say that that are incompatible observables: these are two properties of a quantum particle that are somehow in conflict or tension with each other — such that having one property well defined precludes the other property being well defined too. This is a completely novel aspect of quantum mechanics compared to classical physics, where there is no such difficulty. For example, in classical mechanics, a particle has both a definite position and a definite momentum at all times.

Crucially, it is not just position and momentum which are incompatible. In fact most observable properties are incompatible in quantum mechanics! That is, apart from some special cases, if we consider two distinct physical properties, it is very likely that they are incompatible, and that a particle cannot have definite values for both properties simultaneously.

The question we will answer in the next section is how to determine whether two observables are incompatible or not? We will see that this is determined to a purely mathematical property of the operators associated to each observable property, known as commutation:

Two physical observables are compatible if (and only if) their associated operators commute.

We will explain what this means in the next section.

3.4Commutation

Recall that operators are things which ‘act’ on a quantum state. It is possible to act with multiple operators in a sequence. We can therefore ask a natural question: whether the order matters or not. That is, if we act with one operator and then a second, is this the same as first acting with the second operator, followed by the first? The answer in general is no. This is precisely what is captured by commutation: two operators commute if the two orders lead to the same action. On the other hand, two operators for which the two orders lead to different overall actions are said to not commute.

In order to capture this mathematically, we introduce the so-called commutator of two operators. We denote the commutator of (two arbitrary operators) A^\hat{A} and B^\hat{B} by [A^,B^][\hat{A}, \hat{B}], and it is defined by

[A^,B^]=A^B^B^A^.[\hat{A},\hat{B}] = \hat{A}\hat{B} - \hat{B}\hat{A}.

That is, the commutator is the difference between the two orders. Having introduced the commutator, we can re-state in a more succinct manner what it means for two operators to commute:

A^\hat{A} and B^\hat{B} commute if (and only if) [A^,B^]=0[\hat{A}, \hat{B}] = 0.

This is because the commutator will vanish if A^B^=B^A^\hat{A}\hat{B} = \hat{B}\hat{A}, i.e. precisely when the two different orders coincide.

In the context of quantum mechanics, we are usually interested not in the operators A^\hat{A} and B^\hat{B} directly, but rather thier associated operators A^w\op{A} and B^w\op{B} (that act on wavefunctions ψ(x)\psi(x) rather than on quantum states ψ\ket{\psi}). We will use the same notation to refer to the associated commutator in this context:

[A^w,B^w]ψ(x)=A^w(B^wψ(x))B^w(A^wψ(x)),[\op{A}, \op{B}]\psi(x) = \op{A}\left(\op{B}\psi(x)\right) - \op{B}\left(\op{A}\psi(x)\right),

where we have additionally included brackets, to make it clear exactly what it means in this case. Note that, in this case, we really need the wavefunction in order for this to make sense properly, even though the wavefunction is completely arbitrary.

Crucially, the commutation of any two operators A^\hat{A} and B^\hat{B} can be determined from A^w\op{A} and B^w\op{B}. To explain what this means, we will use the example of X^\hat{X} and P^\hat{P}. As an exercise, you will show that

[X^w,P^w]ψ(x)=x(ixψ(x))+ix(xψ(x))=iψ(x).\begin{align*} [\op{X}, \op{P}]\psi(x) &= x\left(-i\hbar\frac{\partial}{\partial x}\psi(x)\right) + i\hbar\frac{\partial}{\partial x}\left(x\psi(x)\right) \\ &= i\hbar \psi(x). \end{align*}

From this we can ‘read off’ the commutator of X^\hat{X} and P^\hat{P}:

[X^,P^]=i.[\hat{X}, \hat{P}] = i\hbar.

This is often referred to as the canonical commutation relation since it is so important and central.

To arrive at (3.10), we simply replaced everywhere X^w\op{X} and P^w\op{P} by X^\hat{X} and P^\hat{P}, and removed the wavefunction. This is generally true. Namely,

[A^w,B^w]ψ(x)=C^wψ(x)    [A^,B^]=C^.[\op{A},\op{B}]\psi(x) = \op{C}\psi(x) \implies [\hat{A},\hat{B}] = \hat{C}.

That is, we calculate the commutator of A^w\op{A} and B^w\op{B} on a wavefunction ψ(x)\psi(x), and call this C^wψ(x)\op{C}\psi(x) (in general, it will be some operator acting on the wavefunction). Then, from this, using (1.21) we can find the corresponding operator C^\hat{C} associated to C^w\op{C}.

Coming back to the big picture: we see in (3.10) that X^\hat{X} and P^\hat{P} do not commute, and hence (as we already knew in this case!) they are incompatible observables. If we now want to consider other pairs of properties, and want to know if they are incompatible or not, we follow the same procedure: we calculate the commutator of their associated operators (most likely the operators acting on their wavefunctions) and whether or not this vanishes answers our physical question.

3.5Exercises