Skip to article frontmatterSkip to article content

Chapter 1: History and Background

School of Physics, University of Bristol
Video: History and Background

Quantum mechanics constitutes a huge departure from classical physics and completely alters the view we have of how the universe functions. In order to understand where quantum mechanics came from, and why it was necessary to replace classical physics by it, in this first section we briefly describe some of the key phenomena that arose, roughly around the turn of the last century, that classical physics was simply unable to give a satisfactory explanation of. The list here will be neither comprehensive nor in chronological order, but is meant just to provide a short account of the main difficulties faced by classical physics, for which a new physical theory was ultimately necessary. We will also discuss at the same time some of the early ideas and insights which would eventually develop into the theory that we now call quantum mechanics. The goal of this section is to set the scene and provide the a small amount of context for the following sections, where we will begin our exploration of quantum mechanics.

1.1The photoelectric effect

A key effect that classical physics was unable to explain is the photoelectric effect. It is observed that if light is shone on a piece of metal, then electrons are emitted. This effect can be studied carefully by forming a capacitor out of two metal plates in vacuum, placing this in a circuit, and applying a voltage (potential difference) across the plates.

Picture of Photoelectric effect

Figure 1.1:The photoelectric effect. Schematic diagram of the photoelectric effect. Light of a fixed frequency ff is shone on one of the metal plates of a capacitor (which is in a vacuum). This causes electrons to be emitted. Some of these arrive at the other place, and a current is generated. By placing a potential across the plates the maximum kinetic energy of the emitted electrons can be measured.

When light is shone on one of the plates, it causes a current to flow, which is how we know that electrons are being emitted. Varying the potential difference between the plates allows us to study the kinetic energy of the emitted electrons. It is observed that if light of a fixed wavelength λ, and therefore frequency f=cλf = c\lambda, is shone, then for all metals the maximum kinetic energy EmaxE_\mathrm{max} of the electrons emitted follows a simple rule

Emax=hfϕ,E_\mathrm{max} = h f - \phi,

where hh is a constant of nature known as Planck’s constant (since it was discovered by Planck, while exploring blackbody radiation), and equal to

\begin{equation}
h = 6.626 \times 10^{-34} Js,
\end{equation}

and ϕ is known as the workfunction, and is in general different for different metals. This maximum energy is determined by shining the light on the plate at the lower potential, and measuring the potential difference that needs to be put between the plates in order to just stop the current, since in order to reach the other plate (and generate a current) the electron needs enough kinetic energy to overcome this potential difference.

There are some particularly surprising aspects about what is observed:

  • As the intensity of light is increased, the rate at which electrons are emitted increases -- producing a bigger current -- but does not affect the maximum energy of the emitted photons, as measured by varying the potential.
  • Even for extremely weak light, a small current is observed to begin flowing immediately when the light is shone on the metal. That is, there is no delay after shining the light before some current starts flowing.
  • Low frequency/long wavelength light, for example red light, for certain metals is unable to generate any current, no matter how intense the light, while very weak light of higher frequency/shorter wavelength, for example violet light, is able to generate a weak current.

All of these aspects of the photoelectric effect seem impossible to explain within classical physics. In classical physics, the energy density of the light is proportional to the amplitude squared, and does not depend upon the wavelength. Moreover, the energy arrives at a constant rate, and so if the light is very weak, it should take a long time before enough energy is supplied in order to account for the kinetic energy of the emitted electron, so there should be a long delay before the current begins to flow. Finally, why it should be that long wavelength light is simply unable to free any electrons was a mystery.

1.1.1The photon

Einstein was able to provide an explanation for the photoelectric effect by postulating that classical electromagnetism does not in fact completely describe the properties of light. He said that we need to assume moreover that light is quantized in small `packets’ known as photons. Einstein postulated that a photon of wavelength λ and frequency ff has energy

E=hf,E = hf,

which we will refer to as the Einstein relation. With this postulate, the maximum energy that an electron can gain is the energy of the photon E=hfE = h f. Part of this energy will be used in order to liberate the electron from the surface -- this is precisely the workfunction ϕ. The remaining energy is then converted into the kinetic energy of the freed electron.

This model of the photon also accounts for the two surprising aspects of the photoelectric effect. As the intensity is increased, the number of photons increases, accounting for the larger current. It also explains why a current is observed immediately even for very weak light -- since now the rate at which photons arrive will be very slow, but whenever a photon does arrive, it arrives with all of its energy, and there is a chance of emitting an electron.

1.2Atoms

Atoms also provided big challenges for classical physics, from two completely different perspectives: The observation of spectral lines, and their stability.

1.2.1Spectral lines

It is observed that if a gas of atoms (all of the same element) is `excited’ -- for example by making an electrical spark -- then light is emitted. If this light is analysed spectroscopically, then the light is found to contain only certain very precise wavelengths, and none others. These are known as spectral lines. These line were moreover found to have intricate structure. For example, for Hydrogen, it was found that all the observed wavelengths fitted the formula

1λ=RH(1m21n2),\frac{1}{\lambda} = R_H \left(\frac{1}{m^2} - \frac{1}{n^2}\right),

for integers mm and nn, and where RHR_H is a constant known as Rydberg’s constant.

true

Figure 1.2:Spectral lines. Example diagram of the emission spectrum of Hydrogen (showing only lines in the visible part of the spectrum).

In order to explain the existence of spectral lines from the perspective of classical physics, the small number of allowed frequencies of the emitted light must arise from some characteristic vibrations within the atom. But what could these characteristic vibrations be? Moreover, where does the intricate relation between the allowed wavelengths come from? Classical physics seems to provide no satisfactory answer to these questions, and therefore the existence of spectral lines was a mystery.

1.2.2The stability of the atom

A second problem concerns the stability of the atom. Once the nucleus and the electron had been discovered, one possible model for an atom was the `solar system’ model, of a positively charged nucleus at the centre, around which the electron orbits.

Classical electromagnetism says however that accelerating charges emit radiation, and that this radiation carries away with it energy. The electron, which is undergoing circular motion, is accelerating constantly, and should therefore emit radiation and lose energy. Ultimately the electron should therefore spiral into the nucleus as it loses its energy and slows down. Atoms in vacuum are however observed to be very stable, surviving almost indefinitely -- much longer than the timescale over which the electron should lose all its energy and spiral into the nucleus. No satisfactory way of accounting for the stability of atoms from such a classical model was ever found.

1.2.3Quantisation of angular momentum and the Bohr atom

Bohr was able to provide partial answers to both of these problems by postulating that angular momentum of the electron is quantised. In particular, Bohr postulated that much like the energy of a photon is quantised, so too is the angular momentum LL of an electron in orbit around a nucleus. Bohr postulated that an elementary quanta of orbital angular momentum is the Reduced Plank constant

=h2π=1.05×1034Js\hbar = \frac{h}{2\pi} = 1.05 \times 10^{-34} Js

This constant is commonly referred to simply as ‘‘h-bar’’. Bohr postulated that the angular momentum of electrons is only ever an integer multiple of \hbar, and no other value, L=nL = n\hbar.

Under this assumption, Bohr was able to give a stable model for an atom. In particular, electrons do not radiate continuously as classical electromagnetism predicts, but lose angular momentum and energy in finite, quantised amounts, by emitting photons. This radiation continues until the electron has reached the minimum angular momentum of \hbar, after which it no longer radiates, and orbits the nucleus in a stable fashion. Bohr found that the radii and energies of the allowed orbits are

rn=ϵ0h2πMee2n2,En=Mee48ϵ02h21n2,r_n = \frac{\epsilon_0 h^2 }{\pi M_e e^2}n^2,\quad E_n = -\frac{M_e e^4}{8 \epsilon_0^2 h^2} \frac{1}{n^2},

where ϵ0\epsilon_0 is the permittivity of free space, MeM_e is the mass of the electron, ee is the electron charge, and nn is an arbitrary positive integer.

The spectral lines are also explained by this model. When an electron `jumps’ from one orbit to another, its energy changes, and the photon carries away this energy. In particular, if it jumps from the orbit with energy EnE_n to the one with energy EmE_m, then the photon must carry away the energy ΔE=EmEn\Delta E = E_m - E_n. However, Einstein already told us that a photon with energy ΔE\Delta E has frequency f=ΔE/hf = \Delta E/h and wavelength λ=hc/ΔE\lambda = hc/\Delta E. Thus

1λ=ΔEhc=EmEnhc=Mee48ϵ02h3c(1n21m2).\frac{1}{\lambda} = \frac{\Delta E}{hc} = \frac{E_m - E_n}{hc} = \frac{M_e e^4}{8 \epsilon_0^2 h^3 c}\left(\frac{1}{n^2} - \frac{1}{m^2}\right).

The constant out the front exactly matches the experimentally observed Rydberg constant, and thus reproduced the spectral lines of Hydrogen. In fact, this prediction of the Bohr model led to the search, and subsequent discovery of new lines outside the visible part of the electromagnetic spectrum, which had previously not been observed.

1.3Matter waves

Einstein postulated that electromagnetic radiation was in fact comprised of ‘packets’ which he called photons, which can roughly be thought of as particles. De Broglie postulated that matter might conversely behave like a wave. When electromagnetic radiation has energy EE, its momentum is p=E/cp = E/c. Using the Einstein formula for the energy of the photon, E=hf=hc/λE = hf = hc/\lambda, we see that

p=Ec=hcλc=hλ.p = \frac{E}{c} = \frac{hc}{\lambda c} = \frac{h}{\lambda}.\vph

De Broglie postulated that this formula holds universally for both radiation and matter, and prescribed that if a particle has momentum pp then it will behave like a wave of wavelength

p=hλp = \frac{h}{\lambda}

At first sight this seems strange, but it can be readily seen that the wavelengths for ‘‘everyday’’ objects are so small that they would be imperceptible. However, for elementary particles, in conditions which could be formed in the laboratory at that time, in principle this wavelength should have been observable. Two-slit experiments were subsequently carried out, and interference effects, at the wavelength predicted by de Broglie, were observed, indicating that indeed matter can exhibit wave-like properties too.

However, this raised a huge number of questions. Were particles really waves? How does this fit with the conclusion of the photoelectric effect, that waves are in fact particles? There were numerous other phenomena that couldn’t be explained -- subtle aspects of black-body radiation, the way x-rays scatter off solids, the so-called Compton effect. Everything pointed towards the need for a radical re-thinking of the physical world. Quantum mechanics was invented to answer these questions, and in the next section we will start of exploration of the basic aspects of this new theory.