Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Quantum field theory, Part 1

Up to this point, we have seen both Special Relativity and General Relativity, both of which are written in the language of tensors. But we have only seen relativity on big scales acting on macroscopic objects, not the small scales of quantum mechanics. So it natural to ask whether we can describe quantum particles in a relativistic fashion.

The answer is yes, and in fact, the study of the intersection of relativity and quantum mechanics - called relativistic quantum field theory - is central to modern physics. It also has useful applications in giving a theoretical explanation of previously-unknown atomic transitions and performing incredibly-precise quantum calculations for atoms, particularly in extreme high-energy conditions. Some features of quantum mechanics we take for granted - like the photon as the quantum of light - are actually only possible to fully explain using quantum field theory. So let’s dive in!

Introduction to quantum electrodynamics

Throughout our discussion of spontaneous and stimulated emission, we have discussed photons everywhere. But photons are not something that can be completely described using the Schrödinger equtaion. Photons must be described by quantum electrodynamics, the quantum theory of the electromagnetic field.

Remember that light is classically-described as a wave, which obeys the solution to the Maxwell equations of electromagnetism in free space:

2Et2c22E=02Bt2c22B=0\begin{align} \dfrac{\partial^2 \mathbf{E}}{\partial t^2} - c^2 \nabla^2 \mathbf{E} = 0 \\ \dfrac{\partial^2 \mathbf{B}}{\partial t^2} - c^2 \nabla^2 \mathbf{B} = 0 \end{align}

But in classical electrodynamics, we find that due to relativistic effects, the electric field and magnetic field can no longer be thought of as distinct entities. Rather, they must be considered together as part of an electromagnetic field. Instead of separate electric and magnetic fields, we define a common electromagnetic 4-potential, which is a more fundamental physical quantity from which the fields arise. The electromagnetic 4-potential is given by Aμ=(φ,Ax,Ay,Az)A^\mu = (\varphi, A_x, A_y, A_z) where φ\varphi is the electric scalar potential and A=(Ax,Ay,Az)\mathbf{A} = (A_x, A_y, A_z) is the magnetic vector potential. The definitions of φ\varphi and A\mathbf{A} are:

E=φ1cAtB=×A\begin{align} \mathbf{E} &= -\nabla \varphi - \dfrac{1}{c} \dfrac{\partial \mathbf{A}}{\partial t} \\ \mathbf{B} &= \nabla \times \mathbf{A} \end{align}

The electromagnetic 4-potential, being a potential, can be shifted by an arbitrary constant without changing the fields (and thus the physics). Indeed, they can even be shifted by the gradient of a function f(r,t)f(\mathbf{r}, t) without changing the fields, with the transformations:

φ=φftA=A+f\begin{align} \varphi = \varphi - \dfrac{\partial f}{\partial t} \\ \mathbf{A} = \mathbf{A} + \nabla f \end{align}

This is known as gauge freedom, and therefore we must impose certain conditions to restrict the electromagnetic four-potential to a particular form, just as we define a reference position in classical mechanics to define the classical potential energy. Applying these chosen conditions (called “gauges”) is called gauge fixing. There are two common gauges used: the Coulomb gauge, commonly used for non-relativistic quantum field theory, and the Lorenz (note: not “Lorentz”) gauge, usually used for relativistic quantum field theory. As we work in the non-relativistic regime, we will choose the Coulomb gauge, which sets the condition A=0\nabla \cdot \mathbf{A} = 0. Thus, the equations of motion become:

2φt2c22φ=02At2c22A=0\begin{align} \dfrac{\partial^2 \varphi}{\partial t^2} - c^2 \nabla^2 \varphi = 0 \\ \dfrac{\partial^2 \mathbf{A}}{\partial t^2} - c^2 \nabla^2 \mathbf{A} = 0 \end{align}

The general, normalized solutions to the system of PDEs in a region of volume VV is given by:

A(r,t)=1V2k[Ak(k)eikr+Ak(k)eikr]\mathbf{A}(\mathbf{r}, t) = \dfrac{1}{V\sqrt2} \sum_{\mathbf{k}} \left[\mathbf{A}_\mathbf{k}(\mathbf{k}) e^{i \mathbf{k} \cdot \mathbf{r}} + \mathbf{A}_\mathbf{k}^*(\mathbf{k}) e^{-i\mathbf{k} \cdot \mathbf{r}}\right]

Where k\mathbf{k} is the wavevector, which is directly related to the momentum of a wave as well as its wavelength by k=2πλ|\mathbf{k}| = \dfrac{2\pi}{\lambda}, and where Ak(k)\mathbf{A}_\mathbf{k}(\mathbf{k}) represents the polarization (directionality) and field strength of the wave, which are given respectively by the direction and the magnitude of Ak(k)\mathbf{A}_\mathbf{k}(\mathbf{k}). Here, Ak\mathbf{A}_\mathbf{k}^* represents the complex conjugate of Ak(k)\mathbf{A}_\mathbf{k}(\mathbf{k}). Those who are aware may recognize this as a Fourier expansion in terms of the normal modes; if that terminology is unfamiliar, don’t worry about it. Importantly, the wavevector is related to the angular frequency by ωk=kc\omega_\mathbf{k} = \mathbf{k} c, which will be significant later.

The classical field theory of electromagnetism

Now, we will use the tools of classical field theory to be able to analyze the electromagnetic field, because classical quantities of fields translate into operators in quantum field theory. In classical field theory, we define a Lagrangian L(q,q˙,q)\mathcal{L}(q, \dot q, \nabla q) based on the coordinate qq as well as its time and space derivatives (in more precise terms we should write qμq^\mu to denote that this is a coordinate). In our case, q=Aμq = A^\mu is the electromagnetic four-potential. We may also define a Hamiltonian H(q,p)\mathcal{H}(q, p) based on the coordinate qq as well as the canonical momentum pp obtained from p=Lq˙p = \dfrac{\partial \mathcal{L}}{\partial \dot q} (this would also be more precisely be written as pμp^\mu). The Lagrangian of the electromagnetic field in free space is given by:

L=18π(E2B2)=18π(φ1cAt)218π(×A)2\begin{align} \mathcal{L} &= \dfrac{1}{8\pi} (\mathbf{E}^2 - \mathbf{B}^2) \\ &=\dfrac{1}{8\pi} \left(-\nabla \varphi - \dfrac{1}{c} \dfrac{\partial \mathbf{A}}{\partial t}\right)^2 - \dfrac{1}{8\pi} (\nabla \times \mathbf{A})^2 \\ \end{align}

We can also write this in terms of the canonical momentum. First, let us derive it by findind the derivative of the Lagrangian with respect to At\dfrac{\partial \mathbf{A}}{\partial t}. This comes out to:

p=LA˙=L(tA)=14πc(φ+1cAt)=14πcE\begin{align} \mathbf{p} &= \dfrac{\partial \mathcal{L}}{\partial \dot{\mathbf{A}}} \\ &= \dfrac{\partial \mathcal{L}}{\partial(\partial_t \mathbf{A})} \\ &= \dfrac{1}{4\pi c} \left(\nabla \varphi + \dfrac{1}{c} \dfrac{\partial \mathbf{A}}{\partial t}\right) \\ &= -\frac{1}{4\pi c}\mathbf{E} \end{align}

That is to say, we have found that the canonical momentum of the electromagnetic four-potential p\mathbf{p} is related to the electric field by E=4πcp\mathbf{E} = -4\pi c\mathbf{p}. This result allows us to write the Lagrangian in the simplified form:

L=18π(E2B2)=18π(4πcp)218π(×A)2=2πc2p218π(×A)2\begin{align} \mathcal{L} &= \dfrac{1}{8\pi} (\mathbf{E}^2 - \mathbf{B}^2) \\ &= \dfrac{1}{8\pi}(-4\pi c\mathbf{p})^2 - \dfrac{1}{8\pi} (\nabla \times \mathbf{A})^2 \\ &= 2\pi c^2\mathbf{p}^2 - \dfrac{1}{8\pi} (\nabla \times \mathbf{A})^2 \end{align}

It also means that we can rearrange to express At\dfrac{\partial \mathbf{A}}{\partial t} in terms of p\mathbf{p}, something that will be very useful later:

p=14πc(φ+1cAt)4πcp=φ+1cAtc(4πcpφ)=At\begin{gather} \mathbf{p} = \dfrac{1}{4\pi c} \left(\nabla \varphi + \dfrac{1}{c} \dfrac{\partial \mathbf{A}}{\partial t}\right) \\ 4\pi c \mathbf{p} =\nabla \varphi + \dfrac{1}{c} \dfrac{\partial \mathbf{A}}{\partial t} \\ c(4\pi c \mathbf{p} - \nabla \varphi) = \dfrac{\partial \mathbf{A}}{\partial t} \end{gather}

We may use these results to find the Hamiltonian of the electromagnetic field:

H=pq˙L=pAt2πc2p2+18π(×A)2=pc(4πcpφ)2πc2p2+18π(×A)2=2πc2p2cpφ+18π(×A)2\begin{align} \mathcal{H} &= \mathbf{p} \cdot \dot{\mathbf{q}} - \mathcal{L} \\ &= \mathbf{p} \cdot \dfrac{\partial \mathbf{A}}{\partial t} -2\pi c^2\mathbf{p}^2 + \dfrac{1}{8\pi} (\nabla \times \mathbf{A})^2 \\ &= \mathbf{p} \cdot c(4\pi c \mathbf{p} - \nabla \varphi) -2\pi c^2\mathbf{p}^2 + \dfrac{1}{8\pi} (\nabla \times \mathbf{A})^2 \\ &= 2\pi c^2 \mathbf{p}^2 - c\mathbf{p} \cdot \nabla \varphi + \dfrac{1}{8\pi} (\nabla \times \mathbf{A})^2 \end{align}

Up to this point, we have been very sloppy with our language: what we have been calling the Lagrangian is actually called the Lagrangian density, and what we have been calling the Hamiltonian is actually called the Hamiltonian density. We may find the Lagrangian and the Hamiltonian by integrating the Lagrangian and Hamiltonian densities over space (here d3x=dxdydzd^3 x = dx\, dy\, dz):

L=Ld3x=d3x[2πc2p218π(×A)2]H=Hd3x=d3x[2πc2p2cpφ+18π(×A)2]\begin{align} L &= \int \mathcal{L}\, d^3x \\ &= \int d^3 x \left[2\pi c^2\mathbf{p}^2 - \dfrac{1}{8\pi} (\nabla \times \mathbf{A})^2\right] \\ H &= \int \mathcal{H} \, d^3x \\ &= \int d^3 x \left[2\pi c^2 \mathbf{p}^2 - c\mathbf{p} \cdot \nabla \varphi + \dfrac{1}{8\pi} (\nabla \times \mathbf{A})^2\right] \end{align}

But notice something peculiar. If we explicitly evaluate the middle term, using the results we have derived previously, we get:

cpφ=c(14πcE)φ=14π(E)φF(ϕ)=(F)ϕ=0\begin{align} c\mathbf{p} \cdot \nabla \varphi &= c\left(-\frac{1}{4\pi c}\mathbf{E}\right) \cdot \nabla \varphi \\ &= \underbrace{-\dfrac{1}{4\pi} (\nabla \cdot \mathbf{E}) \varphi}_{\mathbf{F} \cdot (\nabla \phi) = (\mathbf{F} \cdot \nabla) \phi} \\ &= 0 \end{align}

Which comes from the vector calculus identity F(ϕ)=(F)ϕ\mathbf{F} \cdot (\nabla \phi) = (\mathbf{F} \cdot \nabla) \phi and the fact that E=0\nabla \cdot \mathbf{E} = 0 in free space. Thus the middle term in our expression for the Hamiltonian drops out, leaving us with just:

H=d3x[2πc2p2+18π(×A)2]H = \int d^3 x \left[2\pi c^2 \mathbf{p}^2 + \dfrac{1}{8\pi} (\nabla \times \mathbf{A})^2\right]

This, together with the general solution we found for A(r,t)\mathbf{A}(\mathbf{r}, t), lays the classical foundation for quantizing the electromagnetic field.

Quantization of the electromagnetic field

Before we proceed onto the process of treating the electromagnetic field in a fully quantum way, let us take some time to review the quantum harmonic oscillator. The quantum harmonic oscillator has a Hamiltonian H^\hat H given by:

H^=p^22m+12mω2q^2\hat H = \dfrac{\hat p^2}{2m} + \dfrac{1}{2} m\omega^2 \hat q^2

Where q^\hat q is the coordinate used, which, in non-QFT, is equal to the position operator x^\hat x. One may write the Hamiltonian in a more useful form by defining the operators a^\hat a and a^\hat a^\dagger, as follows:

a^=mω2q^+i12mωp^a^=mω2q^i12mωp^\begin{align} \hat a &= \sqrt{\dfrac{m\omega}{2\hbar}} \hat q + i \dfrac{1}{\sqrt{2m\omega \hbar}} \hat p \\ \hat a^\dagger &= \sqrt{\dfrac{m\omega}{2\hbar}} \hat q - i \dfrac{1}{\sqrt{2m\omega \hbar}} \hat p \\ \end{align}

Which allows the Hamiltonian to be written as:

H^=ω(a^a^12)\hat H = \hbar \omega\left(\hat a \hat a^\dagger - \dfrac{1}{2}\right)

We may now take these results for the non-QFT quantum harmonic oscillator and generalize it to the quantized electromagnetic field. To do so, we promote the classical quantities to their quantum operator analogues: we replace the generalized coordinate q^\hat q with our field expression q^Ak\hat q \to \mathbf{A}_\mathbf{k}, and we replace the momentum operator p^\hat p with the canonical momentum we just derived earlier, and substitute them into the a^\hat a and a^\hat a^\dagger operators. For the momentum, we know that p=14πcE\mathbf{p} = -\dfrac{1}{4\pi c} \mathbf{E}; using the relationship E=φ1cAt\mathbf{E} = -\nabla \varphi - \dfrac{1}{c} \dfrac{\partial \mathbf{A}}{\partial t}, which just becomes E=1cAt\mathbf{E} = -\dfrac{1}{c} \dfrac{\partial \mathbf{A}}{\partial t} in our case, yields:

p=14πcE=14πc2At=i8πc2Vωkk[Ak(k)eikr+Ak(k)eikr]=12Vk[pkeikr+pkeikr]pk=iωk4πc2Ak\begin{align} \mathbf{p} &= -\dfrac{1}{4\pi c} \mathbf{E} \\ &= \dfrac{1}{4\pi c^2} \dfrac{\partial \mathbf{A}}{\partial t} \\ &=\dfrac{-i}{8\pi c^2 \sqrt{V}}\omega_\mathbf{k} \sum_{\mathbf{k}} \left[\mathbf{A}_\mathbf{k}(\mathbf{k}) e^{i \mathbf{k} \cdot \mathbf{r}} + \mathbf{A}_\mathbf{k}^*(\mathbf{k}) e^{-i\mathbf{k} \cdot \mathbf{r}}\right] \\ &= \dfrac{1}{2\sqrt{V}} \sum_{\mathbf{k}} \left[\mathbf{p}_\mathbf{k} e^{i \mathbf{k} \cdot \mathbf{r}} + \mathbf{p}_\mathbf{k}^* e^{-i\mathbf{k} \cdot \mathbf{r}}\right] \\ &\qquad \Rightarrow \mathbf{p}_\mathbf{k} = \dfrac{-i\omega_\mathbf{k}}{4\pi c^2} \mathbf{A}_\mathbf{k} \end{align}

From here, we may define the operators Q^\hat Q and P^\hat P, by:

Q^=mωkq^=mωkAkP^=1mωkp^=1mωk(iωk4πc2Ak)=i4πc21mωkAk\begin{align} \hat Q &= \sqrt{\dfrac{m\omega_\mathbf{k}}{\hbar}} \hat q \\ &= \sqrt{\dfrac{m\omega_\mathbf{k}}{\hbar}} \mathbf{A}_\mathbf{k} \\ \hat P &= \dfrac{1}{\sqrt{m\hbar \omega}_\mathbf{k}} \hat p \\ &= \dfrac{1}{\sqrt{m\hbar \omega}_\mathbf{k}} \left(\dfrac{-i\omega_\mathbf{k}}{4\pi c^2} \mathbf{A}_\mathbf{k}\right) \\ &= \dfrac{-i}{4\pi c^2} \dfrac{1}{\sqrt{m\hbar \omega}_\mathbf{k}} \mathbf{A}_\mathbf{k} \end{align}

For which we then have:

a^=12(Q^+iP^)=mωk2Ak+14πc212mωkAka^=12(Q^iP^)=mωk2Ak14πc212mωkAk\begin{align} \hat a &= \dfrac{1}{\sqrt{2}}(\hat Q + i \hat P) \\ &= \sqrt{\dfrac{m\omega_\mathbf{k}}{2\hbar}} \mathbf{A}_\mathbf{k} + \dfrac{1}{4\pi c^2} \dfrac{1}{\sqrt{2m\hbar \omega}_\mathbf{k}} \mathbf{A}_\mathbf{k} \\ \hat a^\dagger &= \dfrac{1}{\sqrt{2}}(\hat Q - i\hat P) \\ &= \sqrt{\dfrac{m\omega_\mathbf{k}}{2\hbar}} \mathbf{A}_\mathbf{k} - \dfrac{1}{4\pi c^2} \dfrac{1}{\sqrt{2m\hbar \omega}_\mathbf{k}} \mathbf{A}_\mathbf{k} \end{align}

The Hamiltonian becomes the Hamiltonian density, and so it must be integrated over all of space:

H^=d3x[ωk(a^a^12)]\hat H = \int d^3 x \, \left[\hbar \omega_\mathbf{k}\left(\hat a \hat a^\dagger - \dfrac{1}{2}\right)\right]

It is common to denote the states of the quantum electromagnetic field with n|n\rangle, where 0|0\rangle is the ground state, 1|1\rangle is the first excited state, 2|2\rangle is the second excited state, and so on. The creation and annihilation operators have the properties that:

a^n=cnn1a^n=cnn+1\begin{align} \hat a |n\rangle = c_n |n-1\rangle \\ \hat a^\dagger |n\rangle = c_n^* |n+1\rangle \\ \end{align}

That is to say, applying the creation operator a^\hat a^\dagger raises the electromagnetic field from state n|n\rangle to a higher-energy state n+1|n + 1\rangle, while applying the annihilation operator a^\hat a lowers the electromagnetic field from state n|n\rangle to a lower-energy state n1|n - 1\rangle, with cn,cnc_n, c_n^* being normalization constants given by:

cn=ncn=n+1\begin{align} c_n &= \sqrt{n} \\ c_n^* &= \sqrt{n + 1} \end{align}

This process continues until we reach the ground state, where n=0n = 0, and thus applying the annihilation operator on the ground state yields a^0=(0)0=0\hat a | 0 \rangle = (\sqrt{0}) | 0 \rangle = 0.

Calculating the ground-state energy of the electromagnetic field

When we say “ground-state energy”, we should be careful about our use of words: there is no singular ground-state energy for a quantum field, as it constantly fluctuates in energy due to the Heisenberg uncertainty principle ΔEΔt2\Delta E \Delta t \geq \dfrac{\hbar}{2}. Thus the ground-state energy we speak of is in fact the average ground-state and would be given by the expectation value of the Hamiltonian:

E0=0H^0=0d3x[ωk(a^a^12)]0=0(d3x(ωka^a^)12d3xωk)0\begin{align} \langle E_0\rangle &= \langle 0 |\hat H |0\rangle \\ &= \big \langle 0 \big| \int d^3 x \, \left[\hbar \omega_\mathbf{k}\left(\hat a \hat a^\dagger - \dfrac{1}{2}\right)\right] \big|0\big\rangle \\ &= \big \langle 0 \big| \left( \int d^3 x \left( \hbar \omega_\mathbf{k} \hat a \hat a^\dagger\right) - \dfrac{1}{2} \int d^3 x\, \hbar \omega_\mathbf{k}\right)\big|0\rangle \end{align}

Remember that the annihilation operator annihilates the ground state, so the first term in the integral is zero. We are simply left with:

E0=0(12d3xωk)0\langle E_0 \rangle = \big \langle 0 \big| \left( - \dfrac{1}{2} \int d^3 x\, \hbar \omega_\mathbf{k}\right)\big|0\rangle

But wait! This integral looks divergent! Indeed it is, since every point in space has a nonzero energy 12ωk\dfrac{1}{2} \hbar \omega_\mathbf{k}, and therefore if we integrate this over all space, the integral becomes infinite and unphysical. Thus, we usually just ignore this integral by redefining the energy of a state as the energy above the ground state. By this redefinition, the ground state would then be at E0=0E_0 = 0, and the successive excited states would have non-infinite energies En>0E_n > 0.

However, if we wanted, we could restrict our integral over a particular region, such as a cubical box of side length LL. Then, assuming that the electromagnetic field is restricted to one wavelength within the box, the integral could be evaluated, and we get a reasonable value for the energy:

E0=0(12d3xωk)0=0(120L0L0Ldxdydzωk)0=0L3ωk20=L3ωk200=L3ωk2\begin{align} \langle E_0 \rangle &= \big \langle 0 \big| \left( - \dfrac{1}{2} \int d^3 x\, \hbar \omega_\mathbf{k}\right)\big|0\rangle \\ &= \big \langle 0 \big| \left( - \dfrac{1}{2} \int_0^L \int_0^L \int_0^L dx\, dy\, dz\, \hbar \omega_\mathbf{k}\right)\big|0\rangle \\ &= -\langle 0 |\dfrac{L^3 \hbar \omega_\mathbf{k}}{2} |0 \rangle \\ &= -\dfrac{L^3 \hbar \omega_\mathbf{k}}{2} \langle 0 | 0 \rangle \\ &= -\dfrac{L^3 \hbar \omega_\mathbf{k}}{2} \end{align}

This is exactly what happens in the Casimir effect, an effect that arises due to the nonzero ground-state energy of the quantum electrodynamical vacuum, which leads to a force between two metal plates brought close together. Within finite distances and volumes, it is indeed possible to calculate the ground-state energy, and this is essential to many quantum effects.

Gauge coupling

Unlike the quantum electrodynamical field A^μ\hat A_\mu, which reduces to the classical electromagnetic 4-potential (field) AμA_\mu in the classical limit, the quantum electron field ψ\psi has no classical field it reduces to. In the classical picture, AμA_\mu satisfies a wave equation, and therefore describes (among other things) electromagnetic waves; light is an electromagnetic wave and can be described by AμA_\mu. But in the classical picture, there is no analog for ψ\psi; an electron is just a particle, it isn’t a field. In fact, only massless particles with integer spin are associated with classical fields (the only two particles that satisfy this requirement are the photon and graviton, but the latter has yet to be experimentally confirmed and remains purely theoretical for now).

QED correction to the Coulomb potential

Uehling potential

Multi-state systems with the quantized electromagnetic field

Derive the Einstein A and B coefficients for spontaneous and stimulated emission.

The nonrelativistic and classical limit

Other corrections

See https://books.google.com/books?id=nxz2CAAAQBAJ&lpg=PA103&pg=PA99#v=onepage&q&f=false This guide is based on: