- A.22.1 Classical selectostatics
- A.22.2 Classical selectodynamics
- A.22.3 Quantum selectostatics
- A.22.4 Poincaré and Einstein try to save the universe
- A.22.5 Lorenz saves the universe
- A.22.6 Gupta-Bleuler condition
- A.22.7 The conventional Lagrangian
- A.22.8 Quantization following Fermi
- A.22.9 The Coulomb potential and the speed of light

A.22 Forces by particle exchange

As noted in chapter 7.5.2, the fundamental forces of nature
arise from the exchange of particles. This addendum will illustrate
the general idea. It will first derive the hypothetical
Koulomb

force due to the exchange of equally
hypothetical particles called fotons.

The Koulomb potential provides a fairly simple model of a quantum field. It also provides a simple context to introduce some key concepts in quantum field theories, such as Green’s functions, variational calculus, Lagrangians, the limitation of the speed of light, description in terms of momentum modes, Fock space kets, annihilation and creation operators, antiparticles, special relativity, the imperfections of physicists, and Lorentz invariance. The Koulomb potential can also readily be modified to explain nuclear forces. However, that will have to wait until a later addendum, {A.41}.

In the current addendum, the Koulomb potential provides the starting point for a discussion of the electromagnetic field. The classical Maxwell equations for the electromagnetic field will be derived in a slightly unconventional way. Who needs to know classical electromagnetics when all it takes is quantum mechanics, relativity, and a few plausible guesses to derive electromagnetics from scratch?

To quantize the electromagnetic field is not that straightforward; it has unexpected features that do not occur for the Koulomb field. This book follows the derivation as formulated by Fermi in 1932. This derivation is the basis for more advanced modern quantum field approaches. These advanced theories will not be covered, however.

Essentially, the Fermi derivation splits off the Coulomb potential from the electromagnetic field. What is left is then readily described by a simple quantum field theory much like for the Koulomb potential. This is sufficient to handle important applications such as the emission or absorption of radiation by atoms and atomic nuclei. That, however, will again be done in subsequent addenda.

A word to the wise. While this addendum is on the calculus level like virtually everything else in this book, there is just quite a lot of mathematics. Some mathematical maturity may be needed not to get lost. Note that this addendum is not needed to understand the discussion of the emission and absorption of radiation in the subsequent addenda.

A.22.1 Classical selectostatics

The Koulomb force holds the sarged spotons and selectons together in satoms. The force is due to the exchange of massless particles called fotons between the sarged particles. (It will be assumed that the spoton is an elementary particle, though really it consists of three skarks.)

This subsection will derive the selectostatic Koulomb force by representing the fotons by a classical field, not a quantum field. The next subsection will explain classical selectodynamics, and how it obeys the speed of light. Subsection A.22.3 will eventually fully quantize the selectic field. It will show how quantum effects modify some of the physics expected from the classical analysis.

Physicists have some trouble measuring the precise properties of the selectic field. However, a few basic quantum ideas and some reasonable guesses readily substitute for the lack of empirical data. And guessing is good. If you can guess a self-consistent Koulomb field, you have a lot of insight into its nature.

Consider first the wave function for the exchanged foton in isolation.
A foton is a boson without spin. That means that its wave function is
a simple function, not some vector. But since the foton is massless,
the Schrödinger equation does not apply to it. The appropriate equation
follows from the relativistic expression for the energy of a massless
particle as given by Einstein, chapter 1.1.2
(1.2):

Here is the foton energy, its linear momentum, and the speed of light. The squares are used because momentum is really a vector, not a number like energy.

Quantum mechanics replaces the momentum vector by the operator

Note the vector operator , called

nablaor

del.This operator is treated much like an ordinary vector in various computations. Its properties are covered in Calculus III in the U.S. system. (Brief summaries of properties of relevance here can be found in the notations section.)

The Hamiltonian eigenvalue problem for a foton wave function
then takes the form

A solution to this equation is an energy eigenstate. The corresponding value of is the energy of the state. (To be picky, the above is an eigenvalue problem for the square Hamiltonian. But eigenfunctions of an operator are also eigenfunctions of the square operator. The reverse is not always true, but that is not a concern here.)

Using the momentum operator as given above and some rearranging, the
eigenvalue problem becomes

For foton wave functions that are not necessarily energy eigenstates,
quantum mechanics replaces the energy by the operator
. That gives the time-dependent
Klein-Gordon equation as:

Now consider solutions of this equation of the form

Here is a positive constant called the angular frequency. Substitution in the time-dependent Klein-Gordon equation shows that this solution also satisfies the time-independent Klein-Gordon equation, with energy

That is the famous

Planck-Einsteinrelation. It is implicit in the association of with .

Note however that there will also be a solution of the form

This solution too has energy . The difference in sign in the exponential is taken to mean that the particle moves backwards in time. Note that changing the sign in the exponential is equivalent to changing the sign of the time . At least it is if you require that cannot be negative. If a particle moves backwards in time, it is called an

antiparticle.So the wave function above describes an

antifoton.

There is really no physical difference between a foton and an antifoton. That is not necessarily true for other types of particles. Quantities such as electric charge, lepton number, baryon number, strangeness, etcetera take opposite values for a particle and its antiparticle.

There is a very important difference between the Klein-Gordon equation and the Schrödinger equation. The Schrödinger equation describes nonrelativistic physics where particles can neither be destroyed nor created. Mass must be conserved. But the Klein-Gordon equation applies to relativistic physics. In relativistic physics particles can be created out of pure energy or destroyed following Einstein’s famous relationship , chapter 1.

There is a mathematical consequence to this. It concerns the integral

(In this addendum, integrals like this are over all space unless explicitly stated otherwise. It is also assumed that the fields vanish quickly enough at large distances that such integrals are finite. Alternatively, for particles confined in a large box it is assumed that the box is periodic, chapter 6.17.) Now for solutions of the Schrödinger equation, the integral keeps the same value, 1, for all time. Physically, the integral represent the probability of finding the particle. The probability of finding the particle if you look in all space must be 1.

But fotons are routinely destroyed or created by sarged particles. So the probability of finding a foton is not a preserved quantity. (It is not even clear what finding a foton would mean in the first place.) The Klein-Gordon equation reflects that. It does not preserve the integral . (There is one exception: if the wave function is purely described by particle states or purely described by antiparticle states, the integral is still preserved.)

But the Klein-Gordon equation does preserve an other integral,
{D.32}. That is

Now if the number of fotons is not a preserved quantity, what can this preserved integral stand for? Not momentum or angular momentum, which are vectors. The integral must obviously stand for the energy. Energy is still preserved in relativity, even if the number of particles of a given type is not.

Of course, the energy of a foton wave function is also given by the Planck-Einstein relation. But wave functions are not observable. Still, fotons do affect spotons and selectons. That is observable. So there must be an observable foton field. This observable field will be called the foton potential. It will be indicated by simply , without a subscript f. Quantum uncertainty in the values of the field will be ignored in this subsection. So the field will be modeled as a classical (i.e. nonquantum) field.

And if there is an observable field, there must be an observable
energy associated with that field. Now what could the expression for
the energy in the field be? Obviously it will have to take the form
of the integral above. What other options are there that are
plausible? Of course, there will be some additional empirical
constant. If the integral is constant, then any multiple of it will
be constant too. And the above integral will not have units of energy
as it is. The needed empirical constant is indicated by
and is called, um no, the permissivity of space. It is a measure of
how efficient the foton field is in generating energy. To be precise,
for arcane historical reasons the constant in the energy is actually
defined as half the permissivity. The bottom line is that the
expression for the energy in the observable foton field is:

That is really all that is needed to figure out the properties of classical selectostatics in this subsection. It will also be enough to figure out classical selectodynamics in the next subsection.

The first system that will be considered here is that of a foton field and a single spoton. It will be assumed that the spoton is pretty much located at the origin. Of course, in quantum mechanics a particle must have some uncertainty in position, or its kinetic energy would be infinite. But it will be assumed that the spoton wave function is only nonzero within a small distance of the origin. Beyond that distance, the spoton wave function is zero.

However, since this is a classical derivation and not a quantum one,
the term spoton wave function

must not be used. So
imagine instead that the spoton sarge is smeared out
over a small region of radius around the origin.

For a smeared out sarge, there will be a sarge density

, defined as the local sarge per unit volume.
This sarge density can be expressed mathematically as

Here is some function that describes the detailed shape of the smeared-out sarge distribution. The integral of this function must be 1, because the sarge density must integrate to the total spoton sarge . So:

To ensure that the sarge density is zero for distances from the origin greater than the given small value , must be zero at these distances. So:

In the limit that becomes zero, becomes the so-called three-dimensional “Dirac delta function” . This function is totally concentrated at a single point, the origin. But its integral over that single point is still 1. That is only possible if the function value at the point is infinite. Now infinity is not a proper number, and so the Dirac delta function is not a proper function. However, mathematicians have in fact succeeded in generalizing the idea of functions to allow delta functions. That need not concern the discussion here because “Physicists are sloppy about mathematical rigor,” as Zee [52, p. 22] very rightly states. Delta functions are named after the physicist Dirac. They are everywhere in quantum field theory. That is not really surprising as Dirac was one of the major founders of the theory. See section 7.9 for more on delta functions.

Here the big question is how the spoton manages to create a foton
field around itself. That is not trivial. If there was a nonzero
probability of finding an energetic foton well away from the spoton,
surely it would violate energy conservation. However, it turns out
that the time-independent Klein-Gordon equation (A.101)
actually has a very simple solution where the foton energy appears
to be zero away from the origin. In spherical coordinates, it is

Here is some constant which is still arbitrary about this stage. To check the above solution, plug it into the energy eigenvalue problem (A.101) with zero.

This then seems to be a plausible form for the observable potential
away from the spoton at the origin. However, while the
energy of a potential appears to be zero, it is not really.
Such a potential is infinite at the origin, and you cannot just ignore
that. The correct foton field energy is given by the earlier integral
(A.103). For a steady potential, it can be written as

All that then raises the question why there is a foton field in the first place. The interest in this subsection is in the selectostatic field. That is supposed to be the stable ground state of lowest energy. According to the above, the state of lowest energy would be when there is no foton field; 0.

And so it is. The only reasonable way to explain that there is a nontrivial foton field in the ground state of the spoton-foton system is if the foton field energy is compensated for by something else. There must be an energy of interaction between the foton field and the spoton.

Consider the mathematical form that this energy could take in a given
volume element . Surely the simplest possibility is
that it is proportional to the potential at the location
times the sarge . Therefore the total
energy of spoton-foton interaction is presumably

The question is now, what is the ground state foton field? In other words, for what potential is the complete system energy minimal? To answer that requires “variational calculus.” Fortunately, variational calculus is just calculus. And you need to understand how it works if you want to make any sense at all out of books on quantum field theory.

Suppose that you wanted an equation for the minimum of some function
depending on a single variable . The equation would be
that 0 at the position of the minimum
. In terms of differentials, that would mean that
the function does not change going from position to a
slightly different position :

It is the same for the change in net energy . Assume that is the desired potential at minimum net energy. Then at the net energy should not change when you change by an infinitesimal amount . Or rather, by an infinitesimal amount : the symbol is used in variational calculus instead of . That is to avoid confusion with any symbol that may already be around.

So the requirement for the ground state potential is

Using the expressions (A.104) and (A.105) for the energies, that means that

The usual rules of calculus can be used, (see {A.2} for more details). The only difference from basic calculus is that the change may depend on the point that you look at. In other words, it is some arbitrary but small function of the position . For example,

Also, by itself is validly approximated as , but is a completely separate quantity that can be anything. Working it out gives

Performing an integration by parts moves the from to and adds a minus sign. Then the two integrals combine as

If this is supposed to be zero for whatever you take the small change in field to be, then the parenthetical expression in the integral will have to be zero. If the parenthetical expression is nonzero somewhere, you can easily make up a nonzero change in that region so that the integral is nonzero.

The parenthetical expression can now be rearranged to give the final
result:

minwas left away again as the ground state is the only state of interest here anyway.

The above equation is the famous “Poisson equation” for the selectostatic potential
. The same equation appears in electrostatics, chapter
13.3.4. So far, this is all quite encouraging. Note also
that the left hand side is the steady Klein-Gordon equation. The
right hand side is mathematically a forcing

term; it
forces a nonzero solution for .

Beyond the small vicinity of radius around the origin,
the spoton sarge density in the right hand side is zero. That means
that away from the spoton, you get the time-independent Klein-Gordon
equation (A.101) with 0. That was a good guess, earlier.
Assuming spherical symmetry, away from the spoton the solution to the
Poisson equation is then indeed

But now that the complete Poisson equation (A.106) is known, the constant can be figured out, {D.2}. The precise field turns out to be

For unit value of the above solution is called the “fundamental solution” or “Green’s function” of the Poisson equation. It is the solution due to a delta function.

If the spoton is not at the origin, but at some position
, you simply replace by the distance from that
point:

Now the net energy is of interest. It can be simplified by
substituting the Poisson equation (A.106) in the expression
(A.104) for the foton field energy and adding the interaction
energy (A.105). That gives

which simplifies to

Note that the spoton-foton interaction energy is twice the foton field energy, and negative instead of positive. That means that the total energy has been lowered by an amount equal to the foton field energy, despite the fact that the field energy itself is positive.

The fact that there is a foton field in the ground state has now been explained. The interaction with the spoton lowers the energy more than the field itself raises it.

Note further from the solution for above that is large in the vicinity of the spoton. As a result, the energy in the foton field becomes infinite when the spoton sarge contracts to a point. (That is best seen from the original integral for the foton field energy in (A.104).) This blow up is very similar to the fact that the energy in a classical electromagnetic field is infinite for a point charge. For the Koulomb field, the interaction energy blows up too, as it is twice the foton field energy. All these blow ups are a good reason to use a sarge density rather than a point sarge. Then all energies are normal finite numbers.

The final step to derive the classical Koulomb force is to add a selecton. The selecton is also sarged, so it too generates a field. To avoid confusion, from now on the field generated by the spoton will always be indicated by , and the one generated by the selecton by . The variational analysis can now be repeated including the selecton, {D.37.1}. That shows that there are three effects that produce the Koulomb force between the spoton and selecton:

- 1.
- the selecton sarge interacts with the potential generated by the spoton;
- 2.
- the spoton sarge interacts with the potential generated by the selecton;
- 3.
- the energy in the combined foton field is different from the sum of the energies of the separate fields and .

All three effects turn out to produce the same energy, but the first
two energies are negative and the third positive. So the net energy
change is the same as if there was just item 1, the interaction of the
selecton sarge density with the potential
produced by the spoton. That is of course given by
a similar expression as before:

The expression for was given above in (A.107) for any arbitrary position of the spoton . And it will be assumed that the selecton sarge density is spread out a bit just like the spoton one, but around a different location . Then the interaction energy becomes

Since is assumed small, the selecton sarge density is
only nonzero very close to the nominal position .
Therefore you can approximate as in the fraction
and take it out of the integral as a constant. Then the delta
function integrates to 1, and you get

That then is the final energy of the Koulomb interaction between the two sarged particles. Because the spoton and the selecton both interact with the foton field, in effect it produces a spoton-selecton interaction energy.

Of course, in classical physics you would probably want to know the
actual force on say the selecton. To get it, move the origin of the
coordinate system to the spoton and rotate it so that the selecton
is on the positive -axis. Now give the selecton a small
displacement in the -direction. Slowly of
course; this is supposed to be selectostatics. Because of energy
conservation, the work done by the force during
this displacement must cause a corresponding small decrease in energy.
So:

But on the positive -axis, is just the -position of the selecton , so

It is seen that if the sarges have equal sign, the force is in the negative -direction, towards the spoton. So sarges of the same sign attract.

More generally, the force on the selecton points towards the spoton if the sarges are of the same sign. It points straight away from the spoton if the sarges are of opposite sign.

The Koulomb energy looks almost exactly the same as the Coulomb energy in electrostatics. Recall that the Coulomb energy was used in chapter 4.3 to describe the attraction between the proton and electron in a hydrogen atom. The difference is that the Coulomb energy has no minus sign. That means that while like sarges attract, like charges repel each other. For example, two spotons attract, but two protons repel.

Now a spoton must necessarily create a foton field that is attractive to spotons. Otherwise there should be no field at all in the ground state. And if spotons create fields that attract spotons, then spotons attract. So the Koulomb force is clearly right.

It is the Coulomb force that does not seem to make any sense. Much more will be said about that in later subsections.

A.22.2 Classical selectodynamics

According to the previous section the Koulomb energy between a spoton
and a selecton is given by

However, this result can only be correct in a stationary state like a ground state, or maybe some other energy state.

To see the problem, imagine that the spoton is suddenly given a kick. According to the Koulomb potential given above, the selecton notices that instantly. There is no time in the Koulomb potential, so there is no time delay. But Einstein showed that no observable effect can move faster than the speed of light. So there should be a time delay.

Obviously then, to discuss unsteady evolution will require the full governing equations for selectodynamics. The big question is how to find these equations.

The quantum mechanics in this book is normally based on some Hamiltonian . But there is a more basic quantity for a system than the Hamiltonian. That quantity is called the “Lagrangian” . If you can guess the correct Lagrangian of a system, its equations of motion follow. That is very important for quantum field theories. In fact, a lot of what advanced quantum field theories really do is guess Lagrangians.

To get at the Lagrangian for selectodynamics, consider first the
motion of the spoton for a given foton field .
The Hamiltonian of the spoton by itself is just the energy of the
spoton. As discussed in the previous subsection, a spoton has a
potential energy of interaction with the given foton field

However, to discuss the dynamics of the spoton, it is easier to
consider it a point particle located at a single moving point
. Therefore it will be assumed that the sarge
density is completely concentrated at that one point. That means that
the only value of the foton field of interest is the value at
. And the sarge distribution integrates to the
net spoton sarge . So the above energy of
interaction becomes approximately

Note that in this addendum the position components are indicated as , , and instead of the more familiar , , and or , , and . That is in order that a generic position component can be indicated by where can be 1, 2, or 3.

In addition to the interaction energy above there is the kinetic
energy of the spoton,

Here is the mass of the spoton and its velocity,

The kinetic energy can be written out in terms of the velocity components as

Now the Hamiltonian of the spoton is the sum of the kinetic and
potential energies. But the Lagrangian is the difference
between the kinetic and potential energies:

This Lagrangian can now be used to find the equation of motion of the
spoton. This comes about in a somewhat weird way. Suppose that there
is some range of times, from a time to a time ,
during which you want to know the motion of the spoton. (Maybe the
spoton is at rest at time and becomes again at rest at time
.) Suppose further that you now compute the so-called
action

integral

If you use the correct velocity and position of the spoton, you will get some number. But now suppose that you use a slightly different (wrong) spoton path. Suppose it is different by a small amount , which of course depends on time. You would think that the value of the action integral would change by a corresponding small amount. But that is not true. Assuming that the path used in the original integral was indeed the right one, and that the change in path is infinitesimally small, the action integral does not change. Mathematically

Yes, this is again variational calculus. The action may not be minimal at the correct spoton path, but it is definitely stationary at it.

Probably this sounds like a stupid mathematical trick. But in the so-called path integral approach to quantum field theory, the action is central to the formulation.

For classical physics the action by itself is pretty useless.
However, with some manipulations, you can get the evolution equations
for your system out of it, {A.1}. They are found
as

Note that for the governing equations it does not matter at all what you take the times and in the action to be. They are pretty vaguely defined anyway. You might want to let them go to minus and plus infinity to get rid of them.

The next step is to write out the governing equation (A.112)
in terms of physical quantities. To do that correctly, the trick is
that the Lagrangian must be treated as a function of velocity and
position, as independent variables. In reality velocity and position
are not independent; velocity is the derivative of position. But when
differentiating the Lagrangian you are supposed to forget about that.
Consider how this works out for the -component, 1,

That are simple differentiations taking the given Lagrangian at face value.

However, when you do the remaining time derivative in (A.112)
you have to do it properly, treating the velocity as the function of
time that it is. That gives the final equation of motion as

Note that the left hand side is mass times acceleration in the
-direction. So the right hand side must be the selectic force on
the spoton. This force is called the Sorentz force.

It is seen that the Sorentz force is proportional to the derivative of
the foton potential, evaluated at the position of the spoton. If you
compare the Sorentz force with the force in electrostatics, you see
that the force in electrostatics has an additional minus sign. That
reflects again that equal sarges attract, while equal charges repel.

So far, it was assumed that the foton field was given. But in reality
the foton field is not given, it depends on the motion of the spoton.
To describe the field, its energies must be added to the Lagrangian
too. The total energy in the foton field was given in the previous
subsection as (A.103). Using some shorthand notation, this
becomes

The shorthand is to indicate derivatives by subscripts, as in

with 1, 2, or 3 for the , , and derivatives respectively. For example, would be the partial -derivative of .

Actually, even more concise shorthand will be used. If an index like
occurs twice in a term, summation over that index is to be
understood. The summation symbol will then not be shown. That is
called the Einstein summation convention. So the energy in the foton
field will be indicated briefly as

(Note that , so occurs twice in the second term of the integrand.) All this is done as a service to you, the reader. You are no doubt getting tired of having to look at all these mathematical symbols.

Now the first term in the energy above is a time derivative, just like
was the time derivative of the spoton position.
So this term has presumably the same sign in the Lagrangian, while the
sign of the other term flips over. That makes the total
selectodynamic Lagrangian equal to

The last two terms are as before for a given field.

However, for the final term it is now desirable to go back to the
representation of the spoton in terms of a sarge density
, as in (A.110). The final term as
written would lead to a nasty delta function in the analysis of the
field. In the sarge density form the term can be brought inside the
integral to give the complete Lagrangian as

An integrand of a spatial integral in a Lagrangian is called a
“Lagrangian density” and indicated by the symbol .
In this case:

The action principle can readily be extended to allow for Lagrangian
densities, {D.37}. The equations of motion for the field
are then found to be

Working this out much like for the equation of motion of the spoton
gives, taking to the other side,

Saxwell wave equationof selectodynamics. If there is also a selecton, say, its sarge density can simply be added to the spoton one in the right hand side.

To check the Saxwell equation, first consider the case that the system is steady, i.e. independent of time. In that case the Saxwell wave equation becomes the Poisson equation of the previous subsection as it should. (The second term is summed over the three Cartesian directions . That gives .) So the spoton produces the same steady Koulomb field (A.107) as before. So far, so good.

How about the force on a selecton in this field? Of course, the
force on a selecton is a Sorentz force of the same form as
(A.113),

self-interactionproduces no net force. That is fortunate because if the selecton was really a point sarge, the self-interaction is mathematically singular.) Now minus the potential (A.107) times the selecton sarge gave the energy of the spoton-selecton interaction in the previous subsection. And minus the derivative of that gave the force on the selecton. A look at the force above then shows it is the same.

So in the steady case the Saxwell equation combined with the Sorentz force does reproduce selectostatics correctly. That means that the given Lagrangian (A.114) contains all of selectostatics in a single concise mathematical expression. At the minimum. Neat, isn’t it?

Consider next the case that the time dependence cannot be ignored.
Then the time derivative in the Saxwell equation (A.116)
cannot be ignored. In that case the left hand side in the equation is
the complete unsteady Klein-Gordon equation. Since there is a nonzero
right-hand side, mathematically the Saxwell equation is an
inhomogeneous Klein-Gordon equation. Now it is known from the theory
of partial differential equations that the Klein-Gordon equation
respects the speed of light. As an example, imagine that at time
= 0 you briefly shake the spoton at the origin and then put it back
where it was. The right hand side of the Saxwell equation is then
again back to what it was. But near the origin, the foton field
will now contain additional disturbances. These
disturbances evolve according to the homogeneous Saxwell equation,
i.e. the equation with zero right hand side. And it is easy to check
by substitution that the homogeneous equation has solutions of the
form

That are waves traveling in the x-direction with the speed of light . The wave shape is the arbitrary function and is preserved in time. And note that the -direction is arbitrary. So waves like this can travel in any direction. The perturbations near the origin caused by shaking the spoton will consist of such waves. Since they travel with the speed of light, they need some time to reach the selecton. The selecton will not notice anything until this happens. However, when the perturbations in the foton field do reach the selecton, they will change the foton field at the selecton. That then will change the force (A.117) on the selecton.

It follows that selectodynamics, as described by the Lagrangian (A.114), also respects the speed of light limitation.

A.22.3 Quantum selectostatics

The previous subsections derived the Koulomb force between sarged
particles. This force was due to foton exchange. While the
derivations used some ideas from quantum mechanics, they were
classical. The effect of the fotons took the form of a potential
that the sarged particles interacted with. This potential
was a classical field; it had a definite numerical value at each
point. To be picky, there really was an undetermined constant in the
potential . But its gradient produced
the fully determined Sorentz force per unit sarge (A.117).
This force can be observed

by a sarged spoton or
selecton.

However, that very fact violates the fundamental postulates of quantum mechanics as formulated at the beginning of this book, chapter 3.4. Observable values should be the eigenvalues of Hermitian operators that act on wave functions. While the foton potential was loosely associated with a foton wave function, wave functions should not be observable.

Now if classically every position has its own observable local
potential , then in a proper quantum description every
position must be associated with its own Hermitian operator
. In the terminology of addendum
{A.15.9}, the foton field must be a
quantum field;

an infinite amount of operators, one
for each position.

The objective in this subsection is to deduce the form of this quantum field. And the type of wave function that it operates on. The results will then be used to verify the Koulomb force between stationary sarges as found the first subsection. It is imperative to figure out whether like sarges still attract in a proper quantum description.

Doing this directly would not be easy. It helps a lot if the field is written in terms of linear momentum eigenstates.

In fact, typical quantum field theories depend very heavily on this trick. However, often such theories use relativistic combined energy-momentum states in four-dimensional space-time. This subsection will use simpler purely spatial momentum states. The basic idea is the same. And it is essential for understanding the later Fermi derivation of the Coulomb potential.

Linear momentum states are complex exponentials of the form . Here is a constant vector called the wave number vector. The momentum of such a state is given in terms of the wave number vector by the de Broglie relation as . (It may be noted that the states need an additional constant to properly normalize them, chapter 6.18. But for conciseness, in this addendum that normalization constant will be absorbed in the constants multiplying the exponentials.)

If a field is written in terms of linear momentum states,
its value at any point is given by:

Note that if you know the coefficients of the momentum states, it is equivalent to knowing the field . Then the field at any point can in principle be found by doing the sum.

The expression above assumes that the entire system is confined to a very large periodic box, as in chapter 6.17. In infinite space the sum becomes an integral, section 7.9. That would be much more messy. (But that is the way you will usually find it in a typical quantum field analysis.) The precise values of the wave number vectors to sum over for a given periodic box were given in chapter 6.18 (6.28); they are all points in figure 6.17.

The first subsection found the selectostatic potential that was produced by a spoton, (A.107). This potential was a classical field; it had a definite numerical value for each position. The first step will be to see how this potential looks in terms of momentum states. While the final objective is to rederive the classical potential using proper quantum mechanics, the correct answer will need to be recognized when written in terms of momentum states. Not to mention that the answer will reappear in the discussion of the Coulomb potential. For simplicity it will be assumed that the spoton is at the origin.

According to the first subsection, the classical potential was the
solution to a Poisson equation; a steady Klein-Gordon equation with
forcing by the spoton:

As a reminder that is a classical potential, not a quantum one, a subscript

clhas been added. Also note that since this is a now a quantum description, the spoton sarge density has been identified as the spoton sarge times the square magnitude of the spoton wave function .

Now the classical potential is to be written in the form

To figure out the coefficients , plug it in the Poisson equation above. That gives

Note that in the left hand side each produced a factor for total.

Now multiply this equation at both sides by some sample
complex-conjugate momentum eigenfunction and
integrate over the entire volume of the periodic box. In the
left hand side, you only get something nonzero for the term in the sum
where because eigenfunctions are orthogonal. For that
term, the exponentials multiply to 1. So the result is

Now in the right hand side, assume again that the spoton is almost exactly at the origin. In other words, assume that its wave function is zero except very close to the origin. In that case, the exponential in the integral is approximately 1 when the spoton wave function is not zero. Also, the square wave function integrates to 1. So the result is, after clean up,

This expression applies for any wave number vector , so you can leave the underline away. It fully determines in terms of the momentum states:

This solution is definitely one to remember. Note in particular that the coefficients of the momentum states are a constant divided by . Recall also that for a unit value of , this solution is the fundamental solution, or Green’s function, of the Poisson equation with point wise forcing at the origin.

If the requirement that the spoton wave function is completely at the
origin is relaxed, the integral involving the spoton wave function
stays:

Note that the integration variable over the spoton wave function has been renamed to avoid confusion with the position at which the potential is evaluated. The above result is really better to work with in this subsection, since it does not suffer from some convergence issues that the Green’s function solution has. And it is exact for a spoton wave function that is somewhat spread out.

Now the objective is to reproduce this classical result using a proper quantum field theory. And to find the force when a selecton is added to the system.

To do so, consider initially a system of fotons and a single spoton. The spoton will be treated as a nonrelativistic particle. Then its wave function describes exactly one spoton. The spoton wave function will be treated as given. Imagine something keeping the spoton in a ground state squeezed around the origin. Maxwell’s demon would work. He has not been doing much anyway after he failed his thermo test.

Next the fotons. Their description will be done based upon linear momentum states. Such a state corresponds to a single-foton wave function of the form .

To keep it simple, for now only a single momentum state will be considered. In other words, only a single wave number vector will be considered. But there might be multiple fotons in the state, or even uncertainty in the number of fotons.

Of course, at the end of the day the results must still be summed over all values of .

Some notations are needed now. A situation in which there are no
fotons in the considered state will be indicated by the “Fock
space ket” . If there is one foton in the state,
it is indicated by , two by , etcetera. In
the mathematics of quantum field theory, kets are taken to be
orthonormal, {A.15}:

The ground state wave function for the combined spoton-fotons system
is then assumed to be of the form

(It may be noted that in typical quantum field theories, a charged
relativistic particle would also be described in terms of kets and
some quantum field . However, unlike for a
photon, for a charged particle would normally be a
complex quantum field. Then or something
along these lines provides a real probability for a photon to
observe

the particle. That resembles the Born
interpretation of the nonrelativistic wave function somewhat,
especially for a spinless particle. Compare [[17, pp. 49, 136, 144]]. The field
will describe both the particle and its oppositely charged
antiparticle. The spoton wave function as used here
represents some nonrelativistic limit in which the antiparticle has
been approximated away from the field, [[17, pp. 41-45]].
Such a nonrelativistic limit simply does not exist for a real scalar
field like the Koulomb one.)

Now, of course, the Hamiltonian is needed. The Hamiltonian determines
the energy. It consists of three parts:

The first part is the Hamiltonian for the spoton in isolation. It consists of the kinetic energy of the spoton, as well as the potential provided by the fingers of the demon. By definition

where is the energy of the spoton in isolation.

The second part is the Hamiltonian of the free foton field. Each
foton in the considered state should have an energy with
. That is the energy that you get if you
substitute the momentum eigenfunction into the
Klein-Gordon eigenvalue problem (A.101) for a massless
particle. And if one foton has an energy , then
of them should have energy , so

Finally, the third part of the total Hamiltonian is the interaction between the spoton and the foton field. This is the tricky one. First recall the classical expression for the interaction energy. According to the previous subsection, (A.111), it was . Here was the classical foton potential, evaluated at the position of the spoton.

In quantum field theory, the observable field gets replaced
by a quantum field . The interaction
Hamiltonian then becomes

(A.124) |

To answer that, first note that sarged particles can create and destroy fotons. The above interaction Hamiltonian must express that somehow. After all, it is the Hamiltonian that determines the time evolution of systems in quantum mechanics.

Now in quantum field theories, creation and destruction of particles
are accounted for through creation and annihilation operators,
{A.15}. A creation operator creates a single
particle in a momentum state . An
annihilation operator annihilates a single particle from such
a state. More precisely, the operators are defined as

Note incidentally that the foton field Hamiltonian given earlier can
now be rewritten as

In general this Hamiltonian will still need to be summed over all values of .

But surely, the creation and annihilation of particles should also
depend on where the spoton is. Fotons in the considered state have a
spatially varying wave function. That should be reflected in the
quantum field somehow. To find the correct
expression, it is easiest to first perform a suitable normalization of
the foton state. Now the full wave function corresponding to the
single-foton momentum eigenstate in empty space is

Here is some normalization constant to be chosen. The above wave function can be verified by putting it into the Klein-Gordon equation (A.102). The energy of the foton is given in terms of its wave function above as . But the energy in the foton field is also related to the observable field ; classical selectostatics gives that relation as (A.103). If you plug the foton wave function above into that classical expression, you do not normally get the correct energy . There is no need for it; the foton wave function is not observable. However, it makes things simpler if you choose so that the classical energy does equal . That gives a energy-normalized wave function

In those terms, the needed quantum field turns out to be

You might of course wonder about that second term. Mathematically it is needed to make the operator Hermitian. Recall that operators in quantum mechanics need to be Hermitian to ensure that observable quantities have real, rather than complex values, chapter 2.6. To check whether an operator is Hermitian, you need to check that it is unchanged when you take it to the other side of an inner product. Now the wave function is a numerical quantity that changes into its complex conjugate when taken to the other side. And changes into and vice-versa when taken to the other side, {A.15.2}. So each term in changes into the other one, leaving the sum unchanged. So the operator as shown is indeed Hermitian.

But what to make physically of the two terms? One way of thinking about it is that the observed field is real because it does not just involve an interaction with an foton, but also with an antifoton.

In general, the quantum field above would still need to be summed over all wave numbers . (Or integrated over in infinite space). It may be noted that for given the sum of the creation operator terms over all can be understood as a field operator that creates a particle at position , [34, p. 24]. That is a slightly different definition of the creation field operator than given in {A.15.9}, [42, pp. 22]. But for nonrelativistic particles (which have nonzero rest mass) it would not make a difference.

With the quantum field now identified, the Hamiltonian of
the spoton-fotons interaction becomes finally

Note that the spoton has uncertainty in position. The spoton position in the Hamiltonian above is just a possible spoton position. In usage it will still get multiplied by the square spoton wave function magnitude that gives the probability for that position. Still, at face value the interaction of the spoton with the field takes place at the location of the spoton. Interactions in quantum field theories are “local.” At least on macroscopic scales that is needed to satisfy the limitation of the speed of light.

Having a Hamiltonian allows quantum selectodynamics to be explored. That will be done to some detail for the case of the electromagnetic field in subsequent addenda. However, here the only question that will be addressed is whether classical selectostatics as described in the first subsection was correct. In particular, do equal sarges still attract in the quantum description?

Selectostatics of the spoton-fotons system should correspond to the
ground state of the system. The ground state has the lowest possible
energy. You can therefore find the ground state by finding the state
of lowest possible system energy. That is the same trick as was used
to find the ground states of the hydrogen molecular ion and the
hydrogen molecule in chapters 4.6 and 5.2. The
expectation value

of the system energy is defined by
the inner product

Here the Hamiltonians have already been described above.

Now to find the ground state, the lowest possible value of the expectation energy above is needed. To get that, the inner products between the kets in the factors must be multiplied out. First apply the Hamiltonians (A.122), (A.123), and (A.129) on the wave function , (A.121), using (A.125). Then apply the orthogonality relations (A.120) of kets. Do not forget the complex conjugate on the left side of an inner product. That produces

Here the dots stand for terms involving the coefficients of states with three or more fotons.

Note that the first term in the right hand side above is the energy of the spoton by itself. That term is a given constant. The question is what foton states produce the lowest energy for the remaining terms. The answer is easy if the spoton sarge is zero. Then the terms in the last two lines are zero. So the second line shows that must all be zero. Then there are no fotons; only the state with zero fotons is then left in the system wave function (A.121).

If the spoton sarge is nonzero however, the interaction terms in the last two lines can lower the energy for suitable nonzero values of the constants , , .... To simplify matters, it will be assumed that the spoton sarge is nonzero but small. Then so will be the constants. In that case only the terms need to be considered; the other terms in the last two lines involve the product of two small constants, and those cannot compete. Further the normalization condition in (A.121) shows that will be approximately 1 since even is small. Then may be assumed to be 1, because any eigenfunction is indeterminate by a factor of magnitude 1 anyway.

Further, since any complex number may always be written as its
magnitude times some exponential of magnitude 1, the second last
line of the energy above can be written as

Replacing everywhere by gives the corresponding expression for the last line. The complete expression for the energy then becomes

In the last term, note that the sign of inside an absolute value does not make a difference. Using the Euler formula (2.5) on the trailing exponentials gives

But in the ground state, the energy should be minimal. Clearly, that requires that the cosine is at its maximum value 1. So it requires taking so that .

That still leaves the magnitude to be found. Note that the
final terms in expression above are now of the generic form

where and are positive constants. That is a quadratic function of . By differentiation, it is seen that the minimum occurs at and has a value . Putting in what and are then gives

The second part of the energy is the energy lowering achieved by having a small probability of a single foton in the considered momentum state.

This energy-lowering still needs to be summed over all states to
get the total:

Note that the final two inner products represent separate integrations. Therefore to avoid confusion, the subscript p was dropped from one integration variable. In the sum, the classical field (A.119) can now be recognized:

Ignoring the differences in notation, the energy lowering is exactly the same as (A.108) found in the classical analysis. The classical analysis, while not really justified, did give the right answer.

However now an actual picture of the quantum ground state has been obtained. It is a quantum superposition of system states. The most likely state is the one where there are no fotons at all. But there are also small probabilities for system states where there is a single foton in a single linear momentum foton state. This picture does assume that the spoton sarge is small. If that was not true, things would get much more difficult.

Another question is whether the observable values of the foton potential are the same as those obtained in the classical analysis. This is actually a trick question because even the classical foton potential is not observable. There is still an undetermined constant in it. What is observable are the derivatives of the potential: they give the observable selectic force per unit sarge on sarged particles.

Now, in terms of momentum modes, the derivatives of the classical
potential can be found by differentiating (A.119). That gives

Recall again the convention introduced in the previous subsection that a subscript on indicates the derivative , where , , and correspond to , , and respectively. So the above expression gives the selectic force per unit sarge in the , , or direction, depending on whether is 1, 2, or 3.

The question is now whether the quantum analysis predicts the same observable forces. Unfortunately, the answer here is no. The observable forces have quantum uncertainty that the classical analysis missed. However, the Ehrenfest theorem of chapter 7.2.1 suggests that the expectation forces should still match the classical ones above.

The quantum expectation force per unit sarge in the
-direction is given by

Here is the ground state energy. Note that in this case the full, time-dependent wave function is used. That is done because in principle an observed field could vary in time as well as in space. Substituting in the -derivative of the quantum field (A.128) gives

Note that here is not a possible position of the spoton, but a given position at which the selectic force per unit sarge is to be found. Also note that the time dependent exponentials have dropped out against each other; the expectation forces are steady like for the classical field.

The above expression can be multiplied out as before. Using the
obtained expression for , and the fact that
1 because wave
functions are normalized, that gives.

Summed over all , the two terms in the right hand side produce the same result, because opposite values of appear equally in the summation. In other words, for every term in the first sum, there is a term in the second sum that produces the same value. And that then means that the expectation selectic forces are the same as the classical ones. The classical analysis got that right, too.

To see that there really is quantum uncertainty in the forces, it
suffices to look at the expectation square forces. If there
was no uncertainty in the forces, the expectation square forces would
be just the square of the expectation forces. To see that that is not
true, it is sufficient to simply take the spoton sarge zero. Then
the expectation field is zero too. But the expectation square field
is given by

Multiplying this out gives

Since Planck’s constant is not zero, this is not zero either. So even without the spoton, a selectic force measurement will give a random, but nonzero value. The average of a large number of such force measurements will be zero, but not the individual measurements.

The above expression can be compared with the corresponding
of a single foton, as given by (A.127).
That comparison indicates that even in the ground state in empty
space, there is still half a foton of random field energy left.
Recall now the Hamiltonian (A.126) for the foton field.
Usually, this Hamiltonian would be defined as

The additional expresses the half foton of energy left in the ground state. The ground state energy does not change the dynamics. However, it is physically reflected in random nonzero values if the selectic field is measured in vacuum.

The bad news is that if you sum these ground state energies over all values of , you get infinite energy. The exact same thing happens for the photons of the electromagnetic field. Quantum field theories are plagued by infinite results; this “vacuum energy” is just a simple example. What it really means physically is as yet not known either. More on this can be found in {A.23.4}.

The final issue to be addressed is the attraction between a spoton and a selecton. That can be answered by simply adding the selecton to the spoton-fotons analysis above, {D.37.2}. The answer is that the spoton-selecton interaction energy is the same as found in the classical analysis.

So equal sarges still attract.

A.22.4 Poincaré and Einstein try to save the universe

The Koulomb universe is a grim place. In selectodynamics, particles with the same sarge attract. So all selectons clump together into one gigantic ball. Assuming that spotons have the opposite sarge, they clump together into another big ball at the other end of the universe. But actually there is no justification to assume that spotons would have a different sarge from selectons. That then means that all matter clumps together into a single gigantic satom. A satom like that will form one gigantic, obscene, black hole. It is hardly conductive to the development of life as we know it.

Unfortunately, the Koulomb force is based on highly plausible, apparently pretty unavoidable assumptions. The resulting force simply makes sense. None of these things can be said about the Coulomb force.

But maybe, just maybe, the Koulomb juggernaut can be tripped up by some legal technicality. Things like that have happened before.

Now in a time not really that very long ago, there lived a
revolutionary of mathematics called Poincaré. Poincaré dreamt of
countless shining stars that would sweep through a gigantic, otherwise
dark universe. And around these stars there would be planets
populated by living beings called observers.

But if
the stars all moved in random directions, with random speeds, then
which star would be the one at rest? Which star would be the king
around which the other stars danced? Poincaré thought long and hard
about that problem. No!

he thundered eventually;
“It shall not be. I hereby proclaim that all stars are created
equal. Any observer at any star can say at any given time that its
star is at rest and that the other stars are moving. On penalty of
dead, nothing in physics may indicate that observer to be
wrong.”

Now nearby lived a young physicist called Einstein who was very lazy. For example, he almost never bothered to write the proper summation symbols in his formulae. Of course, that made it difficult for him to find a well paying job in some laboratory where they smash spotons into each other. Einstein ended up working in some patent office for little pay. But, fortunate for our story, working in a patent office did give Einstein a fine insight in legal technicalities.

First Einstein noted that the Proclamation of Poincaré meant that observers at different stars had to disagree seriously about the locations and times of events. However, it would not be complete chaos. The locations and times of events as perceived by different observers would still be related. The relation would be a transformation that the famous physicist Lorentz had written down earlier, chapter 1.2.1 (1.6).

And the Proclamation of Poincaré also implied that different observers had to agree about the same laws of physics. So the laws of physics should remain the same when you change them from one observer to the next using the Lorentz transformation. Nowadays we would say that the laws of physics should be “Lorentz invariant.” But at the time, Einstein did not want to use the name of Lorentz in vain.

Recall now the classical action principle

of
subsection A.22.2. The so-called action integral had to be
unchanged under small deviations from the correct physics. The
Proclamation of Poincaré demands that all observers must agree that
the action is unchanged. If the action is unchanged for an observer
at one star, but not for one at another star, then not all stars are
created equal.

To see what that means requires a few fundamental facts about special relativity, the theory of systems in relative motion.

The Lorentz transformation badly mixes up spatial positions and times
of events as seen by different observers. To deal with that
efficiently, it is convenient to combine the three spatial coordinates
and time into a single four-dimensional vector, a four-vector, chapter
1.2.4. Time becomes the “zeroth
coordinate” that joins the three spatial coordinates. In
various notations, the four-vector looks like

First of all, note that the zeroth coordinate receives an additional factor , the speed of light. That is to ensure that it has units of length just like the other components. It has already been noted before that the spatial coordinates , , and are indicated by , , and in this addendum. That allows a generic component to be indicated by for 1, 2, or 3. Note also that curly brackets are a standard mathematical way of indicating a set or collection. So stands for the set of all three values; in other words, it stands for the complete position vector . That is the primary notation that will be used in this addendum.

However, in virtually any quantum field book, you will find four-vectors indicated by . Here is an index that can have the values 0, 1, 2, or 3. (Except that some books make time the fourth component instead of the zeroth.) An by itself probably really means , in other words, the complete four-vector. Physicists have trouble typing curly brackets, so they leave them away. When more than one index is needed, another Greek symbol will be used, like . However, would stand for just the spatial components, so for the position vector . The give-away is here that a roman superscript is used. Roman superscript would mean the same thing as ; the spatial components only.

There are similar notations for the derivatives of a function
:

Note again that time derivatives in this addendum are indicated by a subscript and spatial derivatives by a subscript for 1, 2, or 3.

Quantum field books use for derivatives. They still have problems with typing curly brackets, so by itself probably means the set of all four derivatives. Similarly would probably mean the spatial gradient .

The final key fact to remember about special relativity is:

Dot products between four-vectors are very important because all observers agree about the values of these dot products. They are Lorentz invariant. (In nonrelativistic mechanics, all observers agree about the usual dot products between spatial vectors. That is no longer true at relativistic speeds.)In dot products between four-vectors, the product of the zeroth components gets a minus sign.

One warning. In almost all modern quantum field books, the products
of the spatial components get the minus sign instead of the
time components. The purpose is to make the relativistic dot product
incompatible with the nonrelativistic one. After all,
backward compatibility

is so, well, backward. (One
source that does use the compatible dot product is [48].
This is a truly excellent book written by a Nobel Prize winning
pioneer in quantum field theory. It may well be the best book on the
subject available. Unfortunately it is also very mathematical and the
entire thing spans three volumes. Then again, you could certainly
live without supersymmetry.)

One other convention might be mentioned. Some books put a factor in the zeroth components of four-vectors. That takes care of the minus sign in dot products automatically. But modern quantum field books do not this.

Armed with this knowledge about special relativity, the Koulomb force
can now be checked. Action is defined as

Here the time range from to should be chosen to include the times of interest. Further is the so-called Lagrangian.

If all observers agree about the value of the action in
selectodynamics, then selectodynamics is Lorentz invariant. Now the
Lagrangian of classical selectodynamics was of the form, subsection
A.22.2,

Here the Lagrangian density of the foton field was

To this very day, a summation symbol may not be used to reveal to nonphysicists that the last term needs to be summed over all three values of . That is in honor of the lazy young physicist, who tried to save the universe.

Note that the parenthetical term in the Lagrangian density is simply the square of the four-vector of derivatives of . Indeed, the relativistic dot product puts the minus sign in front of the product of the time derivatives. Since all observers agree about dot products, they all agree about the values of the Lagrangian density. It is Lorentz invariant.

To be sure, it is the action and not the Lagrangian density that must be Lorentz invariant. But note that in the action, the Lagrangian density gets integrated over both space and time. Such integrals are the same for any two observers. You can easily check that from the Lorentz transformation chapter 1.2.1 (1.6) by computing the Jacobian of the integration between observers.

(OK, the limits of integration are not really the same for different observers. One simple way to get around that is to assume that the field vanishes at large negative and positive times. Then you can integrate over all space-time. A more sophisticated argument can be given based on the derivation of the action principle {A.1.5}. From that derivation it can be seen that it suffices to consider small deviations from the correct physics that are localized in both space and time. It implies that the limits of integration in the action integral are physically irrelevant.)

(Note that this subsection does no longer mention periodic boxes. In relativity periodicity is not independent of the observer, so the current arguments really need to be done in infinite space.)

The bottom line is that the first, integral, term in the Lagrangian produces a Lorentz-invariant action. The second term in the Lagrangian is the nonrelativistic kinetic energy of the spoton. Obviously the action produced by this term will not be Lorentz invariant. But you can easily fix that up by substituting the corresponding relativistic term as given in chapter 1.3.2. So the lack of Lorentz invariance of this term will simply be ignored in this addendum. If you want, you can consider the spoton mass to be the moving mass in the resulting equations of motion.

The final term in the Lagrangian is the problem. It represents the spoton-fotons interaction. The term by itself would be Lorentz invariant, but it gets integrated with respect to time. Now in relativity time intervals are not the same for different observers. So the action for this term is not Lorentz invariant. Selectodynamics cannot be correct. The Koulomb juggernaut has been stopped by a small legal technicality.

(To be sure, any good lawyer would have pointed out that there is no problem if the spoton sarge density, instead of the spoton sarge , is the same for different observers. But the Koulomb force was so sure about its invincibility that it never bothered to seek competent legal counsel.)

The question is now of course how to fix this up. That will hopefully produce a more appealing universe. One in which particles like protons and electrons have charges rather than sarges . Where these charges allow them to interact with the photons of the electromagnetic field. And where these photons assure that particles with like charges repel, rather than attract.

Consider the form of the problem term in the Koulomb action:

It seems logical to try to write this in relativistic terms, like

Here is the zeroth component of the change in spoton four-vector . The product of the two parenthetical factors is definitely not Lorentz invariant. But suppose that you turn each of the factors into a complete four-vector? Dot products are Lorentz invariant. And the four-vector corresponding to is clearly .

But the photon potential must also become a four-vector, instead of a
scalar. That is what it takes to achieve Lorentz invariance. So
electrodynamics defines a four-vector of potentials of the form

Here is the so-called

vector potentialwhile is now the electrostatic potential.

The interaction term in the action now becomes, replacing the spoton
sarge by minus the proton charge ,

In writing out the dot product, note that the spatial components of are simply the proton velocity components . That gives the interaction term in the action as

Once again nonphysicists may not be told that the second term in parentheses must be summed over all three values of since appears twice.

The integrand above is the interaction term in the electromagnetic
Lagrangian,

For now at least.

The Lagrangian density of the photon field is also needed. Since
the photon field is a four-vector rather than a scalar, the
self-evident electromagnetic density is

Here the constant is called the “permittivity of space.” Note that the second term in parentheses must be summed over both and . The curious sign pattern for the parenthetical terms arises because it involves two dot products: one from the square four-gradient (derivatives), and one from the square four-potential. Simply put, having electrostatic potentials is worth a minus sign, and having time derivatives is too.

It might be noted that in principle the proper Lagrangian density could be minus the above expression. But a minus sign in a Lagrangian does not change the motion. The convention is to choose the sign so that the corresponding Hamiltonian describes energies that can be increased by arbitrarily large amounts, not lowered by arbitrarily large amounts. Particles can have unlimited amounts of positive kinetic energy, not negative kinetic energy.

Still, it does seem worrisome that the proper sign of the Lagrangian density is not self-evident. But that issue will have to wait until the next subsection.

Collecting things together, the self-evident Lagrangian for
electromagnetic field plus proton is

Here was given above.

The first thing to check now is the equation of motion for the proton.
Following subsection A.22.2, it can be found from

Substituting in the Lagrangian above gives

This can be cleaned up, {D.6}. In short, first an
electric” field and a “magnetic

field are defined as, in vector notation,

Here 1, 2, or 3 corresponds to the , , or components respectively. Also follows in the periodic sequence and precedes . In these terms, the simplified equation of motion of the proton becomes, in vector notation,

The left hand side is mass times acceleration. Relativistically speaking, the mass should really be the moving mass here, but OK. The right hand side is known as the

Lorentz force.

Note that there are 4 potentials with 4 derivatives each, for a
total of 16 derivatives. But matter does not observe all 16
individually. Only the 3 components of the electric field and the 3
of the magnetic field are actually observed. That suggests that there
may be changes to the fields that can be made that are not observable.
Such changes are called gage (or gauge) changes.

The
name arises from the fact that a gage is a measuring device. You and
I would then of course say that these changes should be called
nongage changes. They are not measurable. But
gage

is really shorthand for “Take that, you
stupid gage.”

Consider the most general form of such gage changes. Given potentials
and , equivalent potentials can be created as

Here can be any function of space and time that you want.

The potentials and give the exact same electric and magnetic fields as and . (These claims are easily checked using a bit of vector calculus. Use Stokes to show that they are the most general changes possible.)

The fact that you can make unmeasurable changes to the potentials like
that is called the gage

(or gauge) property of the
electromagnetic field. Nonphysicists probably think it is something
you read off from a voltage gage. Hilarious, isn’t it?

Observable or not, the evolution equations of the four potentials are
also needed. To find them it is convenient to spread the proton
charge out a bit. That is the same trick as was used in subsection
A.22.2. For the spread-out charge, a “charge
density” can be defined as the charge per unit
volume. It is also convenient to define a “current
density” as the charge density times its
velocity. Then the proton-photons interaction terms in the Lagrangian
are:

The interaction terms can now be included in the Lagrangian density to
give the total Lagrangian

If there are more charged particles than just a proton, their charge and current densities will combine into a net and .

The field equations now follow similarly as in subsection
A.22.2. For the electrostatic potential:

where is the combined Lagrangian density. Worked out and converted to vector notation, that gives

This is the same equation as for the Koulomb potential earlier.

Similarly, for the components of the vector potential

That gives

The above equations are again Klein-Gordon equations, so they respect the speed of light. And since the action is now Lorentz invariant, all observers agree with the evolution. That seems very encouraging.

Consider now the steady case, with no charge motion. The current density is then zero. That leads to zero vector potentials. Then there is no magnetic field either, (A.130).

The steady equation (A.135) for the electrostatic field is exactly the same as the one for the Koulomb potential. But note that the electric force per unit charge is now minus the gradient of the electrostatic potential, (A.130) and (A.132). And that means that like charges repel, not attract. All protons in the universe no longer clump together into one big ball. And neither do electrons. That sounds great.

But wait a second. How come that apparently protons suddenly manage to create fields that are repulsive to protons? What happened to energy minimization? It seems that all is not yet well in the universe.

A.22.5 Lorenz saves the universe

The previous subsection derived the self-evident equations of electromagnetics. But there were some worrisome aspects. A look at the Hamiltonian can clarify the problem.

Given the Lagrangian (A.134) of the previous subsection, the
Hamiltonian can be found as, {A.1.5}:

That gives

(This would normally still need to be rewritten in terms of canonical momenta, but that is not important here.)

Note that the electrostatic potential produces negative electromagnetic energy. That means that the electromagnetic energy can have arbitrarily large negative values for large enough .

That then answers the question of the previous subsection: “How come a proton produces an electrostatic field that repels it? What happened to energy minimization?” There is no such thing as energy minimization here. If there is no lowest energy, then there is no ground state. Instead the universe should evolve towards larger and larger electrostatic fields. That would release infinite amounts of energy. It should blow life as we know it to smithereens. (The so-called second law of thermodynamics says, simply put, that thermal energy is easier to put into particles than to take out again. See chapter 11.)

In fact, the Koulomb force would also produce repulsion between equal sarges, if its field energy was negative instead of positive. Just change the sign of the constant in classical selectodynamics. Then its universe should blow up too. Unlike what you will read elsewhere, the difference between the Koulomb force, (or its more widely known sibling, the Yukawa force of {A.41}), and the Coulomb force is not simply that the photon wave function is a four-vector. It is whether negative field energy appears in the most straightforward formulation.

As the previous subsection noted, you might assume that the electrodynamic Lagrangian, and hence the Hamiltonian, would have the opposite sign. But that does not help. In that case the vector potentials would produce the negative energies. Reversing the sign of the Hamiltonian is like reversing the direction of time. In either direction, the universe gets blown to smithereens.

To be sure, it is not completely sure that the universe will be blown to smithereens. A negative field energy only says that it is in theory possible to extract limitless amounts of energy out of the field. But you would still need some actual mechanism to do so. There might not be one. Nature might be carefully constrained so that there is no dynamic mechanism to extract the energy.

In that case, there might then be some mathematical expression for the constraint. As another way to look at that, suppose that you would indeed have a highly unstable system. And suppose that there is still something recognizable left at the end of the first day. Then surely you would expect that whatever is left is special in some way. That it obeys some special mathematical condition.

So presumably, the electromagnetic field that we observe obeys some
special condition, some constraint. What could this constraint be?
Since this is very basic physics, you would guess it to be relatively
simple. Certainly it must be Lorentz invariant. The simplest
condition that meets this requirement is that the dot product of the
four-gradient with the four-potential is zero. Written
out that produces the so-called “Lorenz condition:”

Please note that the Lorenz condition is named after the Danish physicist Ludvig Lorenz, not the Dutch physicist Hendrik Lorentz. Almost all my sources mislabel it the Lorentz condition. The savior of the universe deserves more respect. Always remember: the Lorenz condition is Lorentz invariant.

(You might wonder why the first term in the Lorenz condition does not
have the minus sign of dot products. One way of thinking about it is
that the four-gradient in its natural

condition
already has a minus sign on the time derivative. Physicists call it a
covariant

four-vector rather than a
contravariant

one. A better way to see it is to grind
it out; you can use the Lorentz transform (1.6) of chapter
1.2.1 to show directly that the above form is the same
for different observers. But those familiar with index notation will
recognize immediately that the Lorenz condition is Lorentz invariant
from the fact that it equals 0, and that has
as both subscript and superscript. See chapter
1.2.5 for more.)

To be sure, the Lorenz condition can only be true if the interaction
with matter does not produce violations. To check that, the evolution
equation for the Lorenz condition quantity can be obtained from the
Klein-Gordon equations of the previous subsection. In particular, in
vector notation take (A.135) plus
(A.136) to get

This important result is known as “Maxwell’s continuity equation.” It expresses conservation of charge. (To see that, take any arbitrary volume. Integrate both sides of the continuity equation over that volume. The left hand side then becomes the time derivative of the charge inside the volume. The right hand side becomes, using the [divergence] [Gauss] [Ostrogradsky] theorem, the net inflow of charge. And if the charge inside can only change due to inflow or outflow, then no charge can be created out of nothing or destroyed.) So charge conservation can be seen as a consequence of the need to maintain the Lorenz condition.

Note that the Lorenz condition (A.138) looks mathematically just like the continuity equation. It produces conservation of the integrated electrostatic potential. In subsection A.22.7 it will be verified that it is indeed enough to produce a stable electromagnetic field. One with meaningfully defined energies that do not run off to minus infinity.

Note that charge conservation by itself is not quite enough to ensure that the Lorenz condition is satisfied. However, if in addition the Lorenz quantity and its time derivative are zero at just a single time, it is OK. Then (A.139) ensures that the Lorenz condition remains true for all time.

A.22.6 Gupta-Bleuler condition

The ideas of the previous subsection provide one way to quantize the electromagnetic field, [[17, 6]].

As already seen in subsection A.22.3 (A.128), in
quantum field theory the potentials become quantum fields,
i.e. operator fields. For electromagnetics the quantum field
four-vector is a bit more messy

Since a four-vector has four components, a general four-vector can be written as a linear combination of four chosen basis four-vectors , , , and . (That is much like a general vector in three dimensions can be written as a linear combination of , , and .) The four basis vectors physically represent different possible

polarizationsof the electromagnetic field. That is why they are typically aligned with the momentum of the wave rather than with some Cartesian axis system and its time axis. Note that each polarization vector has its own annihilation operator and creation operator . These annihilate respectively create photons with that wave number vector and polarization.

(Electromagnetic waves in empty space are special; for them only two independent polarizations are possible. Or to be precise, even in empty space the Klein-Gordon equations with Lorenz condition allow a third polarization. But these waves produce no electric and magnetic fields and contain no net electromagnetic energy. So they are physically irrelevant. You can call them “gage equivalent to the vacuum.” That sounds better than irrelevant.)

The Lorenz condition of the previous subsection is again needed to get rid of negative energy states. The question is now exactly how to phrase the Lorenz condition in quantum terms.

(There is an epidemic among my, highly authorative, sources that come up with negative norm states without Lorenz condition. Now the present author himself is far from an expert on quantum field theories. But he knows one thing: norms cannot be negative. If you come up with negative norms, it tells you nothing about the physics. It tells you that you are doing the mahematics wrong. I believe the correct argument goes something like this: “Suppose that we can do our usual stupid canonical quantization tricks for this system. Blah Blah. That gives negative norm states. Norms cannot be negative. Ergo: we cannot do our usual stupid canonical quantization tricks for this system.” If you properly define the creation and annihilation operators to put photons in negative energy states, there is no mathematical problem. The commutator relation for the negative energy states picks up a minus sign and the norms are positive as they should. Now the mathematics is sound and you can start worrying about problems in the physics. Like that there are negative energy states. And maybe lack of Lorentz invariance, although the original system is Lorentz invariant, and I do not see what would not be Lorentz invariant about putting particles in the negative energy states.)

The simplest idea would be to require that the quantum field above satisfies the Lorenz condition. But the quantum field determines the dynamics. Like in the classical case, you do not want to change the dynamics. Instead you want to throw certain solutions away. That means that you want to throw certain wave functions away.

The strict condition would be to require (in the Heisenberg picture
{A.12})

for all physically observable states . Here the parenthetical expression is the operator of the Lorenz quantity that must be zero. The above requirement makes an eigenvector of the Lorenz quantity with eigenvalue zero. Then according to the rules of quantum mechanics, chapter 3.4, the only measurable value of the Lorenz quantity is zero.

But the above strict condition is too restrictive. Not even the vacuum
state with no photons would be physically observable. That is because
the creation operators in and will create
nonzero photon states when applied on the vacuum state. That suggests
that only the annihilation terms should be included. That then gives
the “Gupta-Bleuler condition:”

for physically observable states . Here the superscript on the quantum fields means that only the annihilation operator terms are included.

You might of course wonder why the annihilation terms are indicated by
a plus sign, instead of the creation terms. After all, it are the
creation operators that create more photons. But the plus sign
actually stands for the fact that the annihilation terms are
associated with an time dependence instead of
. Yes true, has a minus
sign, not a plus sign. But has the normal sign,
and normal

is represented by a plus sign. Is not
addition more normal than subtraction? Please do not pull at your
hair like that, there are less drastic ways to save on professional
hair care.

Simply dropping the creation terms may seem completely arbitrary. But
it actually has some physical logic to it. Consider the inner product

This is the amount of state produced by applying the Lorenz quantity on the physically observable state . The strict condition is equivalent to saying that this inner product must always be zero; no amount of any state may be produced. For the Gupta-Bleuler condition, the above inner product remains zero if is a physically observable state. (The reason is that the creation terms can be taken to the other side of the inner product as annihilation terms. Then they produce zero if is physically observable.) So the Gupta-Bleuler condition implies that no amount of any physically observable state may be produced by the Lorenz quantity.

There are other ways to do quantization of the electromagnetic field. The quantization following Fermi, as discussed in subsection A.22.8, can be converted into a modern quantum field theory. But that turns out to be a very messy process indeed, [[17, 6]]. The derivation is essentially to mess around at length until you more or less prove that you can use the Lorenz condition result instead. You might as well start there.

It does turns out that the so-called path-integral

formulation of quantum mechanics does a very nice job here,
[52, pp. 30ff]. It avoids many of the contortions of
canonical quantization like the ones above.

In fact, a popular quantum field textbook, [34, p. 79], refuses to do canonical quantization of the electromagnetic field at all, calling it an awkward subject. This book is typically used during the second year of graduate study in physics, so it is not that its readers are unsophisticated.

A.22.7 The conventional Lagrangian

Returning to the classical electromagnetic field, it still needs to be examined whether the Lorenz condition has made the universe safe for life as we know it.

The answer depends on the Lagrangian, because the Lagrangian determines the evolution of a system. So far, the Lagrangian has been written in terms of the four potentials and (with = 1, 2, and 3) of the electromagnetic field. But recall that matter does not observe the four potentials directly. It only notices the electric field and the magnetic field . So it may help to reformulate the Lagrangian in terms of the electric and magnetic fields. Concentrating on the observed fields is likely to show up more clearly what is actually observed.

With a bit of mathematical manipulation, {D.37.3}, the
self-evident electromagnetic Lagrangian density can be written as:

Here the dots stand for terms that do not affect the motion. (Since in the action, Lagrangian densities get integrated over space and time, terms that are pure spatial or time derivatives integrate away. The quantities relevant to the action principle vanish at the limits of integration.)

The term inside the curly brackets is zero according to the Lorenz condition (A.138). Therefore, it too does not affect the motion. (To be precise, the term does not affect the motion because it is squared. By itself it would affect the motion. In the formal way in which the Lagrangian is differentiated, one power is lost.)

The conventional Lagrangian density is found by disregarding the terms
that do not change the motion:

So the conventional Lagrangian density of the electromagnetic field is completely in terms of the observable fields.

As an aside, it might be noted that physicists find the above
expression too intuitive. So you will find it in quantum field books
in relativistic index notation as:

Here the “field strength tensor” is defined by

Note that the indices on each are subscripts instead of superscripts as they should be. That means that you must add a minus sign whenever the index on an is 0. If you do that correctly, you will find that from the 16 values, some are zero, while the rest are components of the electric or magnetic fields. To go from to , you must raise both indices, so add a minus sign for each index that is zero. If you do all that the same Lagrangian density as before results.

Because the conventional Lagrangian density is different from the self-evident one, the field equations (A.135) and (A.136) for the potentials pick up a few additional terms. To find them, repeat the analysis of subsection A.22.4 but use the conventional density above in (A.134). Note that you will need to write the electric and magnetic fields in terms of the potentials using (A.131). (Using the field strength tensor is actually somewhat simpler in converting to the potentials. If you can get all the blasted sign changes right, that is.)

Then the conventional field equations become:

Here is again the charge density and the current density of the charges that are around,

The additional terms in each equation above are the two before the equals signs. Note that these additional terms are zero on account of the Lorenz condition. So they do not change the solution.

The conventional field equations above are obviously more messy than the original ones. Even if you cancel the second order time derivatives in (A.140). However, they do have one advantage. If you use these conventional equations, you do not have to worry about satisfying the Lorenz condition. Any solution to the equations will give you the right electric and magnetic fields and so the right motion of the charged particles.

To be sure, the potentials will be different if you do not satisfy the Lorenz condition. But the potentials have no meaning of their own. At least not in classical electromagnetics.

To verify that the Lorenz condition is no longer needed, first recall
the indeterminacy in the potentials. As subsection A.22.4
discussed, more than one set of potentials can produce the same
electric and magnetic fields. In particular, given potentials
and , you can create equivalent potentials as

Here can be any function of space and time that you want. The potentials and give the exact same electric and magnetic fields as and . Such a transformation of potentials is called a

gage transform.

Now suppose that you have a solution and of the
conventional field equations, but it does not satisfy the Lorenz
condition. In that case, simply apply a gage transform as above to
get new fields and that do satisfy the Lorenz
condition. To do so, write out the Lorenz condition for the new
potentials,

You can always choose the function to make this quantity zero. (Note that that gives an inhomogeneous Klein-Gordon equation for .)

Now it turns out that the new potentials and still satisfy the conventional equations. That can be seen by straight substitution of the expressions for the new potentials in the conventional equations. So the new potentials are perfectly OK: they satisfy both the Lorenz condition and the conventional equations. But the original potentials and produced the exact same electric and magnetic fields. So the original potentials were OK too.

The evolution equation (A.140) for the electrostatic field is
worth a second look. Because of the definition of the electric field
(A.130), it can be written as

Maxwell’s first equation,chapter 13.2. It ties the charge density to the electric field quite rigidly.

Maxwell’s first equation is a consequence of the Lorenz condition. It would not be required for the original Klein-Gordon equations without Lorenz condition. In particular, it is the Lorenz condition that allows the additional two terms in the evolution equation (A.140) for the electrostatic potential. These then eliminate the second order time derivative from the equation. That then turns the equation from a normal evolution equation into a restrictive spatial condition on the electric field.

It may be noted that the other evolution equation (A.141) is Maxwell’s fourth equation. Just rewrite it in terms of the electric and magnetic fields. The other two Maxwell equations follow trivially from the definitions (A.130) of the electric and magnetic fields in terms of the potentials.

Since there is no Lorenz condition for the conventional equations, it becomes interesting to find the corresponding Hamiltonian. That should allow the stability of electromagnetics to be examined more easily.

The Hamiltonian for electromagnetic field plus a proton may be found
the same way as (A.137) in subsection A.22.5,
{A.1.5}. Just use the conventional Lagrangian
density instead. That gives

But the proton charge density may eliminated using Maxwell’s first equation above. An additional integration by parts of that term then causes it to drop away against the previous term. That gives the conventional energy as

The first term is the energy in the observable fields and the final term is the kinetic energy of the proton.

The simplified energy above is no longer really a Hamiltonian; you cannot write Hamilton’s equations based on it as in {A.1.5}. But it does still give the energy that is conserved.

The energy above is always positive. So it can no longer be lowered by arbitrary amounts. The system will not blow up. And that then means that the original Klein-Gordon equations (A.135) and (A.136) for the fields are stable too as long as the Lorenz condition is satisfied. They produce the same evolution. And they satisfy the speed of light restriction and are Lorentz invariant. Lorenz did it!

Note also the remarkable result that the interaction energy between proton charge and field has disappeared. The proton can no longer minimize any energy of interaction between itself and the field it creates. Maxwell’s first equation is too restrictive. All the proton can try to do is minimize the energy in the electric and magnetic fields.

A.22.8 Quantization following Fermi

Quantizing the electromagnetic field is not easy. The previous
subsection showed a couple of problems. The gage property implies
that the electromagnetic potentials and are
indeterminate. Also, taking the Lorenz condition into account, the
second order time derivative is lost in the Klein-Gordon equation for
the electrostatic potential . The equation turns into
Maxwell’s first equation,

That is not an evolution equation but a spatial constraint for the electric field in terms of the charge density .

Various ways to deal with that have been developed. The quantization procedure discussed in this subsection is a simplified version of the one found in Bethe’s book, [6, pp. 255-271]. It is due to Fermi, based on earlier work by Dirac and Heisenberg & Pauli. This derivation was a great achievement at the time, and fundamental to more advanced quantum field approaches, [6, p. 266]. Note that all five mentioned physicists received a Nobel Prize in physics at one time or the other.

The starting point in this discussion will be the original potentials and of subsection A.22.4. The ones that satisfied the Klein-Gordon equations (A.135) and (A.136) as well as the Lorenz condition (A.138).

It was Fermi who recognized that you can make things a lot simpler for
yourself if you write the potentials as sums of exponentials of the
form :

That is the same trick as was used in quantizing the Koulomb potential in subsection A.22.3. However, in classical mechanics you do not call these exponentials momentum eigenstates. You call them

Fourier modes.The principle is the same. The constant vector that characterizes each exponential is still called the wave number vector. Since the potentials considered here vary with time, the coefficients and are functions of time.

Note that the coefficients are vectors. These will have
three independent components. So the vector potential can be written
more explicitly as

where , , and are unit vectors. Fermi proposed that the smart thing to do is to take the first of these unit vectors in the same direction as the wave number vector . The corresponding electromagnetic waves are called

longitudinal.The other two unit vectors should be orthogonal to the first component and to each other. That still leaves a bit choice in direction. Fortunately, in practice it does not really make a difference exactly how you take them. The corresponding electromagnetic waves are called

transverse.

In short, the fields can be written as

From those expressions, and the directions of the unit vectors, it can
be checked by straight substitution that the curl

of
the longitudinal potential is zero:

A vector field with zero curl is called “irrotational.” (The term can be understood from fluid mechanics; there the curl of the fluid velocity field gives the local average angular velocity of the fluid.)

The same way, it turns out that that the divergence

of
the transverse potential is zero

A field with zero divergence is called “solenoidal.” (This term can be understood from magnetostatics; a magnetic field, like the one produced by a solenoid, an electromagnet, has zero divergence.)

To be fair, Fermi did not really discover that it can be smart to take vector fields apart into irrotational and solenoidal parts. That is an old trick known as the “Helmholtz decomposition.”

Since the transverse potential has no divergence, the longitudinal potential is solely responsible for the Lorenz condition (A.138). The transverse potential can do whatever it wants.

The real problem is therefore with the longitudinal potential and the electrostatic potential . Bethe [6] deals with these in terms of the Fourier modes. However, that requires some fairly sophisticated analysis. It is actually easier to return to the potentials themselves now.

Reconsider the expressions (A.130) for the electric and magnetic fields in terms of the potentials. They show that the electrostatic potential produces no magnetic field. And neither does the longitudinal potential because it is irrotational.

They do produce a combined electric field
. But this electric field is
irrotational, because the longitudinal potential is, and the gradient
of any scalar function is. That helps, because then the
Stokes theorem of calculus implies that the electric field
is minus the gradient of some scalar
potential:

Note that normally is not the same as the electrostatic potential , since there is also the longitudinal potential. To keep them apart, will be called the

Coulomb potential.

As far as the divergence of the electric field
is concerned, it is the same as the
divergence of the complete electric field. The reason is that the
transverse field has no divergence. And the divergence of the
complete electric field is given by Maxwell’s first equation.
Together these observations give

Note that the final equation is a Poisson equation for the Coulomb potential.

Now suppose that you replaced the electrostatic field with the Coulomb potential and had no longitudinal field at all. It would give the same electric and magnetic fields. And they are the only ones that are observable. They give the forces on the particles. The potentials are just mathematical tools in classical electromagnetics.

So why not? To be sure, the combination of the Coulomb potential and remaining vector potential will no longer satisfy the Lorenz condition. But who cares?

Instead of the Lorenz condition, the combination of Coulomb potential
plus transverse potential satisfies the so-called “Coulomb condition:”

(A.145) |

Lorenz gage,while the new ones use the

Coulomb gage.

Because the potentials and do no longer satisfy the Lorenz condition, the Klein-Gordon equations (A.135) and (A.136) do no longer apply. But the conventional equations (A.140) and (A.141) do still apply; they do not need the Lorenz condition.

Now consider the Coulomb potential somewhat closer. As noted above
it satisfies the Poisson equation

The solution to this equation was already found in the first subsection, (A.107). If the charge distribution consists of a total of point charges, it is

(A.146) |

If the charge distribution is smoothly distributed, simply take
it apart in small point charges

. That gives

(A.147) |

The key point to note here is that the Coulomb potential has no life of its own. It is rigidly tied to the positions of the charges. That then provides the most detailed answer to the question: “What happened to energy minimization?” Charged particles have no option of minimizing any energy of interaction with the field. Maxwell’s first equation, the Poisson equation above, forces them to create a Coulomb field that is repulsive to them. Whether they like it or not.

Note further that all the mechanics associated with the Coulomb field is quasi-steady. The Poisson equation does not depend on how fast the charged particles evolve. The Coulomb electric field is minus the spatial gradient of the potential, so that does not depend on the speed of evolution either. And the Coulomb force on the charged particles is merely the electric field times the charge.

It is still not obvious how to quantize the Coulomb potential, even
though there is no longer a longitudinal field. But who cares about
the Coulomb potential in the first place? The important thing is how
the charged particles are affected by it. And the forces on the
particles caused by the Coulomb potential can be computed using the
electrostatic potential energy, {D.37.4},

Incidentally, note the required omission of the terms with in the potential energy above. Otherwise you would get infinite energy. In fact, a point charge in classical electromagnetics does have infinite Coulomb energy. Just take any of the point charges and mentally chop it up into two equal parts sitting at the same position. The interaction energy between the halves is infinite.

The issue does not exist if the charge is smoothly distributed. In
that case the Coulomb potential energy is, {D.37.4},

So the big idea is to throw away the electrostatic and longitudinal potentials and replace them with the Coulomb energy , origin unknown. Now it is mainly a matter of working out the details.

First, consider the Fermi Lagrangian. It is found by throwing out the
electrostatic and longitudinal potentials from the earlier Lagrangian
(A.134) and subtracting . That gives,
using the point charge approximation (A.133) and in vector
notation,

(A.150) |

You may wonder how you can achieve that only the transverse potential is left. That would indeed be difficult to do if you work in terms of spatial coordinates. The simplest way to handle it is to work in terms of the transverse waves (A.144). They are transverse by construction.

The unknowns are now no longer the values of the potential at the
infinitely many possible positions. Instead the unknowns are now the
coefficients and of the transverse waves. Do
take into account that since the field is real,

So the number of independent variables is half of what it seems. The most straightforward way of handling this is to take the unknowns as the real and imaginary parts of the and for half of the values. For example, you could restrict the values to those for which the first nonzero component is positive. The corresponding unknowns must then describe both the and waves.

(The 0 terms are awkward. One way to deal with it is to take an adjacent periodic box and reverse the sign of all the charges and fields in it. Then take the two boxes together to be a new bigger periodic box. The net effect of this is to shift the mesh of -values figure 6.17 by half an interval. That means that the 0 terms are gone. And other problems that may arise if you sum over all boxes, like to find the total Coulomb potential, are gone too. Since the change in values becomes zero in the limit of infinite box size, all this really amounts to is simply ignoring the 0 terms.)

The Hamiltonian can be obtained just like the earlier one
(A.137), {A.1.5}. (Or make that
{A.1.4}, since the unknowns, and
, are now indexed by the discrete values of the wave
number .) But this time it really needs to be done right,
because this Hamiltonian is supposed to be actually used. It is best
done in terms of the components of the potential and velocity vectors.
Using to index the components, the Lagrangian becomes

Now Hamiltonians should not be in terms of particle velocities,
despite what (A.137) said. Hamiltonians should be in terms
of canonical momenta,

{A.1.4}. The
canonical momentum corresponding to the velocity component
of a particle is defined as

Differentiating the Lagrangian above gives

It is this canonical momentum that in quantum mechanics gets replaced by the operator . That is important since, as the above expression shows, canonical momentum is not just linear momentum in the presence of an electromagnetic field.

The time derivatives of the real and imaginary parts of the coefficients and should be replaced by similarly defined canonical momenta. However, that turns out to be a mere rescaling of these time derivatives.

The Hamiltonian then becomes, following {A.1.4} and
in vector notation,

Note in particular that the center term is the kinetic energy of the particles, but in terms of their canonical momenta.

In terms of the waves (A.144), the integral falls apart in
separate contributions from each and mode.
That is a consequence of the orthogonality of the exponentials,
compare the Parseval identity in {A.26}. (Since the
exponentials are complex, the absolute values in the integral are now
required.) As a result, the equations for different coefficients are
only indirectly coupled by the interaction with the charged particles.
In particular, it turns out that each coefficient satisfies its own
harmonic oscillator equation with forcing by the charged particles,
{A.1.4},

If the speed of the particle gets comparable to the speed of light,
you may want to use the relativistic energy (1.2);

Sometimes, it is convenient to assume that the system under consideration also experiences an external electromagnetic field. For example, you might consider an atom or atomic nucleus in the magnetic field produced by an electromagnet. You probably do not want to include every electron in the wires of the electromagnet in your model. That would be something else. Instead you can simply add the vector potential that they produce to in the Hamiltonian. If there is also an external electrostatic potential, add a separate term to the Hamiltonian for each particle . The external fields will be solutions of the homogeneous evolution equations (A.140) and (A.141), (i.e. the equations without charge and current densities). However, the external fields will not vanish at infinity; that is why they can be nonzero without charge and current densities.

Note that the entire external vector potential is needed, not just the transverse part. The longitudinal part is not included in . Bethe [6, p. 266] also notes that the external field should satisfy the Lorenz condition. No further details are given. However, at least in various simple cases, a gage transform that kills off the Lorenz condition may be applied. See for example the gage property for a pure external field {A.19.5}. In the classical case a gage transform of the external fields does not make a difference either, because it does not change either the Lagrangian equations for the transverse field nor those for the particles. Using the Lorenz condition cannot hurt, anyway.

Particle spin, if any, is not included in the above Hamiltonian. At nonrelativistic speeds, its energy can be described as a dot produce with the local magnetic field, chapter 13.4.

So far all this was classical electrodynamics. But the interaction between the charges and the transverse waves can readily be quantized using essentially the same procedure as used for the Koulomb potential in subsection A.22.3. The details are worked out in addendum {A.23} for the fields. It allows a relativistic description of the emission of electromagnetic radiation by atoms and nuclei, {A.24} and {A.25}.

While the transverse field must be quantized, the Coulomb potential can be taken unchanged into quantum mechanics. That was done, for example, for the nonrelativistic hydrogen atom in chapter 4.3 and for the relativistic one in addendum {D.82}.

Finally, any external fields are assumed to be given; they are not quantized either.

Note that the Fermi quantization is not fully relativistic. In a
fully relativistic theory, the particles too should be described by
quantum fields. The Fermi quantization does not do that. So even the
relativistic hydrogen atom is not quite exact, even though it is
orders of magnitude more accurate than the already very accurate
nonrelativistic one. The energy levels are still wrong by the
so-called Lamb shift,

{A.38} But this is an
extremely tiny effect. Little in life is perfect, isn’t it?

A.22.9 The Coulomb potential and the speed of light

The Coulomb potential

does not respect the speed of light . Move a charge, and the Coulomb potential immediately changes everywhere in space. However, special relativity says that an event may not affect events elsewhere unless these events are reachable by the speed of light. Something else must prevent the use of the Coulomb potential to transmit observable effects at a speed greater than that of light.

To understand what is going on, assume that at time zero some charges at the origin are given a well-deserved kick. As mentioned earlier, the Klein-Gordon equations respect the speed of light. Therefore the original potentials and , the ones that satisfied the Klein-Gordon equations and Lorenz condition, are unaffected by the kick beyond a distance from the origin. The original potentials do respect the speed of light.

The Coulomb potential above, however, includes the longitudinal part of the vector potential . As the Coulomb potential reflects, does change immediately all the way up to infinity. But the transverse part also changes immediately all the way up to infinity. Beyond the limit dictated by the speed of light, the two parts of the potential exactly cancel each other. As a result, beyond the speed of light limit, the net vector potential does not change.

The bottom line is

The limitation is still there, it is just much more difficult to see. The change in current density caused by kicking the charges near the origin is restricted to the immediate vicinity of the origin. But both the longitudinal part and the transverse part extend all the way to infinity. And then so do the longitudinal and transverse potentials. It is only when you add the two that you see that the sum is zero beyond the speed of light limit.The mathematics of the Helmholtz decomposition of into and hides, but of course does not change, the limitation imposed by the speed of light.