D.71 Emergence of spin from relativity

This note will give a (relatively) simple derivation of the Dirac
equation to show how relativity naturally gives rise to spin. The
equation will be derived without ever mentioning the word spin while
doing it, just to prove it can be done. Only Dirac’s assumption
that Einstein's square root disappears,

will be used and a few other assumptions that have nothing to do with spin.

The conditions on the coefficient matrices for the linear
combination to equal the square root can be found by squaring both
sides in the equation above and then comparing sides. They turn out to be:

where the sizes of the positive and negative unit matrices in are still undecided; one of the two could in principle be of zero size.

However, since must be zero for
the three other Hermitian matrices, it is seen from
multiplying that out that they must be of the form

The matrices, whatever they are, must be square in size or the matrices would be singular and could not square to one. This then implies that the positive and negative unit matrices in must be the same size.

Now try to satisfy the remaining conditions on ,
, and using just complex numbers, rather
than matrices, for the . By multiplying out the
conditions (D.45), you see that

The first condition above would require each to be a number of magnitude one, in other words, a number that can be written as for some real angle . The second condition is then according to the Euler formula (2.5) equivalent to the requirement that

this implies that all three angles would have to be 90 degrees apart. That is impossible: if and are each 90 degrees apart from , then and are either the same or apart by 180 degrees; not by 90 degrees.

It follows that the components cannot be numbers, and must be matrices too. Assume, reasonably, that they correspond to some measurable quantity and are Hermitian. In that case the conditions above on the are the same as those on the , with one critical difference: there are only three matrices, not four. And so the analysis repeats.

Choose to describe the wave function in terms of the eigenvectors of
the matrix; this does not conflict with the earlier choice
since all half wave function vectors are eigenvectors of the positive
and negative unit matrices in . So you have

and the other two matrices must then be of the form

But now the components and can indeed be just complex numbers, since there are only two, and two angles can be apart by 90 degrees. You can take and then or . The existence of two possibilities for implies that on the wave function level, nature is not mirror symmetric; momentum in the positive -direction interacts differently with the and momenta than in the opposite direction. Since the observable effects are mirror symmetric, do not worry about it and just take the first possibility.

So, the goal of finding a formulation in which Einstein's square root falls apart has been achieved. However, you can clean up some more, by redefining the value of away. If the four-dimensional wave function vector takes the form , define , and similar for and .

In that case, the final cleaned-up matrices are

(D.46) |

sword has not been mentioned even once in this derivation. So, now please express audible surprise that the matrices turn out to be the Pauli (it can now be said) spin matrices of chapter 12.10.

But there is more. Suppose you define a new coordinate system rotated 90 degrees around the -axis. This turns the old -axis into a new -axis. Since has an additional factor , to get the normalized coefficients, you must include an additional factor in , which by the fundamental definition of angular momentum discussed in addendum {A.19} means that it describes a state with angular momentum . Similarly corresponds to a state with angular momentum and and to ones with .

For nonzero momentum, the relativistic evolution of spin and momentum
becomes coupled. But still, if you look at the eigenstates of positive
energy, they take the form:

where is a small number in the nonrelativistic limit and is the two-component vector . The operator corresponding to rotation of the coordinate system around the momentum vector commutes with , hence the entire four-dimensional vector transforms as a combination of a spin state and a spin state for rotation around the momentum vector.