9.3 The Hartree-Fock Approximation

Many of the most important problems that you want to solve in quantum mechanics are all about atoms and/or molecules. These problems involve a number of electrons around a number of atomic nuclei. Unfortunately, a full quantum solution of such a system of any nontrivial size is very difficult. However, approximations can be made, and as section 9.2 explained, the real skill you need to master is solving the wave function for the electrons given the positions of the nuclei.

But even given the positions of the nuclei, a brute-force solution for any nontrivial number of electrons turns out to be prohibitively laborious. The Hartree-Fock approximation is one of the most important ways to tackle that problem, and has been so since the early days of quantum mechanics. This section explains some of the ideas.

9.3.1 Wave function approximation

The key to the basic Hartree-Fock method is the assumptions it makes about the form of the electron wave function. It will be assumed that there are a total of electrons in orbit around a number of nuclei. The wave function describing the set of electrons then has the general form:

$\begin{displaymath} \Psi({\skew0\vec r}_1,S_{z1},{\skew0\vec r}_2,S_{z2},\ldots,{\skew0\vec r}_i,S_{zi},\ldots {\skew0\vec r}_I,S_{zI}) \end{displaymath}$

where ${\skew0\vec r}_i$ is the position of the electron numbered

, and $S_{zi}$ its spin (i.e. internal angular momentum) in a chosen

-direction. Recall that while the position of an electron can be anywhere in three-dimensional space, its spin component

can have only two measurable values: $\leavevmode \kern.03em\raise.7ex\hbox{\the\scriptfont0 1}\kern-.2em /\kern-.21em\lower.56ex\hbox{\the\scriptfont0 2}\kern.05em\hbar$ or $-\leavevmode \kern.03em\raise.7ex\hbox{\the\scriptfont0 1}\kern-.2em /\kern-.21em\lower.56ex\hbox{\the\scriptfont0 2}\kern.05em\hbar$ . Because of the factor $\leavevmode \kern.03em\raise.7ex\hbox{\the\scriptfont0 1}\kern-.2em /\kern-.21em\lower.56ex\hbox{\the\scriptfont0 2}\kern.05em$ , an electron is a “particle of spin one-half”. Such a particle is also called a “spin doublet” because of the two possible spin values.

The square magnitude of the wave function above gives the probability for the electrons $i=1,2,\ldots,I$ to be near the position ${\skew0\vec r}_i$ , per unit volume, with spin component $S_{zi}$ .

Of course, what the wave function is will also depend on where the nuclei are. However, in this section, the nuclei are supposed to be at given positions. Therefore to reduce the clutter, the dependence of the electron wave function on the nuclear positions will not be shown explicitly.

Hartree-Fock approximates the wave function above in terms of single-electron wave functions. Each single-electron wave function takes the form of a product of a spatial function $\pe////$ of the electron position ${\skew0\vec r}$ , times a function of the electron spin component . The spin function is either taken to be ${\uparrow}$ or ${\downarrow}$ ; by definition, function ${\uparrow}(S_z)$ equals 1 if the spin is $\leavevmode \kern.03em\raise.7ex\hbox{\the\scriptfont0 1}\kern-.2em /\kern-.21em\lower.56ex\hbox{\the\scriptfont0 2}\kern.05em\hbar$ , and 0 if it is $-\leavevmode \kern.03em\raise.7ex\hbox{\the\scriptfont0 1}\kern-.2em /\kern-.21em\lower.56ex\hbox{\the\scriptfont0 2}\kern.05em\hbar$ . Conversely, function ${\downarrow}(S_z)$ equals 0 if is $\leavevmode \kern.03em\raise.7ex\hbox{\the\scriptfont0 1}\kern-.2em /\kern-.21em\lower.56ex\hbox{\the\scriptfont0 2}\kern.05em\hbar$ and 1 if it is $-\leavevmode \kern.03em\raise.7ex\hbox{\the\scriptfont0 1}\kern-.2em /\kern-.21em\lower.56ex\hbox{\the\scriptfont0 2}\kern.05em\hbar$ . Function ${\uparrow}$ is called spin-up and ${\downarrow}$ spin-down.

A complete single-electron wave function is then of the form

$\begin{displaymath} \pe/{\skew0\vec r}/b/z/ \end{displaymath}$

where ${\updownarrow}$ is either ${\uparrow}$ or ${\downarrow}$ . Such a single-electron wave function is called an “orbital” or more accurately a “spin orbital.” The reason is that people tend to think of the single-electron wave function as describing a single electron being in a particular orbit around the nuclei with a particular spin. Wrong, of course: the electrons do not have well-defined positions on these scales, so you cannot talk about orbits But people do tend to think of the spatial orbitals $\pe/{\skew0\vec r}///$ that way anyway.

For simplicity, it will be assumed that the spin orbitals are taken to be normalized; if you integrate the square magnitude of $\pe/{\skew0\vec r}/b/z/$ over all possible positions ${\skew0\vec r}$ of the electron and sum over the two possible values of its spin , you get 1. Physically that merely expresses that the electron must be at some position and have some spin for certain (probability 1). The integral plus sum combination can be expressed using the concise bra[c]ket notation from chapter 2;

$\begin{displaymath} \left\langle\vphantom{\pe//b//}\pe//b//\hspace{-\nulldelimi... ...right.\!\left\vert\vphantom{\pe//b//}\pe//b//\right\rangle = 1 \end{displaymath}$

Such a bracket, or inner product, is equivalent to a dot product for functions.

If there is more than one electron, as will be assumed in this section, a single spin orbital $\pe//b//$ is not enough to create a valid wave function for the complete system. In fact, the “Pauli exclusion principle” says that each of the electrons must go into a different spin orbital, chapter 5.7. So a series of orbitals is needed,

$\begin{displaymath} \pe 1//b//, \pe 2//b//, \ldots, \pe n//b//, \ldots, \pe N//b// \end{displaymath}$

where the number of orbitals

must be at least as big as the number of electrons

It will be assumed that any two different spin orbitals $\pe{n}//b//$ and $\pe{{\underline n}}//b//$ are taken to be orthogonal; by definition this means that their bracket is zero:

$\begin{displaymath} \left\langle\vphantom{\pe {\underline n}//b//}\pe n//b//\hs... ...}//b//\right\rangle = 0 \quad\mbox{if}\quad n\ne{\underline n} \end{displaymath}$

In short, it is assumed that the set of spin orbitals is orthonormal; mutually orthogonal and normalized.

Note that the bracket above can be written as a product of a spatial bracket and a spin one:

$\begin{displaymath} \left\langle\vphantom{\pe {\underline n}//b//}\pe n//b//\hs... ...m{{\updownarrow}_n}{\updownarrow}_{\underline n}\right\rangle \end{displaymath}$

So for different spin orbitals to be orthogonal, either the spatial states or the spin states must orthogonal; they do not both need to be orthogonal. (To verify the expression above, just write the first bracket out in terms of a spatial integral over ${\skew0\vec r}$ and a sum over the two values of

and reorder terms.)

Note also that the spin states ${\uparrow}$ and ${\downarrow}$ are an orthonormal set:

$\begin{displaymath} \left\langle\vphantom{{\uparrow}}{\uparrow}\hspace{-\nullde... ...\!\left\vert\vphantom{{\downarrow}}{\uparrow}\right\rangle = 0 \end{displaymath}$

So if the spin states are opposite, the spatial states do not need to be orthogonal. In fact, the spatial states can then be the same.

The base Hartree-Fock method uses the absolute minimum number of orbitals $\vphantom0\raisebox{1.5pt}{$=$}$ . In that case, the simplest you could do to create a system wave function is to put electron 1 in orbital 1, electron 2 in orbital 2, etcetera. That would give the system wave function

$\begin{displaymath} \pe1/{\skew0\vec r}_1/b/z1/ \pe2/{\skew0\vec r}_2/b/z2/ \pe3/{\skew0\vec r}_3/b/z3/ \ldots \pe I/{\skew0\vec r}_I/b/zI/ \end{displaymath}$

A product of single-electron wave functions like this is called a “Hartree product.”

But a single Hartree product like the one above is physically not acceptable as a wave function. The Pauli exclusion principle is only part of what is needed, chapter 5.7. The full requirement is that a system wave function must be antisymmetric under electron exchange: the wave function must simply change sign when any two electrons are swapped. But if, say, electrons 1 and 2 are swapped in the Hartree product above, it produces the new Hartree product

$\begin{displaymath} \pe1/{\skew0\vec r}_2/b/z2/ \pe2/{\skew0\vec r}_1/b/z2/ \pe3/{\skew0\vec r}_3/b/z3/ \ldots \pe I/{\skew0\vec r}_I/b/zI/ \end{displaymath}$

That is a fundamentally different wave function, not just minus the first Hartree product; orbitals $\pe1//b//$ and $\pe2//b//$ are not allowed to be equivalent.

To get a wave function that does simply change sign when electrons 1 and 2 are swapped, you can take the first Hartree product minus the second one. That solves that problem. But it is not enough: the wave function must also simply change sign if electrons 1 and 3 are swapped. Or if 2 and 3 are swapped, etcetera.

So you must add more Hartree products with swapped electrons to the mix. A lot more in fact. There are ways to order electrons, and each ordering adds one Hartree product to the mix. (The Hartree product gets a plus sign or a minus sign in the mix depending on whether the number of swaps to get there from the first one is even or odd). So for, say, a single carbon atom with $\vphantom0\raisebox{1.5pt}{$=$}$ 6 electrons, writing down the full Hartree-Fock wave function would mean writing down $\vphantom0\raisebox{1.5pt}{$=$}$ 720 Hartree products. Roughly a thousand of them, in short. Of course, writing all that out would be insane. Fortunately, there is a more concise way to write the complete wave function; it uses a so-called Slater determinant,

$\begin{displaymath} \frac{1}{\sqrt{I!}} \left\vert \begin{array}{cccccc} \pe... ...\ldots & \pe I/{\skew0\vec r}_I/b/zI/ \end{array} \right\vert \end{displaymath}$

(9.15)

The determinant multiplies out to the

individual Hartree products. (See chapter 5.7 and the notations section for more on determinants.) The factor 1 $\raisebox{.5pt}{$/$}$ $/\sqrt{I!}$ is there to ensure that the wave function remains normalized after summing the

Hartree products together.

The most general system wave function $\Psi$ using only $\vphantom0\raisebox{1.5pt}{$=$}$ orbitals is any coefficient of magnitude 1 times the above Slater determinant. However, displaying the Slater determinant fully as above is still a lot to write and read. Therefore, from now on the Slater determinant will be abbreviated as in

$\begin{displaymath} \Psi = a {\left\vert{\rm det}\;\pe 1//b//,\pe 2//b//,\ldots,\pe n//b//,\ldots,\pe I//b//\right\rangle} % \end{displaymath}$

(9.16)

where ${\left\vert{\rm det}\;\ldots\right\rangle}$ is the Slater determinant.

It is important to realize that using the minimum number of single-electron functions will unavoidably produce an error that is mathematically speaking not small {N.16}. To get a vanishingly small error, you would need a large number of different Slater determinants, not just one. Still, the results you get with the basic Hartree-Fock approach may be good enough to satisfy your needs. Or you may be able to improve upon them enough with post-Hartree-Fock methods.

But none of that would be likely if you just selected the single-electron functions $\pe1//b//$ , $\pe2//b//$ , ...at random. The cleverness in the Hartree-Fock approach will be in writing down equations for these single-electron wave functions that produce the best approximation possible with a single Slater determinant.

Recall the approximate solutions that were written down for the electrons in atoms in chapter 5.9. These solutions were really single Slater determinants. To improve on these results, you might think of trying to find more accurate ways to average out the effects of the neighboring electrons than just putting them in the nucleus as that chapter essentially did. You could smear them out over some optimal area, say. But even if you did that, the Hartree-Fock solution will still be better, because it gives the best possible approximation obtainable with any single determinant.

That assumes of course that the spins are taken the same way. Consider that problem for a second. Typically, a nonrelativistic approach is used, in which spin effects on the energy are ignored. Then spin only affects the antisymmetrization requirements.

Things are straightforward if you try to solve, say, a helium atom. The correct ground state takes the form

$\begin{displaymath} \Psi_{\rm {He}}({\skew0\vec r}_1,{\skew0\vec r}_2) \times ... ...row}(S_{z2}) -{\downarrow}(S_{z1}){\uparrow}(S_{z2})}{\sqrt2}, \end{displaymath}$

The factor $\Psi_{\rm {He}}({\skew0\vec r}_1,{\skew0\vec r}_2)$ is the spatial wave function that has the absolutely lowest energy, regardless of any antisymmetrization concerns. This wave function must be symmetric (unchanged) under electron exchange since the two electrons are identical and the ground state is unique. The antisymmetrization requirement is met because the spins combine together as shown in the second factor above; this factor changes sign when the electrons are exchanged. So the spatial state does not have to change sign.

The combined spin state shown in the second factor above is called the singlet state, chapter 5.5.6. In the singlet state the two spins cancel each other completely: the net electron spin is zero. If you measure the net spin component in any direction, not just the chosen -direction, you get zero.

Based on the exact helium ground state wave function above, you would take the Hartree-Fock approximation to be of the form

$\begin{displaymath} {\left\vert{\rm det}\;\pe1//u//,\pe2//d//\right\rangle} \end{displaymath}$

and then you would make things easier for yourself by postulating a priori that the spatial orbitals are the same, $\pe1////$ $\vphantom0\raisebox{1.5pt}{$=$}$ $\pe2////$ . Lo and behold, when you multiply out the Slater determinant,

$\begin{displaymath} \frac{1}{\sqrt{2}} \left\vert \begin{array}{cc} \pe1/{\s... ...2/u/z2/ & \pe1/{\skew0\vec r}_2/d/z2/ \end{array} \right\vert \end{displaymath}$

you get

$\begin{displaymath} \pe1/{\skew0\vec r}_1///\pe1/{\skew0\vec r}_2/// \times \f... ...arrow}(S_{z2})-{\downarrow}(S_{z1}){\uparrow}(S_{z2})}{\sqrt2} \end{displaymath}$

This automagically reproduces the correct singlet spin state! (The approximation comes in because the exact spatial ground state, $\Psi_{\rm {He}}({\skew0\vec r}_1,{\skew0\vec r}_2)$ is not just the product of two single-electron functions as in Hartree-Fock.) And you only need to find one spatial orbital instead of two.

As discussed in chapter 5.9, a beryllium atom has two electrons with opposite spins in the 1s shell like helium, and two more in the 2s shell. An appropriate Hartree-Fock wave function would be

$\begin{displaymath} {\left\vert{\rm det}\;\pe1//u//,\pe1//d//,\pe3//u//,\pe3//d//\right\rangle} \end{displaymath}$

in other words, two pairs of orbitals with the same spatial states and opposite spins. Similarly, Neon has an additional 6 paired electrons in a closed 2p shell, and you could use 3 more pairs of orbitals with the same spatial states and opposite spins. The number of spatial orbitals that must be found in such solutions is only half the number of electrons. This procedure is called the “closed shell Restricted Hartree-Fock (RHF)” method. It restricts the form of the spatial states to be pair-wise equal.

But now look at lithium. Lithium has two paired 1s electrons like helium, and an unpaired 2s electron. For the third orbital in the Hartree-Fock determinant, you will now have to make a choice: whether to take it of the form $\pe3//u//$ or $\pe3//d//$ . Lets assume you take $\pe3//u//$ , so the wave function is

$\begin{displaymath} {\left\vert{\rm det}\;\pe1//u//,\pe2//d//,\pe3//u//\right\rangle} \end{displaymath}$

You have introduced a bias in the determinant: there is now a real difference between the spatial orbitals $\pe1////$ and $\pe2////$ : $\pe1//u//$ has the same spin as the third spin orbital, but $\pe2//d//$ the opposite.

If you find the best approximation to the energy among all possible spatial orbitals $\pe1////$ , $\pe2////$ , and $\pe3////$ , you will end up with orbitals $\pe1////$ and $\pe2////$ that are not the same. Allowing for them to be different is called the “Unrestricted Hartree-Fock (UHF)” method. In general, you no longer require that equivalent spatial orbitals are the same in their spin-up and spin down versions. For a bigger system, you will end up with one set of orthonormal spatial orbitals for the spin-up orbitals and a different set of orthonormal spatial orbitals for the spin-down ones. These two sets of orthonormal spatial orbitals are not mutually orthogonal; the only reason the complete spin orbitals are still orthonormal is because the two spins are orthogonal, $\left\langle\vphantom{{\downarrow}}{\uparrow}\hspace{-\nulldelimiterspace}\hspace{.03em}\right.\!\left\vert\vphantom{{\uparrow}}{\downarrow}\right\rangle$ $\vphantom0\raisebox{1.5pt}{$=$}$ 0.

If instead of using unrestricted Hartree-Fock, you insist on demanding that the spatial orbitals for spin up and down do form a single set of orthonormal functions, it is called “open shell Restricted Hartree-Fock (RHF).” In the case of lithium, you would then demand that $\pe2////$ equals $\pe1////$ . Since the best (in terms of energy) solution has them different, your solution is then no longer the best possible. You pay a price, but you now only need to find two spatial orbitals rather than three. The spin orbital $\pe3//u//$ without a matching opposite-spin orbital counts as an open shell. For nitrogen, you might want to use three open shells to represent the three different spatial states 2p, 2p, and 2p with an unpaired electron in it.

If you use unrestricted Hartree-Fock instead, you will need to compute more spatial functions, and you pay another price, spin. Since all spin effects in the Hamiltonian are ignored, it commutes with the spin operators. So, the exact energy eigenfunctions are also, or can be taken to be also, spin eigenfunctions. Restricted Hartree-Fock has the capability of producing approximate energy eigenstates with well defined spin. Indeed, as you saw for helium, in restricted Hartree-Fock all the paired spin-up and spin-down states combine into zero-spin singlet states. If any additional unpaired states are all spin up, say, you get an energy eigenstate with a net spin equal to the sum of the spins of the unpaired states. This allows you to deal with typical atoms, including lithium and nitrogen, very nicely.

But a true unrestricted Hartree-Fock solution does not have correct, definite, spin. For two electrons to produce states of definite combined spin, the coefficients of spin-up and spin-down must come in specific ratios. As a simple example, an unrestricted Slater determinant of $\pe1//u//$ and $\pe2//d//$ with unequal spatial orbitals multiplies out to

$\begin{displaymath} {\left\vert{\rm det}\;\pe1//u//,\pe2//d//\right\rangle} = ... ...ew0\vec r}_2///{\downarrow}(S_{z1}){\uparrow}(S_{z2})}{\sqrt2} \end{displaymath}$

or, writing the spin combinations in terms of singlets (which change sign under electron exchange) and triplets (which do not),

$\begin{eqnarray*} & & \frac{\pe1/{\skew0\vec r}_1///\pe2/{\skew0\vec r}_2///+\p... ...wnarrow}(S_{z2})+{\downarrow}(S_{z1}){\uparrow}(S_{z2})}{\sqrt2} \end{eqnarray*}$

So the spin will be some combination of the singlet state, the first term, and a triplet state, the second. And the precise combination will depend on the spatial locations of the electrons to boot. Now while the singlet state has net spin 0, triplet states have net spin 1. So the net spin is uncertain, either 0 or 1, even though it should not be. (Spin 1 implies that the measured component of the spin in any direction must be one of the triplet of values $\hbar$ , 0, or $-\hbar$ . For the particular triplet state shown above, the component of spin in the

-direction happens to be zero. But the net spin is not; in a direction normal to the

-direction, the spin will be measured to be either $\hbar$ or $-\hbar$ .) However, despite the spin problem, it may be noted that unrestricted wave functions are commonly used as first approximations of doublet (spin $\leavevmode \kern.03em\raise.7ex\hbox{\the\scriptfont0 1}\kern-.2em /\kern-.21em\lower.56ex\hbox{\the\scriptfont0 2}\kern.05em$ ) and triplet (spin 1) states anyway [46, p. 105].

To show that all this can make a real difference, take the example of the hydrogen molecule, chapter 5.2, when the two nuclei are far apart. The correct electronic ground state is

$\begin{displaymath} \frac{\psi_{\rm {L}}({\skew0\vec r}_1)\psi_{\rm {R}}({\skew... ...arrow}(S_{z2})-{\downarrow}(S_{z1}){\uparrow}(S_{z2})}{\sqrt2} \end{displaymath}$

where $\psi_{\rm {L}}({\skew0\vec r}_1)\psi_{\rm {R}}({\skew0\vec r}_2)$ is the state in which electron 1 is around the left proton and electron 2 around the right one, and $\psi_{\rm {R}}({\skew0\vec r}_1)\psi_{\rm {L}}({\skew0\vec r}_2)$ is the same state but with the electrons reversed. Note that, like for the helium atom, the spatial state is symmetric under electron exchange. However, it is not just a product of two single-electron functions but a sum of two of such products. Note also that the correct spin state is the singlet one with zero net spin, just like for the helium atom. It takes care of the antisymmetrization requirement.

Now try to approximate this solution with a restricted closed shell Hartree-Fock wave function of the form

$\begin{displaymath} {\left\vert{\rm det}\;\pe1//u//,\pe1//d//\right\rangle} \end{displaymath}$

Multiplying out the determinant gives

$\begin{displaymath} \pe1/{\skew0\vec r}_1///\pe1/{\skew0\vec r}_2/// \times \f... ...arrow}(S_{z2})-{\downarrow}(S_{z1}){\uparrow}(S_{z2})}{\sqrt2} \end{displaymath}$

Note that you do get the correct singlet spin state. But $\pe1////$ will be something like $(\psi_{\rm {L}}+\psi_{\rm {R}})$ $\raisebox{.5pt}{$/$}$ $\sqrt2$ ; the energy of either electron is lowest when it is near one of the nuclei. If you multiply out the resulting spatial wave function, the terms include $\psi_{\rm {L}}\psi_{\rm {L}}$ and $\psi_{\rm {R}}\psi_{\rm {R}}$ , in addition to the correct $\psi_{\rm {L}}\psi_{\rm {R}}$ and $\psi_{\rm {R}}\psi_{\rm {L}}$ . That produces a 50/50 chance that the two electrons are found around the same nucleus. That is all wrong, since the electrons repel each other: if one electron is around the left nucleus, the other electron should be around the right nucleus. The computed energy, which should be that of two neutral hydrogen atoms far apart, will be much too high due to electron-electron repulsion.

(Fortunately, at the nuclear separation distance corresponding to the ground state of the complete molecule, the errors are much less, [46, p. 166]. Note that if you put the two nuclei completely on top of each other, you get a helium atom, for which Hartree-Fock gives a much more reasonable electron energy. Only when you are breaking the bond, dissociating the molecule, i.e. taking the nuclei far apart, do you get into major trouble.)

If instead you would use unrestricted Hartree-Fock, say

$\begin{displaymath} {\left\vert{\rm det}\;\pe1//u//,\pe2//d//\right\rangle} \end{displaymath}$

you should find $\pe1////$ $\vphantom0\raisebox{1.5pt}{$=$}$ $\psi_{\rm {L}}$ and $\pe2////$ $\vphantom0\raisebox{1.5pt}{$=$}$ $\psi_{\rm {R}}$ (or vice versa), which would produce a wave function

$\begin{displaymath} \frac{ \psi_{\rm {L}}({\skew0\vec r}_1)\psi_{\rm {R}}({\sk... ...w0\vec r}_2){\downarrow}(S_{z1}){\uparrow}(S_{z2})}{\sqrt{2}}. \end{displaymath}$

In both terms, if the first electron is around the one nucleus, the second electron is around the other. So this produces the correct energy, that of two neutral hydrogen atoms. But the spin is now all wrong. It is not a singlet state, but the combination of a singlet and a triplet state already written down earlier. Little in life is ideal, is it?

(Actually there is a dirty trick to fix this. Note that which of the two orbitals you give spin-up and which spin-down is physically immaterial. So there is a trivially different solution

$\begin{displaymath} {\left\vert{\rm det}\;\pe1//d//,\pe2//u//\right\rangle} \end{displaymath}$

If you take a 50/50 combination of the original Slater determinant and minus the one above, you get the correct singlet spin state. And the spatial state will now be the correct average of $\psi_{\rm {L}}\psi_{\rm {R}}$ and $\psi_{\rm {R}}\psi_{\rm {L}}$ to boot. This spatial state is more accurate than just two neutral atoms if the distance between the nuclei decreases, chapter 5.2. All this for free! This sort of dirty trick in Hartree-Fock is called a “spin adapted configuration.” It is usually used to deal with a few open shells in an otherwise closed-shell restricted Hartree-Fock configuration.)

All of the above may be much more than you ever wanted to hear about the wave function. The purpose was mainly to indicate that things are not as simple as you might initially suppose. As the examples showed, some understanding of the system that you are trying to model definitely helps. Or experiment with different approaches.

Let’s go on to the next step: how to get the equations for the spatial orbitals $\pe1////,\pe2////,\ldots$ that give the most accurate approximation of a multi-electron problem. The expectation value of energy will be needed for that, and to get that, first the Hamiltonian is needed. That will be the subject of the next subsection.

9.3.2 The Hamiltonian

The nonrelativistic Hamiltonian of the system of electrons consists of a number of contributions. First there is the kinetic energy of the electrons; the sum of the kinetic energy operators of the individual electrons:

$\begin{displaymath} {\widehat T}^{\rm E} = - \sum_{i=1}^I \frac{\hbar^2}{2m_{\... ...\partial y_i^2} + \frac{\partial^2}{\partial z_i^2} \right). \end{displaymath}$

(9.17)

Next there is the potential energy due to the ambient electric field that the electrons move in. It will be assumed that this field is caused by nuclei, numbered using an index , and having charge (i.e. there are protons in nucleus number ). In that case, the total potential energy due to nucleus-electron attractions is, summing over all electrons and over all nuclei:

$\begin{displaymath} V^{\rm NE} = - \sum_{i=1}^I \left(\sum_{j=1}^J \frac{Z_j e^2}{4\pi\epsilon_0} \frac{1}{r^{\rm n}_{ij}}\right) \end{displaymath}$

(9.18)

where $r^{\rm n}_{ij}\equiv\vert{\skew0\vec r}_i-{\skew0\vec r}^{\,\rm {n}}_j\vert$ is the distance between electron number

and nucleus number

, and $\epsilon_0$ $\vphantom0\raisebox{1.5pt}{$=$}$ 8.85 10 $\POW9,{-12}$ C $\POW9,{2}$ /J m is the permittivity of space.

And now for the black plague of quantum mechanics, the electron to electron repulsions. The potential energy for those repulsions is

$\begin{displaymath} V^{\rm EE}= \sum_{i=1}^I \sum_{{\underline i}>i}^I \frac{e^2}{4\pi\epsilon_0} \frac{1}{r_{i{\underline i}}} \end{displaymath}$

(9.19)

where $r_{i{\underline i}}\equiv\vert{\skew0\vec r}_i-{\skew0\vec r}_{\underline i}\vert$ is the distance between electron number

and electron number ${\underline i}$ . To avoid counting each repulsion energy twice, (the second time with reversed electron order), the second electron number is required to be larger than the first.

Without this repulsion between different electrons, you could solve for each electron separately, and all would be nice. But you do have it, and so you really need to solve for all electrons at once, usually an impossible task. You may recall that when chapter 5.9 examined the atoms heavier than hydrogen, those with more than one electron, the discussion cleverly threw out the electron to electron repulsion terms, by assuming that the effect of each neighboring electron is approximately like canceling out one proton in the nucleus. And you may also remember how this outrageous assumption led to all those wrong predictions that had to be corrected by various excuses. The Hartree-Fock approximation tries to do better than that.

It is helpful to split the Hamiltonian into the single electron terms and the troublesome interactions, as follows,

$\begin{displaymath} H = \sum_{i=1}^I h^{\rm e}_i + \sum_{i=1}^I \sum_{{\underline i}>i}^I v^{\rm ee}_{i{\underline i}} % \end{displaymath}$

(9.20)

where $h^{\rm e}_i$ is the single-electron Hamiltonian of electron

$\begin{displaymath} h^{\rm e}_i = - \frac{\hbar^2}{2m_{\rm e}}\nabla_i^2 + \s... ...}^J \frac{Z_j e^2}{4\pi\epsilon_0} \frac{1}{r^{\rm n}_{ij}} % \end{displaymath}$

(9.21)

and $v^{\rm ee}_{i{\underline i}}$ is the electron

to electron ${\underline i}$ repulsion potential energy,

$\begin{displaymath} v^{\rm ee}_{i{\underline i}} = \frac{e^2}{4\pi\epsilon_0} \frac{1}{r_{i{\underline i}}} % \end{displaymath}$

(9.22)

Note that $h^{\rm e}_1$ , $h^{\rm e}_2$ , ..., $h^{\rm e}_I$ all take the same general form; the difference is just in which electron you are talking about. That is not surprising because the electrons all have the same properties. Similarly, the difference between $v^{\rm ee}_{12}$ , $v^{\rm ee}_{13}$ , ..., $v^{\rm ee}_{(I-1)I}$ is just in which pair of electrons you talk about.

9.3.3 The expectation value of energy

As was discussed in more detail in section 9.1, to find the best possible Hartree-Fock approximation, the expectation value of energy will be needed. For example, the best approximation to the ground state is the one that has the smallest expectation value of energy.

The expectation value of energy $\left\langle{E}\right\rangle$ is defined as the inner product

$\begin{displaymath} {\left\langle\vphantom{H\Psi}\Psi\hspace{0.3pt}\right\vert}H{\left\vert\Psi\vphantom{\Psi H}\right\rangle} \end{displaymath}$

where

is the Hamiltonian as given in the previous subsection. There is a problem with using this expression mindlessly, though. Take once again the example of the arsenic atom. There are 33 electrons in this atom, so you could try to choose 33 promising single-electron wave functions to describe it. You could then try to multiply out the Slater determinant for $\Psi$ , but that produces 33!, or about 4 10 $\POW9,{36}$ , Hartree products. If you put these 33! terms in both sides of the inner product, you get (33!) $\POW9,{2}$ or 7.5 10 $\POW9,{73}$ pairs of terms, each producing one inner product that must be integrated. Now since there are 3 coordinates for each of the positions of the 33 electrons, this means that each term requires integration over 99 scalar coordinates. Even using only 10 points in each direction, that would mean evaluating 10 $\POW9,{99}$ integration points for each of the 7.5 10 $\POW9,{73}$ pairs of terms. A computer that could do that is unimaginable. As of 2014, the fastest computer in the world can do no more than 10 $\POW9,{25}$ floating point computations if it stays at it for 10 years.

Fortunately, it turns out, {D.52}, that almost all of those integrations are trivial since the single-electron functions are orthonormal. If you sit down and identify what is really left, you find that only a few three-dimensional and six-dimensional inner products survive the weeding-out process.

In particular, the single-electron Hamiltonians from the previous subsection produce only single-electron energy expectation values of the general form

$\begin{displaymath} \fbox{$\displaystyle E^{\rm e}_n \equiv {\left\langle\vpha... ...vert\pe n////\vphantom{\pe n////h^{\rm e}}\right\rangle} $} % \end{displaymath}$

(9.23)

If you had only one single electron, and it was in the spatial single-particle state $\pe{n}/{\skew0\vec r}///$ , the above inner product would be its energy.

The combined single-electron energy for all electrons is then

$\begin{displaymath} \sum_{n=1}^I E^{\rm e}_n \end{displaymath}$

It is just as if you had electron 1 in state $\pe1////$ , electron 2 in state $\pe2////$ , etcetera. Of course, that is not really true. Antisymmetrization requires that all electrons are partly in all states. Indeed, if you look a bit closer at the math, you see that each of the

electrons contributes an equal fraction 1 $\raisebox{.5pt}{$/$}$

to each of the

terms above. But it does not make a real difference. Without electron-electron interactions, quantum mechanics would be so much easier!

But the repulsions are there. The Hamiltonians of the repulsions turn out to produce six-dimensional spatial inner products of two types. The inner products of the first type are called “Coulomb integrals:”

$\begin{displaymath} \fbox{$\displaystyle J_{n{\underline n}} \equiv {\left\lan... ...{\pe n////\pe{\underline n}////v^{\rm ee}}\right\rangle} $} % \end{displaymath}$

(9.24)

To understand the Coulomb integrals better, the inner product above can be written out explicitly as an integral, while also expanding $v^{\rm ee}$ :

$\begin{displaymath} \int_{{\rm all}\;{\skew0\vec r}} \int_{{\rm all}\;{\underli... ...{\,\rm d}^3{\skew0\vec r}{\,\rm d}^3{\underline{\skew0\vec r}} \end{displaymath}$

The integrand equals the probability of an electron in state $\pe{n}////$ to be found near a position ${\skew0\vec r}$ , times the probability of an electron in state $\pe{\underline n}////$ to be found near a position ${\underline{\skew0\vec r}}$ , times the Coulomb repulsion energy if the two electrons are at those positions. In short, $J_{n{\underline n}}$ is the expectation value of the Coulomb repulsion potential between an electron in state $\pe{n}////$ and one in state $\pe{\underline n}////$ . Thinking again of electron 1 in state $\pe1////$ , electron 2 in state $\pe2////$ , etcetera, the total Coulomb repulsion energy would be

$\begin{displaymath} \sum_{n=1}^I \sum_{{\underline n}>n}^I J_{n{\underline n}} \end{displaymath}$

which is indeed the correct combined sum of the Coulomb integrals.

Unfortunately, that is not the complete story for the repulsion energy. Recall that there are different ways in which you can distribute the electrons over the single particle states. And the antisymmetrization requirement requires that the system wave function is an equal combination of all these different possibilities. In terms of classical physics, it might still seem that this should make no difference: if any one of these possibilities is true, then the others must be untrue. But quantum mechanics allows states in which the electrons are distributed in one way to interact with states in which they are distributed in another way. That produces the so-called “exchange integrals:”

$\begin{displaymath} \fbox{$\displaystyle K_{n{\underline n}} \equiv {\left\lan... ...{\pe n////\pe{\underline n}////v^{\rm ee}}\right\rangle} $} % \end{displaymath}$

(9.25)

Written out explicitly, that equals

$\begin{displaymath} \int_{{\rm all}\;{\skew0\vec r}} \int_{{\rm all}\;{\underli... ...{\,\rm d}^3{\skew0\vec r}{\,\rm d}^3{\underline{\skew0\vec r}} \end{displaymath}$

It is an interaction of the possibility that the first electron is in state $\pe{n}////$ and the second in state $\pe{\underline n}////$ with the possibility that the second electron is in state $\pe{n}////$ and the first in state $\pe{\underline n}////$ . This book likes to call terms like this twilight terms, since in terms of classical physics they do not make sense.

It may be noted that a single Hartree product satisfying the Pauli exclusion principle would not produce exchange integrals; in such a wave function, there is no possibility for an electron to be in another state. But don't start thinking that the exchange integrals are there just because the wave function must be antisymmetric under electron exchange. They, and others, would show up in any reasonably general wave function. You can think of the exchange integrals instead as Coulomb integrals with the electrons in the right hand side of the inner product exchanged.

Adding it all up, the expectation energy of the complete system of electrons can be written as

$\begin{displaymath} \fbox{$\displaystyle \left\langle{E}\right\rangle = \sum_... ...rrow}_{\underline n}\right\rangle ^2 K_{n{\underline n}} $} % \end{displaymath}$

(9.26)

Note that the above expression sums over all values of ${\underline n}$ , not just ${\underline n}>n$ . That counts each pair of single-electron wave functions twice, so factors one-half have been added to compensate. It also adds terms in which ${\underline n}=n$ , both electrons in the same state, which is not allowed by the Pauli principle. But since $J_{nn}=K_{nn}$ , these additional terms cancel each other.

Note also the spin inner products multiplying the exchange terms. These are zero if the two states have opposite spin, so there are no exchange contributions between electrons in spin orbitals of opposite spins. And if the spin orbitals have the same spin, the spin inner product is 1, so the square is somewhat superfluous.

There are also some a priori things you can say about the Coulomb and exchange integrals, {D.53}; they are real, and additionally

$\begin{displaymath} J_{nn} = K_{nn} \qquad J_{n{\underline n}} = J_{{\underli... ...}K_{n{\underline n}} \mathrel{\raisebox{-1pt}{$\geqslant$}}0 % \end{displaymath}$

(9.27)

Note in particular that since the $K_{n{\underline n}}$ terms are positive, they lower the net expectation energy of the system. So a wave function consisting of a single Hartree product, which produces no exchange terms, cannot be the state of lowest energy. Even without the antisymmetrization requirement, you would need Hartree products with the electrons exchanged, simply to lower the energy.

It is actually somewhat tricky to prove that the $K_{n{\underline n}}$ terms are positive and so lower the energy, {D.53}. But there is a simple physical reason why you might guess that an antisymmetric wave function would lower the electron-electron repulsion energy compared to the individual Hartree products from which it is made up. In particular, the Coulomb repulsion between electrons becomes very large when they get close together. But for an anti-symmetric wave function, unlike for a single Hartree product, the relative probability of electrons of the same spin getting close together is vanishingly small. That prevents any strong Coulomb repulsion between electrons of the same spin.

(Recall that the relative probability for electrons to be at given positions and spins is given by the square magnitude of the wave function at those positions and spins. Now an antisymmetric wave function must be zero wherever any two electrons are at the same position with the same spin, making this impossible. After all, if you swap the two electrons, the antisymmetric wave function must change sign. But since neither electron changes position nor spin, the wave function cannot change either. Something can only change sign and stay the same if it is zero. See also {A.34}.)

The analysis given in this subsection can easily be extended to generalized orbitals that take the form

$\begin{displaymath} \pp n/{\skew0\vec r}//z/ = \pe n+/{\skew0\vec r}/u/z/+\pe n-/{\skew0\vec r}/d/z/. \end{displaymath}$

However, the normal unrestricted spin-up or spin-down orbitals, in which either $\pe{n+}////$ or $\pe{n-}////$ is zero, already satisfy the variational requirement $\delta\left\langle{E}\right\rangle$ $\vphantom0\raisebox{1.5pt}{$=$}$ 0 even if generalized variations in the orbitals are allowed, {N.17}.

In any case, the expectation value of energy has been found.

9.3.4 The canonical Hartree-Fock equations

The previous subsection found the expectation value of energy for any electron wave function described by a single Slater determinant. The final step is to find the orbitals that produce the best approximation of the true wave function using such a single determinant. For the ground state, the best single determinant would be the one with the lowest expectation value of energy. But surely you would not want to guess spatial orbitals at random until you find some with really, really, low energy.

What you would like to have is specific equations for the best spatial orbitals that you can then solve in a methodical way. And you can have them using the methods of section 9.1, {D.54}. In unrestricted Hartree-Fock, for every spatial orbital $\pe{n}/{\skew0\vec r}///$ there is an equation of the form:

$\begin{displaymath} \fbox{$\displaystyle \begin{array}[b]{l} \displaystyle h... ... r}/// = \epsilon_n \pe n/{\skew0\vec r}/// \end{array} $} % \end{displaymath}$

(9.28)

These are called the canonical Hartree-Fock equations. For equations valid for the restricted closed-shell and single-determinant open-shell approximations, see the derivation in {D.54}.

Recall that $h^{\rm e}$ is the single-electron Hamiltonian consisting of the electron's kinetic energy and potential energy due to nuclear attractions, and that $v^{\rm ee}$ is the potential energy of repulsion between the electron and another at a position ${\underline{\skew0\vec r}}$ :

$\begin{displaymath} h^{\rm e}= - \frac{\hbar^2}{2m_{\rm e}}\nabla^2 - \sum_{j=... ...e r}\equiv\vert{\skew0\vec r}- {\underline{\skew0\vec r}}\vert \end{displaymath}$

So, if there were no electron-electron repulsions, i.e. $v^{\rm ee}$ $\vphantom0\raisebox{1.5pt}{$=$}$ 0, the canonical equations above would be single-electron Hamiltonian eigenvalue problems of the form $h^{\rm e}\pe{n}////$ $\vphantom0\raisebox{1.5pt}{$=$}$ $\epsilon_n\pe{n}////$ where $\epsilon_n$ would be the energy of the single-electron orbital. This is really what happened in the approximate analysis of atoms in chapter 5.9: the electron to electron repulsions were ignored there in favor of nuclear strength reductions, and the result was single-electron hydrogen-atom orbitals.

In the presence of electron to electron repulsions, the equations for the orbitals can still symbolically be written as if they were single-electron eigenvalue problems,

$\begin{displaymath} {\cal F}\pe n/{\skew0\vec r}/b/z/ = \epsilon_n\pe n/{\skew0\vec r}/b/z/ \end{displaymath}$

where ${\cal F}$ is called the “Fock operator,” and is written out further as:

$\begin{displaymath} {\cal F}= h^{\rm e}+ v^{\rm HF}. \end{displaymath}$

The first term in the Fock operator is the single-electron Hamiltonian. The mischief is in the innocuous-looking second term $v^{\rm {HF}}$ . Supposedly, this is the potential energy related to the repulsion by the other electrons. What is it? Well, it will have to be the terms in the canonical equations (9.28) not described by the single-electron Hamiltonian $h^{\rm e}$ :

$\begin{eqnarray*} v^{\rm HF} \pe/{\skew0\vec r}/b/z/ & = & \sum_{{\underline n}... .../v^{\rm ee}}\right\rangle} \pe{\underline n}/{\skew0\vec r}/b/z/ \end{eqnarray*}$

The definition of the Fock operator is unavoidably in terms of spin rather than just spatial orbitals: the spin of the state on which it operates must be known to evaluate the final term.

Note that the above expression did not give an expression for $v^{\rm {HF}}$ by itself, but only for $v^{\rm {HF}}$ applied to an arbitrary single-electron function $\pe//b//$ . The reason is that $v^{\rm {HF}}$ is not a normal potential at all: the second term, the one due to the exchange integrals, does not multiply $\pe//b//$ by a potential function, it shoves it into an inner product! The Hartree-Fock potential $v^{\rm {HF}}$ is an operator, not a normal potential energy. Given a single-electron function including spin, it produces another single-electron function including spin.

Actually, even that is not quite true. The Hartree-Fock potential is only an operator after you have found the orbitals $\pe1//b//$ , $\pe2//b//$ , ..., $\pe{\underline n}//b//$ , ..., $\pe{I}//b//$ appearing in it. While you are still trying to find them, the Fock operator is not even an operator, it is just a thing. However, given the orbitals, at least the Fock operator is a Hermitian one, one that can be taken to the other side if it appears in an inner product, and that has real eigenvalues and a complete set of eigenfunctions, {D.55}.

So how do you solve the canonical Hartree-Fock equations for the orbitals $\pe{n}////$ ? If the Hartree-Fock potential $v^{\rm {HF}}$ was a known operator, you would have only linear, single-electron eigenvalue problems to solve. That would be relatively easy, as far as those things come. But since the operator $v^{\rm {HF}}$ contains the unknown orbitals, you do not have a linear problem at all; it is a system of coupled cubic equations in infinitely many unknowns. The usual way to solve it is iteratively: you guess an approximate form of the orbitals and plug it into the Hartree-Fock potential. With this guessed potential, the orbitals may then be found from solving linear eigenvalue problems. If all goes well, the obtained orbitals, though not perfect, will at least be better than the ones that you guessed at random. So plug those improved orbitals into the Hartree-Fock potential and solve the eigenvalue problems again. Still better orbitals should result. Keep going until you get the correct solution to within acceptable accuracy.

You will know when you have got the correct solution since the Hartree-Fock potential will no longer change; the potential that you used to compute the final set of orbitals is really the potential that those final orbitals produce. In other words, the final Hartree-Fock potential that you compute is consistent with the final orbitals. Since the potential would be a field if it was not an operator, that explains why such an iterative method to compute the Hartree-Fock solution is called a “self-consistent field method.” It is like calling an iterative scheme for the Laplace equation on a mesh a “self-consistent neighbors method,” instead of point relaxation. Surely the equivalent for Hartree-Fock, like “iterated potential” or potential relaxation would have been much clearer to a general audience?

9.3.5 Additional points

This brief section was not by any means a tutorial of the Hartree-Fock method. The purpose was only to explain the basic ideas in terms of the notations and coverage of this book. If you actually want to apply the method, you will need to take up a book written by experts who know what they are talking about. The book by Szabo and Ostlund [46] was the main reference for this section, and is recommended as a well written introduction. Below are some additional concepts you may want to be aware of.

9.3.5.1 Meaning of the orbital energies

In the single electron case, the orbital energy $\epsilon_n$ in the canonical Hartree-Fock equation

$\begin{displaymath} \begin{array}[b]{l} \displaystyle h^{\rm e}\pe n/{\skew0\... ...w0\vec r}/// = \epsilon_n \pe n/{\skew0\vec r}/// \end{array}\end{displaymath}$

represents the actual energy of the electron. It also represents the ionization energy, the energy required to take the electron away from the nuclei and leave it far away at rest. This subsubsection will show that in the multiple electron case, the “orbital energies” $\epsilon_n$ are not orbital energies in the sense of giving the contributions of the orbitals to the total expectation energy. However, they can still be taken to be approximate ionization energies. This result is known as “Koopman’s theorem.”

To verify the theorem, a suitable equation for $\epsilon_n$ is needed. It can be found by taking an inner product of the canonical equation above with $\pe{n}/{\skew0\vec r}///$ , i.e. by putting $\pe{n}/{\skew0\vec r}///^*$ to the left of both sides and integrating over ${\skew0\vec r}$ . That produces

$\begin{displaymath} \epsilon_n = E^{\rm e}_n + \sum_{{\underline n}=1}^I J_{n{... ...ownarrow}_{\underline n}\right\rangle ^2 K_{n{\underline n}} % \end{displaymath}$

(9.29)

which consists of the single-electron energy $E^{\rm e}_n$ , Coulomb integrals $J_{n{\underline n}}$ and exchange integrals $K_{n{\underline n}}$ as defined in subsection 9.3.3. It can already be seen that if all the $\epsilon_n$ are summed together, it does not produce the total expectation energy (9.26), because that one includes a factor ${\textstyle\frac{1}{2}}$ in front of the Coulomb and exchange integrals. So, $\epsilon_n$ cannot be seen as the part of the system energy associated with orbital $\pe{n}//b//$ in any meaningful sense.

However, $\epsilon_n$ can still be viewed as an approximate ionization energy. Assume that the electron is removed from orbital $\pe{n}//b//$ , leaving the electron at infinite distance at rest. No, scratch that; all electrons share orbital $\pe{n}//b//$ , not just one. Assume that one electron is removed from the system and that the remaining electrons stay out of the orbital $\pe{n}//b//$ . Then, if it is assumed that the other orbitals do not change, the new system’s Slater determinant is the same as the original system’s, except that column and a row have been removed. The expectation energy of the new state then equals the original expectation energy, except that $E^{\rm e}_n$ and the -th column plus the -th row of the Coulomb and exchange integral matrices have been removed. The energy removed is then exactly $\epsilon_n$ above. (While $\epsilon_n$ only involves the -th row of the matrices, not the -th column, it does not have the factor $\frac12$ in front of them like the expectation energy does. And rows equal columns in the matrices, so half the row in $\epsilon_n$ counts as the half column in the expectation energy and the other half as the half row. This counts the element ${\underline n}$ $\vphantom0\raisebox{1.5pt}{$=$}$ twice, but that is zero anyway since $J_{nn}$ $\vphantom0\raisebox{1.5pt}{$=$}$ $K_{nn}$ .)

So by the removal of the electron from (read: and) orbital $\pe{n}//b//$ , an amount of energy $\epsilon_n$ has been removed from the expectation energy. Better put, a positive amount of energy $-\epsilon_n$ has been added to the expectation energy. So the ionization energy is $-\epsilon_n$ if the electron is removed from orbital $\pe{n}//b//$ according to this story.

Of course, the assumption that the other orbitals do not change after the removal of one electron and orbital is dubious. If you were a lithium electron in the expansive 2s state, and someone removed one of the two inner 1s electrons, would you not want to snuggle up a lot more closely to the now much less shielded three-proton nucleus? On the other hand, in the more likely case that someone removed the 2s electron, it would probably not seem like that much of an event to the remaining two 1s electrons near the nucleus, and the assumption that the orbitals do not change would appear more reasonable. And normally, when you say ionization energy, you are talking about removing the electron from the highest energy state.

But still, you should really recompute the remaining two orbitals from the canonical Hartree-Fock equations for a two-electron system to get the best, lowest, energy for the new electron ground state. The energy you get by not doing so and just sticking with the original orbitals will be too high. Which means that all else being the same, the ionization energy will be too high too.

However, there is another error of importance here, the error in the Hartree-Fock approximation itself. If the original and final system would have the same Hartree-Fock error, then it would not make a difference and $\epsilon_n$ would overestimate the ionization energy as described above. But Szabo and Ostlund [46, p. 128] note that Hartree-Fock tends to overestimate the energy for the original larger system more than for the final smaller one. The difference in Hartree-Fock error tends to compensate for the error you make by not recomputing the final orbitals, and in general the orbital energies provide reasonable first approximations to the experimental ionization energies.

The opposite of ionization energy is “electron affinity,” the energy with which the atom or molecule will bind an additional free electron [in its valence shell], {N.19}. It is not to be confused with electronegativity, which has to do with willingness to take on electrons in chemical bonds, rather than free electrons.

To compute the electron affinity of an atom or molecule with electrons using the Hartree-Fock method, you can either recompute the orbitals with the additional electron from scratch, or much easier, just use the Fock operator of the electrons to compute one more orbital $\pe{I+1}//b//$ . In the later case however, the energy of the final system will again be higher than Hartree-Fock, and it being the larger system, the Hartree-Fock energy will be too high compared to the -electron system already. So now the errors add up, instead of subtract as in the ionization case. If the final energy is too high, then the computed binding energy will be too low, so you would expect $\epsilon_{I+1}$ to underestimate the electron affinity relatively badly. That is especially so since affinities tend to be relatively small compared to ionization energies. Indeed Szabo and Ostlund [46, p. 128] note that while many neutral molecules will take up and bind a free electron, producing a stable negative ion, the orbital energies almost always predict negative binding energy, hence no stable ion.

9.3.5.2 Asymptotic behavior

The exchange terms in the Hartree-Fock potential are not really a potential, but an operator. It turns out that this makes a major difference in how the probability of finding an electron decays with distance from the system.

Consider again the Fock eigenvalue problem, but with the single-electron Hamiltonian identified in terms of kinetic energy and nuclear attraction,

$\begin{displaymath} \begin{array}[b]{l} \displaystyle - \frac{\hbar^2}{2m_{\r... ...w0\vec r}/// = \epsilon_n \pe n/{\skew0\vec r}/// \end{array}\end{displaymath}$

Now consider the question which of these terms dominate at large distance from the system and therefore determine the large-distance behavior of the solution.

The first term that can be thrown out is $v^{\rm Ne}$ , the Coulomb potential due to the nuclei; this potential decays to zero approximately inversely proportional to the distance from the system. (At large distance from the system, the distances between the nuclei can be ignored, and the potential is then approximately the one of a single point charge with the combined nuclear strengths.) Since $\epsilon_n$ in the right hand side does not decay to zero, the nuclear term cannot survive compared to it.

Similarly the third term, the Coulomb part of the Hartree-Fock potential, cannot survive since it too is a Coulomb potential, just with a charge distribution given by the orbitals in the inner product.

However, the final term in the left hand side, the exchange part of the Hartree-Fock potential, is more tricky, because the various parts of this sum have other orbitals outside of the inner product. This term can still be ignored for the slowest-decaying spin-up and spin-down states, because for them none of the other orbitals is any larger, and the multiplying inner product still decays like a Coulomb potential (faster, actually). Under these conditions the kinetic energy will have to match the right hand side, implying

$\begin{displaymath} \mbox{slowest decaying orbitals: } \pe n/{\skew0\vec r}/// \sim \exp(-\sqrt{-2m_{\rm e}\epsilon_n}r/\hbar + \ldots) \end{displaymath}$

From this expression, it can also be seen that the $\epsilon_n$ values must be negative, or else the slowest decaying orbitals would not have the exponential decay with distance of a bound state.

The other orbitals, however, cannot be less than the slowest decaying one of the same spin by more than algebraic factors: the slowest decaying orbital with the same spin appears in the exchange term sum and will have to be matched. So, with the exchange terms included, all orbitals normally decay slowly, raising the chances of finding electrons at significant distances. The decay can be written as

$\begin{displaymath} \pe n/{\skew0\vec r}/// \sim \exp(-\sqrt{2m_{\rm e}\vert\e... ...on_m\vert _{\rm min,\ same\ spin,\ no\ ss}}r /\hbar + \ldots) \end{displaymath}$

(9.30)

where $\epsilon_m$ is the $\epsilon$ value of smallest magnitude (absolute value) among all the orbitals with the same spin.

However, in the case that $\pe{n}/{\skew0\vec r}///$ is spherically symmetric, (i.e. an s state), exclude other s-states as possibilities for $\epsilon_m$ . The reason is a peculiarity of the Coulomb potential that makes the inner product appearing in the exchange term exponentially small at large distance for two orthogonal, spherically symmetric states. (For the incurably curious, it is a result of Maxwell’s first equation applied to a spherically symmetric configuration like figure 13.1, but with multiple spherically distributed charges rather than one, and the net charge being zero.)

9.3.5.3 Hartree-Fock limit

The Hartree-Fock approximation greatly simplifies finding a many-dimensional wave function. But really, solving the “eigenvalue problems” (9.28) for the orbitals iteratively is not that easy either. Typically, what one does is to write the orbitals $\pe{n}////$ as sums of chosen single-electron functions $f_1,f_2,\ldots$ . You can then precompute various integrals in terms of those functions. Of course, the number of chosen single-electron functions will have to be a lot more than the number of orbitals ; if you are only using chosen functions, it really means that you are choosing the orbitals $\pe{n}////$ rather than computing them.

But you do not want to choose too many functions either, because the required numerical effort will go up. So there will be an error involved; you will not get as close to the true best orbitals as you can. One thing this means is that the actual error in the ground state energy will be even larger than true Hartree-Fock would give. For that reason, the Hartree-Fock value of the ground state energy is called the Hartree-Fock limit: it is how close you could come to the correct energy if you were able to solve the Hartree-Fock equations exactly.

In short, to compute the Hartree-Fock solution accurately, you want to select a large number of single-electron functions to represent the orbitals. But don't start using zillions of them. The problem is that even the exact Hartree-Fock solution still has a finite error; a wave function cannot in general be described accurately using only a single Slater determinant. So what would the point in computing the very inaccurate numbers to ten digits accuracy?

9.3.5.4 Correlation energy

As the previous subsubsection noted, the Hartree-Fock solution, even if computed exactly, will still have a finite error. You might think that this error would be called something like “Hartree-Fock error.” Or maybe “representation error“ or single-determinant error, since it is due to an incomplete representation of the true wave function using a single Slater determinant.

However, the Hartree-Fock error in energy is called “correlation energy.” The reason is because there is a energizing correlation between the more impenetrable and poorly defined your jargon, and the more respect you will get for doing all that incomprehensible stuff.

And of course the word error should never be used in the first place, God forbid. Or those hated non-experts might figure out that Hartree-Fock has an error in energy so big that it makes the base approximation pretty much useless for chemistry.

To understand what physicists are referring to with correlation, reconsider the form of the Hartree-Fock wave function, as described in subsection 9.3.1. It consisted of a single Slater determinant. However, that Slater determinant in turn consisted of a lot of Hartree products, the first of which was

$\begin{displaymath} \pe1/{\skew0\vec r}_1/b/z1/ \pe2/{\skew0\vec r}_2/b/z2/ \pe3/{\skew0\vec r}_3/b/z3/ \ldots \pe I/{\skew0\vec r}_I/b/zI/ \end{displaymath}$

The other Hartree products were different only in the order in which the electrons appear in the product. And since the electrons are all the same, the order does not make a difference: each of these Hartree products has the same expectation energy. Each also satisfies the Pauli exclusion principle but, by itself, not the antisymmetrization requirement.

Now, consider what the Born statistical interpretation says about the single Hartree product above. It says that the probability of electron 1 to be within a vicinity of volume ${\rm d}^3{\skew0\vec r}_1$ around a given position ${\skew0\vec r}_1$ with given spin $S_{z1}$ , and electron 2 to be within a vicinity of volume ${\rm d}^3{\skew0\vec r}_2$ around a given position ${\skew0\vec r}_2$ with given spin $S_{z2}$ , etcetera, is given by

$\begin{eqnarray*} && \left\vert\pe1/{\skew0\vec r}_1/b/z1/\right\vert^2 {\rm d}... ...t^2 {\rm d}^3{\skew0\vec r}_3 \\ && \quad \quad \quad{} \ldots \end{eqnarray*}$

This takes the form of a probability for electron 1 to be in the given state that is independent of where the other electrons are, times a probability for electron 2 to be in the given state that is independent of where the other electrons are, etcetera. In short, in a single Hartree product the electrons do not care where the other electrons are. Their positions are uncorrelated.

Uncorrelated positions would be OK if the electrons did not repel each other. In that case, each electron would indeed not care where the other electrons are. Then all Hartree products would have the same energy, which would also be the energy of the complete Slater determinant.

But electrons do repel each other. So, if electron 1 is at a given position ${\skew0\vec r}_1$ , electron 2 can reduce its potential energy by preferring positions farther away from that position. It cannot overdo it, as that will increase its kinetic energy too much, but there is some room for improvement. So the exact wave function will have correlations between the positions of different electrons. Based on arguments like that, physicists then come up with the term correlation energy.

Not so fast, physicists! For one, a Slater determinant is not a single Hartree product but already includes some electron correlations. Also, correlation energy is not the same as “error in energy caused by incorrect correlations.” And “error in energy caused by incorrect correlations” is not the same as “error in energy for an incorrect solution, including incorrect correlations.” And the last is what the Hartree-Fock error really is. Note that while there is some rough qualitative relation between potential energy and electron position correlations, you cannot find the potential energy by pontificating about electrons trying to stay away from each other. And the correct energy state is found by delicately balancing subtle reductions in potential energy against subtle increases in kinetic energy. The kinetic energy does not even care about electron correlations. However, the kinetic energy is wrong too when applied on a single Slater determinant.

See note {N.18} for more.

9.3.5.5 Configuration interaction

Since the base Hartree-Fock approximation has an error that is far too big for typical chemistry applications, the next question is what can be done about it. The basic answer is simple: use more that orbitals, i.e. single-particle wave functions. As already noted in section 5.7, if you include enough orthonormal basis functions, using all their possible Slater determinants, you can approximate any function to arbitrary accuracy.

After the , (or $\raisebox{.5pt}{$/$}$ 2 in the restricted closed-shell case,) spatial orbitals have been found, the Hartree-Fock operator becomes just a Hermitian operator, and can be used to compute further orthonormal orbitals $\pe{I+1}//b//,\pe{I+2}//b//,\ldots$ . You can add these to the mix, say to get a better approximation to the true ground state wave function of the system.

You might want to try to start small. If you include just one more orbital $\pe{I+1}//b//$ , you can already form more Slater determinants: you can replace any of the orbitals in the original determinant by the new function $\pe{I+1}//b//$ . So you can now approximate the true wave function by the more general expression

$\begin{eqnarray*} \Psi &=& a_0 \Big({\left\vert{\rm det}\;\pe1//b//,\pe2//b//,... ...b//,\pe2//b//,\pe3//b//,\ldots,\pe I+1//b//\right\rangle} \Big) \end{eqnarray*}$

where the coefficients $a_1,a_2,\ldots$ are to be chosen to approximate the ground state energy more closely and

is a normalization constant.

The additional Slater determinants are called “excited determinants”. For example, the first excited state

$\begin{displaymath} {\left\vert{\rm det}\;\pe I+1//b//,\pe2//b//,\pe3//b//,\ldots,\pe I//b//\right\rangle} \end{displaymath}$

is like a state where you excited an electron out of the lowest state $\pe1//b//$ into an elevated energy state $\pe{I+1}//b//$ .

(However, note that if you really wanted to satisfy the variational requirement $\delta\left\langle{E}\right\rangle$ $\vphantom0\raisebox{1.5pt}{$=$}$ 0 for such a state, you would have to recompute the orbitals from scratch, using $\pe{I+1}//b//$ in the Fock operator instead of $\pe1//b//$ . That is not what you want to do here; you do not want to create totally new orbitals, just more of them.)

It may seem that this must be a winner: as much as more determinants to further minimize the energy. Unfortunately, now you pay the price for doing such a great job with the single determinant. Since, hopefully, the Slater determinant is the best single determinant that can be formed, any changes that are equivalent to simply changing the determinant's orbitals will do no good. And it turns out that the -determinant wave function above is equivalent to the single-determinant wave function

$\begin{displaymath} \Psi = a_0 {\left\vert{\rm det}\;\pe1//b//+a_1\pe{I+1}//b//... ...{I+1}//b//, \ldots,\pe{I}//b//+a_I\pe{I+1}//b//\right\rangle} \end{displaymath}$

as you can check with some knowledge of the properties of determinants. Since you already have the best single determinant, all your efforts are going to be wasted if you try this.

You might try forming another set of excited determinants by replacing one of the orbitals in the original Hartree-Fock determinant by $\pe{I+2}//b//$ instead of $\pe{I+1}//b//$ , but the fact is that the infinitesimal variational condition $\delta\left\langle{E}\right\rangle$ $\vphantom0\raisebox{1.5pt}{$=$}$ 0 is still going to be satisfied when the wave function is the original Hartree-Fock one. For small changes in wave function, the additional determinants can still be pushed inside the Hartree-Fock one. To ensure a decrease in energy, you want to include determinants that allow a nonzero decrease in energy even for small changes from the original determinant, and that requires doubly excited determinants, in which two different original states are replaced by excited ones like $\pe{I+1}//b//$ and $\pe{I+2}//b//$ .

Note that you can form such determinants; the number of determinants rapidly explodes when you include more and more orbitals. And a mathematically convergent process would require an asymptotically large set of orbitals, compare chapter 5.7. How big is your computer?

Most people would probably call improving the wave function representation using multiple Slater determinants something like multiple-determinant representation, or excited-determinant correction.. However, it is called configuration interaction. The reason is that every hated non-expert will wonder whether the physicist is talking about the configuration of the nuclei or the electrons, and what it is interacting with.

(Actually, configuration refers to the practitioner configuring all those determinants, no kidding. The interaction is with the computer used to do so. Suppose you were creating the numerical mesh for some finite difference or finite element computation. If you called that configuration interaction instead of “mesh generation,” because it required you to configure all those mesh points through interacting with your computer, some people might doubt your sanity. But in physics, the standards are not so high.)