We are going to use the concept of exchange energy to demonstrate why two electrons interacting via their Coulomb interaction prefer to have their spins parallel rather than anti-parallel, assuming all other factors are equal.
This preference arises from the quantum mechanical principle that the total wavefunction of two indistinguishable fermions (such as electrons) must be antisymmetric with respect to the exchange of particles. This principle leads to the Pauli exclusion principle, which states that no two electrons can occupy the same quantum state simultaneously.
When two electrons have parallel spins (), the spatial part of their wavefunction must be anti-symmetric to satisfy the antisymmetry of the total wavefunction. An anti-symmetric spatial wavefunction reduces the overlap between the electron wavefunctions, which in turn reduces the Coulomb repulsion between them. This reduced repulsion lowers the total energy of the system, leading to a situation where:
The energy difference between these two configurations is known as the exchange energy. The exchange energy is a quantum mechanical effect with no classical analog, and it is a direct consequence of the antisymmetry requirement for fermions.
1. Stating the problem
The first excited state of a Helium atom is a straightforward example to consider because it only involves two electrons. We focus on the first excited state to compare the energy configurations of parallel versus anti-parallel spins when one electron is in the orbital and the other is in the orbital. By examining these configurations, we can determine which one has the lower energy.
In the ground state of the Helium atom, both electrons occupy the orbital. In this configuration, the Pauli exclusion principle requires the spins to be anti-parallel, so it is not possible to compare parallel and anti-parallel spin configurations in the ground state
The Hamiltonian for a system with two interacting electrons can be decomposed into the following terms:
where,
- represents the hamiltonian for electron 1, assuming it is the only electron present. It consists of the kinetic energy and the potential energydue to the Coulomb interaction between electron 1 and the nucleus. This is given by:
In this term, is the distance of electron 1 from the nucleus. - has the same form,but it describes the kinetic and potential energy for electron 2:
.
If the electrons did not interact with each other, the total Hamiltonian would simply be the sum of and . However, because the electrons do interact, there is an additional term; - represents the Coulomb interactions between the two electrons.
This term is positive because the interaction is repulsive (both electrons have the same negative charge). Here, is the distance between the two electrons.
We are interested in calculating the energy for the cases where the spins of the electrons are parallel and where they are anti-parallel.
2. Energy of the system
The Pauli exclusion principle states that any atomic or molecular orbital can accommodate a maximum of two electrons, and these electrons must have opposite spins. A more rigorous expression of the Pauli principle is that the total electronic wave function must be antisymmetric with respect to the exchange of any two electrons.
In our specific problem involving two electrons, the wave function must change sign when the electrons are swapped. The wave function is composed of a spatial part, dependent on the positions of electrons 1 and 2, and a spin part, dependent on the orientation of their spins. Thus, there are two configurations of the wave function for two electrons that adhere to the Pauli principle:
- A symmetric spin part with an antisymmetric spatial part.
- An antisymmetric spin part with a symmetric spatial part.
A symmetric spin part means that the spin part of the wave function remains unchanged when the spins of the two electrons are exchanged. This occurs when the electron spins are parallel. In contrast, an antisymmetric spin part means that swapping the spins of the two electrons results in the spin part of the wave function changing sign. This happens when the electron spins are antiparallel.
The spatial parts of the wave function (symmetric and antisymmetric) correspond to these different spin arrangements. In summary, when the spins are parallel, the spatial part of the wave function must be antisymmetric, and when the spins are antiparallel, the spatial part must be symmetric.
2.1 Symmetric spin part
For the spin part to be symmetric (), the spatial part of the wave function has to be anti-symmetric. The way that we can make an anti-symmetric wave function of electron 1 and electron 2, is to put electron 1 in the orbital, and electron 2 in the orbital (). Of course we can equally likely have electron 1 in the and electron 2 in the orbital (). We can write this in the following way:
where the root 2 is to normalize it. Furthermore, he minus sign in between is to make the wave function anti-symmetric. If I exchange electron 1 with electron 2, the wave function as a whole changes sign. You’d end up with:
2.2 Anti-symmetric spin part
For the anti-symmetric spin part (), we can write down the symmetric spatial part of the same form as follows:
Now we see a plus sign, so that means if I swap electron 1 with electron 2, this spatial part of the wave function remains unchanged. It is symmetric with respect to the exchange of electron 1 and 2.
3. Solving the expectation value
Given that we have two spatial wave functions, we can now work out the energy in the two cases ( versus ).
The energy can be calculated with the expectation value of the wave function, which is a function of electron 1 and electron 2, with the Hamiltonian that we wrote down before. Do remember that only operates on wave functions of electron 1, similarly for , while operates on the parts of both electron 1 and 2.
To go from the first to the second line, we implemented the form of the wave function from before, with plus for the symmetric case and minus for the anti-symmetric case of the spatial wave function. The next step will be to work them all out, which gives the lengthy equation shown below:
- In the first row we have a term with electron 1 and the orbital, and a second term with electron 1 and the orbital. What this describes is the energy of electron 1 in the orbital plus the energy of electron 1 in the orbital. We will call this .
- In the second row, it is similar, but now it is for electron 2. This is the energy of electron 2 in the orbital plus the energy of electron 2 in the orbital. We call this . Since the electrons are indistinguishable, it will be the same energy as for the first electron.
- In the first term (line 3), we have electron 1 in orbital and electron 2 in , with the Coulomb interaction operator. In the second term (line 4), we have electron 1 in the orbital and electron 2 in the orbital. We call both terms, . So far all the signs have been positive, which changes in the last term.
- here we notice the , where it is positive when we have a symmetric spatial form of the wave function and negative for the anti-symmetric spatial form. This is exactly what is going to change the relative energies of the two wave functions. In the first term (line 5) you see electron 1 in the orbital and electron 2 in the orbital, times the coulomb interaction, and we see that now electron 1 is in the orbital and electron 2 in the . That means we have exchanged the orbital that each electron is sitting in. Likewise in line 6, we have done the same, but where electron 1 starts in and electron 2 starts in . We call these terms
We can shorten the equation to look like:
where + is for the symmetric spatial part, thus anti-parallel spins (); and – is for the anti-symmetric spatial part, and parallel spins ().
3.1 Coulomb interaction
The expression for is
where represents the Coulomb interaction between two electrons, as described above. The Coulomb interaction integral can be written as:
The complex conjugate and the real part can be rewritten as the square modulus of the electronic wave function squared. We can further simplify this equation, by defining the charge densities:
The integral then further simplifies to:
This integral represents the total Coulomb interaction between the charge densities of electrons in the and orbitals. The positive sign correctly indicates that this is the sum of the Coulomb interactions from both possible configurations (electron 1 in the orbital and electron 2 in the orbital, plus electron 1 in the orbital and electron 2 in the orbital).
So, these terms in represent the quantum mechanical exact analog to the classical coulomb interaction between two charge distributions.
3.2 Exchange interaction
The expression for is:
In this expression, the interaction remains the Coulomb interaction between the two electrons, but with their spatial coordinates exchanged between the orbitals. The key point here is that this term cannot be expressed in terms of charge densities because the orbital of each electron on the left side of the inner product differs from the orbital on the right side. For example, in the first term, electron 1 is in the orbital on the left but in the orbital on the right. The same applies to electron 2.
This type of interaction is purely quantum mechanical and has no classical analog. It is known as the exchange interaction and arises due to the indistinguishable nature of electrons and the requirement that the total wavefunction (spatial part multiplied by the spin part) be antisymmetric under the exchange of two electrons.
4. The result
The exchange interaction plays a crucial role in determining the energy of different spin states:
Symmetric Spatial Wave Function (Anti-Parallel Spins, ):
When the spatial part of the wavefunction is symmetric, the spin part must be anti-symmetric, corresponding to anti-parallel spins. The energy associated with this configuration is higher because the symmetric spatial wavefunction increases the overlap between electron wavefunctions, leading to greater Coulomb repulsion.Anti-Symmetric Spatial Wave Function (Parallel Spins, ):
When the spatial part of the wavefunction is anti-symmetric, the spin part must be symmetric, corresponding to parallel spins. This configuration reduces the overlap between the electron wavefunctions, resulting in lower Coulomb repulsion and, consequently, a lower energy.
As a result, the energy of the symmetric spatial wavefunction (anti-parallel spins) is higher by an amount than that of the anti-symmetric spatial wavefunction (parallel spins). The arrangement of parallel spins (with a net magnetic moment) typically corresponds to a lower energy state because of the reduced repulsion.
As a side note, in Helium, the ground state is non-magnetic, as both electrons occupy the orbital and pair up with anti-parallel spins, forming a singlet state. This singlet state has the lowest energy due to the complete filling of the orbital and the minimized electron repulsion in this configuration.
WE’ve now seen the exchange interaction in a single atom, but it is also used to explain ferromagnetism. Read more on this topic on my article on the basics of ferromagnetism.