Time-Independent Perturbation Theory
Most useful Hamiltonians cannot be solved exactly. Time-independent perturbation theory gives controlled corrections when the true Hamiltonian is close to a solvable one. It explains fine structure, Zeeman splitting, Stark shifts, anharmonic oscillators, and many effective models.
Sakurai presents nondegenerate and degenerate perturbation theory, then applies it to hydrogenlike atoms, fine structure, and electromagnetic-field interactions. Ballentine gives a careful stationary-state perturbation chapter with variational comparison. The Gottfried-named notes emphasize Rayleigh-Schrodinger expansions and hydrogen in fields. Schiff's classic treatment is often more coordinate-based and calculation-heavy.
Definitions
Write the Hamiltonian as
where is exactly solvable and is a bookkeeping parameter set to at the end.
The unperturbed eigenproblem is
For a nondegenerate level, expand
and
The common normalization convention is intermediate normalization:
which implies
For a degenerate subspace, one must first diagonalize inside that subspace. Ordinary nondegenerate formulas fail because denominators become zero.
Key results
For a nondegenerate level,
The first-order state correction is
The second-order energy correction is
For a degenerate level with basis inside the degenerate subspace, build the perturbation matrix
Its eigenvalues are the first-order energy shifts, and its eigenvectors give the correct zeroth-order linear combinations.
In the linear Stark effect, a hydrogen perturbation has the form
up to charge-sign convention. Degeneracy within the shell allows a first-order shift. In contrast, the nondegenerate ground state has no first-order Stark shift by parity because
In a weak magnetic field, Zeeman shifts are controlled by magnetic moments and angular momentum projections. The simplest orbital form is
with spin and fine-structure refinements depending on coupling regime. Sakurai's treatment makes the angular-momentum basis choice explicit.
Visual
| Case | First action | First-order shift | Main danger |
|---|---|---|---|
| Nondegenerate | compute diagonal matrix element | small denominators | |
| Degenerate | diagonalize in subspace | eigenvalues of | using wrong basis |
| Parity-odd perturbation on even state | check symmetry | often zero | missing second-order effect |
| Magnetic splitting | choose angular-momentum basis | projection-dependent | mixing weak/strong field regimes |
Worked example 1: Two-level nondegenerate perturbation
Problem. Let
with . Find the first nonzero correction to the lower energy.
Method.
- The lower unperturbed state is
- The first-order shift is
- The only other state is
- The coupling matrix element is
- The second-order shift is
Checked answer. The lower level moves downward. Exact diagonalization gives for small .
Worked example 2: Degenerate perturbation in a two-state subspace
Problem. Two unperturbed states have the same energy . In this subspace,
Find the first-order corrected energies and states.
Method.
-
Nondegenerate perturbation theory is invalid because .
-
Diagonalize the perturbation matrix:
- The eigenvalues are
- The normalized eigenvectors are
- Therefore the first-order energies are
Checked answer. The perturbation splits the degeneracy linearly, and the correct starting states are symmetric and antisymmetric combinations.
Code
import numpy as np
delta = 10.0
v = 0.5
h = np.array([[0, v], [v, delta]], dtype=float)
exact = np.linalg.eigvalsh(h)[0]
perturbative = -v**2 / delta
print("exact lower energy:", exact)
print("second-order estimate:", perturbative)
print("relative error:", abs((exact - perturbative) / exact))
Common pitfalls
- Using nondegenerate formulas inside a degenerate subspace. Zero denominators are not a minor algebra issue; the basis is wrong.
- Forgetting symmetry selection rules. Parity can make matrix elements vanish before integration.
- Trusting perturbation theory near level crossings or small denominators.
- Ignoring normalization conventions for corrected states.
- Calling a first-order zero shift "no effect." The leading effect may be second order.
- Mixing exact eigenstates of the full Hamiltonian with unperturbed states inside matrix elements.
- Applying weak-field Zeeman formulas in regimes where spin-orbit coupling is no longer the dominant internal interaction.
Perturbation theory begins with a modeling decision: what is , and what is small? The best is not always the most obvious algebraic split; it should capture the dominant physics and leave a perturbation whose matrix elements are small compared with relevant level spacings. If the perturbation connects nearly degenerate states, the expansion parameter is effectively large even when the operator looks small. This is why level spacing appears in denominators.
Symmetry should be used before summing states. If is odd under parity, its diagonal matrix element in a parity eigenstate vanishes. If is rotationally invariant, it cannot mix states with different total angular momentum labels in the usual way. Selection rules reduce both computation and error. Sakurai's angular-momentum-first structure pays off here because perturbation theory in atoms is often mostly a problem of choosing the right coupled basis.
Degenerate perturbation theory is best understood as exact diagonalization of the perturbation inside a small subspace. The perturbation chooses the linear combinations that nature uses as the correct zeroth-order states. Once that diagonalization is done, corrections from states outside the degenerate subspace can be treated with formulas resembling the nondegenerate case. Skipping this first step leads to infinite denominators and physically meaningless "corrections."
Always compare perturbative results with limiting behavior. A repulsive positive perturbation should raise an energy if the diagonal expectation is positive. Coupling to a higher level usually pushes a lower level downward at second order, as in level repulsion. A formula that becomes larger as the perturbation becomes smaller is signaling a denominator or degeneracy problem. Ballentine's careful stationary-state treatment is useful because it keeps these assumptions visible.
Perturbative state corrections are just as important as energy corrections when computing observables. A first-order energy shift may vanish, while a first-order change in the state produces a nonzero expectation value for another operator. Electric polarizability is the standard example: the first-order Stark shift of a nondegenerate parity eigenstate can vanish, but the field mixes opposite-parity states into the wave function and produces a second-order energy shift and induced dipole response.
In matrix problems, exact diagonalization of a small model is an excellent check. Expand the exact eigenvalues in powers of the small parameter and compare term by term with perturbation theory. This confirms signs, denominators, and degeneracy handling. Sakurai frequently uses compact operator formulas, but small matrices expose the same physics without hidden sums over infinite bases.
Finally, perturbation theory is an asymptotic tool, not always a convergent one. Adding more terms does not guarantee improvement outside the method's domain. Physical judgment about small parameters, symmetries, and nearby levels matters as much as algebraic fluency.
For operator perturbations in infinite-dimensional spaces, convergence of sums over intermediate states is another hidden assumption. Selection rules, closure relations, or sum rules can simplify these expressions, but they do not remove the need to check whether the perturbation and states are physically admissible. Formal sums that diverge signal missing physics, an invalid approximation, or the need for renormalized parameters in more advanced settings.
This is why perturbative answers should always be paired with stated small parameters and validity conditions.
State those assumptions explicitly before trusting numerical agreement.