Skip to main content

Time-Independent Perturbation Theory

Most useful Hamiltonians cannot be solved exactly. Time-independent perturbation theory gives controlled corrections when the true Hamiltonian is close to a solvable one. It explains fine structure, Zeeman splitting, Stark shifts, anharmonic oscillators, and many effective models.

Sakurai presents nondegenerate and degenerate perturbation theory, then applies it to hydrogenlike atoms, fine structure, and electromagnetic-field interactions. Ballentine gives a careful stationary-state perturbation chapter with variational comparison. The Gottfried-named notes emphasize Rayleigh-Schrodinger expansions and hydrogen in fields. Schiff's classic treatment is often more coordinate-based and calculation-heavy.

Definitions

Write the Hamiltonian as

H=H0+λV,H=H_0+\lambda V,

where H0H_0 is exactly solvable and λ\lambda is a bookkeeping parameter set to 11 at the end.

The unperturbed eigenproblem is

H0n(0)=En(0)n(0).H_0|n^{(0)}\rangle=E_n^{(0)}|n^{(0)}\rangle.

For a nondegenerate level, expand

En=En(0)+λEn(1)+λ2En(2)+,E_n=E_n^{(0)}+\lambda E_n^{(1)}+\lambda^2E_n^{(2)}+\cdots,

and

n=n(0)+λn(1)+λ2n(2)+.|n\rangle=|n^{(0)}\rangle+\lambda|n^{(1)}\rangle+\lambda^2|n^{(2)}\rangle+\cdots.

The common normalization convention is intermediate normalization:

n(0)n=1,\langle n^{(0)}|n\rangle=1,

which implies

n(0)n(1)=0.\langle n^{(0)}|n^{(1)}\rangle=0.

For a degenerate subspace, one must first diagonalize VV inside that subspace. Ordinary nondegenerate formulas fail because denominators become zero.

Key results

For a nondegenerate level,

En(1)=n(0)Vn(0).E_n^{(1)}=\langle n^{(0)}|V|n^{(0)}\rangle.

The first-order state correction is

n(1)=knk(0)k(0)Vn(0)En(0)Ek(0).|n^{(1)}\rangle =\sum_{k\neq n} {|k^{(0)}\rangle\langle k^{(0)}|V|n^{(0)}\rangle \over E_n^{(0)}-E_k^{(0)}}.

The second-order energy correction is

En(2)=knk(0)Vn(0)2En(0)Ek(0).E_n^{(2)} =\sum_{k\neq n} {|\langle k^{(0)}|V|n^{(0)}\rangle|^2 \over E_n^{(0)}-E_k^{(0)}}.

For a degenerate level with basis α\vert \alpha\rangle inside the degenerate subspace, build the perturbation matrix

Vαβ=αVβ.V_{\alpha\beta}=\langle \alpha|V|\beta\rangle.

Its eigenvalues are the first-order energy shifts, and its eigenvectors give the correct zeroth-order linear combinations.

In the linear Stark effect, a hydrogen perturbation has the form

V=eEzV=e\mathcal E z

up to charge-sign convention. Degeneracy within the n=2n=2 shell allows a first-order shift. In contrast, the nondegenerate ground state has no first-order Stark shift by parity because

100z100=0.\langle 100|z|100\rangle=0.

In a weak magnetic field, Zeeman shifts are controlled by magnetic moments and angular momentum projections. The simplest orbital form is

ΔE=μBBm,\Delta E=\mu_B B m_\ell,

with spin and fine-structure refinements depending on coupling regime. Sakurai's treatment makes the angular-momentum basis choice explicit.

Visual

CaseFirst actionFirst-order shiftMain danger
Nondegeneratecompute diagonal matrix elementnVn\langle n\vert V\vert n\ranglesmall denominators
Degeneratediagonalize VV in subspaceeigenvalues of VαβV_{\alpha\beta}using wrong basis
Parity-odd perturbation on even statecheck symmetryoften zeromissing second-order effect
Magnetic splittingchoose angular-momentum basisprojection-dependentmixing weak/strong field regimes

Worked example 1: Two-level nondegenerate perturbation

Problem. Let

H0=(000Δ),V=(0vv0),H_0=\begin{pmatrix}0&0\\0&\Delta\end{pmatrix}, \qquad V=\begin{pmatrix}0&v\\v&0\end{pmatrix},

with Δ>0\Delta\gt 0. Find the first nonzero correction to the lower energy.

Method.

  1. The lower unperturbed state is
0=(10),E0(0)=0.|0\rangle=\begin{pmatrix}1\\0\end{pmatrix}, \qquad E_0^{(0)}=0.
  1. The first-order shift is
E0(1)=0V0=0.E_0^{(1)}=\langle0|V|0\rangle=0.
  1. The only other state is
1=(01),E1(0)=Δ.|1\rangle=\begin{pmatrix}0\\1\end{pmatrix}, \qquad E_1^{(0)}=\Delta.
  1. The coupling matrix element is
1V0=v.\langle1|V|0\rangle=v.
  1. The second-order shift is
E0(2)=v2E0(0)E1(0)=v2Δ.E_0^{(2)}={|v|^2\over E_0^{(0)}-E_1^{(0)}} =-{v^2\over \Delta}.

Checked answer. The lower level moves downward. Exact diagonalization gives E=(ΔΔ2+4v2)/2v2/ΔE_-=(\Delta-\sqrt{\Delta^2+4v^2})/2\approx -v^2/\Delta for small v/Δv/\Delta.

Worked example 2: Degenerate perturbation in a two-state subspace

Problem. Two unperturbed states a,b\vert a\rangle,\vert b\rangle have the same energy E0E_0. In this subspace,

V=(0gg0).V=\begin{pmatrix}0&g\\g&0\end{pmatrix}.

Find the first-order corrected energies and states.

Method.

  1. Nondegenerate perturbation theory is invalid because Ea(0)Eb(0)=0E_a^{(0)}-E_b^{(0)}=0.

  2. Diagonalize the perturbation matrix:

det(ϵggϵ)=ϵ2g2=0.\det\begin{pmatrix}-\epsilon&g\\g&-\epsilon\end{pmatrix} =\epsilon^2-g^2=0.
  1. The eigenvalues are
ϵ+=g,ϵ=g.\epsilon_+=g,\qquad \epsilon_-=-g.
  1. The normalized eigenvectors are
+=12(a+b),=12(ab).|+\rangle={1\over\sqrt2}(|a\rangle+|b\rangle), \qquad |-\rangle={1\over\sqrt2}(|a\rangle-|b\rangle).
  1. Therefore the first-order energies are
E±=E0+λ(±g).E_\pm=E_0+\lambda(\pm g).

Checked answer. The perturbation splits the degeneracy linearly, and the correct starting states are symmetric and antisymmetric combinations.

Code

import numpy as np

delta = 10.0
v = 0.5
h = np.array([[0, v], [v, delta]], dtype=float)
exact = np.linalg.eigvalsh(h)[0]
perturbative = -v**2 / delta

print("exact lower energy:", exact)
print("second-order estimate:", perturbative)
print("relative error:", abs((exact - perturbative) / exact))

Common pitfalls

  • Using nondegenerate formulas inside a degenerate subspace. Zero denominators are not a minor algebra issue; the basis is wrong.
  • Forgetting symmetry selection rules. Parity can make matrix elements vanish before integration.
  • Trusting perturbation theory near level crossings or small denominators.
  • Ignoring normalization conventions for corrected states.
  • Calling a first-order zero shift "no effect." The leading effect may be second order.
  • Mixing exact eigenstates of the full Hamiltonian with unperturbed states inside matrix elements.
  • Applying weak-field Zeeman formulas in regimes where spin-orbit coupling is no longer the dominant internal interaction.

Perturbation theory begins with a modeling decision: what is H0H_0, and what is small? The best H0H_0 is not always the most obvious algebraic split; it should capture the dominant physics and leave a perturbation whose matrix elements are small compared with relevant level spacings. If the perturbation connects nearly degenerate states, the expansion parameter is effectively large even when the operator looks small. This is why level spacing appears in denominators.

Symmetry should be used before summing states. If VV is odd under parity, its diagonal matrix element in a parity eigenstate vanishes. If VV is rotationally invariant, it cannot mix states with different total angular momentum labels in the usual way. Selection rules reduce both computation and error. Sakurai's angular-momentum-first structure pays off here because perturbation theory in atoms is often mostly a problem of choosing the right coupled basis.

Degenerate perturbation theory is best understood as exact diagonalization of the perturbation inside a small subspace. The perturbation chooses the linear combinations that nature uses as the correct zeroth-order states. Once that diagonalization is done, corrections from states outside the degenerate subspace can be treated with formulas resembling the nondegenerate case. Skipping this first step leads to infinite denominators and physically meaningless "corrections."

Always compare perturbative results with limiting behavior. A repulsive positive perturbation should raise an energy if the diagonal expectation is positive. Coupling to a higher level usually pushes a lower level downward at second order, as in level repulsion. A formula that becomes larger as the perturbation becomes smaller is signaling a denominator or degeneracy problem. Ballentine's careful stationary-state treatment is useful because it keeps these assumptions visible.

Perturbative state corrections are just as important as energy corrections when computing observables. A first-order energy shift may vanish, while a first-order change in the state produces a nonzero expectation value for another operator. Electric polarizability is the standard example: the first-order Stark shift of a nondegenerate parity eigenstate can vanish, but the field mixes opposite-parity states into the wave function and produces a second-order energy shift and induced dipole response.

In matrix problems, exact diagonalization of a small model is an excellent check. Expand the exact eigenvalues in powers of the small parameter and compare term by term with perturbation theory. This confirms signs, denominators, and degeneracy handling. Sakurai frequently uses compact operator formulas, but small matrices expose the same physics without hidden sums over infinite bases.

Finally, perturbation theory is an asymptotic tool, not always a convergent one. Adding more terms does not guarantee improvement outside the method's domain. Physical judgment about small parameters, symmetries, and nearby levels matters as much as algebraic fluency.

For operator perturbations in infinite-dimensional spaces, convergence of sums over intermediate states is another hidden assumption. Selection rules, closure relations, or sum rules can simplify these expressions, but they do not remove the need to check whether the perturbation and states are physically admissible. Formal sums that diverge signal missing physics, an invalid approximation, or the need for renormalized parameters in more advanced settings.

This is why perturbative answers should always be paired with stated small parameters and validity conditions.

State those assumptions explicitly before trusting numerical agreement.

Connections