Probability and Density Matrices
Statistical mechanics begins by replacing an impossible microscopic description with a precise probabilistic one. A gas with particles has a well-defined Hamiltonian, but the detailed microstate is neither measurable nor useful for thermodynamic questions. What matters is the probability distribution over microstates and the way macroscopic observables become sharply predictable when many weakly correlated degrees of freedom are combined.
Schwabl starts here because the later ensembles are not guesses about ignorance. They are distributions or density matrices constrained by macroscopic data and by the microscopic equations of motion. The same language also ties classical phase-space densities to quantum statistical operators, so this page is the common vocabulary for the rest of the section.
Definitions
A random variable has probability density when is the probability that lies in . Normalization and expectation are
The moments and variance are
The characteristic function is the Fourier transform of the density,
For many variables , the covariance matrix is
In quantum mechanics, a pure state defines the density operator
A mixed ensemble with states and probabilities has
Pure states obey and . A genuine mixed state has . This distinction becomes central when we use rather than a single wavefunction to describe equilibrium.
Key results
The first structural result is the central limit theorem in its statistical-mechanical form. Let be independent identically distributed random variables with mean and variance , and let
Then for large , the distribution of approaches a Gaussian with
This is why thermodynamic quantities are reproducible. Energy, magnetization, and particle number have fluctuations of order while their means are usually of order , so relative fluctuations vanish in the thermodynamic limit.
The cumulant generating form of the characteristic function makes this transparent:
where , , and higher cumulants describe non-Gaussian corrections. For , the characteristic function is . After centering and scaling by , the second cumulant survives and higher cumulants are suppressed by powers of .
For density matrices, the dynamical counterpart is the von Neumann equation,
It is the quantum analogue of Liouville flow. A stationary equilibrium density matrix must commute with the Hamiltonian, , which is why equilibrium ensembles can be written as functions of conserved operators such as and .
Another key result is entropy as a functional of a distribution or density matrix:
Schwabl later derives the usual ensembles by maximizing this entropy subject to macroscopic constraints, or equivalently by considering weakly coupled subsystems of a microcanonical total system.
Two refinements are worth keeping in mind throughout the subject. First, independence is an idealization; many statistical-mechanical variables are weakly correlated rather than strictly independent. The central-limit conclusion remains useful when correlations decay fast enough that the system can be divided into many almost independent correlation volumes. If the correlation length is finite in dimensions, a macroscopic sample of volume contains roughly independent blocks, so relative fluctuations are suppressed by the square root of that number. This estimate fails near a critical point, where grows and fluctuations become macroscopic.
Second, entropy maximization is not a substitute for dynamics. It selects the least biased stationary distribution compatible with known constraints, but the list of constraints must come from conserved quantities and physical preparation. For an ordinary isolated gas, energy, volume, and particle number are enough. For an integrable or nearly integrable system, there may be many additional conserved quantities, and a simple microcanonical or canonical ensemble may not describe all relaxation phenomena. Schwabl's later discussion of irreversibility returns to this distinction between formal probability assignments and actual approach to equilibrium.
In quantum notation, diagonalizing shows why the von Neumann entropy has the same form as the classical expression. If
then
The basis-independent trace formula matters because the same mixed state can be represented by different convex decompositions into nonorthogonal pure states. The eigenvalues of , not a particular story about preparation, determine entropy and equilibrium averages.
Visual
| Object | Classical probability | Quantum statistics | Typical use |
|---|---|---|---|
| State description | or | density operator | encode incomplete microscopic information |
| Normalization | total probability | ||
| Observable average | macroscopic prediction | ||
| Pure state test | delta-like idealization | one microscopic state | |
| Mixed state test | broad distribution | statistical ensemble | |
| Equilibrium condition | stationary under flow | time-independent ensemble |
Worked example 1: Central-limit scaling of energy fluctuations
Problem: A model solid has independent modes. Each mode has mean energy and standard deviation . Estimate the mean total energy, the standard deviation, and the relative fluctuation.
Method:
- The total energy is .
- The mean is additive:
- The variance is additive for independent modes:
Thus
- The relative fluctuation is
Checked answer: even though the microscopic mode energy is random, the macroscopic energy is effectively sharp. The suppression factor is .
Worked example 2: Density matrix for a spin mixture
Problem: A spin- ensemble contains and . Compute , , and .
Method:
- In the basis ,
- The operator is
- The expectation value is
- The purity is
Checked answer: , so this is mixed. It is not the pure spinor , which would also contain off-diagonal coherence.
Code
import numpy as np
rng = np.random.default_rng(7)
N = 200_000
trials = 4_000
# Exponential variables are visibly non-Gaussian, but their sums become Gaussian.
samples = rng.exponential(scale=1.0, size=(trials, N // 1000))
sums = samples.sum(axis=1)
mean = sums.mean()
std = sums.std(ddof=1)
relative = std / mean
rho = np.diag([0.7, 0.3])
sz = 0.5 * np.array([[1.0, 0.0], [0.0, -1.0]])
print("sum mean", mean)
print("sum std", std)
print("relative fluctuation", relative)
print("<Sz>/hbar", np.trace(rho @ sz))
print("purity", np.trace(rho @ rho))
Common pitfalls
- Confusing a classical probability mixture with a coherent quantum superposition. A mixture has probabilities; a superposition has amplitudes and phase-sensitive off-diagonal density-matrix elements.
- Forgetting that fluctuations add as variances, not as standard deviations, when variables are independent.
- Treating the central limit theorem as requiring Gaussian microscopic variables. It requires finite variance and weak enough dependence, not Gaussian inputs.
- Using for every density matrix. That identity is special to pure states.
- Dropping normalization factors in probability densities; without normalization, averages and entropies are meaningless.