Normal, Exponential, Gamma, Beta, and Cauchy Laws
The continuous distributions in the middle of MIT 18.440 each encode a different mechanism. The normal law describes the accumulated effect of many small mostly independent influences. The exponential law describes memoryless waiting times. The gamma law describes sums of exponential waits. The beta law describes random probabilities and order-statistic shapes. The Cauchy law is a warning: a natural density may fail to have a finite mean or variance.
These distributions are not just a catalog. Their formulas explain why later theorems need hypotheses. The central limit theorem uses finite variance; the law of large numbers needs finite mean; the Cauchy distribution violates these assumptions. The exponential and gamma laws connect continuous densities back to Poisson processes.
Figure: Normal density functions with different means and variances. Image: Wikimedia Commons, Inductiveload, public domain.
Definitions
A standard normal random variable has density
A normal random variable with mean and variance has density
An exponential random variable with rate has density
A gamma random variable with rate and shape has density
where
A beta random variable with parameters has density on
A standard Cauchy random variable has density
Key results
If is standard normal, then
If , then is normal with mean and variance . Standardization reverses this:
If is exponential with rate , then
The exponential law is memoryless:
If are independent exponential random variables, then
More generally, gamma random variables with the same rate add by shapes:
when the variables are independent.
The beta distribution appears naturally when a probability parameter is unknown. If is initially uniform on and then coin tosses produce heads and tails, the posterior density for is proportional to
which is beta.
The Cauchy distribution has no finite expectation. The positive and negative tails are too heavy for to converge. This is why one must not apply mean-based limit theorems blindly.
The normal distribution is determined by location and scale. Translating changes the mean but not the shape; multiplying by a positive constant stretches the standard deviation by that constant and the variance by its square. This is why standardization is so useful: every normal probability can be reduced to a probability for the standard normal variable .
The exponential distribution is the continuous analogue of the geometric distribution. Both are memoryless, and both describe waiting for the first success or event. The parameter conventions are different: geometric uses success probability per trial, while exponential uses rate per unit time. In a Poisson process with rate , the exponential mean is the average waiting time between events.
The gamma distribution keeps track of repeated exponential waiting. If the first event time is exponential, the time of the th event is a sum of independent exponential interarrival times and is gamma with shape . The formula with extends this distribution to non-integer shapes, but the waiting-time story is most literal for positive integer shape.
The beta distribution is flexible on . When , it is uniform. When and are large, it concentrates near . When or , it can pile density near the endpoints. In Bayesian coin-toss examples, and behave like prior counts plus observed successes and failures.
The Cauchy distribution shows why formulas must include hypotheses. It has a symmetric bell-like density, but its tails decay like , too slowly for to be finite. A sample average of independent Cauchy variables does not settle down according to the usual law of large numbers; in fact, the average has the same Cauchy distribution as each summand. This is not a contradiction, because the theorem's finite-mean assumption fails.
Visual
| Distribution | Support | Main mechanism | Mean | Variance |
|---|---|---|---|---|
| Normal | all real numbers | accumulated small effects | ||
| Exponential | memoryless waiting time | |||
| Gamma | sum of exponential waits | |||
| Beta | random probability or order statistic | |||
| Cauchy | all real numbers | heavy-tailed ratio or tangent angle | undefined | undefined |
The comparison table is meant to prevent formula memorization from replacing model recognition. A waiting-time story with no memory points to exponential; a waiting time for several events points to gamma; a proportion or unknown probability points to beta; accumulated small effects point to normal; a heavy-tailed angle or ratio story may produce Cauchy. Once the mechanism is identified, the support and parameter meanings usually follow.
These families also interact. Exponential waiting times come from Poisson processes, gamma variables are sums of exponentials, normal variables are stable under independent addition, and beta variables arise from uniform order statistics and Bayesian updating for Bernoulli trials. The Cauchy law is deliberately different: it is stable under averaging in a way that breaks the usual mean-based intuition.
Worked example 1: a standardized normal probability
Problem: Let be normal with mean and variance . Express in terms of the standard normal CDF .
Method:
- The standard deviation is .
- Standardize:
- Convert the lower endpoint:
- Convert the upper endpoint:
- Therefore
- In CDF form,
Checked answer: using approximate values and , the probability is about .
Worked example 2: competing exponential clocks
Problem: Two independent clocks ring after exponential waiting times. Clock 1 has rate per hour, and clock 2 has rate per hour. Find the distribution of the first ringing time and the probability clock 1 rings first.
Method:
- Let and .
- The first ringing time is
- For ,
Thus is exponential with rate .
- The probability clock 1 rings first is
- Substitute:
Checked answer: clock 1 has rate out of total rate , so its chance of being first is .
Code
from math import erf, sqrt, exp
def normal_cdf(x, mu=0.0, sigma=1.0):
z = (x - mu) / (sigma * sqrt(2))
return 0.5 * (1 + erf(z))
prob = normal_cdf(13, 10, 2) - normal_cdf(8, 10, 2)
print("P(8 <= X <= 13):", prob)
lambda1, lambda2 = 2.0, 3.0
print("first ringing rate:", lambda1 + lambda2)
print("P(clock 1 first):", lambda1 / (lambda1 + lambda2))
def exponential_survival(rate, t):
return exp(-rate * t)
print("P(first ringing after 0.5 hours):", exponential_survival(5, 0.5))
Common pitfalls
- Forgetting the normalizing factor in a normal density.
- Confusing exponential rate with mean .
- Applying the exponential memoryless property to normal, uniform, or gamma waiting times.
- Assuming every familiar-looking density has a finite mean. The Cauchy density is the standard warning example.
- Using a beta distribution outside . It models proportions and probabilities, not arbitrary real-valued measurements.