Poisson Random Variables and Poisson Processes
The Poisson distribution models counts of rare events in a fixed region of time or space: emissions in a time interval, calls to a call center, goals in a match, or unusual events in a large number of trials. MIT 18.440 introduces it as a limit of binomial random variables where is large, is small, and remains near a fixed value .
The Poisson process extends this count model from one interval to an entire random counting function . Its central assumptions are independent increments, stationary increments, and a constant rate. From these assumptions come Poisson counts over intervals and exponential waiting times between events, linking this page to the continuous distributions later in the course.
Figure: Poisson probability mass functions for several parameter values. Image: Wikimedia Commons, Skbkekas, CC BY 3.0.
Definitions
A random variable has the Poisson distribution with parameter if
The parameter is both the mean and the variance.
A Poisson process with rate is a random counting function , , satisfying:
- .
- is nondecreasing and integer-valued.
- Counts in disjoint time intervals are independent.
- The distribution of the number of events in an interval depends only on the interval length.
- For interval length , has Poisson parameter .
The waiting time until the first event satisfies
so is exponential with rate .
Key results
The Poisson distribution arises from the binomial law. Let . For fixed ,
As , the first factor tends to , the middle exponential factor tends to , and the last factor tends to . Therefore
The probabilities sum to one because
The expectation is
One can similarly show , or use the binomial approximation with and .
If and are independent, then
This matches the process interpretation: joining independent event streams adds their rates.
The Poisson approximation is most reliable when the count is a sum of many indicators, each with small probability, and no single indicator has a large influence. In the binomial limit this is explicit: grows, shrinks, and stays fixed. The approximation can also work for events that are not literally independent if dependence is weak and local, but the lecture-level model is the clean independent-trial limit.
The ratio of successive probabilities is often useful:
The probabilities increase while and decrease once . Thus the mass is centered near , consistent with the mean. This ratio is also a stable way to compute Poisson probabilities numerically without evaluating large factorials directly.
For a Poisson process, the phrase "rate " means expected events per unit time. Counts over an interval of length have parameter , not unless the interval length is one. This scaling is essential when changing units. A rate of per hour is the same as per minute, and a minute interval has parameter .
Independent increments imply that what happens in disjoint intervals does not affect the count distribution in another interval. Stationary increments imply that only the length of the interval matters, not its location on the time axis. Together, these properties make the process mathematically tractable and lead to exponential interarrival times.
The no-event probability is the bridge between counts and waits. The event is exactly the event , so
Differentiating gives the exponential density . This is why the Poisson and exponential distributions should be learned as a pair, not as unrelated formulas.
Visual
| Object | Formula | Interpretation |
|---|---|---|
| Poisson count | rare-event count | |
| Mean | expected count | |
| Variance | spread equals rate parameter | |
| Process count | events in interval | |
| First wait | exponential survival | |
| Superposition | rates add | independent streams combine |
The sequence diagram describes a small-time heuristic. Over a tiny interval of length , a rate process has probability about of one event and much smaller probability of two or more events. Splitting a longer interval into many tiny pieces then resembles many Bernoulli trials with small success probability. Taking the limit gives the Poisson distribution with parameter .
This heuristic also explains why simultaneous events have probability zero in the ideal continuous-time model. If the interval is made very small, the chance of two or more events in that interval is negligible compared with . Real systems may have batching or dependence, but the mathematical Poisson process assumes events arrive singly in continuous time.
Worked example 1: rare poker event approximation
Problem: Suppose the probability that a five-card poker hand is a royal flush is . In independent hands, approximate the probability of seeing exactly two royal flushes.
Method:
- The exact count is binomial with
- The Poisson parameter is
- Use the Poisson probability for :
- Substitute:
- Numerically,
Thus
Checked answer: the expected number is about , so seeing exactly two is quite plausible.
Worked example 2: waiting for the first event in a process
Problem: Calls arrive according to a Poisson process at rate per hour. What is the probability that no call arrives in the next minutes, and what is the expected waiting time until the first call?
Method:
- Convert minutes to hours:
- The count in this interval is Poisson with parameter
- The probability of no call is
- The first waiting time is exponential with rate per hour.
- Its expectation is
Checked answer: the probability of waiting longer than the mean is for an exponential random variable, exactly matching the no-call probability over minutes.
Code
from math import exp, factorial
def poisson_pmf(lam, k):
return exp(-lam) * (lam ** k) / factorial(k)
lam = 1_000_000 / 649_740
print("lambda:", lam)
print("P(two royal flushes):", poisson_pmf(lam, 2))
rate_per_hour = 3
t_hours = 20 / 60
print("P(no calls in 20 minutes):", poisson_pmf(rate_per_hour * t_hours, 0))
print("Expected wait in minutes:", 60 / rate_per_hour)
# Verify total mass over a large initial range.
print("Poisson mass k<=30:", sum(poisson_pmf(4.0, k) for k in range(31)))
Common pitfalls
- Using a Poisson approximation when events are not rare or trials are not close to independent.
- Confusing the process rate with the interval parameter .
- Forgetting that variance equals mean for a Poisson random variable. If data have much larger variance than mean, a simple Poisson model may be poor.
- Treating independent increments as automatic. It is an assumption of the Poisson process model.
- Thinking exponential waiting times are an extra assumption unrelated to Poisson counts. In the process model, they are two views of the same structure.