Skip to main content

Common Continuous Distributions

Continuous distributions model measurements that vary on intervals: times, lengths, errors, proportions, variances, and transformed statistics. They replace probability at a point with probability over intervals. For example, the probability that a lifetime is exactly 5.0000005.000000 years is usually zero, but the probability that it lies between 55 and 66 years can be positive.

Lane et al.'s statistics text treats the normal distribution in depth and later introduces distributions such as tt and chi-square for inference. This probability page gives a theory-focused reference to the main continuous families, while cross-linking to statistics where the inferential use is the main topic.

Several exponential density curves are plotted for different rate parameters.

Figure: Exponential density functions for several rate parameters. Image: Wikimedia Commons, Skbkekas, CC BY 3.0.

Definitions

A continuous random variable XX has density fXf_X if

P(aXb)=abfX(x)dx.P(a\le X\le b)=\int_a^b f_X(x)\,dx.

The Uniform distribution on [a,b][a,b] has constant density

f(x)=1ba,axb.f(x)=\frac{1}{b-a},\quad a\le x\le b.

The Exponential distribution with rate λ>0\lambda\gt 0 has density

f(x)=λeλx,x0.f(x)=\lambda e^{-\lambda x},\quad x\ge 0.

The Normal distribution with mean μ\mu and variance σ2\sigma^2 has density

f(x)=1σ2πexp((xμ)22σ2).f(x)=\frac{1}{\sigma\sqrt{2\pi}} \exp\left(-\frac{(x-\mu)^2}{2\sigma^2}\right).

The Gamma distribution with shape α>0\alpha\gt 0 and rate λ>0\lambda\gt 0 has density

f(x)=λαΓ(α)xα1eλx,x>0.f(x)=\frac{\lambda^\alpha}{\Gamma(\alpha)}x^{\alpha-1}e^{-\lambda x},\quad x>0.

The Beta distribution with parameters α,β>0\alpha,\beta\gt 0 has density on (0,1)(0,1):

f(x)=Γ(α+β)Γ(α)Γ(β)xα1(1x)β1.f(x)=\frac{\Gamma(\alpha+\beta)}{\Gamma(\alpha)\Gamma(\beta)} x^{\alpha-1}(1-x)^{\beta-1}.

The Chi-square distribution with ν\nu degrees of freedom is a Gamma distribution:

χν2Gamma(ν2,12)\chi^2_\nu \sim \operatorname{Gamma}\left(\frac{\nu}{2},\frac{1}{2}\right)

when using the rate parameterization.

If ZN(0,1)Z\sim N(0,1) and Uχν2U\sim \chi^2_\nu are independent, then the Student tt distribution is

T=ZU/ν.T=\frac{Z}{\sqrt{U/\nu}}.

If U1χν12U_1\sim\chi^2_{\nu_1} and U2χν22U_2\sim\chi^2_{\nu_2} are independent, then the FF distribution is

F=U1/ν1U2/ν2.F=\frac{U_1/\nu_1}{U_2/\nu_2}.

Key results

DistributionSupportMeanVarianceCommon role
Uniform(a,b)(a,b)[a,b][a,b](a+b)/2(a+b)/2(ba)2/12(b-a)^2/12equally likely interval values
Exponential(λ)(\lambda)[0,)[0,\infty)1/λ1/\lambda1/λ21/\lambda^2waiting times with constant hazard
Normal(μ,σ2)(\mu,\sigma^2)R\mathbb{R}μ\muσ2\sigma^2errors, sums, approximations
Gamma(α,λ)(\alpha,\lambda)(0,)(0,\infty)α/λ\alpha/\lambdaα/λ2\alpha/\lambda^2waiting time to repeated events
Beta(α,β)(\alpha,\beta)(0,1)(0,1)α/(α+β)\alpha/(\alpha+\beta)αβ(α+β)2(α+β+1)\frac{\alpha\beta}{(\alpha+\beta)^2(\alpha+\beta+1)}random proportions
Chi-square(ν)(\nu)[0,)[0,\infty)ν\nu2ν2\nusums of squared normals
tνt_\nuR\mathbb{R}00 if ν>1\nu\gt 1ν/(ν2)\nu/(\nu-2) if ν>2\nu\gt 2standardized mean with estimated variance
Fν1,ν2F_{\nu_1,\nu_2}[0,)[0,\infty)ν2/(ν22)\nu_2/(\nu_2-2) if ν2>2\nu_2\gt 2depends on degreesvariance ratios

Memorylessness of exponential. If XExponential(λ)X\sim\operatorname{Exponential}(\lambda), then

P(X>s+tX>s)=P(X>t).P(X>s+t\mid X>s)=P(X>t).

The exponential is the continuous analogue of the geometric distribution.

Normal standardization. If XN(μ,σ2)X\sim N(\mu,\sigma^2), then

Z=XμσN(0,1).Z=\frac{X-\mu}{\sigma}\sim N(0,1).

Thus normal probabilities can be reduced to the standard normal CDF Φ\Phi.

Gamma sums. If independent exponential waiting times share rate λ\lambda, then their sum has a Gamma distribution. Specifically, the waiting time until the α\alpha-th event in a Poisson process has Gamma(α,λ)(\alpha,\lambda) distribution when α\alpha is a positive integer.

Several continuous families are connected by transformations. If ZN(0,1)Z\sim N(0,1), then Z2χ12Z^2\sim\chi^2_1, and sums of independent squared standard normals produce chi-square distributions with more degrees of freedom. Ratios involving a standard normal and an independent chi-square variable produce tt distributions. Ratios of scaled independent chi-square variables produce FF distributions. These relationships explain why the same families appear repeatedly in sampling theory.

Location and scale also matter. If ZZ has a standard distribution, then X=μ+σZX=\mu+\sigma Z shifts the center by μ\mu and stretches the spread by σ\sigma. Normal distributions are closed under this transformation, but many positive distributions are better described by shape and rate or shape and scale. Always record the parameterization. For Gamma and Exponential distributions, "rate" λ\lambda and "scale" θ=1/λ\theta=1/\lambda are reciprocals, and confusing them changes the mean.

For reliability and survival problems, the survival function S(x)=P(X>x)S(x)=P(X\gt x) is often clearer than the CDF. The exponential survival function S(x)=eλxS(x)=e^{-\lambda x} makes the constant-hazard assumption visible.

Continuous families are often chosen for shape constraints. Beta distributions can be U-shaped, uniform, left-skewed, right-skewed, or concentrated near the center depending on α\alpha and β\beta. Gamma distributions can be highly right-skewed for small shape and more symmetric for large shape. Normal distributions are symmetric and light-tailed. These shape facts should be checked against context before fitting a model or using a table.

Quantiles are often more stable to communicate than densities. For example, saying that 95%95\% of a normal distribution lies within about two standard deviations of the mean is more interpretable than quoting density heights. For skewed distributions, report asymmetric intervals rather than forcing a mean-plus-minus-standard-deviation summary.

Visual

Modeling clueCandidate distributionWhy
all values in an interval are equally likelyUniformconstant density
time until next event at constant rateExponentialmemoryless waiting time
sum of many small independent effectsNormalcentral limit behavior
time until several eventsGammasum of exponentials
unknown probability or proportionBetasupport is 00 to 11
sum of squared standard normalsChi-squarequadratic normal variation

Worked example 1: exponential reliability

Problem. A component lifetime XX is exponential with mean 200200 hours. Find the rate λ\lambda, the probability it lasts at least 300300 hours, and the probability it lasts another 100100 hours given that it has already lasted 300300 hours.

Method.

  1. For an exponential distribution, E[X]=1/λE[X]=1/\lambda. Since the mean is 200200,
λ=1200=0.005.\lambda=\frac{1}{200}=0.005.
  1. The survival function is
P(X>x)=eλx.P(X>x)=e^{-\lambda x}.
  1. Probability of lasting at least 300300 hours:
P(X300)=e0.005(300)=e1.50.2231.P(X\ge 300)=e^{-0.005(300)}=e^{-1.5}\approx 0.2231.
  1. Conditional probability of lasting another 100100 hours after 300300:
P(X>400X>300)=P(X>400)P(X>300).P(X>400\mid X>300)=\frac{P(X>400)}{P(X>300)}.
  1. Substitute survival probabilities:
e0.005(400)e0.005(300)=e0.005(100)=e0.50.6065.\begin{aligned} \frac{e^{-0.005(400)}}{e^{-0.005(300)}} &=e^{-0.005(100)}\\ &=e^{-0.5}\\ &\approx 0.6065. \end{aligned}

Checked answer. λ=0.005\lambda=0.005, P(X300)0.2231P(X\ge 300)\approx 0.2231, and the conditional probability is about 0.60650.6065. The last result illustrates memorylessness.

Worked example 2: normal probability by standardization

Problem. Exam scores are approximately normal with mean 7070 and standard deviation 88. What proportion of scores lies between 6262 and 8484?

Method.

  1. Let XN(70,82)X\sim N(70,8^2).

  2. Standardize the lower endpoint:

z1=62708=1.z_1=\frac{62-70}{8}=-1.
  1. Standardize the upper endpoint:
z2=84708=1.75.z_2=\frac{84-70}{8}=1.75.
  1. Convert to standard normal probability:
P(62X84)=P(1Z1.75).P(62\le X\le 84)=P(-1\le Z\le 1.75).
  1. Use the standard normal CDF:
P(1Z1.75)=Φ(1.75)Φ(1).P(-1\le Z\le 1.75)=\Phi(1.75)-\Phi(-1).
  1. With Φ(1.75)0.9599\Phi(1.75)\approx 0.9599 and Φ(1)0.1587\Phi(-1)\approx 0.1587,
P(62X84)0.95990.1587=0.8012.P(62\le X\le 84)\approx 0.9599-0.1587=0.8012.

Checked answer. About 80.1%80.1\% of scores lie between 6262 and 8484 under the normal model.

Code

from scipy.stats import expon, norm, gamma, beta, chi2, t, f

# Exponential reliability.
mean_life = 200
rate = 1 / mean_life
survive_300 = expon.sf(300, scale=1 / rate)
survive_another_100 = expon.sf(100, scale=1 / rate)
print(rate, survive_300, survive_another_100)

# Normal interval probability.
mu, sigma = 70, 8
prob = norm.cdf(84, loc=mu, scale=sigma) - norm.cdf(62, loc=mu, scale=sigma)
print(prob)

# A small reference table.
print("Gamma mean:", gamma.mean(a=3, scale=1 / 2)) # rate 2 means scale 1/2
print("Beta mean:", beta.mean(a=2, b=5))
print("Chi-square 95th percentile:", chi2.ppf(0.95, df=10))
print("t 97.5th percentile:", t.ppf(0.975, df=12))
print("F 95th percentile:", f.ppf(0.95, dfn=5, dfd=20))

Common pitfalls

  • Confusing rate and scale. SciPy often uses scale = 1 / rate, while many textbooks write exponential and gamma distributions with rate λ\lambda.
  • Treating density height as probability. For continuous variables, probabilities are areas.
  • Assuming all bell-shaped data are normal. Tail behavior and skew matter.
  • Applying tt, chi-square, or FF formulas without their normal-sample assumptions when doing inference.
  • Forgetting support. A normal model can assign small probability to impossible negative values; sometimes this is acceptable, sometimes not.
  • Using the exponential memoryless property for non-exponential lifetimes. Aging components often do not have constant hazard.

Connections