Bivariate Normal Distribution — Definition, Formula & Examples
The bivariate normal distribution describes the joint probability distribution of two random variables that are each normally distributed and linked by a linear correlation. It is the simplest multivariate extension of the familiar bell curve.
A continuous random vector follows a bivariate normal distribution if its joint probability density function is determined entirely by five parameters: the means and , the standard deviations and , and the correlation coefficient . Every linear combination is univariate normal, and the conditional distributions and are also normal.
Key Formula
Where:
- = Means of X and Y
- = Standard deviations of X and Y (both > 0)
- = Correlation coefficient between X and Y, where −1 < ρ < 1
How It Works
The distribution produces an elliptical, mound-shaped surface over the -plane. When the ellipse axes align with the coordinate axes and , are independent. As increases toward 1, the ellipse narrows and tilts, reflecting stronger linear dependence. Contours of equal probability density are ellipses centered at . You can compute marginal distributions by integrating out one variable — each marginal is simply a univariate normal. Conditional distributions are also normal, with a mean that shifts linearly with the conditioning value.
Worked Example
Problem: Suppose (X, Y) follows a bivariate normal distribution with μ_X = 50, μ_Y = 100, σ_X = 10, σ_Y = 20, and ρ = 0.6. Find the conditional distribution of Y given X = 60.
Conditional mean: The conditional mean of Y given X = x is given by the formula below.
Conditional standard deviation: The conditional standard deviation does not depend on the observed value of X.
Answer: Given X = 60, Y is normally distributed with mean 112 and standard deviation 16, i.e., Y | X = 60 ~ N(112, 16²).
Why It Matters
The bivariate normal distribution underpins simple linear regression — the assumption that errors are normal and linearly related leads directly to this model. It appears routinely in fields like finance (modeling correlated asset returns) and engineering (tolerance analysis of paired measurements).
Common Mistakes
Mistake: Assuming that two individually normal variables are automatically bivariate normal.
Correction: Two marginals can each be normal while their joint distribution is not bivariate normal. Bivariate normality requires that every linear combination aX + bY also be normal, which is a stronger condition.
