Mathwords logoMathwords

Chebyshev's Inequality — Definition, Formula & Examples

Chebyshev's Inequality states that for any dataset (regardless of its shape or distribution), at least 11k21 - \frac{1}{k^2} of the data values fall within kk standard deviations of the mean, where k>1k > 1.

For any random variable XX with finite mean μ\mu and finite, positive variance σ2\sigma^2, and for any real number k>1k > 1, the inequality P(Xμkσ)1k2P(|X - \mu| \geq k\sigma) \leq \frac{1}{k^2} holds. Equivalently, at least a fraction 11k21 - \frac{1}{k^2} of the probability mass lies within kk standard deviations of the mean.

Key Formula

P(Xμ<kσ)11k2P(|X - \mu| < k\sigma) \geq 1 - \frac{1}{k^2}
Where:
  • XX = A random variable (or data value)
  • μ\mu = The mean of the distribution or dataset
  • σ\sigma = The standard deviation
  • kk = Number of standard deviations from the mean (must be greater than 1)

How It Works

Choose a value of kk representing the number of standard deviations from the mean. Plug kk into 11k21 - \frac{1}{k^2} to find the minimum proportion of data guaranteed to lie in the interval (μkσ,  μ+kσ)(\mu - k\sigma,\; \mu + k\sigma). The result is a lower bound — the actual proportion is often higher, especially for symmetric or bell-shaped distributions. The power of this inequality is that it applies to every distribution with a finite mean and variance, making no assumptions about shape.

Worked Example

Problem: A dataset has a mean of 50 and a standard deviation of 5. Using Chebyshev's Inequality, find the minimum percentage of data values that lie between 35 and 65.
Find k: The interval from 35 to 65 spans 15 units on each side of the mean. Divide by the standard deviation to get k.
k=65505=3k = \frac{65 - 50}{5} = 3
Apply the formula: Substitute k = 3 into Chebyshev's Inequality.
11k2=119=890.8891 - \frac{1}{k^2} = 1 - \frac{1}{9} = \frac{8}{9} \approx 0.889
Interpret: Convert the fraction to a percentage.
8988.9%\frac{8}{9} \approx 88.9\%
Answer: At least 88.9% of the data values must lie between 35 and 65.

Why It Matters

Chebyshev's Inequality is essential in introductory probability and statistics courses because it provides guaranteed bounds without assuming normality. It is used in quality control and finance to estimate the spread of data when the underlying distribution is unknown or skewed.

Common Mistakes

Mistake: Using k = 1 and expecting a useful bound.
Correction: At k = 1 the formula gives 11/1=01 - 1/1 = 0, which tells you nothing. Chebyshev's Inequality only provides meaningful bounds when k > 1.