Mathwords logoMathwords

Jensen's Inequality — Definition, Formula & Examples

Jensen's Inequality states that for a convex function, the function applied to an average is always less than or equal to the average of the function applied to individual values. In short, convexity pushes the function's output upward relative to the midpoint.

If φ\varphi is a convex function on an interval II and XX is a random variable with E[X]E[X] existing and XX taking values in II, then φ(E[X])E[φ(X)]\varphi(E[X]) \leq E[\varphi(X)]. The inequality reverses if φ\varphi is concave.

Key Formula

φ(E[X])E[φ(X)]\varphi\bigl(E[X]\bigr) \leq E\bigl[\varphi(X)\bigr]
Where:
  • φ\varphi = A convex function (inequality reverses for concave functions)
  • XX = A random variable (or a set of values being averaged)
  • E[]E[\cdot] = Expected value (or weighted average)

How It Works

Identify whether your function is convex (curves upward, like x2x^2 or exe^x) or concave (curves downward, like lnx\ln x or x\sqrt{x}). For a convex function, applying it to the mean gives a result no larger than the mean of the applied values. For a concave function, the direction flips: the function of the mean is at least as large as the mean of the function values. This relationship holds for weighted averages, expectations, and integrals alike.

Worked Example

Problem: Let XX take values 1 and 3 each with probability 12\tfrac{1}{2}. Verify Jensen's Inequality for the convex function φ(x)=x2\varphi(x) = x^2.
Compute E[X]: Find the expected value of XX.
E[X]=12(1)+12(3)=2E[X] = \tfrac{1}{2}(1) + \tfrac{1}{2}(3) = 2
Compute the left side: φ(E[X]): Apply the function to the expected value.
φ(E[X])=22=4\varphi(E[X]) = 2^2 = 4
Compute the right side: E[φ(X)]: Apply the function to each value first, then take the expectation.
E[φ(X)]=12(12)+12(32)=12(1)+12(9)=5E[\varphi(X)] = \tfrac{1}{2}(1^2) + \tfrac{1}{2}(3^2) = \tfrac{1}{2}(1) + \tfrac{1}{2}(9) = 5
Compare: Check whether the inequality φ(E[X])E[φ(X)]\varphi(E[X]) \leq E[\varphi(X)] holds.
454 \leq 5 \quad \checkmark
Answer: The inequality holds: φ(E[X])=45=E[φ(X)]\varphi(E[X]) = 4 \leq 5 = E[\varphi(X)].

Why It Matters

Jensen's Inequality underpins core results in statistics and information theory, including why variance is nonnegative (apply it with φ(x)=x2\varphi(x)=x^2) and why the log-likelihood in the EM algorithm behaves the way it does. In finance, it explains why the expected return of a nonlinear portfolio differs from the return at the expected price — a concept central to risk management and option pricing.

Common Mistakes

Mistake: Applying the inequality in the wrong direction — assuming φ(E[X])E[φ(X)]\varphi(E[X]) \geq E[\varphi(X)] for a convex function.
Correction: Remember: convex means the function bulges downward between endpoints, so evaluating at the average underestimates. The mnemonic 'convex = cup = ≤' can help: the cup holds the higher value on the right side.