Negative Binomial Distribution — Definition, Formula & Examples
The negative binomial distribution models the number of independent Bernoulli trials needed to achieve a specified number of successes. It answers questions like 'How many trials will it take to get my 5th success?'
Let be the number of trials required to obtain exactly successes in a sequence of independent Bernoulli trials, each with success probability . Then follows a negative binomial distribution with parameters and , and its probability mass function gives for
Key Formula
Where:
- = Total number of trials until the r-th success
- = Desired number of successes (positive integer)
- = Probability of success on each trial
- = Number of ways to place r − 1 successes in the first x − 1 trials
How It Works
You use the negative binomial distribution when you repeat an experiment with two outcomes (success or failure) and want to know the probability that the -th success occurs on exactly the -th trial. The last trial must be a success, so you choose how to arrange successes among the first trials, then multiply by the probability of the remaining failures and the final success. Note that some textbooks define the random variable as the number of failures before the -th success; always check which convention your course uses.
Worked Example
Problem: A free-throw shooter makes 80% of her shots. What is the probability that her 3rd made free throw occurs on exactly the 5th attempt?
Identify parameters: Here r = 3 successes, p = 0.8, and we want x = 5 total trials.
Compute the binomial coefficient: Choose where the first 2 successes fall among the first 4 trials.
Apply the formula: Multiply the coefficient by the success and failure probabilities raised to the appropriate powers.
Answer: The probability is approximately 0.1229, or about 12.3%.
Why It Matters
The negative binomial distribution appears in quality control (how many items must you inspect to find defectives?), clinical trials (how many patients until respond to treatment?), and ecology (modeling organism counts with overdispersion). It generalizes the geometric distribution, which is the special case .
Common Mistakes
Mistake: Confusing it with the binomial distribution by fixing the number of trials instead of the number of successes.
Correction: The binomial fixes the number of trials and counts successes. The negative binomial fixes the number of successes and counts trials (or failures). Make sure you identify which quantity is predetermined.
