Cantor Set — Definition, Formula & Examples
The Cantor set is the set of all points in the interval that remain after you repeatedly remove the open middle third of every remaining segment, infinitely many times.
Define and , where the operation removes the open middle third of each closed interval in . The Cantor set is . Equivalently, consists of all real numbers in whose base-3 (ternary) expansion contains only the digits 0 and 2.
Key Formula
Where:
- = The Cantor set
- = The union of closed intervals remaining after n removal steps
How It Works
Start with the closed interval . Remove the open middle third , leaving . At the next step, remove the open middle third of each remaining interval, leaving four intervals. Continue this process indefinitely. The Cantor set is what survives after infinitely many removals. Despite removing more and more material, the set is uncountably infinite — yet it has total length (Lebesgue measure) equal to zero.
Worked Example
Problem: Show that the total length removed from [0,1] during the Cantor set construction equals 1, confirming the Cantor set has measure zero.
Step 1: At step 1, remove 1 interval of length .
Step 2: At step 2, remove 2 intervals each of length . At step , remove intervals each of length .
Step 3: Evaluate the geometric series with ratio .
Answer: The total length removed is 1, so the Cantor set has Lebesgue measure 0.
Why It Matters
The Cantor set is a foundational counterexample in real analysis and topology. It demonstrates that a set can be uncountable yet have zero length, and it is a natural example of a fractal with Hausdorff dimension . Understanding it is essential in measure theory, where it motivates the distinction between "large" in cardinality and "large" in measure.
Common Mistakes
Mistake: Assuming the Cantor set contains only the endpoints of removed intervals (and is therefore countable).
Correction: The Cantor set is uncountable. Every number in with a ternary expansion using only digits 0 and 2 belongs to , and there are uncountably many such numbers (they biject onto via base-2 reinterpretation).
