Why induction works

Induction is a proving technique. The most elementary version of induction is used to prove statements of the form

where is the set of natural numbers, that is and is a statement about . For example, could be

That this particular statement is true for all is known as the Goldbach Conjecture and it remains a famous open problem.

The well-ordering principle

The property that makes induction possible is the well-ordering principle:

Axiom (Well-ordering principle).

Every non-empty subset of the naturals has a least element.

Proving this is beyond the scope of the course, but the curious reader will verify that the next two corollaries of the theorem are indeed logically equivalent to it. What this means meta-mathematically is that any statement provable by one technique is provable by all. Knowing which one to pick is a skill acquired by practice.

Some questions to deepen your understanding of the well-ordering principle follow.

Exercises.

  1. Is it true that every non-empty subset of the naturals has a maximal element?
  2. Is the well-ordering principle true if you replace the naturals with the integers?

First example: Gauss summation formula

The way this principle is applied is that, whenever some natural number has a property, there must always be a first natural number with that property. Let us start with a simple example often attributed to Gauss.

Theorem (Gauss summation formula).

For any natural number ,

Proof:

Let be the set of natural numbers for which the statements does not hold. That is, is the set of all counterexamples, which I now show must be empty.

By contradiction, suppose that . Then the well-ordering principle dictates that there must be a smallest natural inside of . First, notice that cannot be zero, because zero does satisfy the proposition:

Then it makes sense to consider the natural number , which by minimality (meaning, by the fact that is the smallest element of ) cannot be in , and thus must satisfy the proposition. That is,

But then, if I add on both sides of that equality,

which is the conclusion of the proposition but for . This is a contradiction. Hence, such an cannot exist and thus or, in words, all naturals satisfy the proposition.

The principle of mathematical induction

There is an easier way of writing the previous proof, namely using the following principle instead.

Corollary (Principle of mathematical induction).

Suppose that satisfies that

  1. ; and that
  2. for all , implies .

Then, must contain every natural number. That is, .

Proof:

By contradiction, suppose that satisfies (1.) and (2.) but that the conclusion is wrong, that is, that . Then, there exists some natural number not in . By the well-ordering principle, there is a smallest natural not in .

By the assumption (1.), . Then, by the choice of , . But then, applying the assumption (2.), , which is a contradiction.

Therefore, such an cannot exist and hence .

Reflect.

How would the conclusion change if (1.) is changed to instead? What about ?

How to write proofs by induction

This principle offers a sort of “cooking recipe” for writing proofs: First, verify that (or whatever natural the induction starts at) has the property you want to prove for all . This is called the Base case. Then, assume the Inductive hypothesis, or the fact that . Finally, perform the Inductive step, where you use the fact that has the sought property to show that and thus you are done.

Exercise.

Rewrite the proof of the Gauss summation formula using this form.

Warning:

A common mistake is having an inductive hypothesis that assumes what you want to prove in the first place. For example, assuming that for all , as part of the inductive hypothesis.

While this structure is useful for standardizing solutions and learning how to write proofs, I discourage rote memorization, as it hinders understanding more complicated proofs that might not be easily written in this format.

Example of an incorrect proof by induction

Warning:

Careful when applying the inductive hypothesis to the inductive step. The following example illustrates one possible complication.

Fallacious claim.

All trains have just one type of car.

Fallacious proof:

By induction on , the length of the train.

Base case: When , the train has just one car, and thus all cars are the same type.

Inductive hypothesis: Suppose that for , all trains of length just have one type of car.

Inductive step: Let be the cars of an arbitrary train of length . By the inductive hypothesis, are the same type and are the same type (given that they both are trains of length ). Since and are the same type and and are the same type, then all cars must be the same type. Thus, the train has just one type of car.

Exercise.

Find the first mistake in the previous fallacious proof.

Strong induction

As I move to more complicated proofs by induction, one might run into the following situation where reducing the statement from to is unhelpful or tricky. In this case, it would be much better to reduce it to say or some other (or even multiple!) natural . Here is an example of this scenario.

Claim.

Alice has an infinite amount of $6 coins and $10 and $15 bills. Use induction to prove that Alice can form any amount of at least $30 using these. For example, Alice can form $31 by taking one of each, or she can form $32 by taking two $10 bills and two coins.

The principle of strong mathematical induction

You can strengthen the hypothesis by not just assuming that the property you want to prove holds for the previous natural but indeed all smaller naturals:

Corollary (Principle of strong mathematical induction).

Suppose that satisfies that

  1. for every , if for all , , then .

Then, .

There is no need to add the assumption that in this case, since this is a consequence of property (1.). Indeed, by vacuous truth, any natural (of which there are none) satisfies that .

Warning:

This does not mean that, in general, proofs by strong induction do not need a base case.

Proof:

Suppose that . Then, by the well-ordering principle, there must be a least natural . By minimality, any natural must be in . But by property (1.), this would imply that , a contradiction.

I illustrate the fact that, in many cases, you still need to provide the base case to complete the proof by rewriting the proof of the Claim from above.

# Can you write a computer program that
# explicitly finds these parameters?
# Something like this:
 
def money(n: int):
	# {YOUR CODE HERE}
	return s, t, u

Implementing this particular idea in code produces a greedy algorithm that attempts to maximize the number of coins. Find similar proofs/algorithms that maximize the bills instead. This makes the proof longer to write. Why? An important insight to be gained is that from a deep mathematical perspective:

Induction and recursion are the same phenomenon.

Moreover, the attentive reader will verify that the base cases were tacitly used in the original proof, hinting the fact that both induction types are really just different ways of writing the same proof.

Reflect.

In this section I showed you an example of how a proof by strong induction can be much shorter than the “weak” variant. Can you find a problem that is much harder to solve by weak induction than by strong induction? See here.