next up previous contents index
Next: References Up: Some Case Studies Previous: A theorem of number

 

The induction axiom

The first explicit attempts to axiomatize whole number arithmetic were made near the end of the 19th century. The most famous presentations of such axioms are Dedekind's 1888 book, Was sind und was sollen die Zahlen [1] and Peano's Arithmetices Principia Novo Methodo Exposita [10] of 1889. Peano listed four truths about equality and five special postulates for the positive integers. His special postulates were, [7, page 473,]:

     
  1. 1 is a number.
  2. The successor of any number is a number.
  3. No two numbers have the same successor.
  4. 1 is not the successor of any number.
  5. Any property which belongs to 1 and also to the successor of any number which has it belongs to all numbers.

Our axioms for ordered rings replace Peano's postulates 1 - 4. Peano's 5th postulate which has come to be known as the axiom of induction is the subject of this Section.

Peano is using here a somewhat antiquated language about properties. We translate his word property to mean set and we say that a number belongs to a set rather than a set belonging to a number. This is just a matter of convention. We would express his axiom using second order logic as introduced on page gif as


displaymath2715

Again, this axiom is only supposed to refer to the positive integers.

Another way to phrase this same axiom is to say that the empty set is the only set of positive integers with no least element. This is the version that will be most useful for us. Translated to the domain of all integers, positive, negative, and zero, this becomes, ``Any non-empty set with a lower bound has a least element.'' Writing this as a logical formula gives us our axiom of induction:


 equation626

This axiom uses the abbreviation, tex2html_wrap_inline2725 as shorthand for tex2html_wrap_inline2727.


 theorem630

proof633

This may not seem like a very exciting theorem since it is so obviously true. The reader should keep in mind however that our task is to present a logical analysis, not to actually do number theory. We have an axiom system and if it is adequate, it must be able to prove the obvious facts as well the more esoteric ones. So our real purpose in proving this theorem is not to show that it is true, but rather that it is a logical consequence of the axioms. Besides, the analysis of its proof gives us a chance to introduce plenty of logic.

We begin our analysis of this proof by translating the statement of the theorem into formal logic. There are many equivalent ways one could read this theorem. I shall choose the one requiring the most logical analysis. So one could say that it means,
displaymath2716

where again tex2html_wrap_inline2745 is an abbreviation for tex2html_wrap_inline2747 so that, according to axiom 10 on page gif, tex2html_wrap_inline2749 is equivalent to y < x.

The proof is a proof by contradiction. We begin by adding the negation of the theorem to our data base of facts and adopt the goal of trying to prove a contradiction. Usually this means deducing enough facts to add to the data base so that some statement and its negation are both facts. Going for a proof by contradiction has one big advantage and one big disadvantage. The advantage is that we have an extra fact to work with. The disadvantage is that the goal is kind of vague. With the goal just being, ``Find some contradiction,'' we cannot work backwards from that goal very easily until we have a better idea as to just which contradiction we are seeking. So we must really begin with some forward argument.

The new fact to add to the data base is the negation of the theorem,


 equation639

When faced with a fact like this it is almost always advisable to ``move the negation sign into the formula.'' This means replacing the formula by an equivalent formula in which the negation sign does not come right at the beginning. There are very specific rules about how this is done. Here they are.

  1.   tex2html_wrap_inline2753 iff tex2html_wrap_inline2755
  2.   tex2html_wrap_inline2757 iff tex2html_wrap_inline2759
  3.   tex2html_wrap_inline2761 iff tex2html_wrap_inline2763
  4.   tex2html_wrap_inline2765 iff tex2html_wrap_inline2767
  5.   tex2html_wrap_inline2769 iff tex2html_wrap_inline2771
  6.   tex2html_wrap_inline2773 iff A

We can apply rule 1 to replace (2.16) by the equivalent fact,


 equation652

Then use rule 4 to push the negation sign in still further, replacing (2.17) by,


 equation657


exercise660

We can now apply the tex2html_wrap_inline1613-left-(P) rule on page gif to (2.18) using the new variable b to get,


 equation667

It is our intention to use (2.19) to deduce that tex2html_wrap_inline2791 is non-empty. There is a problem however. Our logical symbolism does not yet include the richness necessary to name this set. Nor does it include the axioms necessary for proving anything about sets. We can overcome this in an extremely simple and direct manner. A constant symbol is just a string of symbols we wish to use in a certain way, so declare the string, tex2html_wrap_inline2791, to be a constant symbol in the universe of sets and add to our axiom list the two new axioms,


 equation671

 equation674

Of course, it would be somewhat ad hoc to do this just for this one set. We need to add axioms for every possible formula. If A(x) is a formula and x is the only free variable of A(x), we declare tex2html_wrap_inline2801 to be a constant symbol for the universe of sets and add to our axiom base two axioms corresponding to (2.20) and (2.21). This will suffice for all the applications we will present, but the reader should be warned that it is not enough to fully exploit the induction axiom. For the full system, we must allow A(x) to have free variables other than x. Then tex2html_wrap_inline2801 would no longer be a constant symbol but would have to be a function symbol. We will not go into details.

So with axiom (2.20) and fact (2.19) we can use the tex2html_wrap_inline1615-left-(P) rule to deduce tex2html_wrap_inline2811 and so, applying the tex2html_wrap_inline1613-right-(P) rule,


 equation684

Now we need to prove the other hypothesis of the induction axiom,


 equation687

Using tex2html_wrap_inline1613-right-(P), we can replace this goal with the new goal:


equation691

To prove this goal, use the tex2html_wrap_inline1615-right-(P) rule and the tex2html_wrap_inline1595-right-(P) rule. In other words, introduce a new variable, c, and assume tex2html_wrap_inline2823 in order to prove 0 < c. This goal follows easily using axiom (2.21) with the tex2html_wrap_inline1615-left-(P) and tex2html_wrap_inline1595-left-(P) rules.

Now use the tex2html_wrap_inline1615-left-(P) rule on the induction axiom (2.15), putting the term tex2html_wrap_inline2791 in for X. Then combine the result with (2.22) and (2.23). Apply the tex2html_wrap_inline1613-left-(P) (with new variable a) and tex2html_wrap_inline1611-left-(P) rules to the result to get the two new facts,


 equation704

 equation707

From (2.25) and axiom (2.21) it follows that 0 < a < 1. We can now use tex2html_wrap_inline1615-left-(P) on axiom 13 to deduce that tex2html_wrap_inline2847. This is a clear contradiction with what we get by putting tex2html_wrap_inline2849 in for x in (2.26), completing the proof.


definition715

 theorem717

proof720

As far as logic is concerned, the new feature of this proof is that we used the fact that either n is prime or it is not prime. Another logical law you may use in a proof is the  Law of the excluded middle. It says that you may at any time add any fact of the form tex2html_wrap_inline2915 to the data base of facts. Such facts of course get used with the tex2html_wrap_inline1655-left-(P) rule which says that you break the proof down into two subproofs, one using A as a fact, the other using tex2html_wrap_inline1829.


exercise729

Our proof of the next theorem uses the factorial function, n!. This function is defined by induction for all integers tex2html_wrap_inline2925 with the equations:

0! = 1
tex2html_wrap_inline2929

There does not appear to be any real good way to include such functions into our logical symbolism. The only feasible way is to make up a new function symbol, ! and add it to the language of arithmetic and then add axioms to define it. So we must add the axioms,

  1. tex2html_wrap_inline2933
  2. 0! = 1
  3. tex2html_wrap_inline2937


 exercise736


 theorem739


proof742


exercise746

This concludes our excursion into number theory. I hope the reader has been enlightened by our sometimes painful analysis of the obvious.


next up previous contents index
Next: References Up: Some Case Studies Previous: A theorem of number

Richard Mansfield
Wed Oct 21 14:09:45 EDT 1998