Section 1.5 Writing Proofs
Subsection 1.5.1 What Is A Proof?
As I said in Section 1.1, a major goal of this course is the introduction of more formalized mathematical language, particular for definitions and proofs. In this section, I want to start that process. I’m going to talk about several general ways to structure a proof, so that you can recognize them in the course and so that you can start to construct them yourself.
So what is a mathematical proof? A simple definition is hard, but my best attempt is very general: a proof is a formal, logical, mathematical argument that succeeds in convincing an informed reader. Let me talk through this definition.
- First, a proof is an argument. It is persuasive: it has a proposition, and it wants to convince its reader that the proposition is true. In this, it shares something with all the other persuasive writing throughout university, including scientific studies, aesthetic critiques, and persuasive essays.
- Second, a proof is logical. Though mathematical discovery can happen in a variety of ways, the presentation of mathematical ideas happen through logic. A proof will be a series of logical statements and implications. If successful, the chain of logic will lead from the assumptions of the proof to the desired conclusion.
- Third, a proof is formal. Mathematics has formal notations, definitions, and rules for manipulating expressions. A proof follows these formalities. It can’t treat mathematical ideas and definitions loosely or casually; it has to follow the rules and strict conventions of mathematics.
- Finally, it succeeds in convincing the reader. At the end of a proof, the question should always be: are you convinced? Proofs are accepted by the mathematical community when a sufficient number of qualified readers are convinced by the mathematical formality and logical sequence. In this course, when I present a proof, I will try to repeatedly ask if you are convinced by the proof. If you are not, either the proof is flawed, or your understanding of the argument is weak and you need to go back and study the assumptions, notations, and structure more carefully.
Subsection 1.5.2 How to Write a Proof
I desperately wish there were something simple and straightforward I could say here about how to write a proof. I wish I could give you a reasonable three-step process that would always produce good proofs. Unfortunately, no such thing exists. Writing proofs is the most difficult skill in mathematics. Writing a good proof is comparable to writing a good essay. I learn by reading good examples, practicing my own, and being patient with the long process.
In this course, I’m not going to give you any short, easy method for writing proofs (since, as I just said, no such method exists). Instead, I’m going to try to start the process of exposing you to proofs. In many ways, learning to write proofs is done by example — by reading lots of proofs and seeing how they work. I’ll show you a bunch of short proofs in this course and ask you to try to emulate them in the activities and assignments.
That said, I will give some pointers and ideas on how to start.
- Remember that a proof is a chain of logical arguments. You should see this structure in the proofs you write. Look out for any sudden jumps or gaps in the logic.
- Use your starting assumptions. If you are proving something about a prime number, but never use the fact that it is prime, something very strange is going on. Either you are making a mistake, or the assumption that the number is prime is unnecessary.
- Don’t assume what you are trying to prove. The statement that you want to prove should show up at the very end of your proof. You can’t start with the statement you want to prove — logical implications don’t work that way. A proof should be a logical chain that starts with the assumptions of the proof and end with the desired conclusion.
- There is a huge difference between the rough work that goes into sketching a proof and the final proof. In the rough work, I will often start with the conclusion and make various attempts to see how I might get there. I try things, play around with ideas, do test cases. But when I write up a proof, I take all this rough work and try to put it into a formal logical chain with the desired statement coming at the end, as a conclusion.
- A good proof includes good writing as well as good symbolic mathematics. Write clear, complete sentences. Treat proof-writing with the same level of care as you would any academic writing.
- There are a number of special proof techniques that are worth some attention. I’ll spend the next section on two of these techniques. Not all proofs need a named technique, but these two are common enough to point out at the start.
- I can’t prove by example. Proof is not inductive reasoning, like that which is used in scientific experiments. A proof should prove all possible instances of the proposition, without exception. A bunch of examples are interesting and can provide hints, but they never constitue a proof. (That said, working with examples is often a great idea when working on a proof. Example gives insight and often show the way to the general arguent. They just can’t be used as the persuasive evidence in the proof.)
- Similarly, I can’t prove by diagram, drawing or illustration. Drawing pictures of what is going on is often a great way to build intuition, to understand the situation. It can often guild you in the right direction for constructing a proof. But a visualization itself does not constitute a complete logical argument. It doesn’t meet the standard that mathematics has set for a proof.
Subsection 1.5.3 Proof by Contradiction
There are many common structures and techniques used with proofs. I’m going to share with you just one of the most common techniques: proof by contradiction. This technique relies on the assumption that any (well-formed) statement in mathematics is either true or false. To prove a statement is true, I can equivalently show that it is impossible for the statement to be false.
To prove by contradiction, I assume the statement I want to prove is false. Then, starting from that assumption, I make logical conclusions. These logical conclusions lead me to an impossible situation — a contradiction. This contradiction can take many forms: a contradiction of some statement I know is true, an impossibility of a statement being both true and false, or some other situation which cannot make sense. Since I end up with this contradiction, my original assumption must have been wrong. Since I assumed the statement was false, the statement must therefore be true.
This is perhaps easier to see in an example. I’ll do two examples, both of which are classic proof examples. (If you’ve study anything about proofs elsewhere, there is a good chance you’ve seen these examples.) First, I’ll prove that there are infinitely many prime numbers.
- Step 1: Assume the statement is false. The statement says there are infinitely many prime numbers. If that is false, there must be only finitely many prime numbers. So my assumption is that there are finitely many prime numbers.
- Step 2: Work from the assumption to build a contradiction. The assumption says that there are finitely many prime numbers. Since there are only finitely many, I can index them: \(p_1, p_2, p_3, \ldots p_k\text{.}\) This is a finite list of some length \(k\text{.}\) Then consider a very large number formed by multiplying these all together and adding 1.\begin{equation*} m = p_1p_2 \ldots p_k + 1 \end{equation*}What are the prime factors of this number \(m\text{?}\) I know that all numbers break down uniquely into prime factors, so it must have prime factors. However, every prime number \(p_i\) in my list (which, by assumption, is all prime numbers) is a factor of \(m-1\text{.}\) A prime number can’t be the factor of two consecutive numbers, so none of the prime numbers are factors of \(m\text{.}\)
- Step 3: State the contradiction. The number \(m\) must have prime factors, but cannot have prime factors, since none of the primes \(p_i\) are factors. This is a contradiction: the number must have prime factors but has none.
- Step 4: Conclude that the original statement was false, which finishes the proof. The assumption that there were only finitely many prime numbers led to a contradiction. Therefore, by contradiction, I have proved that there are infinitely many prime numbers.
This proof is non-constructive. It tells me that there are infinitely many prime numbers, but it doesn’t tell me anything about them: what they are, how to find them, how they are distributed on the number line, etc. This is often a major weakness of proofs by contradiction: they may not give concrete information about the objects in the proposition.
Another classic proof example is the proof that \(\sqrt{2}\) is irrational. I will also prove this by contradiction.
- Step 1: Assume the statement is false. The statement says that \(\sqrt{2}\) is irrational. If that is false, then \(\sqrt{2}\) must be rational. Therefore, my assumption is that \(\sqrt{2}\) is rational.
- Step 2: Work from the assumption to build a contradiction. If \(\sqrt{2}\) is rational, then there are integers \(a\) and \(b\) such that \(\sqrt{2} = \frac{a}{b}\text{.}\) Since any fraction can be written in the simplest terms, I can assume that \(a\) and \(b\) have no common factors. Then I can do some arithmetic with this equation.\begin{align*} \sqrt{2} \amp = \frac{a}{b}\\ 2 \amp = \frac{a^2}{b^2} \\ 2b^2 \amp = a^2 \end{align*}\(2\) divides the left side of the equation, so it must divide the right side as well, meaning that \(a^2\) must be even. But the squares of odd numbers are odd, and the squares of even numbers are even. Therefore, \(a\) must be even. I can write \(a = 2c\) for some integer \(c\text{.}\) Then I’ll replace \(a\) by \(2c\) in the equation and do a bit more arithmetic.\begin{align*} 2b^2 \amp = a^2 \\ 2b^2 \amp = (2c)^2 \\ 2b^2 \amp = 4c^2 \\ b^2 \amp = 2c^2 \end{align*}Now \(2\) divides the right side of the equation, so it must divide the left as well. That means that \(b^2\) is even, so \(b\) must be even.
- Step 3: State the contradiction. I assumed that \(a\) and \(b\) have no common factors since I wrote the fraction in the simplest terms. Now I have proved that both numbers are even, so \(2\) is a factor of both. This is a contradiction: \(a\) and \(b\) have no common factors, but \(2\) is a common factor.
- Step 4: Conclude that the original statement was false, which finishes the proof. The assumption that \(\sqrt{2}\) was rational produced to a contradiction, so I have proved that \(\sqrt{2}\) must be irrational.
Proof by contradiction, as I mentioned at the start, relies on the assumption that any (well-formed) statement in mathematics is either true or false. That seems like a reasonable assumption in a logical system, and proof by contradiction is an important tool in mainstream mathematics. However, some versions of mathematics have rejected this assumption. There are some versions of mathematics that do not allow proof by contradiction. (These are called intuitionistic or constructivist mathematics).