Inverse Linear Transforms and Matrix Inversion

Section 7.1 Inverse Linear Transforms and Matrix Inversion

Subsection 7.1.1 Inverse Transforms

I’ve now defined linear transformations. It is possible that a linear transformation is invertible, much like a function of numbers. Let me give the definition.

Definition 7.1.1.

Let \(T: \RR^n \rightarrow \RR^n\text{.}\) The inverse transformation of \(T\) is a transformation \(S: \RR^n \rightarrow \RR^n\) which undoes what \(T\) did. Equivalently, it is a transformation \(S\) such that both the compositions \(T \circ S\) and \(S \circ T\) are the identity. For an arbitrary transformation, there is no guarantee that an inverse exists; if it does, it is written \(T^{-1}\text{.}\)

Since I’ve encoded transformations as matrices, I can use matrices to understand the inverses of linear transformations.

Subsection 7.1.2 Inverse Matrices

Before returning to inverse transformations, I’m going to talk about the algebraic idea of inversion and reciprocals. In number systems, when there is an identity, there usually is a way of getting back to that identity. Zero is the identity for addition. For any number \(a\text{,}\) there exists a number \((-a)\) so that \(a + (-a) = 0\) gets back to the identity (this obviously works in numbers systems that allow negatives, such as \(\ZZ\)). One is the identity in multiplication. For any non-zero number \(a\text{,}\) there is the number \(\frac{1}{a}\) so that \(a \frac{1}{a} = 1\) gets back to the identity (this obviously works in numbers systems that allow reciprocals). For matrices, I have already defined the identity matrix, which is the identity for matrix multiplication. If \(A\) is a \(k \times l\) matrix, then \(AI_l = I_k A = A\text{.}\) For matrices, multiplying by the identity doesn’t change the matrix.

For matrices, I have the same question as I had for numbers systems. For any matrix \(M\text{,}\) is there another matrix \(N\) such that \(MN = I\text{?}\) Is there a way to get back to the identity? Multiplication of numbers already showed that no such number need exist, since I cannot divide by zero. For matrices, I have to be even more cautious. For now, here is the formal definition.

Definition 7.1.2.

Let \(M\) be a \(n \times n\) (square) matrix. Then the inverse of \(M\) is the unique matrix \(M^{-1}\) (if it exists) such that \(MM^{-1} = M^{-1}M = I_n\text{.}\)

I should note a couple of thing about this definition. First, it only applies to square matrices. I don’t even try to invert non-square matrices. Second, both multiplications are required: \(MM^{-1}\) and \(M^{-1}M\text{.}\) Both are required because matrix multiplication is non-commutative. In general, these two orders could result in different products. In this definition, I insist that both orders produce the identity matrix.

Subsection 7.1.3 Calculating Inverse Matrices

The definition is good, but I am left with the problem of determining which matrices have inverses and calculating those inverses. It turns out there is a convenient algorithm using techniques I have already presented. I will write an extended matrix \((M|I_n)\) with the original matrix and the identity matrix next to each other, separated by a vertical line. Then I row reduce, treating the conglomeration as one matrix with long rows. If the row reduction of \(M\) results in the identity on the left, then the matrix has the form \((I|M^{-1})\text{;}\) the right-hand side will be the inverse.

This algorithm leads to two observations. First, obviously, it is a way to calculate inverses. But it also is a condition: an \(n \times n\) matrix is invertible, in this algorithm, only if it row reduces to the identity. This is part of a general result, which is the first of several conditions for invertible matrices.

Proposition 7.1.3.

A \(n \times n\) matrix is invertible if and only if it row reduces to the identity matrix \(I_n\text{.}\) Since the identity has \(n\) leading ones, this is equivalent to the matrix having rank \(n\text{.}\)

Definition 7.1.4.

The set of all intervible \(n \times n\) matrices with real coefficient forms a group. It is called the General Linear Group and written \(GL_n(\RR)\text{.}\) If I wanted to change the coefficients to another set of scalars \(S\text{,}\) I would write \(GL_n(S)\text{.}\) When the coefficients are understood, the notation is sometimes simplified to \(GL_n\) or \(GL(n)\text{.}\)

Prev Top Next