Section 10.2 The Chain Rule
Subsection 10.2.1 Multivariable Composition
We defined partial derivatives to measure rates of change in a particular variable. We extended this to change in any unit direction with directional derivatives. We can extend this even further, but considering the change in a function as we move along a parametric curve in the domain.
Let \(f(x,y,z): \RR^3 \rightarrow\RR\text{,}\) be a potential energy function. Let \(\gamma(t) = (x(t), y(t), z(t))\) be a curve moving through \(\RR^3\text{.}\) I want to know how quickly energy is gained or lost move along the path \(\gamma\text{.}\) The energy along \(\gamma\) is \(f(\gamma(t)) = f(x(t), y(t), z(t))\text{.}\) The rate of change is \(\frac{df}{dt}\text{.}\) But now \(f\) is a composition, \(f(\gamma(t))\text{,}\) so this must be a chain rule calculation. What is the chain rule when we have three (or more) components?.
Subsection 10.2.2 The Chain Rule
Proposition 10.2.1.
(The Chain Rule) Let \(f: \RR^n \rightarrow \RR\) be a scalar function and \(\gamma(t)\) a parametric curve in \(\RR^n\) inside the domain of \(f\text{.}\) The derivative of \(f\) along \(\gamma\) is
The total rate of change is the sum of the rates of changes in each of the variables.
For reference, here is the chain rule in \(\RR^3\text{.}\)
This is a slightly strange extension of the single-variable chain rule. Originally, the chain rule was for any composition. However, the chain rule is very specifically for the composition of a scalar field with a parameteric curve. However, upon reflection, you could realize that this is really the only possibility. If \(f\) and \(g\) are both functions \(\RR^3 \rightarrow \RR\text{,}\) then the composition \(f \circ g\) or \(g \circ f\) isn't even defined. The output of one is a scalar, but the input needs to be a vector. The only reasonable composition using the function we already know is to compose with a parametric curce, since a parametric curve outputs a vector. In Calculus IV, we will work with vector fields which could allow for other compositions, but for now, this is the only possibility.
Subsection 10.2.3 Chain Rule Examples
Example 10.2.2.
The variables are now mixed, so I have to make replacements. I repalce \(x\) and \(y\) with the matching components of the parametric curve. This matches what we do with the single variable chain rule: we differentiate then we replace the variable. The single variable notation, with the evaluation bar, it not typically used in multivariable situations.
Example 10.2.3.
Consider the potential gravitational energy function
If we move along a curve \(\gamma\text{,}\) it is nice to know how the potential energy changes. A helical path out of the gravity well might be \(\gamma(t) = (\sin t, \cos t, t)\text{.}\) We differentiate along this path using the chain rule.
Notice that if we let \(|\gamma(t)| = \sqrt{1 + t^2}\) at the start, we could have written \(P(t) = \frac{-GmM}{|\gamma(t)|} = \frac{-GmM}{\sqrt{1+t^2}}\) and the \(P^\prime(t) = \frac{GmMt}{\sqrt{(1+t^2)^3}}\) could have been calculated directly. That would have been easier, but its nice to get confirmation that working with the chain rule leads to the right result.