In calculus, the inverse function rule is a formula that expresses the derivative of the inverse of a bijective and differentiable function f in terms of the derivative of f. More precisely, if the inverse of f {\displaystyle f} is denoted as f − 1 {\displaystyle f^{-1}} , where f − 1 ( y ) = x {\displaystyle f^{-1}(y)=x} if and only if f ( x ) = y {\displaystyle f(x)=y} , then the inverse function rule is, in Lagrange's notation,
This formula holds in general whenever f {\displaystyle f} is continuous and injective on an interval I, with f {\displaystyle f} being differentiable at f − 1 ( y ) {\displaystyle f^{-1}(y)} ( ∈ I {\displaystyle \in I} ) and where f ′ ( f − 1 ( y ) ) ≠ 0 {\displaystyle f'(f^{-1}(y))\neq 0} . The same formula is also equivalent to the expression
where D {\displaystyle {\mathcal {D}}} denotes the unary derivative operator (on the space of functions) and ∘ {\displaystyle \circ } denotes function composition.
Geometrically, a function and inverse function have graphs that are reflections, in the line y = x {\displaystyle y=x} . This reflection operation turns the gradient of any line into its reciprocal.[1]
Assuming that f {\displaystyle f} has an inverse in a neighbourhood of x {\displaystyle x} and that its derivative at that point is non-zero, its inverse is guaranteed to be differentiable at x {\displaystyle x} and have a derivative given by the above formula.
The inverse function rule may also be expressed in Leibniz's notation. As that notation suggests,
This relation is obtained by differentiating the equation f − 1 ( y ) = x {\displaystyle f^{-1}(y)=x} in terms of x and applying the chain rule, yielding that:
considering that the derivative of x with respect to x is 1.
Let f {\displaystyle f} be an invertible (bijective) function, let x {\displaystyle x} be in the domain of f {\displaystyle f} , and let y {\displaystyle y} be in the codomain of f {\displaystyle f} . Since f {\displaystyle f} is a bijective function, y {\displaystyle y} is in the range of f {\displaystyle f} . This also means that y {\displaystyle y} is in the domain of f − 1 {\displaystyle f^{-1}} , and that x {\displaystyle x} is in the codomain of f − 1 {\displaystyle f^{-1}} . Since f {\displaystyle f} is an invertible function, we know that f ( f − 1 ( y ) ) = y {\displaystyle f(f^{-1}(y))=y} . The inverse function rule can be obtained by taking the derivative of this equation.
The right side is equal to 1 and the chain rule can be applied to the left side:
Rearranging then gives
At x = 0 {\displaystyle x=0} , however, there is a problem: the graph of the square root function becomes vertical, corresponding to a horizontal tangent for the square function.
Let z = f ′ ( x ) {\displaystyle z=f'(x)} then we have, assuming f ″ ( x ) ≠ 0 {\displaystyle f''(x)\neq 0} : d ( f ′ ) − 1 ( z ) d z = 1 f ″ ( x ) {\displaystyle {\frac {d(f')^{-1}(z)}{dz}}={\frac {1}{f''(x)}}} This can be shown using the previous notation y = f ( x ) {\displaystyle y=f(x)} . Then we have:
By induction, we can generalize this result for any integer n ≥ 1 {\displaystyle n\geq 1} , with z = f ( n ) ( x ) {\displaystyle z=f^{(n)}(x)} , the nth derivative of f(x), and y = f ( n − 1 ) ( x ) {\displaystyle y=f^{(n-1)}(x)} , assuming f ( i ) ( x ) ≠ 0 for 0 < i ≤ n + 1 {\displaystyle f^{(i)}(x)\neq 0{\text{ for }}0<i\leq n+1} :
The chain rule given above is obtained by differentiating the identity f − 1 ( f ( x ) ) = x {\displaystyle f^{-1}(f(x))=x} with respect to x. One can continue the same process for higher derivatives. Differentiating the identity twice with respect to x, one obtains
that is simplified further by the chain rule as
Replacing the first derivative, using the identity obtained earlier, we get
Similarly for the third derivative:
or using the formula for the second derivative,
These formulas are generalized by the Faà di Bruno's formula.
These formulas can also be written using Lagrange's notation. If f and g are inverses, then
so that
which agrees with the direct calculation.