Banach fixed-point theorem

In mathematics, the Banach fixed-point theorem (also known as the contraction mapping theorem or contractive mapping theorem or Banach–Caccioppoli theorem) is an important tool in the theory of metric spaces; it guarantees the existence and uniqueness of fixed points of certain self-maps of metric spaces and provides a constructive method to find those fixed points. It can be understood as an abstract formulation of Picard's method of successive approximations.^[1] The theorem is named after Stefan Banach (1892–1945) who first stated it in 1922.^[2]^[3]

Statement

Definition. Let $(X,d)$ be a metric space. Then a map $T:X\to X$ is called a contraction mapping on X if there exists $q\in [0,1)$ such that

d(T(x),T(y))\leq qd(x,y)

for all $x,y\in X.$

Banach fixed-point theorem. Let $(X,d)$ be a non-empty complete metric space with a contraction mapping $T:X\to X.$ Then T admits a unique fixed-point $x^{*}$ in X (i.e. $T(x^{*})=x^{*}$ ). Furthermore, $x^{*}$ can be found as follows: start with an arbitrary element $x_{0}\in X$ and define a sequence $(x_{n})_{n\in \mathbb {N} }$ by $x_{n}=T(x_{n-1})$ for $n\geq 1.$ Then $\lim _{n\to \infty }x_{n}=x^{*}$ .

Remark 1. The following inequalities are equivalent and describe the speed of convergence:

{\begin{aligned}d(x^{*},x_{n})&\leq {\frac {q^{n}}{1-q}}d(x_{1},x_{0}),\\[5pt]d(x^{*},x_{n+1})&\leq {\frac {q}{1-q}}d(x_{n+1},x_{n}),\\[5pt]d(x^{*},x_{n+1})&\leq qd(x^{*},x_{n}).\end{aligned}}

Any such value of q is called a Lipschitz constant for $T$ , and the smallest one is sometimes called "the best Lipschitz constant" of $T$ .

Remark 2. $d(T(x),T(y))<d(x,y)$ for all $x\neq y$ is in general not enough to ensure the existence of a fixed point, as is shown by the map

T:[1,\infty )\to [1,\infty ),\,\,T(x)=x+{\tfrac {1}{x}}\,,

which lacks a fixed point. However, if $X$ is compact, then this weaker assumption does imply the existence and uniqueness of a fixed point, that can be easily found as a minimizer of $d(x,T(x))$ , indeed, a minimizer exists by compactness, and has to be a fixed point of $T.$ It then easily follows that the fixed point is the limit of any sequence of iterations of $T.$

Remark 3. When using the theorem in practice, the most difficult part is typically to define $X$ properly so that $T(X)\subseteq X.$

Proof

Let $x_{0}\in X$ be arbitrary and define a sequence $(x_{n})_{n\in \mathbb {N} }$ by setting $x_{n}=T(x_{n-1})$ . We first note that for all $n\in \mathbb {N} ,$ we have the inequality

d(x_{n+1},x_{n})\leq q^{n}d(x_{1},x_{0}).

This follows by induction on $n$ , using the fact that $T$ is a contraction mapping. Then we can show that $(x_{n})_{n\in \mathbb {N} }$ is a Cauchy sequence. In particular, let $m,n\in \mathbb {N}$ such that $m>n$ :

{\begin{aligned}d(x_{m},x_{n})&\leq d(x_{m},x_{m-1})+d(x_{m-1},x_{m-2})+\cdots +d(x_{n+1},x_{n})\\[5pt]&\leq q^{m-1}d(x_{1},x_{0})+q^{m-2}d(x_{1},x_{0})+\cdots +q^{n}d(x_{1},x_{0})\\[5pt]&=q^{n}d(x_{1},x_{0})\sum _{k=0}^{m-n-1}q^{k}\\[5pt]&\leq q^{n}d(x_{1},x_{0})\sum _{k=0}^{\infty }q^{k}\\[5pt]&=q^{n}d(x_{1},x_{0})\left({\frac {1}{1-q}}\right).\end{aligned}}

Let $\varepsilon >0$ be arbitrary. Since $q\in [0,1)$ , we can find a large $N\in \mathbb {N}$ so that

q^{N}<{\frac {\varepsilon (1-q)}{d(x_{1},x_{0})}}.

Therefore, by choosing $m$ and $n$ greater than $N$ we may write:

d(x_{m},x_{n})\leq q^{n}d(x_{1},x_{0})\left({\frac {1}{1-q}}\right)<\left({\frac {\varepsilon (1-q)}{d(x_{1},x_{0})}}\right)d(x_{1},x_{0})\left({\frac {1}{1-q}}\right)=\varepsilon .

This proves that the sequence $(x_{n})_{n\in \mathbb {N} }$ is Cauchy. By completeness of $(X,d)$ , the sequence has a limit $x^{*}\in X.$ Furthermore, $x^{*}$ must be a fixed point of $T$ :

x^{*}=\lim _{n\to \infty }x_{n}=\lim _{n\to \infty }T(x_{n-1})=T\left(\lim _{n\to \infty }x_{n-1}\right)=T(x^{*}).

As a contraction mapping, $T$ is continuous, so bringing the limit inside $T$ was justified. Lastly, $T$ cannot have more than one fixed point in $(X,d)$ , since any pair of distinct fixed points $p_{1}$ and $p_{2}$ would contradict the contraction of $T$ :

d(T(p_{1}),T(p_{2}))=d(p_{1},p_{2})>qd(p_{1},p_{2}).

Applications

A standard application is the proof of the Picard–Lindelöf theorem about the existence and uniqueness of solutions to certain ordinary differential equations. The sought solution of the differential equation is expressed as a fixed point of a suitable integral operator on the space of continuous functions under the uniform norm. The Banach fixed-point theorem is then used to show that this integral operator has a unique fixed point.
One consequence of the Banach fixed-point theorem is that small Lipschitz perturbations of the identity are bi-lipschitz homeomorphisms. Let Ω be an open set of a Banach space E; let I : Ω → E denote the identity (inclusion) map and let g : Ω → E be a Lipschitz map of constant k < 1. Then

Ω′ := (I + g)(Ω) is an open subset of E: precisely, for any x in Ω such that B(x, r) ⊂ Ω one has B((I + g)(x), r(1 − k)) ⊂ Ω′;
I + g : Ω → Ω′ is a bi-Lipschitz homeomorphism;

precisely, (I + g)⁻¹ is still of the form I + h : Ω → Ω′ with h a Lipschitz map of constant k/(1 − k). A direct consequence of this result yields the proof of the inverse function theorem.

It can be used to give sufficient conditions under which Newton's method of successive approximations is guaranteed to work, and similarly for Chebyshev's third-order method.
It can be used to prove existence and uniqueness of solutions to integral equations.
It can be used to give a proof to the Nash embedding theorem.^[4]
It can be used to prove existence and uniqueness of solutions to value iteration, policy iteration, and policy evaluation of reinforcement learning.^[5]
It can be used to prove existence and uniqueness of an equilibrium in Cournot competition,^[6] and other dynamic economic models.^[7]

Converses

Several converses of the Banach contraction principle exist. The following is due to Czesław Bessaga, from 1959:

Let f : X → X be a map of an abstract set such that each iterate fⁿ has a unique fixed point. Let $q\in (0,1),$ then there exists a complete metric on X such that f is contractive, and q is the contraction constant.

Indeed, very weak assumptions suffice to obtain such a kind of converse. For example if $f:X\to X$ is a map on a T₁ topological space with a unique fixed point a, such that for each $x\in X$ we have fⁿ(x) → a, then there already exists a metric on X with respect to which f satisfies the conditions of the Banach contraction principle with contraction constant 1/2.^[8] In this case the metric is in fact an ultrametric.

Generalizations

There are a number of generalizations (some of which are immediate corollaries).^[9]

Let T : X → X be a map on a complete non-empty metric space. Then, for example, some generalizations of the Banach fixed-point theorem are:

Assume that some iterate Tⁿ of T is a contraction. Then T has a unique fixed point.
Assume that for each n, there exist c_n such that d(Tⁿ(x), Tⁿ(y)) ≤ c_nd(x, y) for all x and y, and that

\sum \nolimits _{n}c_{n}<\infty .

Then T has a unique fixed point.

In applications, the existence and uniqueness of a fixed point often can be shown directly with the standard Banach fixed point theorem, by a suitable choice of the metric that makes the map T a contraction. Indeed, the above result by Bessaga strongly suggests to look for such a metric. See also the article on fixed point theorems in infinite-dimensional spaces for generalizations.

In a non-empty compact metric space, any function $T$ satisfying $d(T(x),T(y))<d(x,y)$ for all distinct $x,y$ , has a unique fixed point. The proof is simpler than the Banach theorem, because the function $d(T(x),x)$ is continuous, and therefore assumes a minimum, which is easily shown to be zero.

A different class of generalizations arise from suitable generalizations of the notion of metric space, e.g. by weakening the defining axioms for the notion of metric.^[10] Some of these have applications, e.g., in the theory of programming semantics in theoretical computer science.^[11]

Example

An application of the Banach fixed-point theorem and fixed-point iteration can be used to quickly obtain an approximation of $π$ with high accuracy. Consider the function $f(x)=\sin(x)+x$ . It can be verified that $π$ is a fixed point of f, and that f maps the interval $\left[3\pi /4,5\pi /4\right]$ to itself. Moreover, $f'(x)=1+\cos(x)$ , and it can be verified that

0\leq 1+\cos(x)\leq 1-{\frac {1}{\sqrt {2}}}<1

on this interval. Therefore, by an application of the mean value theorem, f has a Lipschitz constant less than 1 (namely $1-1/{\sqrt {2}}$ ). Applying the Banach fixed-point theorem shows that the fixed point $π$ is the unique fixed point on the interval, allowing for fixed-point iteration to be used.

For example, the value 3 may be chosen to start the fixed-point iteration, as $3\pi /4\leq 3\leq 5\pi /4$ . The Banach fixed-point theorem may be used to conclude that

\pi =f(f(f(\cdots f(3)\cdots )))).

Applying f to 3 only three times already yields an expansion of $π$ accurate to 33 digits:

f(f(f(3)))=3.141592653589793238462643383279502\ldots \,.

Notes

^ Kinderlehrer, David; Stampacchia, Guido (1980). "Variational Inequalities in R^N". An Introduction to Variational Inequalities and Their Applications. New York: Academic Press. pp. 7–22. ISBN 0-12-407350-6.
^ Banach, Stefan (1922). "Sur les opérations dans les ensembles abstraits et leur application aux équations intégrales" (PDF). Fundamenta Mathematicae. 3: 133–181. doi:10.4064/fm-3-1-133-181. Archived (PDF) from the original on 2011-06-07.
^ Ciesielski, Krzysztof (2007). "On Stefan Banach and some of his results" (PDF). Banach J. Math. Anal. 1 (1): 1–10. doi:10.15352/bjma/1240321550. Archived (PDF) from the original on 2009-05-30.
^ Günther, Matthias (1989). "Zum Einbettungssatz von J. Nash" [On the embedding theorem of J. Nash]. Mathematische Nachrichten (in German). 144: 165–187. doi:10.1002/mana.19891440113. MR 1037168.
^ Lewis, Frank L.; Vrabie, Draguna; Syrmos, Vassilis L. (2012). "Reinforcement Learning and Optimal Adaptive Control". Optimal Control. New York: John Wiley & Sons. pp. 461–517 [p. 474]. ISBN 978-1-118-12272-3.
^ Long, Ngo Van; Soubeyran, Antoine (2000). "Existence and Uniqueness of Cournot Equilibrium: A Contraction Mapping Approach" (PDF). Economics Letters. 67 (3): 345–348. doi:10.1016/S0165-1765(00)00211-1. Archived (PDF) from the original on 2004-12-30.
^ Stokey, Nancy L.; Lucas, Robert E. Jr. (1989). Recursive Methods in Economic Dynamics. Cambridge: Harvard University Press. pp. 508–516. ISBN 0-674-75096-9.
^ Hitzler, Pascal; Seda, Anthony K. (2001). "A 'Converse' of the Banach Contraction Mapping Theorem". Journal of Electrical Engineering. 52 (10/s): 3–6.
^ Latif, Abdul (2014). "Banach Contraction Principle and its Generalizations". Topics in Fixed Point Theory. Springer. pp. 33–64. doi:10.1007/978-3-319-01586-6_2. ISBN 978-3-319-01585-9.
^ Hitzler, Pascal; Seda, Anthony (2010). Mathematical Aspects of Logic Programming Semantics. Chapman and Hall/CRC. ISBN 978-1-4398-2961-5.
^ Seda, Anthony K.; Hitzler, Pascal (2010). "Generalized Distance Functions in the Theory of Computation". The Computer Journal. 53 (4): 443–464. doi:10.1093/comjnl/bxm108.

References

Agarwal, Praveen; Jleli, Mohamed; Samet, Bessem (2018). "Banach Contraction Principle and Applications". Fixed Point Theory in Metric Spaces. Singapore: Springer. pp. 1–23. doi:10.1007/978-981-13-2913-5_1. ISBN 978-981-13-2912-8.
Chicone, Carmen (2006). "Contraction". Ordinary Differential Equations with Applications (2nd ed.). New York: Springer. pp. 121–135. ISBN 0-387-30769-9.
Granas, Andrzej; Dugundji, James (2003). Fixed Point Theory. New York: Springer-Verlag. ISBN 0-387-00173-5.
Istrăţescu, Vasile I. (1981). Fixed Point Theory: An Introduction. The Netherlands: D. Reidel. ISBN 90-277-1224-7. See chapter 7.
Kirk, William A.; Khamsi, Mohamed A. (2001). An Introduction to Metric Spaces and Fixed Point Theory. New York: John Wiley. ISBN 0-471-41825-0.