Sublinear function

In linear algebra, a sublinear function (or functional as is more often used in functional analysis), also called a quasi-seminorm or a Banach functional, on a vector space $X$ is a real-valued function with only some of the properties of a seminorm. Unlike seminorms, a sublinear function does not have to be nonnegative-valued and also does not have to be absolutely homogeneous. Seminorms are themselves abstractions of the more well known notion of norms, where a seminorm has all the defining properties of a norm except that it is not required to map non-zero vectors to non-zero values.

In functional analysis the name Banach functional is sometimes used, reflecting that they are most commonly used when applying a general formulation of the Hahn–Banach theorem. The notion of a sublinear function was introduced by Stefan Banach when he proved his version of the Hahn-Banach theorem.^[1]

There is also a different notion in computer science, described below, that also goes by the name "sublinear function."

Definitions

Let $X$ be a vector space over a field $\mathbb {K} ,$ where $\mathbb {K}$ is either the real numbers $\mathbb {R}$ or complex numbers $\mathbb {C} .$ A real-valued function $p:X\to \mathbb {R}$ on $X$ is called a sublinear function (or a sublinear functional if $\mathbb {K} =\mathbb {R}$ ), and also sometimes called a quasi-seminorm or a Banach functional, if it has these two properties:^[1]

Positive homogeneity/Nonnegative homogeneity:^[2] $p(rx)=rp(x)$ $p(rx)=rp(x)$ for all real $r\geq 0$ $r\geq 0$ and all $x\in X.$ $x\in X.$
- This condition holds if and only if $p(rx)=rp(x)$ for all positive real $r>0$ and all $x\in X.$
Subadditivity/Triangle inequality:^[2] $p(x+y)\leq p(x)+p(y)$ $p(x+y)\leq p(x)+p(y)$ for all $x,y\in X.$ $x,y\in X.$
- This subadditivity condition requires $p$ to be real-valued.

A function $p:X\to \mathbb {R}$ is called positive^[3] or nonnegative if $p(x)\geq 0$ for all $x\in X,$ although some authors^[4] define positive to instead mean that $p(x)\neq 0$ whenever $x\neq 0;$ these definitions are not equivalent. It is a symmetric function if $p(-x)=p(x)$ for all $x\in X.$ Every subadditive symmetric function is necessarily nonnegative.^{[proof 1]} A sublinear function on a real vector space is symmetric if and only if it is a seminorm. A sublinear function on a real or complex vector space is a seminorm if and only if it is a balanced function or equivalently, if and only if $p(ux)\leq p(x)$ for every unit length scalar $u$ (satisfying $|u|=1$ ) and every $x\in X.$

The set of all sublinear functions on $X,$ denoted by $X^{\#},$ can be partially ordered by declaring $p\leq q$ if and only if $p(x)\leq q(x)$ for all $x\in X.$ A sublinear function is called minimal if it is a minimal element of $X^{\#}$ under this order. A sublinear function is minimal if and only if it is a real linear functional.^[1]

Examples and sufficient conditions

Every norm, seminorm, and real linear functional is a sublinear function. The identity function $\mathbb {R} \to \mathbb {R}$ on $X:=\mathbb {R}$ is an example of a sublinear function (in fact, it is even a linear functional) that is neither positive nor a seminorm; the same is true of this map's negation $x\mapsto -x.$ ^[5] More generally, for any real $a\leq b,$ the map ${\begin{alignedat}{4}S_{a,b}:\;&&\mathbb {R} &&\;\to \;&\mathbb {R} \\[0.3ex]&&x&&\;\mapsto \;&{\begin{cases}ax&{\text{ if }}x\leq 0\\bx&{\text{ if }}x\geq 0\\\end{cases}}\\\end{alignedat}}$ is a sublinear function on $X:=\mathbb {R}$ and moreover, every sublinear function $p:\mathbb {R} \to \mathbb {R}$ is of this form; specifically, if $a:=-p(-1)$ and $b:=p(1)$ then $a\leq b$ and $p=S_{a,b}.$

If $p$ and $q$ are sublinear functions on a real vector space $X$ then so is the map $x\mapsto \max\{p(x),q(x)\}.$ More generally, if ${\mathcal {P}}$ is any non-empty collection of sublinear functionals on a real vector space $X$ and if for all $x\in X,$ $q(x):=\sup\{p(x):p\in {\mathcal {P}}\},$ then $q$ is a sublinear functional on $X.$ ^[5]

A function $p:X\to \mathbb {R}$ which is subadditive, convex, and satisfies $p(0)\leq 0$ is also positively homogeneous (the latter condition $p(0)\leq 0$ is necessary as the example of $p(x):={\sqrt {x^{2}+1}}$ on $X:=\mathbb {R}$ shows). If $p$ is positively homogeneous, it is convex if and only if it is subadditive. Therefore, assuming $p(0)\leq 0$ , any two properties among subadditivity, convexity, and positive homogeneity implies the third.

Properties

Every sublinear function is a convex function: For $0\leq t\leq 1,$ ${\begin{alignedat}{3}p(tx+(1-t)y)&\leq p(tx)+p((1-t)y)&&\quad {\text{ subadditivity}}\\&=tp(x)+(1-t)p(y)&&\quad {\text{ nonnegative homogeneity}}\\\end{alignedat}}$

If $p:X\to \mathbb {R}$ is a sublinear function on a vector space $X$ then^{[proof 2]}^[3] $p(0)~=~0~\leq ~p(x)+p(-x),$ for every $x\in X,$ which implies that at least one of $p(x)$ and $p(-x)$ must be nonnegative; that is, for every $x\in X,$ ^[3] $0~\leq ~\max\{p(x),p(-x)\}.$ Moreover, when $p:X\to \mathbb {R}$ is a sublinear function on a real vector space then the map $q:X\to \mathbb {R}$ defined by $q(x)~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\max\{p(x),p(-x)\}$ is a seminorm.^[3]

Subadditivity of $p:X\to \mathbb {R}$ guarantees that for all vectors $x,y\in X,$ ^[1]^{[proof 3]} $p(x)-p(y)~\leq ~p(x-y),$ $-p(x)~\leq ~p(-x),$ so if $p$ is also symmetric then the reverse triangle inequality will hold for all vectors $x,y\in X,$ $|p(x)-p(y)|~\leq ~p(x-y).$

Defining $\ker p~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~p^{-1}(0),$ then subadditivity also guarantees that for all $x\in X,$ the value of $p$ on the set $x+(\ker p\cap -\ker p)=\{x+k:p(k)=0=p(-k)\}$ is constant and equal to $p(x).$ ^{[proof 4]} In particular, if $\ker p=p^{-1}(0)$ is a vector subspace of $X$ then $-\ker p=\ker p$ and the assignment $x+\ker p\mapsto p(x),$ which will be denoted by ${\hat {p}},$ is a well-defined real-valued sublinear function on the quotient space $X\,/\,\ker p$ that satisfies ${\hat {p}}^{-1}(0)=\ker p.$ If $p$ is a seminorm then ${\hat {p}}$ is just the usual canonical norm on the quotient space $X\,/\,\ker p.$

Pryce's sublinearity lemma^[2]—Suppose $p:X\to \mathbb {R}$ is a sublinear functional on a vector space $X$ and that $K\subseteq X$ is a non-empty convex subset. If $x\in X$ is a vector and $a,c>0$ are positive real numbers such that $p(x)+ac~<~\inf _{k\in K}p(x+ak)$ then for every positive real $b>0$ there exists some $\mathbf {z} \in K$ such that $p(x+a\mathbf {z} )+bc~<~\inf _{k\in K}p(x+a\mathbf {z} +bk).$

Adding $bc$ to both sides of the hypothesis ${\textstyle p(x)+ac\,<\,\inf _{}p(x+aK)}$ (where $p(x+aK)~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\{p(x+ak):k\in K\}$ ) and combining that with the conclusion gives $p(x)+ac+bc~<~\inf _{}p(x+aK)+bc~\leq ~p(x+a\mathbf {z} )+bc~<~\inf _{}p(x+a\mathbf {z} +bK)$ which yields many more inequalities, including, for instance, $p(x)+ac+bc~<~p(x+a\mathbf {z} )+bc~<~p(x+a\mathbf {z} +b\mathbf {z} )$ in which an expression on one side of a strict inequality $\,<\,$ can be obtained from the other by replacing the symbol $c$ with $\mathbf {z}$ (or vice versa) and moving the closing parenthesis to the right (or left) of an adjacent summand (all other symbols remain fixed and unchanged).

Associated seminorm

If $p:X\to \mathbb {R}$ is a real-valued sublinear function on a real vector space $X$ (or if $X$ is complex, then when it is considered as a real vector space) then the map $q(x)~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\max\{p(x),p(-x)\}$ defines a seminorm on the real vector space $X$ called the seminorm associated with $p.$ ^[3] A sublinear function $p$ on a real or complex vector space is a symmetric function if and only if $p=q$ where $q(x)~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\max\{p(x),p(-x)\}$ as before.

More generally, if $p:X\to \mathbb {R}$ is a real-valued sublinear function on a (real or complex) vector space $X$ then $q(x)~{\stackrel {\scriptscriptstyle {\text{def}}}{=}}~\sup _{|u|=1}p(ux)~=~\sup\{p(ux):u{\text{ is a unit scalar }}\}$ will define a seminorm on $X$ if this supremum is always a real number (that is, never equal to $\infty$ ).

Relation to linear functionals

If $p$ is a sublinear function on a real vector space $X$ then the following are equivalent:^[1]

$p$ is a linear functional.
for every $x\in X,$ $p(x)+p(-x)\leq 0.$
for every $x\in X,$ $p(x)+p(-x)=0.$
$p$ is a minimal sublinear function.

If $p$ is a sublinear function on a real vector space $X$ then there exists a linear functional $f$ on $X$ such that $f\leq p.$ ^[1]

If $X$ is a real vector space, $f$ is a linear functional on $X,$ and $p$ is a positive sublinear function on $X,$ then $f\leq p$ on $X$ if and only if $f^{-1}(1)\cap \{x\in X:p(x)<1\}=\varnothing .$ ^[1]

Dominating a linear functional

A real-valued function $f$ defined on a subset of a real or complex vector space $X$ is said to be dominated by a sublinear function $p$ if $f(x)\leq p(x)$ for every $x$ that belongs to the domain of $f.$ If $f:X\to \mathbb {R}$ is a real linear functional on $X$ then^[6]^[1] $f$ is dominated by $p$ (that is, $f\leq p$ ) if and only if $-p(-x)\leq f(x)\leq p(x)\quad {\text{ for every }}x\in X.$ Moreover, if $p$ is a seminorm or some other symmetric map (which by definition means that $p(-x)=p(x)$ holds for all $x$ ) then $f\leq p$ if and only if $|f|\leq p.$

Theorem^[1]—If $p:X\to \mathbb {R}$ be a sublinear function on a real vector space $X$ and if $z\in X$ then there exists a linear functional $f$ on $X$ that is dominated by $p$ (that is, $f\leq p$ ) and satisfies $f(z)=p(z).$ Moreover, if $X$ is a topological vector space and $p$ is continuous at the origin then $f$ is continuous.

Continuity

Theorem^[7]—Suppose $f:X\to \mathbb {R}$ is a subadditive function (that is, $f(x+y)\leq f(x)+f(y)$ for all $x,y\in X$ ). Then $f$ is continuous at the origin if and only if $f$ is uniformly continuous on $X.$ If $f$ satisfies $f(0)=0$ then $f$ is continuous if and only if its absolute value $|f|:X\to [0,\infty )$ is continuous. If $f$ is non-negative then $f$ is continuous if and only if $\{x\in X:f(x)<1\}$ is open in $X.$

Suppose $X$ is a topological vector space (TVS) over the real or complex numbers and $p$ is a sublinear function on $X.$ Then the following are equivalent:^[7]

$p$ is continuous;
$p$ is continuous at 0;
$p$ is uniformly continuous on $X$ ;

and if $p$ is positive then this list may be extended to include:

$\{x\in X:p(x)<1\}$ is open in $X.$

If $X$ is a real TVS, $f$ is a linear functional on $X,$ and $p$ is a continuous sublinear function on $X,$ then $f\leq p$ on $X$ implies that $f$ is continuous.^[7]

Relation to Minkowski functions and open convex sets

Theorem^[7]—If $U$ is a convex open neighborhood of the origin in a topological vector space $X$ then the Minkowski functional of $U,$ $p_{U}:X\to [0,\infty ),$ is a continuous non-negative sublinear function on $X$ such that $U=\left\{x\in X:p_{U}(x)<1\right\};$ if in addition $U$ is a balanced set then $p_{U}$ is a seminorm on $X.$

Relation to open convex sets

Theorem^[7]—Suppose that $X$ is a topological vector space (not necessarily locally convex or Hausdorff) over the real or complex numbers. Then the open convex subsets of $X$ are exactly those that are of the form $z+\{x\in X:p(x)<1\}=\{x\in X:p(x-z)<1\}$ for some $z\in X$ and some positive continuous sublinear function $p$ on $X.$

Proof

Let $V$ be an open convex subset of $X.$ If $0\in V$ then let $z:=0$ and otherwise let $z\in V$ be arbitrary. Let $p:X\to [0,\infty )$ be the Minkowski functional of $V-z,$ which is a continuous sublinear function on $X$ since $V-z$ is convex, absorbing, and open ( $p$ however is not necessarily a seminorm since $V$ was not assumed to be balanced). From $X=X-z,$ it follows that $z+\{x\in X:p(x)<1\}=\{x\in X:p(x-z)<1\}.$ It will be shown that $V=z+\{x\in X:p(x)<1\},$ which will complete the proof. One of the known properties of Minkowski functionals guarantees ${\textstyle \{x\in X:p(x)<1\}=(0,1)(V-z),}$ where $(0,1)(V-z)\;{\stackrel {\scriptscriptstyle {\text{def}}}{=}}\;\{tx:0<t<1,x\in V-z\}=V-z$ since $V-z$ is convex and contains the origin. Thus $V-z=\{x\in X:p(x)<1\},$ as desired. $\blacksquare$

Operators

The concept can be extended to operators that are homogeneous and subadditive. This requires only that the codomain be, say, an ordered vector space to make sense of the conditions.

Computer science definition

In computer science, a function $f:\mathbb {Z} ^{+}\to \mathbb {R}$ is called sublinear if $\lim _{n\to \infty }{\frac {f(n)}{n}}=0,$ or $f(n)\in o(n)$ in asymptotic notation (notice the small $o$ ). Formally, $f(n)\in o(n)$ if and only if, for any given $c>0,$ there exists an $N$ such that $f(n)<cn$ for $n\geq N.$ ^[8] That is, $f$ grows slower than any linear function. The two meanings should not be confused: while a Banach functional is convex, almost the opposite is true for functions of sublinear growth: every function $f(n)\in o(n)$ can be upper-bounded by a concave function of sublinear growth.^[9]

Notes

Proofs

^ Let $x\in X.$ The triangle inequality and symmetry imply $p(0)=p(x+(-x))\leq p(x)+p(-x)=p(x)+p(x)=2p(x).$ Substituting $0$ for $x$ and then subtracting $p(0)$ from both sides proves that $0\leq p(0).$ Thus $0\leq p(0)\leq 2p(x)$ which implies $0\leq p(x).$ $\blacksquare$
^ If $x\in X$ and $r:=0$ then nonnegative homogeneity implies that $p(0)=p(rx)=rp(x)=0p(x)=0.$ Consequently, $0=p(0)=p(x+(-x))\leq p(x)+p(-x),$ which is only possible if $0\leq \max\{p(x),p(-x)\}.$ $\blacksquare$
^ $p(x)=p(y+(x-y))\leq p(y)+p(x-y),$ which happens if and only if $p(x)-p(y)\leq p(x-y).$ $\blacksquare$ Substituting $y:=-x$ and gives $p(x)-p(-x)\leq p(x-(-x))=p(x+x)\leq p(x)+p(x),$ which implies $-p(-x)\leq p(x)$ (positive homogeneity is not needed; the triangle inequality suffices). $\blacksquare$
^ Let $x\in X$ and $k\in p^{-1}(0)\cap (-p^{-1}(0)).$ It remains to show that $p(x+k)=p(x).$ The triangle inequality implies $p(x+k)\leq p(x)+p(k)=p(x)+0=p(x).$ Since $p(-k)=0,$ $p(x)=p(x)-p(-k)\leq p(x-(-k))=p(x+k),$ as desired. $\blacksquare$

References

^ ^a ^b ^c ^d ^e ^f ^g ^h ⁱ Narici & Beckenstein 2011, pp. 177–220.
^ ^a ^b ^c Schechter 1996, pp. 313–315.
^ ^a ^b ^c ^d ^e Narici & Beckenstein 2011, pp. 120–121.
^ Kubrusly 2011, p. 200.
^ ^a ^b Narici & Beckenstein 2011, pp. 177–221.
^ Rudin 1991, pp. 56–62.
^ ^a ^b ^c ^d ^e Narici & Beckenstein 2011, pp. 192–193.
^ Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, and Clifford Stein (2001) [1990]. "3.1". Introduction to Algorithms (2nd ed.). MIT Press and McGraw-Hill. pp. 47–48. ISBN 0-262-03293-7.{{cite book}}: CS1 maint: multiple names: authors list (link)
^ Ceccherini-Silberstein, Tullio; Salvatori, Maura; Sava-Huss, Ecaterina (2017-06-29). Groups, graphs, and random walks. Cambridge. Lemma 5.17. ISBN 9781316604403. OCLC 948670194.{{cite book}}: CS1 maint: location missing publisher (link)

Bibliography

Kubrusly, Carlos S. (2011). The Elements of Operator Theory (Second ed.). Boston: Birkhäuser. ISBN 978-0-8176-4998-2. OCLC 710154895.
Rudin, Walter (1991). Functional Analysis. International Series in Pure and Applied Mathematics. Vol. 8 (Second ed.). New York, NY: McGraw-Hill Science/Engineering/Math. ISBN 978-0-07-054236-5. OCLC 21163277.
Narici, Lawrence; Beckenstein, Edward (2011). Topological Vector Spaces. Pure and applied mathematics (Second ed.). Boca Raton, FL: CRC Press. ISBN 978-1584888666. OCLC 144216834.
Schaefer, Helmut H.; Wolff, Manfred P. (1999). Topological Vector Spaces. GTM. Vol. 8 (Second ed.). New York, NY: Springer New York Imprint Springer. ISBN 978-1-4612-7155-0. OCLC 840278135.
Schechter, Eric (1996). Handbook of Analysis and Its Foundations. San Diego, CA: Academic Press. ISBN 978-0-12-622760-4. OCLC 175294365.
Trèves, François (2006) [1967]. Topological Vector Spaces, Distributions and Kernels. Mineola, N.Y.: Dover Publications. ISBN 978-0-486-45352-1. OCLC 853623322.