Article · Wikipedia archive · Last revised Jun 13, 2026

Logarithmic norm

In mathematics, the logarithmic norm is a real-valued functional on operators, constructed from either a vector norm or an inner product, or directly from the induced operator norm. It quantifies key notions such as positive/negative definiteness in matrix theory, uniformly coercive or monotone vector fields in nonlinear analysis, and strong ellipticity in differential operators on function spaces, subject to specific boundary conditions.

Last revised
Jun 13, 2026
Read time
≈ 31 min
Length
7,125 w
Citations
21
Source

In mathematics, the logarithmic norm is a real-valued functional on operators, constructed from either a vector norm or an inner product, or directly from the induced operator norm. It quantifies key notions such as positive/negative definiteness in matrix theory, uniformly coercive or monotone vector fields in nonlinear analysis, and strong ellipticity in differential operators on function spaces, subject to specific boundary conditions.

The logarithmic norm has a wide range of applications in matrix theory, stability theory for initial and boundary value problems in differential equations, in nonlinear analysis, applied mathematics, and numerical analysis.

Comprehensive treatments of the theory and its applications can be found in recent monographs,12 on which this article is based.

History, original definition, and classical notation

The logarithmic norm was introduced in 1958, independently by Germund Dahlquist3 and Sergei Lozinskiĭ4 for square matrices and bounded linear operators. The original purpose was to estimate solutions to linear differential equations x ˙ = A x + r {\displaystyle {\dot {x}}=Ax+r} , to construct sufficient conditions for stability, and to obtain norm bounds of perturbations due to the forcing function r {\displaystyle r} .

Let A {\displaystyle A} be a square matrix and {\displaystyle \|\cdot \|} be the operator norm induced by a given vector norm. The associated logarithmic norm μ [ A ] {\displaystyle \mu [A]} is defined by

μ [ A ] = lim h 0 + I + h A 1 h , {\displaystyle \mu [A]=\lim \limits _{h\rightarrow 0^{+}}{\frac {\|I+hA\|-1}{h}}\,,}

where h {\displaystyle h} is real and I {\displaystyle I} is the identity matrix of the same dimension as A {\displaystyle A} . The limit exists on account of the convexity of the matrix norm.

The term logarithmic norm is due to Lozinskiĭ, and refers to the fact that if x ˙ = A x {\displaystyle {\dot {x}}=Ax} then

D t + log x μ [ A ] , {\displaystyle {\mathrm {D} }_{t}^{+}\log \|x\|\,\leq \,\mu [A]\,,}

where D t + {\displaystyle {\mathrm {D} }_{t}^{+}} denotes the upper right Dini derivative with respect to time. In other words, μ [ A ] {\displaystyle \mu [A]} is an upper bound for the (short-term) growth rate of the "logarithmic norm" of x ( t ) {\displaystyle x(t)} . Using logarithmic differentiation, this bound can also be written as the differential inequality

D t + x μ [ A ] x , {\displaystyle {\mathrm {D} }_{t}^{+}\|x\|\,\leq \,\mu [A]\cdot \|x\|\,,}

from which it follows that, for t 0 {\displaystyle t\geq 0} ,

e t A e t μ [ A ] . {\displaystyle \|{\mathrm {e} }^{tA}\|\,\leq \,{\mathrm {e} }^{t\mu [A]}\,.}

This bound is always preferable to the Lipschitz bound e t A {\displaystyle {\mathrm {e} }^{t\|A\|}} , since μ [ A ] A {\displaystyle \mu [A]\leq \|A\|} . Moreover, while A 0 {\displaystyle \|A\|\geq 0} , the logarithmic norm may take negative values, e.g. when A {\displaystyle A} is negative definite and the Euclidean norm is used. Thus μ [ A ] 0 {\displaystyle \mu [A]\leq 0} implies that e t A {\displaystyle {\mathrm {e} }^{tA}} is a contraction, and hence the solutions x ( t ) {\displaystyle x(t)} are stable in forward time.

The ability of the logarithmic norm to distinguish forward and reverse time evolution in dynamical systems is a key property. It may also establish whether a problem is well-posed. This asymmetry is also of importance in many applications. For example, in control theory, there is a fundamental need to discriminate between positive and negative feedback; this too can be characterized by the logarithmic norm.

Depending on the setting, the logarithmic norm is related to Lyapunov stability theory.5 Other bounds are similar to those obtained using Grönwall's lemma. Thus the state transition matrix Φ ( t , t 0 ) {\displaystyle \Phi (t,t_{0})} associated with a nonautonomous linear differential equation x ˙ = A ( t ) x {\displaystyle {\dot {x}}=A(t)x} satisfies the upper and lower bounds67 for t t 0 {\displaystyle t\geq t_{0}} ,

exp ( t 0 t μ [ A ( s ) ] d s ) Φ ( t , t 0 ) exp ( t 0 t μ [ A ( s ) ] d s ) . {\displaystyle \exp \left(-\int _{t_{0}}^{t}\mu [-A(s)]\,{\mathrm {d} }s\right)\,\leq \,\|\Phi (t,t_{0})\|\,\leq \,\exp \left(\int _{t_{0}}^{t}\mu [A(s)]\,{\mathrm {d} }s\right).}

More recently, the logarithmic norm has been extended and generalized to nonlinear maps8 as well as to unbounded operators,9 covering boundary value problems and elementary applications in elliptic and parabolic partial differential equations.1

The logarithmic norm may in some literature appear under less informative names, such as the "matrix measure" (for matrices)10 and "one-sided Lipschitz constant" (for nonlinear maps),11 although the concept neither relates to measure theory nor implies Lipschitz continuity.

Extended definitions and modern notation

For clarity and generality, the modern notation defines a greatest lower bound (or simply lower) logarithmic norm m [ A ] {\displaystyle m[A]} as well as the classical least upper bound (upper) logarithmic norm by the two left and right limits

m [ A ] = lim h 0 I + h A 1 h ; M [ A ] = lim h 0 + I + h A 1 h . {\displaystyle m[A]\,=\,\lim _{h\rightarrow 0^{-}}{\frac {\|I+hA\|-1}{h}}\,;\qquad M[A]\,=\,\lim _{h\rightarrow 0^{+}}{\frac {\|I+hA\|-1}{h}}\,.}

Here m [ A ] M [ A ] μ [ A ] {\displaystyle m[A]\leq M[A]\equiv \mu [A]} for all A {\displaystyle A} , and the lower and upper logarithmic norms are related by m [ A ] M [ A ] {\displaystyle m[A]\equiv -M[-A]} .

Alternatively, if an inner product norm is used, the lower and upper logarithmic norms are defined as the best possible bounds such that for all x {\displaystyle x} ,

m 2 [ A ] x 2 2 R e x , A x M 2 [ A ] x 2 2 , {\displaystyle m_{2}[A]\cdot \|x\|_{2}^{2}\,\leq \,{\mathrm {Re} }\,\langle x,Ax\rangle \,\leq \,M_{2}[A]\cdot \|x\|_{2}^{2}\,,}

where x 2 2 = x , x {\displaystyle \|x\|_{2}^{2}=\langle x,x\rangle } denotes the corresponding vector norm. In case of bounded linear operators in a Hilbert space setting, the inner product definition is equivalent to the original definition. But as the inner product definition also admits certain classes of unbounded operators it is more general. In the latter case one of the logarithmic norms may be infinite.

Further variants of use in semigroup theory are based on the limit

M [ A ] = lim h 0 + e h A 1 h , {\displaystyle M[A]\,=\,\lim _{h\rightarrow 0^{+}}{\frac {\|{\mathrm {e} }^{hA}\|-1}{h}}\,,}

which is still equivalent to the classical definition if A {\displaystyle A} is a bounded linear operator. In case e t A {\displaystyle {\mathrm {e} }^{tA}} is a strongly continuous semigroup for t 0 {\displaystyle t\geq 0} , various modifications of this alternative are necessary, since the "vector field" A : x y = A x {\displaystyle A:x\mapsto y=Ax} is defined by the infinitesimal generator of the semigroup, i.e.,

A x = lim h 0 + ( e h A I ) x h . {\displaystyle Ax\,=\,\lim _{h\rightarrow 0^{+}}{\frac {{\big (}{\mathrm {e} }^{hA}-I{\big )}x}{h}}\,.}

The structure of the limit using the matrix exponential is shared by many other pairs of functionals, linking the vector field and the flow of a linear differential equation in a natural manner. For example, it holds that

α [ A ] = lim h 0 + ρ [ e h A ] 1 h ; t r a c e [ A ] = lim h 0 det [ e h A ] 1 h , {\displaystyle \alpha [A]=\lim _{h\rightarrow 0^{+}}{\frac {\rho [{\mathrm {e} }^{hA}]-1}{h}}\,;\qquad {\mathrm {trace} }[A]=\lim _{h\rightarrow 0}{\frac {\det[{\mathrm {e} }^{hA}]-1}{h}}\,,}

where ρ [ A ] = max k | λ k | {\displaystyle \rho [A]=\max _{k}|\lambda _{k}|} and α [ A ] = max k R e λ k {\displaystyle \alpha [A]=\max _{k}{\mathrm {Re} }\,\lambda _{k}} are the (upper) spectral radius and (upper) spectral abscissa, respectively. Both limits exist, although neither the spectral radius nor the determinant is a convex functional. Moreover, for the trace, the left and right limits coincide. Moreover,

D t det [ e t A ] = t r a c e [ A ] det [ e t A ] . {\displaystyle {\mathrm {D} }_{t}\det \!{\big [}{\mathrm {e} }^{tA}{\big ]}\,=\,{\mathrm {trace} }[A]\cdot \det \!{\big [}{\mathrm {e} }^{tA}{\big ]}.}

In spite of the similarities, there are major differences. Thus D t + e t A M [ A ] e t A , {\displaystyle {\mathrm {D} }_{t}^{+}\|{\mathrm {e} }^{tA}\|\,\leq \,M[A]\cdot \|{\mathrm {e} }^{tA}\|\,,} where the important properties, enabling norm bounds, are that the operator norm is submultiplicative, and the logarithmic norm is subadditive. These properties do not hold for the spectral radius and abscissa, which are less powerful tools in the stability analysis of dynamical systems.

Properties and computation of logarithmic norms

For every choice of norm, the basic properties (including some redundancy) of the logarithmic norm of a matrix are the following:1

  1. M [ 0 ] = 0 , M [ I ] = 1 {\displaystyle M[0]=0,\quad M[I]=1}
  2. M [ A ] = m [ A ] {\displaystyle M[-A]=-m[A]}
  3. Boundedness A m [ A ] M [ A ] A {\displaystyle \,-\|A\|\,\leq \,m[A]\,\leq \,M[A]\,\leq \,\|A\|}
  4. Lower and upper bounds g l b [ A ] M [ A ] A {\displaystyle \,-{\mathrm {glb} }[A]\,\leq \,M[A]\,\leq \,\|A\|}
  5. Lower and upper bounds A m [ A ] g l b [ A ] {\displaystyle \,-\|A\|\,\leq \,m[A]\,\leq \,{\mathrm {glb} }[A]}
  6. Continuity | M [ A ] M [ B ] | A B {\displaystyle \,{\big |}M[A]-M[B]{\big |}\,\leq \,\|A-B\|}
  7. Real translation M [ A λ I ] = M [ A ] R e λ , λ C {\displaystyle \,M[A-\lambda I]=M[A]-{\mathrm {Re} }\,\lambda ,\qquad \lambda \in \mathbb {C} }
  8. Weak homogeneity M [ γ A ] = γ M [ A ] , γ R + {\displaystyle \,M[\gamma \,A]=\gamma \cdot M[A],\qquad \gamma \in \mathbb {R} ^{+}}
  9. Sublinearity M [ A + B ] M [ A ] + M [ B ] {\displaystyle \,M[A+B]\,\leq \,M[A]+M[B]}
  10. Superlinearity m [ A ] + m [ B ] m [ A + B ] {\displaystyle \,m[A]+m[B]\,\leq \,m[A+B]}
  11. Convexity M [ θ A + ( 1 θ ) B ] θ M [ A ] + ( 1 θ ) M [ B ] , θ [ 0 , 1 ] {\displaystyle \,M[\theta A+(1-\theta )B]\,\leq \,\theta \,M[A]+(1-\theta )\,M[B],\qquad \theta \in [0,1]}
  12. Concavity θ m [ A ] + ( 1 θ ) m [ B ] m [ θ A + ( 1 θ ) B ] , θ [ 0 , 1 ] {\displaystyle \,\theta \,m[A]+(1-\theta )\,m[B]\,\leq \,m[\theta A+(1-\theta )B],\qquad \theta \in [0,1]}
  13. Lower and upper exponential bounds e t m [ A ] e t A e t M [ A ] , t 0 {\displaystyle \,{\mathrm {e} }^{tm[A]}\,\leq \,\|{\mathrm {e} }^{tA}\|\,\leq \,{\mathrm {e} }^{tM[A]},\qquad t\geq 0}
  14. Positive definite inverse bound m [ A ] > 0 A 1 1 / m [ A ] {\displaystyle \,m[A]>0\quad \Rightarrow \quad \|A^{-1}\|\leq 1/m[A]}
  15. Negative definite inverse bound M [ A ] < 0 A 1 1 / M [ A ] {\displaystyle \,M[A]<0\quad \Rightarrow \quad \|A^{-1}\|\leq -1/M[A]}
  16. Spectral bound α [ A ] M [ A ] . {\displaystyle \,\alpha [A]\leq M[A].}

As illustrated by properties 14 - 15, the condition m [ A ] > 0 {\displaystyle m[A]>0} ( M [ A ] < 0 {\displaystyle M[A]<0} ) generalizes positive (negative) definiteness to arbitrary norms and non-symmetric matrices, while also quantifying these notions. An indefinite matrix with respect to a given norm satisfies m [ A ] 0 M [ A ] {\displaystyle m[A]\leq 0\leq M[A]} .

For the most common l p {\displaystyle l^{p}} norms, the logarithmic norm of a (complex) matrix A {\displaystyle A} can be computed as follows, where a i j {\displaystyle a_{ij}} represents the matrix element in the i t h {\displaystyle i^{\mathrm {th} }} row and j t h {\displaystyle j^{\mathrm {th} }} column, and where A {\displaystyle A^{*}} denotes the complex conjugate of A {\displaystyle A}  :

  • M 1 [ A ] = max j ( R e a j j + i j | a i j | ) {\displaystyle M_{1}[A]\,=\,\max _{j}\left({\mathrm {Re} }\,a_{jj}+\sum _{i\neq j}|a_{ij}|\right)}
  • M 2 [ A ] = α [ A + A 2 ] {\displaystyle M_{2}[A]\,=\,\alpha \left[{\frac {A+A^{*}}{2}}\right]}
  • M [ A ] = max i ( R e a i i + j i | a i j | ) {\displaystyle M_{\infty }[A]\,=\,\max _{i}\left({\mathrm {Re} }\,a_{ii}+\sum _{j\neq i}|a_{ij}|\right)}

Applications in matrix theory and spectral theory

Let A {\displaystyle A} be a (possibly complex) n × n {\displaystyle n\times n} matrix. The symmetric positive definite square root of A A {\displaystyle A^{*}A} and the Hermitian part of A {\displaystyle A} are denoted by

| A | = A A and H e ( A ) = A + A 2 , {\displaystyle |A|\,=\,{\sqrt {A^{*}A}}\qquad {\mbox{and}}\qquad {\mathrm {He} }(A)\,=\,{\frac {A+A^{*}}{2}}\,,}

respectively. Their eigenvalue problems are | A | u = σ u {\displaystyle |A|u=\sigma u} and H e ( A ) v = μ v {\displaystyle {\mathrm {He} }(A)v=\mu v} , where σ k {\displaystyle \sigma _{k}} are the n {\displaystyle n} singular values of A {\displaystyle A} , and μ k {\displaystyle \mu _{k}} are the n {\displaystyle n} logarithmic values of A {\displaystyle A} . The singular and logarithmic values are the n {\displaystyle n} stationary points of the two quadratic forms

u | A | u u u and R e v A v v v v H e ( A ) v v v , {\displaystyle {\frac {u^{*}|A|u}{u^{*}u}}\qquad {\mbox{and}}\qquad {\frac {{\mathrm {Re} }\,v^{*}Av}{v^{*}v}}\,\equiv \,{\frac {v^{*}{\mathrm {He} }(A)v}{v^{*}v}}\,,}

respectively. As for the relations to the n {\displaystyle n} eigenvalues λ k {\displaystyle \lambda _{k}} of A {\displaystyle A} , it holds that

g l b 2 [ A ] = min σ k min | λ k | max | λ k | max σ k = A 2 {\displaystyle {\mathrm {glb} }_{2}[A]\,=\,\min \sigma _{k}\,\leq \,\min |\lambda _{k}|\,\leq \,\max |\lambda _{k}|\,\leq \,\max \sigma _{k}\,=\,\|A\|_{2}}

and

m 2 [ A ] = min μ k min R e λ k max R e λ k max μ k = M 2 [ A ] {\displaystyle m_{2}[A]\,=\,\min \mu _{k}\,\leq \,\min {\mathrm {Re} }\,\lambda _{k}\,\leq \,\max {\mathrm {Re} }\,\lambda _{k}\,\leq \,\max \mu _{k}\,=\,M_{2}[A]}

together with

| det A | = k | λ k | = k σ k = det | A | {\displaystyle |\det A|\,=\,\prod _{k}|\lambda _{k}|\,=\,\prod _{k}\sigma _{k}\,=\,\det |A|}

and

R e t r a c e [ A ] = k R e λ k = k μ k = t r a c e [ H e ( A ) ] . {\displaystyle {\mathrm {Re\,trace} }[A]\,=\,\sum _{k}{\mathrm {Re} }\,\lambda _{k}\,=\,\sum _{k}\mu _{k}\,=\,{\mathrm {trace} }\left[{\mathrm {He} }(A)\right].}

As the quadratic form x A x {\displaystyle x^{*}Ax} is generally complex valued, one can only find extrema of some related real function. For the logarithmic norms one takes the real part. Another option is to take the absolute value, to obtain the numerical radius of A {\displaystyle A} , defined by

ν [ A ] = sup x 0 | x A x | x x . {\displaystyle \nu [A]\,=\,\sup _{x\neq 0}{\frac {|x^{*}Ax|}{x^{*}x}}\,.}

Because R e z | z | {\displaystyle {\mathrm {Re} }\,z\leq |z|} , it follows that for all A {\displaystyle A} ,

M 2 [ A ] ν [ A ] A 2 . {\displaystyle M_{2}[A]\leq \nu [A]\leq \|A\|_{2}.}

The numerical radius is a matrix norm (but not an operator norm). However, as it is not defined by a quadratic form, the determination of ν [ A ] {\displaystyle \nu [A]} is more intricate. But to each matrix norm there is a logarithmic norm, and

lim h 0 ν [ I + h A ] 1 h = m 2 [ A ] ; lim h 0 + ν [ I + h A ] 1 h = M 2 [ A ] , {\displaystyle \lim _{h\rightarrow 0^{-}}{\frac {\nu [I+hA]-1}{h}}\,=\,m_{2}[A]\,;\qquad \lim _{h\rightarrow 0^{+}}{\frac {\nu [I+hA]-1}{h}}\,=\,M_{2}[A]\,,}

i.e., the lower and upper logarithmic norms of the numerical radius coincide with the Euclidean logarithmic norms.1 By the Hausdorff-Toeplitz theorem, the numerical range of a matrix A {\displaystyle A} is the set

W ( A ) = φ [ 0 , 2 π ] C ( A ; φ ) {\displaystyle W(A)=\bigcap _{\varphi \in [0,2\pi ]}C(A;\varphi )}

in the complex plane, given by the intersection of the half-planes

C ( A ; φ ) = { z C : R e e i φ z M 2 [ e i φ A ] } , {\displaystyle C(A;\varphi )\,=\,{\big \lbrace }z\in {\mathbb {C} }\,:\,{\mathrm {Re} }\,{\mathrm {e} }^{-{\mathrm {i} }\varphi }z\leq M_{2}[{\mathrm {e} }^{-{\mathrm {i} }\varphi }A]{\big \rbrace }\,,}

and the numerical radius is

ν [ A ] = sup z W ( A ) | z | = max φ [ 0 , 2 π ] M 2 [ e i φ A ] . {\displaystyle \nu [A]=\sup _{z\in W(A)}|z|=\max _{\varphi \in [0,2\pi ]}M_{2}[{\mathrm {e} }^{-{\mathrm {i} }\varphi }A]\,.}

For matrix functions, the use of the von Neumann inequality and spectral sets provides the following classical result, using the Euclidean norm: let a polynomial P {\displaystyle P} be a such that | z | 1 | P ( z ) | 1 {\displaystyle |z|\leq 1\Rightarrow |P(z)|\leq 1} . Then A 2 1 P ( A ) 2 1 {\displaystyle \|A\|_{2}\leq 1\Rightarrow \|P(A)\|_{2}\leq 1} .

Using the logarithmic norm, this theorem can be generalized from unit circles to half-planes. Thus, if R {\displaystyle R} is a rational function such that R e z 0 | R ( z ) | 1 , {\displaystyle {\mathrm {Re} }\,z\leq 0\Rightarrow |R(z)|\leq 1\,,} then M 2 [ A ] 0 R ( A ) 2 1 . {\displaystyle M_{2}[A]\leq 0\Rightarrow \|R(A)\|_{2}\leq 1\,.}

This property is of importance in numerical analysis, where implicit Runge-Kutta methods approximate a differential equation x ˙ = A x {\displaystyle {\dot {x}}=Ax} by a recursion x k + 1 = R ( h A ) x k {\displaystyle x_{k+1}=R(hA)x_{k}} , where R {\displaystyle R} is a rational function, referred to as the method's stability function. The continuous system is stable with a contractive flow if M 2 [ A ] 0 {\displaystyle M_{2}[A]\leq 0} , while the discrete system is stable (a discrete contraction) if R ( h A ) 2 1 {\displaystyle \|R(hA)\|_{2}\leq 1} . Thus, if the stability function R {\displaystyle R} satisfies R e z 0 | R ( z ) | 1 {\displaystyle {\mathrm {Re} }\,z\leq 0\Rightarrow |R(z)|\leq 1} , a condition known as A-stability, then the Runge-Kutta method produces stable solutions for all negative semidefinite vector fields A {\displaystyle A} .

In a similar manner the theory of logarithmic norms can also be extended to unitarily invariant norms, e.g. the Hilbert-Schmidt norm for trace class operators, the Frobenius norm, and more generally the Schatten norms. As logarithmic values are only invariant under unitary similarity transformations, the logarithmic norms are defined subordinate to the matrix norms.

Extensions to nonlinear maps

For real-valued nonlinear maps there are various possibilities to define operator norms. The least upper bound Lipschitz constant is an operator seminorm, which is all that is required. A corresponding lower Lipschitz constant can be included to define

l [ f ] u v f ( u ) f ( v ) L [ f ] u v {\displaystyle l[f]\cdot \|u-v\|\,\leq \,\|f(u)-f(v)\|\,\leq \,L[f]\cdot \|u-v\|}

on a simply connected domain D {\displaystyle D} . The corresponding lower and upper logarithmic Lipschitz constants are defined by

m [ f ] = lim h 0 L [ I + h f ] 1 h ; M [ f ] = lim h 0 + L [ I + h f ] 1 h , {\displaystyle m[f]=\lim _{h\rightarrow 0^{-}}{\frac {L[I+hf]-1}{h}}\,;\quad M[f]=\lim _{h\rightarrow 0^{+}}{\frac {L[I+hf]-1}{h}}\,,}

where both limits exist if f {\displaystyle f} is Lipschitz continuous. For inner product norms, one can instead use the special definitions

m 2 [ f ] u v 2 2 u v , f ( u ) f ( v ) M 2 [ f ] u v 2 2 , {\displaystyle m_{2}[f]\cdot \|u-v\|_{2}^{2}\,\leq \,\langle u-v,f(u)-f(v)\rangle \,\leq \,M_{2}[f]\cdot \|u-v\|_{2}^{2}\,,}

where one or both of the logarithmic Lipschitz constants may exist even when f {\displaystyle f} is not Lipschitz continuous. Although it is difficult to compute actual values of Lipschitz and logarithmic Lipschitz constants in practice, they retain their theoretical importance as useful tools in nonlinear analysis.12

All properties listed above for linear operators are retained, provided that operations like f + g {\displaystyle f+g} etc. are allowed on a joint domain of definition. In the special case when f {\displaystyle f} is Fréchet differentiable on a convex domain D {\displaystyle D} , it further holds that

L [ f ] = sup x D f ( x ) and M [ f ] = sup x D M [ f ( x ) ] , {\displaystyle L[f]=\sup _{x\in D}\|f'(x)\|\qquad {\mbox{and}}\qquad M[f]=\sup _{x\in D}M[f'(x)]\,,}

where f ( x ) {\displaystyle f'(x)} is the Jacobian matrix of f {\displaystyle f} .

An operator having either m [ f ] > 0 {\displaystyle m[f]>0} or M [ f ] < 0 {\displaystyle M[f]<0} is called uniformly monotone, and the uniform monotonicity theorem states that

m [ f ] > 0 L [ f 1 ] 1 m [ f ] , {\displaystyle m[f]>0\quad \Rightarrow \quad L[f^{-1}]\leq {\frac {1}{m[f]}}\,,}

generalizing Properties 14 - 15 above, and quantifying the Uniform monotonicity theorem due to Browder and Minty (1963). Here, the Lipschitz constant of the inverse is defined on the image of f {\displaystyle f} , while its lower logarithmic Lipschitz constant is defined on the domain of f {\displaystyle f} .

Because composition is not distributive over addition for nonlinear maps, the von Neumann type bounds for 'rational functions of operators' have no direct counterparts in the nonlinear case. Nevertheless some results carry over. For example, in a fixed point iteration x k + 1 = g ( x k ) {\displaystyle x_{k+1}=g(x_{k})} converging to a fixed point x {\displaystyle x_{\infty }} , it can be shown that the error is bounded by

x k + 1 x L [ g ( I g ) 1 ] x k + 1 x k , {\displaystyle \|x_{k+1}-x_{\infty }\|\,\leq \,L{\big [}g\circ (I-g)^{-1}{\big ]}\cdot \|x_{k+1}-x_{k}\|\,,}

where the difference between the two most recent iterates is directly computable, and where any one of the three bounds

L [ g ( I g ) 1 ] L [ g ] 1 M [ g ] L [ g ] 1 L [ g ] , {\displaystyle L{\big [}g\circ (I-g)^{-1}{\big ]}\,\leq \,{\frac {L[g]}{1-M[g]}}\,\leq \,{\frac {L[g]}{1-L[g]}}\,,}

can be used to obtain an error estimate. In order to guarantee the simple estimate x k + 1 x x k + 1 x k {\displaystyle \|x_{k+1}-x_{\infty }\|\,\leq \,\|x_{k+1}-x_{k}\|} using the third expression, one needs L [ g ] 1 / 2 {\displaystyle L[g]\leq 1/2} to ensure that the estimate is satisfied. If instead the central option is used, it is sufficient that L [ g ] < 1 {\displaystyle L[g]<1} and M [ g ] 0 {\displaystyle M[g]\leq 0} . Using the first (best) expression directly, a sharper result holds, but only for inner product norms, viz.,

M 2 [ g ] 1 2 L 2 [ g ( I g ) 1 ] 1 , {\displaystyle M_{2}[g]\leq {\frac {1}{2}}\quad \Rightarrow \quad L_{2}{\big [}g\circ (I-g)^{-1}{\big ]}\leq 1\,,}

without further restrictions on L 2 [ g ] {\displaystyle L_{2}[g]} . In other words, even slowly convergent iterations admit a simple error estimate, provided that the logarithmic Lipschitz constant of g {\displaystyle g} is less than 1 / 2 {\displaystyle 1/2} . The bound relies on using the polarization identity, in the linear as well as the nonlinear case, and therefore only holds for the Euclidean norm.

For more complicated "rational functions" of f {\displaystyle f} there are only limited results. Consider a nonlinear system x ˙ = f ( x ) {\displaystyle {\dot {x}}=f(x)} with M 2 [ f ] 0 {\displaystyle M_{2}[f]\leq 0} , discretized by the trapezoidal rule and the implicit midpoint method, using the same time step h {\displaystyle h} . Both are A-stable Runge-Kutta methods, associated with the same rational stability function R ( z ) = ( 1 + z / 2 ) / ( 1 z / 2 ) {\displaystyle R(z)=(1+z/2)/(1-z/2)} , so if f A {\displaystyle f\equiv A} is a linear map, both methods are contractive (see above). In the nonlinear case, however, the trapezoidal rule yields the recursion

u n + 1 = ( I h f 2 ) 1 ( I + h f 2 ) ( u n ) {\displaystyle u_{n+1}\,=\,\left(I-{\frac {hf}{2}}\right)^{-1}\circ \left(I+{\frac {hf}{2}}\right)(u_{n})}

while the implicit midpoint method is

v n + 1 = ( I + h f 2 ) ( I h f 2 ) 1 ( v n ) . {\displaystyle v_{n+1}\,=\,\left(I+{\frac {hf}{2}}\right)\circ \left(I-{\frac {hf}{2}}\right)^{-1}(v_{n})\,.}

Because the two nonlinear operators do not commute, only the second recursion is contractive under the assumption M 2 [ h f ] 0 {\displaystyle M_{2}[hf]\leq 0} , i.e.,

M 2 [ f ] 0 L 2 [ ( I + h f 2 ) ( I h f 2 ) 1 ] 1. {\displaystyle M_{2}[f]\,\leq \,0\quad \Rightarrow \quad L_{2}\left[\left(I+{\frac {hf}{2}}\right)\circ \left(I-{\frac {hf}{2}}\right)^{-1}\right]\,\leq \,1.}

Thus A-stability is not sufficient. Instead a qualified condition, known as B-stability, is required for contractivity in the Euclidean norm13; the results again rely on the polarization identity. Of the two methods above, only the implicit midpoint method is B-stable, but there are other, advanced B-stable Runge--Kutta methods of high order of convergence.1415

For nonlinear maps boundedness and continuity are not equivalent. While the Lipschitzian approach to logarithmic norms above emphasizes continuity and stability, there is also an alternative, weaker approach, primarily linked to coercivity. Thus, for maps F {\displaystyle F} satisfying F ( 0 ) = 0 {\displaystyle F(0)=0} , defined on a star-shaped domain D {\displaystyle D} containing the origin, one can define an operator norm F = sup x D F ( x ) / x {\displaystyle \|F\|=\sup _{x\in D}\|F(x)\|/\|x\|} , together with an upper logarithmic norm

μ ^ [ F ] = lim h 0 + I + h F 1 h . {\displaystyle {\hat {\mu }}[F]=\lim _{h\rightarrow 0^{+}}{\frac {\|I+hF\|-1}{h}}\,.}

Likewise, in inner product spaces one may define lower and upper logarithmic norms by1

μ ˇ 2 [ F ] x 2 2 x , F ( x ) μ ^ 2 [ F ] x 2 2 . {\displaystyle {\check {\mu }}_{2}[F]\cdot \|x\|_{2}^{2}\,\leq \,\langle x,F(x)\rangle \,\leq \,{\hat {\mu }}_{2}[F]\cdot \|x\|_{2}^{2}\,.}

Differential inequalities

In dynamical systems it is of interest to estimate how far apart two solutions drift, given various types of perturbations. This concerns bounding the difference u v {\displaystyle u-v} , where

u ˙ = f ( u ) + r {\displaystyle {\dot {u}}=f(u)+r}
v ˙ = f ( v ) . {\displaystyle {\dot {v}}=f(v)\,.}

From the differential inequality

D t + u v M [ f ] u v + r , t 0 {\displaystyle {\mathrm {D} }_{t}^{+}\|u-v\|\,\leq \,M[f]\cdot \|u-v\|+\|r\|\,,\qquad t\geq 0}

the following general bound is obtained using logarithmic Lipschitz constants,

u ( t ) v ( t ) e t M [ f ] u 0 v 0 + 0 t e ( t τ ) M [ f ] r ( τ ) d τ . {\displaystyle \|u(t)-v(t)\|\,\leq \,{\mathrm {e} }^{tM[f]}\|u_{0}-v_{0}\|+\int _{0}^{t}{\mathrm {e} }^{(t-\tau )M[f]}\,\|r(\tau )\|\,{\mathrm {d} }\tau \,.}

This bound applies to linear systems, non-autonomous systems, nonlinear systems as well as some problems with where f {\displaystyle f} represents an unbounded operator, and for any preferred choice of norm. Taking r 0 {\displaystyle r\equiv 0} covers perturbed initial values, while taking u 0 = v 0 {\displaystyle u_{0}=v_{0}} covers the effect of forcing functions. In the latter case one can estimate

u ( t ) v ( t ) e t M [ f ] 1 M [ f ] r , {\displaystyle \|u(t)-v(t)\|\,\leq \,{\frac {{\mathrm {e} }^{tM[f]}-1}{M[f]}}\cdot \|r\|_{\infty }\,,}

where r = max t 0 r ( t ) {\displaystyle \|r\|_{\infty }=\max _{t\geq 0}\|r(t)\|} . The bound generally holds on compact intervals, but if M [ f ] < 0 {\displaystyle M[f]<0} it also holds on infinite intervals, showing that the problem is well posed, and that the error u v {\displaystyle u-v} is uniformly bounded in terms of r {\displaystyle r} for all t 0 {\displaystyle t\geq 0} , i.e.,

u ( t ) v ( t ) r M [ f ] . {\displaystyle \|u(t)-v(t)\|\,\leq \,-\,{\frac {\|r\|_{\infty }}{M[f]}}\,.}

In case of a stiff differential equation u ˙ = f ( u ) {\displaystyle {\dot {u}}=f(u)} on [ 0 , T ] {\displaystyle [0,T]} it holds that

e T m [ f ] e T M [ f ] , {\displaystyle {\mathrm {e} }^{-Tm[f]}\gg {\mathrm {e} }^{TM[f]}\,,}

where the expression on the left represents the maximal growth in reverse time over a time interval of length T {\displaystyle T} , while the right expression shows the maximal growth in forward time. Hence the condition

s [ f ] := m [ f ] + M [ f ] 2 1 T {\displaystyle s[f]:={\frac {m[f]+M[f]}{2}}\ll -\,{\frac {1}{T}}}

is characteristic of stiffness: the average of the logarithmic Lipschitz constants is negative, and typically several orders of magnitude larger than 1 / T {\displaystyle 1/T} . Here the functional s [ f ] {\displaystyle s[f]} is referred to as the stiffness indicator.16

By instead using the logarithmic norms of the Jacobian matrix f ( u ) {\displaystyle f'(u)} , the criterion can be given a local interpretation along the solution trajectory. If an explicit method is used to integrate the equation, then the time step h {\displaystyle h} will be restricted by numerical stability requirements to be of order h 1 / | s [ f ( u ) ] | {\displaystyle h\sim 1/\left|s[f'(u)]\right|} .

Note that

s [ f ( u ) ] = μ 1 + μ n 2 , {\displaystyle s[f'(u)]={\frac {\mu _{1}+\mu _{n}}{2}}\,,}

where μ 1 {\displaystyle \mu _{1}} and μ n {\displaystyle \mu _{n}} are the largest and smallest logarithmic values (see above) of the Jacobian f ( u ) {\displaystyle f'(u)} . Because it is relatively expensive to compute the logarithmic norms, an alternative of similar utility is to compute the average of all logarithmic values. The scaled trace τ [ ] {\displaystyle \tau [\cdot ]} is defined by

τ [ f ( u ) ] := 1 n k μ k = 1 n t r a c e [ f ( u ) ] = 1 n ( d i v f ) ( u ) . {\displaystyle \tau [f'(u)]\,:=\,{\frac {1}{n}}\sum _{k}\mu _{k}\,=\,{\frac {1}{n}}\,{\mathrm {trace} }[f'(u)]\,=\,{\frac {1}{n}}\left({\mathrm {div} }\,f\right)(u).}

In practice, τ [ f ( u ) ] {\displaystyle \tau [f'(u)]} serves well as a stiffness indicator, and is cheap to compute as it is simply the average of the diagonal elements of f ( u ) {\displaystyle f'(u)} . Moreover, it equals the scaled divergence of the vector field at u {\displaystyle u} ,

1 n ( d i v f ) ( u ) . {\displaystyle {\frac {1}{n}}\left({\mathrm {div} }\,f\right)(u)\,.}

A dynamical system with d i v f 0 {\displaystyle \,{\mathrm {div} }f\equiv 0\,} preserves phase volume (often referred to as area preserving). What is characteristic of stiff differential equations is that d i v f {\displaystyle \,{\mathrm {div} }f\,} is large and negative, implying that phase volume is compressed along the solution. The phase volume compression rate τ [ f ( u ) ] {\displaystyle \tau [f'(u)]} (per unit time) is indicative of stiffness at u {\displaystyle u} , and quantifies step size restrictions along the trajectory for explicit methods.17

Elliptic differential operators

A differential operator L {\displaystyle {\mathcal {L}}} satisfying u , L u > 0 {\displaystyle \langle u,{\mathcal {L}}u\rangle >0} is called elliptic. In particular, the logarithmic norm quantifies the concept of strong ellipticity. Applications are found in boundary value problems and partial differential equations. In the simplest case one considers piecewise continuously differentiable functions u , v {\displaystyle u,v} satisfying homogeneous Dirichlet conditions u ( 0 ) = u ( 1 ) = 0 {\displaystyle u(0)=u(1)=0} , with the inner product

u , v = 0 1 u v d x {\displaystyle \langle u,v\rangle \,=\,\int _{0}^{1}uv\,{\mathrm {d} }x\,}

inducing the usual L 2 {\displaystyle L^{2}} norm on the function space H 0 1 ( 0 , 1 ) {\displaystyle H_{0}^{1}(0,1)} . Letting u {\displaystyle u'} denote d u / d x {\displaystyle {\mathrm {d} }u/{\mathrm {d} }x} , integration by parts yields u , v = u , v {\displaystyle \langle u,v'\rangle =-\langle u',v\rangle } . Hence, for the operator L = d 2 / d x 2 {\displaystyle {\mathcal {L}}=-{\mathrm {d} }^{2}/{\mathrm {d} }x^{2}} we obtain

u , L u = u , u = u , u = u 2 2 π 2 u 2 2 , {\displaystyle \langle u,{\mathcal {L}}u\rangle \,=\,\langle u,-u''\rangle \,=\,\langle u',u'\rangle \,=\,\|u'\|_{2}^{2}\,\geq \,\pi ^{2}\,\|u\|_{2}^{2}\,,}

where the last step is the Poincaré inequality. Equality is attained for the function u = sin π x {\displaystyle u=\sin \pi x} , implying that π 2 {\displaystyle \pi ^{2}} is the best possible constant. Thus

u , L u m 2 [ L ] u 2 2 π 2 u 2 2 , {\displaystyle \langle u,{\mathcal {L}}u\rangle \,\geq \,m_{2}[{\mathcal {L}}]\cdot \|u\|_{2}^{2}\,\equiv \,\pi ^{2}\,\|u\|_{2}^{2}\,,}

showing that L {\displaystyle {\mathcal {L}}} is strongly elliptic, with m 2 [ d 2 / d x 2 ] = π 2 > 0 {\displaystyle m_{2}[-{\mathrm {d} }^{2}/{\mathrm {d} }x^{2}]=\pi ^{2}>0} quantifying the ellipticity. (The upper logarithmic norm is + {\displaystyle +\infty } , since the operator is unbounded.) An analogous derivation for Neumann conditions u ( 0 ) = u ( 1 ) = 0 {\displaystyle u(0)=u'(1)=0} results in m 2 [ L ] = π 2 / 4 {\displaystyle m_{2}[{\mathcal {L}}]=\pi ^{2}/4} , where the free boundary condition at x = 1 {\displaystyle x=1} reduces ellipticity compared with the case of Dirichlet conditions.

Considering the 1D Poisson equation u = f {\displaystyle -u''=f} with f L 2 ( 0 , 1 ) {\displaystyle f\in L^{2}(0,1)} and homogeneous Dirichlet conditions, Property 14 above offers the solution bound

u 2 f 2 m 2 [ L ] = f 2 π 2 . {\displaystyle \|u\|_{2}\,\leq \,{\frac {\|f\|_{2}}{m_{2}[{\mathcal {L}}]}}\,=\,{\frac {\|f\|_{2}}{\pi ^{2}}}\,.}

Hence the solution u {\displaystyle u} is unique and depends continuously on the data, implying stability, where the inverse of the lower logarithmic norm is the stability constant. A smaller stability constant implies that the solution is more sensitive to perturbations in the data function f {\displaystyle f} .

Similar results hold for finite element approximations, where the function space is replaced by (say) piecewise linear functions defined on a grid. This affects the logarithmic norm of L {\displaystyle {\mathcal {L}}} , which becomes somewhat larger (since the minimizer in the Poincaré inequality cannot be represented). The theory can be extended to Sobolev norms and weak solutions, e.g. allowing "point loads" (Dirac functions in the right hand side), as is common in Galerkin methods, using different norms in the solution and data and spaces as needed.

If instead a finite difference method is used to solve u = f {\displaystyle -u''=f} , the differential equation is approximated by an algebraic equation L u ~ = f ~ {\displaystyle L{\tilde {u}}={\tilde {f}}} . The matrix L {\displaystyle L} will typically inherit the ellipticity, i.e., m 2 [ L ] > 0 {\displaystyle m_{2}[L]>0} , showing that L {\displaystyle L} is positive definite and therefore invertible.

The importance of strong ellipticity in selfadjoint differential operators can be illustrated by Euler buckling, where a slender column, fixed by articulated boundary conditions (homogeneous Dirichlet conditions as above), is subjected to an axial load F {\displaystyle F} . The solution u {\displaystyle u} represents the deflection from the column's center line, and satisfies the eigenvalue problem

u F E I u = 0 , {\displaystyle -u''-{\frac {F}{EI}}\,u\,=\,0\,,}

where E {\displaystyle E} is the elasticity module of the material and I {\displaystyle I} is the cross-section moment of inertia of the column. The logarithmic norm of the associated operator is

m 2 [ d 2 d x 2 F E I ] = π 2 F E I . {\displaystyle m_{2}\left[-{\frac {{\mathrm {d} }^{2}}{{\mathrm {d} }x^{2}}}\,-\,{\frac {F}{EI}}\right]\,=\,\pi ^{2}-{\frac {F}{EI}}\,.}

As long as F / ( E I ) < π 2 {\displaystyle F/(EI)<\pi ^{2}} , the operator remains strongly elliptic, and u 0 {\displaystyle u\equiv 0} is the unique solution. However, at the critical load F c / ( E I ) = π 2 {\displaystyle F_{\mathrm {c} }/(EI)=\pi ^{2}} ellipticity is lost: the column fails (buckles) when F F c {\displaystyle F\geq F_{\mathrm {c} }} . The zero solution is then no longer unique, in agreement with the Fredholm alternative, as the problem also admits nonzero (buckled) solutions satisfying the homogeneous boundary conditions.

The logarithmic norm can also be used in initial-boundary value partial differential equations u t = A u + f ( u ) {\displaystyle u_{t}={\mathcal {A}}u+f(u)} , where it quantifies the growth or decay rates of solutions. This is primarily of use in dissipative systems. The difference between two solutions decays if

M 2 [ A ] + M 2 [ f ] < 0 , {\displaystyle M_{2}[{\mathcal {A}}]\,+\,M_{2}[f]\,<\,0\,,}

in which case the problem is well posed. This will typically require A {\displaystyle -{\mathcal {A}}} to be strongly elliptic and to dominate M 2 [ f ] {\displaystyle M_{2}[f]} for sufficient dissipation, such as in a reaction-diffusion equation.

For a more complicated differential operator, such as in convection-diffusion, M 2 [ A ] {\displaystyle M_{2}[{\mathcal {A}}]} may depend in a nontrivial way on the type of boundary conditions and the balance of convection and diffusion rates (the Péclet number), leading to questions of well posedness. For problems in several space dimensions, it is usually difficult to calculate the logarithmic norm, as it depends on the geometry of the computational domain where the boundary conditions are imposed.

References

References

  1. Söderlind, G. (2024). "Logarithmic Norms." Springer Series in Computational Mathematics vol. 63. ISBN 978-3-031-74378-8 https://doi.org/10.1007/978-3-031-74379-5
  2. Bullo, F. (2022). "Contraction Theory for Dynamical Systems". 1st ed., (19 Aug. 2022). Kindle Direct Publishing ISBN 979-8836646806
  3. Dahlquist, G. (1958). "Stability and error bounds in the numerical integration of ordinary differential equations", Almqvist & Wiksell, Uppsala 1958
  4. Lozinskiĭ, S. (1958). "Error estimates for the numerical integration of ordinary differential equations, Part I". (In Russian) Izv. Vyss. Uceb. Zaved. Matematika 6, pp 52-90 (1958)
  5. Bullo, F. (2022). "Contraction Theory for Dynamical Systems". 1st ed., (19 Aug. 2022). Kindle Direct Publishing ISBN 979-8836646806
  6. Desoer, C.; Haneda, H. (1972). "The measure of a matrix as a tool to analyze computer algorithms for circuit analysis". IEEE Transactions on Circuit Theory. 19 (5): 480–486. doi:10.1109/tct.1972.1083507.
  7. Desoer, C. A.; Vidyasagar, M. (1975). Feedback Systems: Input-output Properties. New York: Elsevier. p. 34. ISBN 9780323157797.
  8. Söderlind, G. (1986). "Bounds on nonlinear operators in finite-dimensional Banach spaces". Numerical Mathematics 50, pp 27-44.
  9. Söderlind, G. (2006). "The logarithmic norm. History and modern theory", BIT Numerical Mathematics, 46(3):631-652.
  10. Desoer, C.; Haneda, H. (1972). "The measure of a matrix as a tool to analyze computer algorithms for circuit analysis". IEEE Transactions on Circuit Theory. 19 (5): 480–486. doi:10.1109/tct.1972.1083507.
  11. Dekker, K.; Verwer, J.G. (1984). "Stability of Runge-Kutta methods for stiff nonlinear differential equations." New York: North Holland.}
  12. Söderlind, G. (1986). "Bounds on nonlinear operators in finite-dimensional Banach spaces". Numerical Mathematics 50, pp 27-44.
  13. Butcher, J.C. (1975): "A stability property of implicit Runge--Kutta methods". BIT 15: 358-361.
  14. Butcher, J.C. (2008) "Numerical methods for ordinary differential equations", 2nd ed. New York: Wiley
  15. Hairer, E.; Wanner, G. (1991). "Solving orindary differential equations II. Stiff and differential-algebraic problems." Berlin-Heidelberg-New York: Springer.
  16. Söderlind, G.; Jay, L.O.; Calvo, M. (2015) "Stiffness 1952--2012: Sixty years in search of a definition." BIT 55: 531-558.
  17. Söderlind, G. (2024) "Logarithmic Norms." Springer Series in Computational Mathematics vol. 63. pp 281-286. ISBN 978-3-031-74378-8 https://doi.org/10.1007/978-3-031-74379-5