In mathematics, the logarithmic norm is a real-valued functional on operators, constructed from either a vector norm or an inner product, or directly from the induced operator norm. It quantifies key notions such as positive/negative definiteness in matrix theory, uniformly coercive or monotone vector fields in nonlinear analysis, and strong ellipticity in differential operators on function spaces, subject to specific boundary conditions.
The logarithmic norm has a wide range of applications in matrix theory, stability theory for initial and boundary value problems in differential equations, in nonlinear analysis, applied mathematics, and numerical analysis.
Comprehensive treatments of the theory and its applications can be found in recent monographs,12 on which this article is based.
History, original definition, and classical notation
The logarithmic norm was introduced in 1958, independently by Germund Dahlquist3 and Sergei Lozinskiĭ4 for square matrices and bounded linear operators. The original purpose was to estimate solutions to linear differential equations , to construct sufficient conditions for stability, and to obtain norm bounds of perturbations due to the forcing function .
Let be a square matrix and be the operator norm induced by a given vector norm. The associated logarithmic norm is defined by
where is real and is the identity matrix of the same dimension as . The limit exists on account of the convexity of the matrix norm.
The term logarithmic norm is due to Lozinskiĭ, and refers to the fact that if then
where denotes the upper right Dini derivative with respect to time. In other words, is an upper bound for the (short-term) growth rate of the "logarithmic norm" of . Using logarithmic differentiation, this bound can also be written as the differential inequality
from which it follows that, for ,
This bound is always preferable to the Lipschitz bound , since . Moreover, while , the logarithmic norm may take negative values, e.g. when is negative definite and the Euclidean norm is used. Thus implies that is a contraction, and hence the solutions are stable in forward time.
The ability of the logarithmic norm to distinguish forward and reverse time evolution in dynamical systems is a key property. It may also establish whether a problem is well-posed. This asymmetry is also of importance in many applications. For example, in control theory, there is a fundamental need to discriminate between positive and negative feedback; this too can be characterized by the logarithmic norm.
Depending on the setting, the logarithmic norm is related to Lyapunov stability theory.5 Other bounds are similar to those obtained using Grönwall's lemma. Thus the state transition matrix associated with a nonautonomous linear differential equation satisfies the upper and lower bounds67 for ,
More recently, the logarithmic norm has been extended and generalized to nonlinear maps8 as well as to unbounded operators,9 covering boundary value problems and elementary applications in elliptic and parabolic partial differential equations.1
The logarithmic norm may in some literature appear under less informative names, such as the "matrix measure" (for matrices)10 and "one-sided Lipschitz constant" (for nonlinear maps),11 although the concept neither relates to measure theory nor implies Lipschitz continuity.
Extended definitions and modern notation
For clarity and generality, the modern notation defines a greatest lower bound (or simply lower) logarithmic norm as well as the classical least upper bound (upper) logarithmic norm by the two left and right limits
Here for all , and the lower and upper logarithmic norms are related by .
Alternatively, if an inner product norm is used, the lower and upper logarithmic norms are defined as the best possible bounds such that for all ,
where denotes the corresponding vector norm. In case of bounded linear operators in a Hilbert space setting, the inner product definition is equivalent to the original definition. But as the inner product definition also admits certain classes of unbounded operators it is more general. In the latter case one of the logarithmic norms may be infinite.
Further variants of use in semigroup theory are based on the limit
which is still equivalent to the classical definition if is a bounded linear operator. In case is a strongly continuous semigroup for , various modifications of this alternative are necessary, since the "vector field" is defined by the infinitesimal generator of the semigroup, i.e.,
The structure of the limit using the matrix exponential is shared by many other pairs of functionals, linking the vector field and the flow of a linear differential equation in a natural manner. For example, it holds that
where and are the (upper) spectral radius and (upper) spectral abscissa, respectively. Both limits exist, although neither the spectral radius nor the determinant is a convex functional. Moreover, for the trace, the left and right limits coincide. Moreover,
In spite of the similarities, there are major differences. Thus where the important properties, enabling norm bounds, are that the operator norm is submultiplicative, and the logarithmic norm is subadditive. These properties do not hold for the spectral radius and abscissa, which are less powerful tools in the stability analysis of dynamical systems.
Properties and computation of logarithmic norms
For every choice of norm, the basic properties (including some redundancy) of the logarithmic norm of a matrix are the following:1
- Boundedness
- Lower and upper bounds
- Lower and upper bounds
- Continuity
- Real translation
- Weak homogeneity
- Sublinearity
- Superlinearity
- Convexity
- Concavity
- Lower and upper exponential bounds
- Positive definite inverse bound
- Negative definite inverse bound
- Spectral bound
As illustrated by properties 14 - 15, the condition () generalizes positive (negative) definiteness to arbitrary norms and non-symmetric matrices, while also quantifying these notions. An indefinite matrix with respect to a given norm satisfies .
For the most common norms, the logarithmic norm of a (complex) matrix can be computed as follows, where represents the matrix element in the row and column, and where denotes the complex conjugate of :
Applications in matrix theory and spectral theory
Let be a (possibly complex) matrix. The symmetric positive definite square root of and the Hermitian part of are denoted by
respectively. Their eigenvalue problems are and , where are the singular values of , and are the logarithmic values of . The singular and logarithmic values are the stationary points of the two quadratic forms
respectively. As for the relations to the eigenvalues of , it holds that
and
together with
and
As the quadratic form is generally complex valued, one can only find extrema of some related real function. For the logarithmic norms one takes the real part. Another option is to take the absolute value, to obtain the numerical radius of , defined by
Because , it follows that for all ,
The numerical radius is a matrix norm (but not an operator norm). However, as it is not defined by a quadratic form, the determination of is more intricate. But to each matrix norm there is a logarithmic norm, and
i.e., the lower and upper logarithmic norms of the numerical radius coincide with the Euclidean logarithmic norms.1 By the Hausdorff-Toeplitz theorem, the numerical range of a matrix is the set
in the complex plane, given by the intersection of the half-planes
and the numerical radius is
For matrix functions, the use of the von Neumann inequality and spectral sets provides the following classical result, using the Euclidean norm: let a polynomial be a such that . Then .
Using the logarithmic norm, this theorem can be generalized from unit circles to half-planes. Thus, if is a rational function such that then
This property is of importance in numerical analysis, where implicit Runge-Kutta methods approximate a differential equation by a recursion , where is a rational function, referred to as the method's stability function. The continuous system is stable with a contractive flow if , while the discrete system is stable (a discrete contraction) if . Thus, if the stability function satisfies , a condition known as A-stability, then the Runge-Kutta method produces stable solutions for all negative semidefinite vector fields .
In a similar manner the theory of logarithmic norms can also be extended to unitarily invariant norms, e.g. the Hilbert-Schmidt norm for trace class operators, the Frobenius norm, and more generally the Schatten norms. As logarithmic values are only invariant under unitary similarity transformations, the logarithmic norms are defined subordinate to the matrix norms.
Extensions to nonlinear maps
For real-valued nonlinear maps there are various possibilities to define operator norms. The least upper bound Lipschitz constant is an operator seminorm, which is all that is required. A corresponding lower Lipschitz constant can be included to define
on a simply connected domain . The corresponding lower and upper logarithmic Lipschitz constants are defined by
where both limits exist if is Lipschitz continuous. For inner product norms, one can instead use the special definitions
where one or both of the logarithmic Lipschitz constants may exist even when is not Lipschitz continuous. Although it is difficult to compute actual values of Lipschitz and logarithmic Lipschitz constants in practice, they retain their theoretical importance as useful tools in nonlinear analysis.12
All properties listed above for linear operators are retained, provided that operations like etc. are allowed on a joint domain of definition. In the special case when is Fréchet differentiable on a convex domain , it further holds that
where is the Jacobian matrix of .
An operator having either or is called uniformly monotone, and the uniform monotonicity theorem states that
generalizing Properties 14 - 15 above, and quantifying the Uniform monotonicity theorem due to Browder and Minty (1963). Here, the Lipschitz constant of the inverse is defined on the image of , while its lower logarithmic Lipschitz constant is defined on the domain of .
Because composition is not distributive over addition for nonlinear maps, the von Neumann type bounds for 'rational functions of operators' have no direct counterparts in the nonlinear case. Nevertheless some results carry over. For example, in a fixed point iteration converging to a fixed point , it can be shown that the error is bounded by
where the difference between the two most recent iterates is directly computable, and where any one of the three bounds
can be used to obtain an error estimate. In order to guarantee the simple estimate using the third expression, one needs to ensure that the estimate is satisfied. If instead the central option is used, it is sufficient that and . Using the first (best) expression directly, a sharper result holds, but only for inner product norms, viz.,
without further restrictions on . In other words, even slowly convergent iterations admit a simple error estimate, provided that the logarithmic Lipschitz constant of is less than . The bound relies on using the polarization identity, in the linear as well as the nonlinear case, and therefore only holds for the Euclidean norm.
For more complicated "rational functions" of there are only limited results. Consider a nonlinear system with , discretized by the trapezoidal rule and the implicit midpoint method, using the same time step . Both are A-stable Runge-Kutta methods, associated with the same rational stability function , so if is a linear map, both methods are contractive (see above). In the nonlinear case, however, the trapezoidal rule yields the recursion
while the implicit midpoint method is
Because the two nonlinear operators do not commute, only the second recursion is contractive under the assumption , i.e.,
Thus A-stability is not sufficient. Instead a qualified condition, known as B-stability, is required for contractivity in the Euclidean norm13; the results again rely on the polarization identity. Of the two methods above, only the implicit midpoint method is B-stable, but there are other, advanced B-stable Runge--Kutta methods of high order of convergence.1415
For nonlinear maps boundedness and continuity are not equivalent. While the Lipschitzian approach to logarithmic norms above emphasizes continuity and stability, there is also an alternative, weaker approach, primarily linked to coercivity. Thus, for maps satisfying , defined on a star-shaped domain containing the origin, one can define an operator norm , together with an upper logarithmic norm
Likewise, in inner product spaces one may define lower and upper logarithmic norms by1
Differential inequalities
In dynamical systems it is of interest to estimate how far apart two solutions drift, given various types of perturbations. This concerns bounding the difference , where
From the differential inequality
the following general bound is obtained using logarithmic Lipschitz constants,
This bound applies to linear systems, non-autonomous systems, nonlinear systems as well as some problems with where represents an unbounded operator, and for any preferred choice of norm. Taking covers perturbed initial values, while taking covers the effect of forcing functions. In the latter case one can estimate
where . The bound generally holds on compact intervals, but if it also holds on infinite intervals, showing that the problem is well posed, and that the error is uniformly bounded in terms of for all , i.e.,
In case of a stiff differential equation on it holds that
where the expression on the left represents the maximal growth in reverse time over a time interval of length , while the right expression shows the maximal growth in forward time. Hence the condition
is characteristic of stiffness: the average of the logarithmic Lipschitz constants is negative, and typically several orders of magnitude larger than . Here the functional is referred to as the stiffness indicator.16
By instead using the logarithmic norms of the Jacobian matrix , the criterion can be given a local interpretation along the solution trajectory. If an explicit method is used to integrate the equation, then the time step will be restricted by numerical stability requirements to be of order .
Note that
where and are the largest and smallest logarithmic values (see above) of the Jacobian . Because it is relatively expensive to compute the logarithmic norms, an alternative of similar utility is to compute the average of all logarithmic values. The scaled trace is defined by
In practice, serves well as a stiffness indicator, and is cheap to compute as it is simply the average of the diagonal elements of . Moreover, it equals the scaled divergence of the vector field at ,
A dynamical system with preserves phase volume (often referred to as area preserving). What is characteristic of stiff differential equations is that is large and negative, implying that phase volume is compressed along the solution. The phase volume compression rate (per unit time) is indicative of stiffness at , and quantifies step size restrictions along the trajectory for explicit methods.17
Elliptic differential operators
A differential operator satisfying is called elliptic. In particular, the logarithmic norm quantifies the concept of strong ellipticity. Applications are found in boundary value problems and partial differential equations. In the simplest case one considers piecewise continuously differentiable functions satisfying homogeneous Dirichlet conditions , with the inner product
inducing the usual norm on the function space . Letting denote , integration by parts yields . Hence, for the operator we obtain
where the last step is the Poincaré inequality. Equality is attained for the function , implying that is the best possible constant. Thus
showing that is strongly elliptic, with quantifying the ellipticity. (The upper logarithmic norm is , since the operator is unbounded.) An analogous derivation for Neumann conditions results in , where the free boundary condition at reduces ellipticity compared with the case of Dirichlet conditions.
Considering the 1D Poisson equation with and homogeneous Dirichlet conditions, Property 14 above offers the solution bound
Hence the solution is unique and depends continuously on the data, implying stability, where the inverse of the lower logarithmic norm is the stability constant. A smaller stability constant implies that the solution is more sensitive to perturbations in the data function .
Similar results hold for finite element approximations, where the function space is replaced by (say) piecewise linear functions defined on a grid. This affects the logarithmic norm of , which becomes somewhat larger (since the minimizer in the Poincaré inequality cannot be represented). The theory can be extended to Sobolev norms and weak solutions, e.g. allowing "point loads" (Dirac functions in the right hand side), as is common in Galerkin methods, using different norms in the solution and data and spaces as needed.
If instead a finite difference method is used to solve , the differential equation is approximated by an algebraic equation . The matrix will typically inherit the ellipticity, i.e., , showing that is positive definite and therefore invertible.
The importance of strong ellipticity in selfadjoint differential operators can be illustrated by Euler buckling, where a slender column, fixed by articulated boundary conditions (homogeneous Dirichlet conditions as above), is subjected to an axial load . The solution represents the deflection from the column's center line, and satisfies the eigenvalue problem
where is the elasticity module of the material and is the cross-section moment of inertia of the column. The logarithmic norm of the associated operator is
As long as , the operator remains strongly elliptic, and is the unique solution. However, at the critical load ellipticity is lost: the column fails (buckles) when . The zero solution is then no longer unique, in agreement with the Fredholm alternative, as the problem also admits nonzero (buckled) solutions satisfying the homogeneous boundary conditions.
The logarithmic norm can also be used in initial-boundary value partial differential equations , where it quantifies the growth or decay rates of solutions. This is primarily of use in dissipative systems. The difference between two solutions decays if
in which case the problem is well posed. This will typically require to be strongly elliptic and to dominate for sufficient dissipation, such as in a reaction-diffusion equation.
For a more complicated differential operator, such as in convection-diffusion, may depend in a nontrivial way on the type of boundary conditions and the balance of convection and diffusion rates (the Péclet number), leading to questions of well posedness. For problems in several space dimensions, it is usually difficult to calculate the logarithmic norm, as it depends on the geometry of the computational domain where the boundary conditions are imposed.
References
References
- Söderlind, G. (2024). "Logarithmic Norms." Springer Series in Computational Mathematics vol. 63. ISBN 978-3-031-74378-8 https://doi.org/10.1007/978-3-031-74379-5
- Bullo, F. (2022). "Contraction Theory for Dynamical Systems". 1st ed., (19 Aug. 2022). Kindle Direct Publishing ISBN 979-8836646806
- Dahlquist, G. (1958). "Stability and error bounds in the numerical integration of ordinary differential equations", Almqvist & Wiksell, Uppsala 1958
- Lozinskiĭ, S. (1958). "Error estimates for the numerical integration of ordinary differential equations, Part I". (In Russian) Izv. Vyss. Uceb. Zaved. Matematika 6, pp 52-90 (1958)
- Bullo, F. (2022). "Contraction Theory for Dynamical Systems". 1st ed., (19 Aug. 2022). Kindle Direct Publishing ISBN 979-8836646806
- Desoer, C.; Haneda, H. (1972). "The measure of a matrix as a tool to analyze computer algorithms for circuit analysis". IEEE Transactions on Circuit Theory. 19 (5): 480–486. doi:10.1109/tct.1972.1083507.
- Desoer, C. A.; Vidyasagar, M. (1975). Feedback Systems: Input-output Properties. New York: Elsevier. p. 34. ISBN 9780323157797.
- Söderlind, G. (1986). "Bounds on nonlinear operators in finite-dimensional Banach spaces". Numerical Mathematics 50, pp 27-44.
- Söderlind, G. (2006). "The logarithmic norm. History and modern theory", BIT Numerical Mathematics, 46(3):631-652.
- Desoer, C.; Haneda, H. (1972). "The measure of a matrix as a tool to analyze computer algorithms for circuit analysis". IEEE Transactions on Circuit Theory. 19 (5): 480–486. doi:10.1109/tct.1972.1083507.
- Dekker, K.; Verwer, J.G. (1984). "Stability of Runge-Kutta methods for stiff nonlinear differential equations." New York: North Holland.}
- Söderlind, G. (1986). "Bounds on nonlinear operators in finite-dimensional Banach spaces". Numerical Mathematics 50, pp 27-44.
- Butcher, J.C. (1975): "A stability property of implicit Runge--Kutta methods". BIT 15: 358-361.
- Butcher, J.C. (2008) "Numerical methods for ordinary differential equations", 2nd ed. New York: Wiley
- Hairer, E.; Wanner, G. (1991). "Solving orindary differential equations II. Stiff and differential-algebraic problems." Berlin-Heidelberg-New York: Springer.
- Söderlind, G.; Jay, L.O.; Calvo, M. (2015) "Stiffness 1952--2012: Sixty years in search of a definition." BIT 55: 531-558.
- Söderlind, G. (2024) "Logarithmic Norms." Springer Series in Computational Mathematics vol. 63. pp 281-286. ISBN 978-3-031-74378-8 https://doi.org/10.1007/978-3-031-74379-5