Karcher means and Karcher equations of positive definite operators

By Jimmie Lawson and Yongdo Lim

Abstract

The Karcher or least-squares mean has recently become an important tool for the averaging and studying of positive definite matrices. In this paper we show that this mean extends, in its general weighted form, to the infinite-dimensional setting of positive operators on a Hilbert space and retains most of its attractive properties. The primary extension is via its characterization as the unique solution of the corresponding Karcher equation. We also introduce power means in the infinite-dimensional setting and show that the Karcher mean is the strong limit of the monotonically decreasing family of power means as . We show that each of these characterizations provide important insights about the Karcher mean.

1. Introduction

Positive definite matrices have become fundamental computational objects in many areas of engineering, statistics, quantum information, and applied mathematics. They appear as “data points” in a diverse variety of settings: covariance matrices in statistics, elements of the search space in convex and semidefinite programming, kernels in machine learning, density matrices in quantum information, and diffusion tensors in medical imaging, to cite only a few. A variety of computational algorithms have arisen for approximations, interpolation, filtering, estimation, and averaging.

The process of averaging typically involves taking some type of matrix mean for some finite number of positive matrices of fixed dimension. Since the pioneering paper of Kubo and Ando Reference 11, an extensive theory of two-variable means has sprung up for positive matrices and operators, but the -variable case for has remained problematic. Once one realizes, however, that the matrix geometric mean is the metric midpoint of and for the trace metric on the set of positive definite matrices of some fixed dimension (see, e.g., Reference 4Reference 13), it is natural to use an averaging technique over this metric to extend this mean to a larger number of variables. First M. Moakher Reference 18 and then Bhatia and Holbrook Reference 5 suggested extending the geometric mean to -points by taking the mean to be the unique minimizer of the sum of the squares of the distances:

where . This idea had been anticipated by Élie Cartan (see, for example, section 6.1.5 of Reference 2), who showed among other things such a unique minimizer exists if the points all lie in a convex ball in a Riemannian manifold, which is enough to deduce the existence of the least-squares mean globally for . A more detailed study of Riemannian centers of mass in the setting of Riemannian manifolds was carried out by H. Karcher Reference 10, whose ideas are important to the present work.

Another approach to generalizing the geometric mean to -variables, independent of metric notions, was suggested by Ando, Li, and Mathias Reference 1 via a “symmetrization procedure” and induction. The Ando-Li-Mathias paper was also important for listing, and deriving for their mean, ten desirable properties for extended geometric means. Moaker and Bhatia and Holbrook were able to establish a number of these important properties for the least-squares mean, but the important question of the monotonicity of this mean, conjectured by Bhatia and Holbrook Reference 5, was left open. However, the authors were recently able to show Reference 16 that all the properties, in particular the monotonicity, were satisfied in the more general setting of weighted means for any weight of non-negative entries summing to :

(P1)

(Consistency with scalars) if the ’s commute;

(P2)

(Joint homogeneity)

(P3)

(Permutation invariance)

(P4)

(Monotonicity) if for all then

(P5)

(Continuity) the map is continuous;

(P6)

(Congruence invariance) for any invertible

(P7)

(Joint concavity) for ;

(P8)

(Self-duality)

(P9)

(Determinental identity) and

(P10)

(AGH weighted mean inequalities)

A key ingredient in the derivation of many of these properties, the monotonicity in particular, is the fact that the trace metric on the manifold of positive definite matrices gives them the structure of a Cartan-Hadamard Riemannian manifold, in particular a manifold of non-positive curvature. This implies that equipped with the Riemannian distance metric the manifold is a metric space of non-positive curvature, an NPC-space for short, also called a -space, a widely studied class of metric spaces with a rich structure (see e.g., Reference 6, Reference 21, Reference 12, Chapter 11).

Since in the quantum as well as other settings, one is interested in the more general case of positive bounded linear operators on an infinite-dimensional Hilbert space, one would like to have suitable and effective averaging procedures for this context also. However, the significant theory that has developed for the multivariable least-squares mean does not readily carry over to the setting of positive operators on a Hilbert space, since one has no such Riemannian structure nor NPC-metric available. Nevertheless a natural question arises as to whether other (non-metric) characterizations of the least-squares mean generalize to the Hilbert space setting, a generalization that continues to satisfy the preceding properties (P1)-(P8), (P10). In this paper we suggest and shall focus on an alternative characterization of the weighted least-squares mean as the unique solution of the Karcher equation

show that this equation also has a unique positive solution in the infinite-dimensional setting, and thus refer to our generalization as the Karcher mean. (In the Riemannian setting the Karcher equation is the condition for the vanishing of the gradient of the least-squares distance function.)

In Section 2 we recall the Thompson metric and list properties of it that will be important for our development. Section 3 on power means introduces an important tool for later developments, but the fact that well-behaved power means exist for the Hilbert operator setting is of independent interest. Lim and Pálfia Reference 17 have recently shown in the finite-dimensional setting that the Karcher or least-squares mean is the limit as of the power means . We show additionally that they are monotonically decreasing, which allows us to deduce the existence of their strong limit. In Section 4 we see that this power mean limit also exists in the Hilbert space setting and use this fact for our initial provisional definition of the Karcher mean. We show in Section 5 that the Karcher mean defined in this way does indeed satisfy the Karcher equation, and in Section 6 we establish that it is the unique solution and present a list of the fundamental properties of the Karcher mean.

Although for convenience we carry out our work in , the -algebra of all bounded linear operators on a Hilbert space , our constructions only require that we be working in a monotone complete subalgebra of , as we point out in the last section.

2. The Thompson metric

For a Hilbert space let be the Banach space of bounded linear operators on equipped with the operator norm, the closed subspace of bounded self-adjoint linear operators, and let be the open convex cone of positive definite operators. The Banach Lie group of bounded invertible linear operators (with operation composition) acts on via congruence transformations: . For we write if is positive semidefinite, and if is positive definite. Note that if and only if for all .

For and the -weighted geometric mean of and is defined by

The following properties for the weighted geometric mean are well known Reference 7Reference 11Reference 14.

Lemma 2.1.

Let and let Then

(i)

for ;

(ii)

for

(iii)

(Loewner-Heinz inequality) for and

(iv)

for

(v)

(vi)

for

(vii)

for

(viii)

for any As a special case, .

The Thompson metric on is defined by where denotes the operator norm of It is known that is a complete metric on and that

where ; see Reference 7Reference 20Reference 22. We note that the Thompson metric (in the second form) exists on all normal cones of real Banach spaces. For instance,

on where

Lemma 2.2 (Reference 3Reference 7Reference 14).

Basic properties of the Thompson metric on include

(i)

for any and

(ii)

(iii)

(iv)

By the triangular inequality,

for all Indeed,

The topology induced by the Thompson metric on agrees with the relative Banach space topology Reference 22. It will be useful to have available some basic relationships between the operator norm and the Thompson metric. For this purpose we introduce the numerical radius of :

Lemma 2.3.

(i) For , ; the equality holds for in .

(ii)

For , , and , implies , and so . If , then .

Proof.

(i) The inequality is a standard one; see, for example, Reference 8. For , we have for : the last equality coming from the fact is a -algebra. Taking the supremum over all yields , so . Since any has a (unique) square root , the second assertion follows.

(ii) For each , so . Taking sups over yields . The second assertion now follows from part (i). Taking , and using for from (i) yields the last inequality.

The following non-expansive property of addition for the Thompson metric will be useful for our purpose.

Lemma 2.4.

Reference 15, Lemma 10.1(iv) Let Then

Proof.

For , suppose that . Then , , , , and thus . Hence . The general case easily follows by induction.

We denote by the simplex of positive probability vectors in , the convex hull of the set of unit coordinate vectors. We equip with the relative Thompson metric given by equation (Equation 2.2)(which induces the relative Euclidean topology). A map is said to be monotonic if for any

whenever and for A map is said to be jointly homogeneous if for all and

Proposition 2.5.

Let be jointly homogeneous and monotonic. Then the following contractive property for the Thompson metric is satisfied:

for all and

Proof.

By definition of the Thompson metric, and for all By joint homogeneity and monotonicity of ,

and similarly This implies that

3. Power means

Power means for positive definite matrices have recently been introduced by Lim and Pálfia Reference 17. Their notion and most of their results readily extend to the setting of positive operators on a Hilbert space, as we point out in this section.

Theorem 3.1.

Let and let Then for each the following equation has a unique positive definite solution:

Proof.

We show that the map defined by is a strict contraction with respect to the Thompson metric. Let By Lemma 2.2(i),(iii) and Lemma 2.4,

Since is a strict contraction, hence has a unique fixed point.

Definition 3.2 (Power means).

Let and For we denote by the unique solution of

For we define where We call the -weighted power mean of order of

Remark 3.3.

We note that and the -weighted arithmetic and harmonic means of respectively. For is the unique positive definite solution of

Indeed, if and only if

Remark 3.4.

Let Let defined by Then by the Loewner-Heinz inequality, is monotone: implies that By Theorem 3.1, is a strict contraction for the Thompson metric with the least contraction coefficient less than or equal to By the Banach fixed point theorem

Similarly, the map is a strict contraction for the Thompson metric and for any

Proposition 3.5.

Let and let Then for

where denotes the -diameter of In particular, the map defined by is continuous with respect to the Thompson metric on

Proof.

Let and

Step 1.

for all

Let By Lemmas 2.2 and 2.4,

and thus for all

Step 2.

for

Let and By Lemma 2.4 and Step 1,

which implies that

Step 3.

Let and Then

Step 4.

Let and Then from Lemmas 2.2(i),(iii) and 2.4

By the triangular inequality and Steps 2-4,

for

For , and for a permutation on -letters, we set

We list some basic properties of

Proposition 3.6.

Let and let

(1)

if the ’s commute;

(2)

(3)

for any permutation

(4)

if for all

(5)

(6)

for any

(7)

for any invertible

(8)

(9)

(10)

for any

(11)

if and only if In particular, if and only if

(12)

for if and only if

(13)

if then for any positive unital linear map where

Proof.

Item (5) was shown in Step 4 of the previous proposition. We provide a proof of (4). The other proofs are similar to those of Reference 17.

Suppose that for all Let Define and Then and for any by the Banach fixed point theorem. By the Loewner-Heinz inequality, for all and whenever Let Then and Inductively, we have for all Therefore,

Let Then and thus Therefore,

Remark 3.7.

By Proposition 3.6(2), the power mean is homogeneous and by (4) it is monotonic.

4. The power mean limit

In Reference 17 Lim and Pálfia have shown in the finite-dimensional setting that the Karcher or least-squares mean is the limit as of the (monotonically decreasing) family of power means . We take this characterization as the launch point for our approach to the infinite-dimensional Karcher mean.

We recall that the strong topology on the space of bounded linear operators is the topology of pointwise convergence. If a net of positive semidefinite operators converges strongly to , then the non-negative values must converge to a non-negative , so the cone is strongly closed. Hence the partial order is strongly closed, since and , in the strong topology imply strongly converges to . We also recall the well-known fact that any monotonically decreasing net of self-adjoint operators that is bounded below possesses an infimum to which it strongly converges (see, for example, Theorem 4.28(b) of Reference 23). Dually a monotonically increasing net that is bounded above strongly converges to its supremum.

For we define if for all and We note that the arithmetic-harmonic mean inequality.

Theorem 4.1.

Let and Then there exist such that

under the strong-operator topology. Define Then for

Proof.

Let and

Step 1.

for

Let be defined by By the Banach fixed point theorem, for any We observe from the fact and Lemma 2.1, parts (vii) and (viii), that

Applying the preceding to yields

Since is monotonic (Remark 3.4), for all Therefore,

Step 2.

for Let and let Then by definition of and We consider the map Then

Since is monotonic, for all By Remark 3.4,

Step 3.

for

Let and By Step 1, Thus,

Step 4.

Follows from Steps 1-3.

Finally the nets and are monotonic and bounded between and Therefore, there exist such that

under the strong-operator topology. By Step 2, for all Since the partial order on is strongly closed, their strong limits satisfy

Remark 4.2.

The monotonically decreasing property is new, even for the finite-dimensional setting considered by Lim and Pálfia Reference 17.

Definition 4.3.

We set and call it the -weighted Karcher mean of We set .

The basic properties of power means in Proposition 3.6 together with Theorem 4.1 provide some important properties of the Karcher mean.

Theorem 4.4.
(P1)

(Consistency with scalars) if the ’s commute;

(P2)

(Homogeneity)

(P3)

(Permutation invariance)

(P4)

(Monotonicity) if for all then

(P5)

(Continuity) for the Thompson metric ; in particular is continuous;

(P6)

(Invariancy) for any invertible

(P7)

(Joint concavity) for ;

(P8)

(Duality)

(P9)

(AGH weighted mean inequalities) .

Proof.

(P1) Follows from the fact that converges to with respect to the operator norm (see e.g. Reference 19) and by Proposition 3.6 and Theorem 4.1.

(P2) Let Then

(P5). Let and Let By Proposition 3.5, for the Thompson metric and thus for . Since for each , we have By the strong closedness of the order and the strong convergence of to as , Similarly . It follows from definition of the Thompson metric that

(P8). Follows from Proposition 3.6 (8),(9) and the strong continuity of inversion on bounded intervals (see Lemma 5.4).

By Proposition 3.6, Theorem 4.1 and the strong closedness of the order, the remaining parts of the proof are immediate.

Remark 4.5.

We note that satisfies all the preceding properties except the joint concavity. For a finite-dimensional Hilbert space, (see Reference 17) and the Karcher mean is the unique solution of the Karcher equation

Then, via the Karcher equation, the Karcher mean satisfies the joint homogeneity property (Reference 18) and the contraction property with respect to the Riemannian trace metric where the denote the eigenvalues of By Proposition 2.5, the contraction property holds for the Thompson metric. We eventually extend these results to the infinite-dimensional setting; see Theorem 6.8.

5. The Karcher equation

Let We consider the following non-linear operator equation on called the Karcher equation:

Note that multiplying by yields the equivalent equation

and we pass freely between the two.

In the finite-dimensional setting it is known that the least-squares mean (the Karcher mean) satisfies the Karcher equation, indeed is the unique positive solution of the Karcher equation (cf. Remark 4.5). Our goal in this section is to show that the Karcher mean we have defined in the preceding section satisfies the Karcher equation (and hence is aptly named).

Let denote the principal branch of the complex logarithm defined on defined by , where is the principal branch of the argument taking values in . By the holomorphic functional calculus is defined for any bounded linear operator with spectrum contained in , in particular with spectrum contained in . We recall the following standard fact.

Lemma 5.1.

For , satisfies the Karcher equation if and only if it satisfies

Proof.

We note that , so is similar to , and hence the latter has positive spectrum and thus a logarithm. Applying inner automorphism by to equation (Equation 5.7) then yields the equation

and applying inner automorphism by to equation (Equation 5.9) yields equation (Equation 5.7). Hence satisfies equation (Equation 5.7) if and only if (Equation 5.9). Now multiplying equation (Equation 5.9) by yields (Equation 5.8) and vice versa; hence the two have the same solution sets, and thus so do (Equation 5.7) and (Equation 5.8).

Lemma 5.2.

Operator multiplication is strongly jointly continuous when restricted to any bounded subset.

Proof.

If in the set of all operators with norm bounded by , strongly and strongly, then

where the last two terms go to by definition of strong convergence.

The following lemma is a special case of Theorem 3.6 from Kadison’s study of strongly continuous operator functions Reference 9 on self-adjoint operators.

Lemma 5.3.

Let be an open or closed subset of and let be a continuous bounded function. Then the corresponding operator function is strong-operator continuous on the set of bounded self-adjoint operators on a Hilbert space with spectra in .

Lemma 5.4.

The following functions are strongly continuous on .

(i) The logarithm map , which is also monotonic.

(ii) The power map for .

(iii) The binary weighted mean map for .

The last two functions have image contained in .

Proof.

Let . For , we have and so . This shows that , hence invertible, and hence . In a similar fashion one sees that is not in the spectrum of , hence spec. Since and are continuous and bounded on the bounded set , we conclude from Lemma 5.3 that the corresponding operator functions are strongly continuous when restricted to . It is well known that is monotonic on .

For and , by the Loewner-Heinz inequality. Since inversion is order-reversing, is stable under inversion, and it follows that also .

For , by Lemma 2.3(ii). Since

, and all partial steps in the computation are bounded in norm by (since ) for , the operation is strongly continuous by Lemma 5.2 and part (ii). Since the weighted mean is monotonic and idempotent,

Lemma 5.5.

Let . Then on any ball such that

where the limit is taken in the strong-operator topology.

Proof.

Let Then

Let Then as and strongly,

The following shows that the Karcher mean is a solution of the Karcher equation.

Theorem 5.6.

For each and , satisfies the Karcher equation (Equation 5.7).

Proof.

Let Let and let By Theorem 4.1, with respect to the strong topology monotonically as and for all Pick such that for all . It follows that for all .

By the order reversal of inversion is closed under inversion so that

and similarly . Thus for all and By part (ii) of Lemma 2.3 we have . By Lemma 5.2 and Lemma 5.4(ii), converges strongly to . By strong continuity of on (Lemma 5.4(i)),

for all By Lemma 5.5 applied to any open ball , , in the strong topology

for all

By definition, . Pre- and post-multiplying this equation by and substituting from equation (Equation 2.1) for the weighted mean yields for :

that is, By (Equation 5.10),

This shows that is a solution of the Karcher equation.

Corollary 5.7.

The operator also satisfies the Karcher equation.

Proof.

By definition , . By strong continuity (or order-reversion) of inversion and the definition of the negative power means, By the preceding theorem satisfies the Karcher equation:

Multiplying the equation by yields

so satisfies the Karcher equation.

6. Uniqueness of the Karcher mean

For and we consider the corresponding Karcher equation

We denote by the set of all positive definite solutions of the Karcher equation (Equation 6.11) and consider three important properties of this set.

Lemma 6.1.

for .

for any

Proof.

(1) Let Then

where the second equality follows from the fact that for any and Therefore, the left hand side equals iff the right hand side does, which translates to

(2) Let and let be the polar decomposition of , where and From , one computes directly from the Karcher equation as given in Lemma 5.1 that

Let Then (Lemma 5.1) implies that

and thus This implies that From we conclude that

By (Equation 6.12) and (Equation 6.13),

In the proof of the next theorem we use the following Banach space version of the Implicit Mapping Theorem, taken from Reference 12, Theorem 5.9 and its proof.

Theorem 6.2 (The Implicit Mapping Theorem).

Let be open sets in Banach spaces , respectively, and let be a -mapping, where is also a Banach space. Let and assume that and that

where is the partial with respect to the second variable, is a Banach space isomorphism. Then there exist neighborhoods of of and of and a -map such that and for , if and only if .

The Loewner order intervals are defined by and

Theorem 6.3.

Let Then there exists such that for any the Karcher equation has a unique solution in Furthermore, the Karcher mean is on a neighborhood of the diagonal in .

Proof.

We consider the map by

and for Then is and if and only if

We next employ the elementary differential calculus for open subsets of Banach spaces, as it appears, for example, in Reference 12, Chapter I,§3-5. The derivative of at is a bounded linear map from the tangent space at to the tangent space at , which may be considered as a linear map since the tangent bundles of and are trivial with fiber . Its action is given by

Let Then so and (which corresponds to and follows from the well-known fact that the exponential mapping has derivative the identity at ). Since satisfies , we conclude that the partial derivative of in the second variable, that is, the -variable, satisfies

Since is the identity map, in particular a Banach space isomorphism, from to , it follows directly from the Implicit Mapping Theorem (Theorem 6.2) that there exist open neighborhoods of in and of in and a -mapping such that and if and only if for all , .

Pick such that and . (This is possible since it is easily seen that is the closed -ball in the Thompson metric.) Then for any , by the monotonicity and idempotency of ,

and thus .

Since we have shown that the Karcher mean always satisfies the Karcher equation, we have , and hence by the preceding two paragraphs . Thus the Karcher mean is on the open neighborhood of and has a unique solution to the Karcher equation in , namely the Karcher mean .

Next, let Set Then for all By the preceding paragraph, is the unique solution to the Karcher equation that lies in . By Lemma 6.1 and Theorem 4.4

This shows that the Karcher equation has the unique solution on and is on the open neighborhood , since

Theorem 6.4.

The following conditions are equivalent and are all satisfied by .

(i)

is jointly homogeneous;

(ii)

is contractive for the Thompson metric;

(iii)

the equation

has a unique solution in for all and

(iv)

for all and

Furthermore, if and only if and if and only if for all and and

Proof.

We first show the satisfies condition (iii). Fix and . Define defined by . Then

where the first inequality follows from Theorem 4.4(P5) and the second from Lemma 2.2(iii). It follows that is a strict contraction for the Thompson metric and hence has a unique fixed point, which is the unique solution for the equation of (iii).

Next, we establish the equivalence between (i)-(iv).

(i) implies (ii): By Proposition 2.5 and the monotonicity of (Theorem 4.4).

(ii) implies (iii): Let and let For it follows from Lemma 2.2(iii) and the hypothesis that the map defined by is a strict contraction for the Thompson metric and hence has a unique fixed point on , i.e., has a unique solution in .

(iii) implies (iv): Let and pick as in Theorem 6.3. Pick such that

Then Clearly is then a solution of and by the monotonicity and idempotency of ,

must belong to . By the uniqueness of the Karcher solution on (Theorem 6.3),

By invariancy under congruence transformations (property (P6) of Theorem 4.4),

Since is one possibility for our original choice of , property (iii) implies This shows that and that is the unique solution of

(iv) implies (i): Follows from Lemma 6.1(1).

Finally, suppose that one of the equivalent conditions (i)-(iv) holds true. Let Then and hence

By (iv), that is,

By (ii) the map defined by is a strict contraction for the Thompson metric and has a unique fixed point. Thus

if and only if

We isolate as a separate theorem the main result of the preceding, a restatement of item (iv).

Theorem 6.5.

For , the Karcher mean is the unique solution of the Karcher equation

Remark 6.6.

In light of Theorem 6.5 it is more natural to define the Karcher mean to be the unique solution of the corresponding Karcher equation. That was certainly our motivation in naming it the Karcher mean from the beginning.

By Corollary 5.7 also satisfies the same Karcher equation as and hence by the uniqueness of solution, the two are equal. This yields the following corollary.

Corollary 6.7.

For a weight and , , and thus .

We gather together our results about the fundamental properties of the Karcher mean.

Theorem 6.8.

For a weight and , the following properties hold.

(P1)

(Consistency with scalars) if the ’s commute;

(P2)

(Joint homogeneity)

(P3)

(Permutation invariance)

(P4)

(Monotonicity) if for all then

(P5)

(Continuity) is contractive for the Thompson metric;

(P6)

(Congruence invariance) for any invertible

(P7)

(Joint concavity) for ;

(P8)

(Self-duality) and

(P9)

(AGH weighted mean inequalities)

Proof.

By Theorem 4.4 and Theorem 6.4, is jointly homogeneous and is contractive for the Thompson metric. The remaining properties appear in Theorem 4.4, except that (P8) is modified in light of the preceding corollary.

Remark 6.9.

The Karcher mean is uniquely determined by congruence invariancy self-duality and the following property

for all and Furthermore, implies that for all and the following statements are equivalent;

(i)

(ii)

for all and

(iii)

for all where

For a finite-dimensional Hilbert space, property and the previous characterization for the Karcher mean appear in Reference 24 and Reference 17, respectively. The proofs are heavily dependent on property (Y) and Hansen’s inequality and are similar to those of Reference 17Reference 24.

7. Subalgebras

For convenience and ease of presentation we have limited our considerations to the full algebra of bounded linear operators. However, we observe that the constructions can be carried out in large classes of subalgebras (which we assume always to contain the identity ). For any norm-closed -subalgebra of , will be its open cone of positive operators, and will be closed under the operation of taking weighted geometric means . Hence it will be closed under taking power means , since the power mean is the limit in the Thompson metric of a contractive map defined from the weighted geometric means on , and since the topology of the Thompson metric agrees with the relative operator norm topology. Since we have defined to be the strong limit of the monotonically decreasing family , we need that the subalgebra is monotone complete (actually, monotone -complete will suffice since one can restrict to and obtain the same infimum). Once one has closure under the Karcher mean for the subalgebra, then one sees readily that its properties that we have derived for the full algebra are inherited by the subalgebra, in particular its characterization as the unique solution of the corresponding Karcher equation. Since the von Neumann subalgebras are strongly closed, they in particular have Karcher means defined in the manner of this paper and satisfying the properties derived for it.

8. Open problems

We close with three open problems:

Problem 1.

Is the strong convergence of the power means to the Karcher mean actually convergence in the operator norm topology?

Problem 2.

For a given weight , is the map from to a -map? (We established this in a neighborhood of the diagonal.)

Problem 3.

For a given weight , is the map from to strongly continuous on order intervals ?

Acknowledgement

The authors wish to express their gratitude to Professor Dick Kadison for helpful insights about the strong topology, especially for his pointing us to reference Reference 9. The authors also thank the referee for helpful comments, particularly for pointing out a needed correction in the authors’ use of the Implicit Function Theorem in the proof of Theorem 6.3.

Mathematical Fragments

Equation (2.1)
Lemma 2.1.

Let and let Then

(i)

for ;

(ii)

for

(iii)

(Loewner-Heinz inequality) for and

(iv)

for

(v)

(vi)

for

(vii)

for

(viii)

for any As a special case, .

Equation (2.2)
Lemma 2.2 (Reference 3Reference 7Reference 14).

Basic properties of the Thompson metric on include

(i)

for any and

(ii)

(iii)

(iv)

Equation (2.3)
Lemma 2.3.

(i) For , ; the equality holds for in .

(ii)

For , , and , implies , and so . If , then .

Lemma 2.4.

Reference 15, Lemma 10.1(iv) Let Then

Proposition 2.5.

Let be jointly homogeneous and monotonic. Then the following contractive property for the Thompson metric is satisfied:

for all and

Theorem 3.1.

Let and let Then for each the following equation has a unique positive definite solution:

Remark 3.4.

Let Let defined by Then by the Loewner-Heinz inequality, is monotone: implies that By Theorem 3.1, is a strict contraction for the Thompson metric with the least contraction coefficient less than or equal to By the Banach fixed point theorem

Similarly, the map is a strict contraction for the Thompson metric and for any

Proposition 3.5.

Let and let Then for

where denotes the -diameter of In particular, the map defined by is continuous with respect to the Thompson metric on

Proposition 3.6.

Let and let

(1)

if the ’s commute;

(2)

(3)

for any permutation

(4)

if for all

(5)

(6)

for any

(7)

for any invertible

(8)

(9)

(10)

for any

(11)

if and only if In particular, if and only if

(12)

for if and only if

(13)

if then for any positive unital linear map where

Theorem 4.1.

Let and Then there exist such that

under the strong-operator topology. Define Then for

Theorem 4.4.
(P1)

(Consistency with scalars) if the ’s commute;

(P2)

(Homogeneity)

(P3)

(Permutation invariance)

(P4)

(Monotonicity) if for all then

(P5)

(Continuity) for the Thompson metric ; in particular is continuous;

(P6)

(Invariancy) for any invertible

(P7)

(Joint concavity) for ;

(P8)

(Duality)

(P9)

(AGH weighted mean inequalities) .

Remark 4.5.

We note that satisfies all the preceding properties except the joint concavity. For a finite-dimensional Hilbert space, (see Reference 17) and the Karcher mean is the unique solution of the Karcher equation

Then, via the Karcher equation, the Karcher mean satisfies the joint homogeneity property (Reference 18) and the contraction property with respect to the Riemannian trace metric where the denote the eigenvalues of By Proposition 2.5, the contraction property holds for the Thompson metric. We eventually extend these results to the infinite-dimensional setting; see Theorem 6.8.

Equation (5.7)
Lemma 5.1.

For , satisfies the Karcher equation if and only if it satisfies

Equation (5.9)
Lemma 5.2.

Operator multiplication is strongly jointly continuous when restricted to any bounded subset.

Lemma 5.3.

Let be an open or closed subset of and let be a continuous bounded function. Then the corresponding operator function is strong-operator continuous on the set of bounded self-adjoint operators on a Hilbert space with spectra in .

Lemma 5.4.

The following functions are strongly continuous on .

(i) The logarithm map , which is also monotonic.

(ii) The power map for .

(iii) The binary weighted mean map for .

The last two functions have image contained in .

Lemma 5.5.

Let . Then on any ball such that

where the limit is taken in the strong-operator topology.

Equation (5.10)
Corollary 5.7.

The operator also satisfies the Karcher equation.

Equation (6.11)
Lemma 6.1.

for .

for any

Equation (6.12)
Equation (6.13)
Theorem 6.2 (The Implicit Mapping Theorem).

Let be open sets in Banach spaces , respectively, and let be a -mapping, where is also a Banach space. Let and assume that and that

where is the partial with respect to the second variable, is a Banach space isomorphism. Then there exist neighborhoods of of and of and a -map such that and for , if and only if .

Theorem 6.3.

Let Then there exists such that for any the Karcher equation has a unique solution in Furthermore, the Karcher mean is on a neighborhood of the diagonal in .

Theorem 6.4.

The following conditions are equivalent and are all satisfied by .

(i)

is jointly homogeneous;

(ii)

is contractive for the Thompson metric;

(iii)

the equation

has a unique solution in for all and

(iv)

for all and

Furthermore, if and only if and if and only if for all and and

Theorem 6.5.

For , the Karcher mean is the unique solution of the Karcher equation

Theorem 6.8.

For a weight and , the following properties hold.

(P1)

(Consistency with scalars) if the ’s commute;

(P2)

(Joint homogeneity)

(P3)

(Permutation invariance)

(P4)

(Monotonicity) if for all then

(P5)

(Continuity) is contractive for the Thompson metric;

(P6)

(Congruence invariance) for any invertible

(P7)

(Joint concavity) for ;

(P8)

(Self-duality) and

(P9)

(AGH weighted mean inequalities)

References

Reference [1]
T. Ando, Chi-Kwong Li, and Roy Mathias, Geometric means, Linear Algebra Appl. 385 (2004), 305–334, DOI 10.1016/j.laa.2003.11.019. MR2063358 (2005f:47049),
Show rawAMSref \bib{ALM}{article}{ author={Ando, T.}, author={Li, Chi-Kwong}, author={Mathias, Roy}, title={Geometric means}, journal={Linear Algebra Appl.}, volume={385}, date={2004}, pages={305--334}, issn={0024-3795}, review={\MR {2063358 (2005f:47049)}}, doi={10.1016/j.laa.2003.11.019}, }
Reference [2]
Marcel Berger, A panoramic view of Riemannian geometry, Springer-Verlag, Berlin, 2003, DOI 10.1007/978-3-642-18245-7. MR2002701 (2004h:53001),
Show rawAMSref \bib{Be03}{book}{ author={Berger, Marcel}, title={A panoramic view of Riemannian geometry}, publisher={Springer-Verlag}, place={Berlin}, date={2003}, pages={xxiv+824}, isbn={3-540-65317-1}, review={\MR {2002701 (2004h:53001)}}, doi={10.1007/978-3-642-18245-7}, }
Reference [3]
Rajendra Bhatia, On the exponential metric increasing property, Linear Algebra Appl. 375 (2003), 211–220, DOI 10.1016/S0024-3795(03)00647-5. MR2013466 (2004m:53032),
Show rawAMSref \bib{Bh03}{article}{ author={Bhatia, Rajendra}, title={On the exponential metric increasing property}, journal={Linear Algebra Appl.}, volume={375}, date={2003}, pages={211--220}, issn={0024-3795}, review={\MR {2013466 (2004m:53032)}}, doi={10.1016/S0024-3795(03)00647-5}, }
Reference [4]
Rajendra Bhatia, Positive definite matrices, Princeton Series in Applied Mathematics, Princeton University Press, Princeton, NJ, 2007. MR2284176 (2007k:15005),
Show rawAMSref \bib{Bha}{book}{ author={Bhatia, Rajendra}, title={Positive definite matrices}, series={Princeton Series in Applied Mathematics}, publisher={Princeton University Press}, place={Princeton, NJ}, date={2007}, pages={x+254}, isbn={978-0-691-12918-1}, isbn={0-691-12918-5}, review={\MR {2284176 (2007k:15005)}}, }
Reference [5]
Rajendra Bhatia and John Holbrook, Riemannian geometry and matrix geometric means, Linear Algebra Appl. 413 (2006), no. 2-3, 594–618, DOI 10.1016/j.laa.2005.08.025. MR2198952 (2007c:15030),
Show rawAMSref \bib{BH}{article}{ author={Bhatia, Rajendra}, author={Holbrook, John}, title={Riemannian geometry and matrix geometric means}, journal={Linear Algebra Appl.}, volume={413}, date={2006}, number={2-3}, pages={594--618}, issn={0024-3795}, review={\MR {2198952 (2007c:15030)}}, doi={10.1016/j.laa.2005.08.025}, }
Reference [6]
Martin R. Bridson and André Haefliger, Metric spaces of non-positive curvature, Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences], vol. 319, Springer-Verlag, Berlin, 1999. MR1744486 (2000k:53038),
Show rawAMSref \bib{BrH}{book}{ author={Bridson, Martin R.}, author={Haefliger, Andr{\'e}}, title={Metric spaces of non-positive curvature}, series={Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]}, volume={319}, publisher={Springer-Verlag}, place={Berlin}, date={1999}, pages={xxii+643}, isbn={3-540-64324-9}, review={\MR {1744486 (2000k:53038)}}, }
Reference [7]
G. Corach, H. Porta, and L. Recht, Convexity of the geodesic distance on spaces of positive operators, Illinois J. Math. 38 (1994), no. 1, 87–94. MR1245836 (94i:58010),
Show rawAMSref \bib{CPR}{article}{ author={Corach, G.}, author={Porta, H.}, author={Recht, L.}, title={Convexity of the geodesic distance on spaces of positive operators}, journal={Illinois J. Math.}, volume={38}, date={1994}, number={1}, pages={87--94}, issn={0019-2082}, review={\MR {1245836 (94i:58010)}}, }
Reference [8]
Karl E. Gustafson and Duggirala K. M. Rao, Numerical range, Universitext, Springer-Verlag, New York, 1997. The field of values of linear operators and matrices, DOI 10.1007/978-1-4613-8498-4. MR1417493 (98b:47008),
Show rawAMSref \bib{GR}{book}{ author={Gustafson, Karl E.}, author={Rao, Duggirala K. M.}, title={Numerical range}, series={Universitext}, note={The field of values of linear operators and matrices}, publisher={Springer-Verlag}, place={New York}, date={1997}, pages={xiv+189}, isbn={0-387-94835-X}, review={\MR {1417493 (98b:47008)}}, doi={10.1007/978-1-4613-8498-4}, }
Reference [9]
Richard V. Kadison, Strong continuity of operator functions, Pacific J. Math. 26 (1968), 121–129. MR0231211 (37 #6766),
Show rawAMSref \bib{Ka}{article}{ author={Kadison, Richard V.}, title={Strong continuity of operator functions}, journal={Pacific J. Math.}, volume={26}, date={1968}, pages={121--129}, issn={0030-8730}, review={\MR {0231211 (37 \#6766)}}, }
Reference [10]
H. Karcher, Riemannian center of mass and mollifier smoothing, Comm. Pure Appl. Math. 30 (1977), no. 5, 509–541. MR0442975 (56 #1350),
Show rawAMSref \bib{Ka77}{article}{ author={Karcher, H.}, title={Riemannian center of mass and mollifier smoothing}, journal={Comm. Pure Appl. Math.}, volume={30}, date={1977}, number={5}, pages={509--541}, issn={0010-3640}, review={\MR {0442975 (56 \#1350)}}, }
Reference [11]
Fumio Kubo and Tsuyoshi Ando, Means of positive linear operators, Math. Ann. 246 (1979/80), no. 3, 205–224, DOI 10.1007/BF01371042. MR563399 (84d:47028),
Show rawAMSref \bib{KA}{article}{ author={Kubo, Fumio}, author={Ando, Tsuyoshi}, title={Means of positive linear operators}, journal={Math. Ann.}, volume={246}, date={1979/80}, number={3}, pages={205--224}, issn={0025-5831}, review={\MR {563399 (84d:47028)}}, doi={10.1007/BF01371042}, }
Reference [12]
Serge Lang, Fundamentals of differential geometry, Graduate Texts in Mathematics, vol. 191, Springer-Verlag, New York, 1999, DOI 10.1007/978-1-4612-0541-8. MR1666820 (99m:53001),
Show rawAMSref \bib{Lang}{book}{ author={Lang, Serge}, title={Fundamentals of differential geometry}, series={Graduate Texts in Mathematics}, volume={191}, publisher={Springer-Verlag}, place={New York}, date={1999}, pages={xviii+535}, isbn={0-387-98593-X}, review={\MR {1666820 (99m:53001)}}, doi={10.1007/978-1-4612-0541-8}, }
Reference [13]
Jimmie D. Lawson and Yongdo Lim, The geometric mean, matrices, metrics, and more, Amer. Math. Monthly 108 (2001), no. 9, 797–812, DOI 10.2307/2695553. MR1864051 (2002h:15023),
Show rawAMSref \bib{LL01}{article}{ author={Lawson, Jimmie D.}, author={Lim, Yongdo}, title={The geometric mean, matrices, metrics, and more}, journal={Amer. Math. Monthly}, volume={108}, date={2001}, number={9}, pages={797--812}, issn={0002-9890}, review={\MR {1864051 (2002h:15023)}}, doi={10.2307/2695553}, }
Reference [14]
Jimmie Lawson and Yongdo Lim, Metric convexity of symmetric cones, Osaka J. Math. 44 (2007), no. 4, 795–816. MR2383810 (2009f:46095),
Show rawAMSref \bib{LL}{article}{ author={Lawson, Jimmie}, author={Lim, Yongdo}, title={Metric convexity of symmetric cones}, journal={Osaka J. Math.}, volume={44}, date={2007}, number={4}, pages={795--816}, issn={0030-6126}, review={\MR {2383810 (2009f:46095)}}, }
Reference [15]
Jimmie Lawson and Yongdo Lim, A general framework for extending means to higher orders, Colloq. Math. 113 (2008), no. 2, 191–221, DOI 10.4064/cm113-2-3. MR2425082 (2009f:47028),
Show rawAMSref \bib{LL12}{article}{ author={Lawson, Jimmie}, author={Lim, Yongdo}, title={A general framework for extending means to higher orders}, journal={Colloq. Math.}, volume={113}, date={2008}, number={2}, pages={191--221}, issn={0010-1354}, review={\MR {2425082 (2009f:47028)}}, doi={10.4064/cm113-2-3}, }
Reference [16]
Jimmie Lawson and Yongdo Lim, Monotonic properties of the least squares mean, Math. Ann. 351 (2011), no. 2, 267–279, DOI 10.1007/s00208-010-0603-6. MR2836658,
Show rawAMSref \bib{LL1}{article}{ author={Lawson, Jimmie}, author={Lim, Yongdo}, title={Monotonic properties of the least squares mean}, journal={Math. Ann.}, volume={351}, date={2011}, number={2}, pages={267--279}, issn={0025-5831}, review={\MR {2836658}}, doi={10.1007/s00208-010-0603-6}, }
Reference [17]
Yongdo Lim and Miklós Pálfia, Matrix power means and the Karcher mean, J. Funct. Anal. 262 (2012), no. 4, 1498–1514, DOI 10.1016/j.jfa.2011.11.012. MR2873848 (2012m:15025),
Show rawAMSref \bib{LP}{article}{ author={Lim, Yongdo}, author={P{\'a}lfia, Mikl{\'o}s}, title={Matrix power means and the Karcher mean}, journal={J. Funct. Anal.}, volume={262}, date={2012}, number={4}, pages={1498--1514}, issn={0022-1236}, review={\MR {2873848 (2012m:15025)}}, doi={10.1016/j.jfa.2011.11.012}, }
Reference [18]
Maher Moakher, A differential geometric approach to the geometric mean of symmetric positive-definite matrices, SIAM J. Matrix Anal. Appl. 26 (2005), no. 3, 735–747 (electronic), DOI 10.1137/S0895479803436937. MR2137480 (2005m:15059),
Show rawAMSref \bib{Mo05}{article}{ author={Moakher, Maher}, title={A differential geometric approach to the geometric mean of symmetric positive-definite matrices}, journal={SIAM J. Matrix Anal. Appl.}, volume={26}, date={2005}, number={3}, pages={735--747 (electronic)}, issn={0895-4798}, review={\MR {2137480 (2005m:15059)}}, doi={10.1137/S0895479803436937}, }
Reference [19]
Roger D. Nussbaum and Joel E. Cohen, The arithmetic-geometric mean and its generalizations for noncommuting linear operators, Ann. Scuola Norm. Sup. Pisa Cl. Sci. (4) 15 (1988), no. 2, 239–308 (1989). MR1007399 (90e:47018),
Show rawAMSref \bib{NC}{article}{ author={Nussbaum, Roger D.}, author={Cohen, Joel E.}, title={The arithmetic-geometric mean and its generalizations for noncommuting linear operators}, journal={Ann. Scuola Norm. Sup. Pisa Cl. Sci. (4)}, volume={15}, date={1988}, number={2}, pages={239--308 (1989)}, issn={0391-173X}, review={\MR {1007399 (90e:47018)}}, }
Reference [20]
Roger D. Nussbaum, Finsler structures for the part metric and Hilbert’s projective metric and applications to ordinary differential equations, Differential Integral Equations 7 (1994), no. 5-6, 1649–1707. MR1269677 (95b:58010),
Show rawAMSref \bib{Nu1}{article}{ author={Nussbaum, Roger D.}, title={Finsler structures for the part metric and Hilbert's projective metric and applications to ordinary differential equations}, journal={Differential Integral Equations}, volume={7}, date={1994}, number={5-6}, pages={1649--1707}, issn={0893-4983}, review={\MR {1269677 (95b:58010)}}, }
Reference [21]
Karl-Theodor Sturm, Probability measures on metric spaces of nonpositive curvature, (Paris, 2002), Contemp. Math., vol. 338, Amer. Math. Soc., Providence, RI, 2003, pp. 357–390, DOI 10.1090/conm/338/06080. MR2039961 (2005d:60009),
Show rawAMSref \bib{St03}{article}{ author={Sturm, Karl-Theodor}, title={Probability measures on metric spaces of nonpositive curvature}, conference={ title={}, address={Paris}, date={2002}, }, book={ series={Contemp. Math.}, volume={338}, publisher={Amer. Math. Soc.}, place={Providence, RI}, }, date={2003}, pages={357--390}, review={\MR {2039961 (2005d:60009)}}, doi={10.1090/conm/338/06080}, }
Reference [22]
A. C. Thompson, On certain contraction mappings in a partially ordered vector space. , Proc. Amer. Math. soc. 14 (1963), 438–443. MR0149237 (26 #6727),
Show rawAMSref \bib{Th}{article}{ author={Thompson, A. C.}, title={On certain contraction mappings in a partially ordered vector space. }, journal={Proc. Amer. Math. soc.}, volume={14}, date={1963}, pages={438--443}, issn={0002-9939}, review={\MR {0149237 (26 \#6727)}}, }
Reference [23]
Joachim Weidmann, Linear operators in Hilbert spaces, Graduate Texts in Mathematics, vol. 68, Springer-Verlag, New York, 1980. Translated from the German by Joseph Szücs. MR566954 (81e:47001),
Show rawAMSref \bib{We}{book}{ author={Weidmann, Joachim}, title={Linear operators in Hilbert spaces}, series={Graduate Texts in Mathematics}, volume={68}, note={Translated from the German by Joseph Sz\"ucs}, publisher={Springer-Verlag}, place={New York}, date={1980}, pages={xiii+402}, isbn={0-387-90427-1}, review={\MR {566954 (81e:47001)}}, }
Reference [24]
T. Yamazaki, The Riemannian mean and matrix inequalities related to the Ando-Hiai inequality and chaotic order, Operators and Matrices 6 (2012), 577–588.

Article Information

MSC 2010
Primary: 47B65 (Positive operators and order-bounded operators)
Secondary: 47L07 (Convex sets and cones of operators), 15B48 (Positive matrices and their generalizations; cones of matrices)
Keywords
  • Positive operator
  • operator mean
  • Karcher equation
  • Karcher mean
  • power mean
Author Information
Jimmie Lawson
Department of Mathematics, Louisiana State University, Baton Rouge, Louisiana 70803
lawson@math.lsu.edu
MathSciNet
Yongdo Lim
Department of Mathematics, Sungkyunkwan University, Suwon 440-746, Korea
ylim@skku.edu
MathSciNet
Additional Notes

The work of the second author was supported by the National Research Foundation of Korea (NRF) grant funded by the Korean government (MEST) (No. 2012-005191).

Journal Information
Transactions of the American Mathematical Society, Series B, Volume 1, Issue 1, ISSN 2330-0000, published by the American Mathematical Society, Providence, Rhode Island.
Publication History
This article was received on , revised on , and published on .
Copyright Information
Copyright 2014 by the authors under Creative Commons Attribution-Noncommercial 3.0 License (CC BY NC 3.0)
Article References
  • Permalink
  • Permalink (PDF)
  • DOI 10.1090/S2330-0000-2014-00003-4
  • MathSciNet Review: 3148817
  • Show rawAMSref \bib{3148817}{article}{ author={Lawson, Jimmie}, author={Lim, Yongdo}, title={Karcher means and Karcher equations of positive definite operators}, journal={Trans. Amer. Math. Soc. Ser. B}, volume={1}, number={1}, date={2014}, pages={1-22}, issn={2330-0000}, review={3148817}, doi={10.1090/S2330-0000-2014-00003-4}, }

Settings

Change font size
Resize article panel
Enable equation enrichment

Note. To explore an equation, focus it (e.g., by clicking on it) and use the arrow keys to navigate its structure. Screenreader users should be advised that enabling speech synthesis will lead to duplicate aural rendering.

For more information please visit the AMS MathViewer documentation.