Parametric models and information geometry on W*-algebras

We introduce the notion of smooth parametric model of normal positive linear functionals on possibly infinite-dimensional W*-algebras generalizing the notions of parametric models used in classical and quantum information geometry. We then use the Jordan product naturally available in this context in order to define a Riemannian metric tensor on parametric models satsfying suitable regularity conditions. This Riemannian metric tensor reduces to the Fisher-Rao metric tensor, or to the Fubini-Study metric tensor, or to the Bures-Helstrom metric tensor when suitable choices for the W*-algebra and the models are made.


Introduction
In classical and quantum information geometry one usually deals with parametrized subsets of probability distributions or of quantum states, respectively, colloquially referred to as parametric models.A typical example in the classical context is given by the family of Gaussian probability distributions, or, in the quantum context, by the family of quantum coherent states.
From both the conceptual and practical point of view, there could be physical theoretical constraints leading to situations in which only a certain family of probability distributions or quantum states can be modelled or physically realized (think again to Gaussian probability distributions and quantum coherent states), thus justifying the choice to work with parametric models.
From a purely mathematical point of view, on the other hand, the choice of working with parametric models is mandatory if we want to exploit the mathematical formalism of standard differential geometry [1,43,50].Indeed, both the space of probability distributions on a measurable outcome space and the space of quantum states identified with the space of density operators on a complex, separable Hilbert space do not posses the structure of smooth manifold.Quite interestingly, this already happens in finite dimensions: in the classical case, the space of probability distributions on a discrete and finite outcome space X n (with n elements) can be naturally identified with the unit simplex in R n which is a typical example of smooth manifold with corners [54]; in the quantum case, the space of quantum states identified with the space of density operators on a finite-dimensional complex Hilbert space H is a smooth manifold with boundary known as the Bloch ball when dim(H) = 2 [11,35] and a stratified manifold when dim(H) > 2 [24].In infinite dimensions, the situation is even worse given the technicalities associated with infinite-dimensional differential geometry.
While it can be argued that there are approaches aiming to build an infinite-dimensional non-parametric theory both in the classical [64] and the quantum case [42], we believe that they truly are parametric models in which the parameters lie in an infinite-dimensional manifold.Indeed, the seminal work by Pistone and Sempi [64] deals with the Banach manifold structure not on the whole space of probability distributions on a measure space, but rather on the space of all those probability distributions mutually absolutely continuous with respect to a given reference probability measure µ.This choice clearly selects what can be reasonably called a parametric model of probability distributions.Something similar happens in the work by Jencova [42] in which a Banach manifold structure is given not to the whole space of states on a W -algebra A , but rather to the space of faithful normal states on A .
Consequently, in order to use the tools of standard differential geometry, as it is customary in classical and quantum information geometry [4,5,51,58,67], we must necessarily accept the need to work with parametric models.The classical case has been thoroughly and sistematically investigated also in the infinite-dimensional setting [7,8,9], while, to the best of our knowledge, the information geometry of parametric models of quantum states (especially in the infinitedimensional setting) is still vastly unexplored.
The aim of this work is to start the exploration of this land, and to do it in such a way that both the classical and quantum case can be simultaneously handled.The key to achieve such a unification is found in the theory of W -algebras.Indeed, as it will be thoroughly explained in the following, both probability distributions and quantum states can be thought as states (in the functional analytic sense) on a suitable W -algebra A which is Abelian in the classical case and non-Abelian in the quantum one.It is worth mentioning that the use of W -algebras to deal with probability distributions on a discrete and finite outcome space in the context of classical information geometry has been recently proposed [37,38,36].Moreover, the use of W -algebras as a common arena for classical and quantum information geometry was also recently introduced by some of us in the finite-dimensional case [20,21].
In this work we aim to discuss in detail a definition of possibly infinite-dimensional parametric models in the context of W -algebras which is well-adapted to classical and quantum information geometry.Moreover, we will also explore the Riemannian aspects of these models showing how well-known geometric structures like the Fisher-Rao metric tensor or the Bures-Helstrom metric tensor are actually connected with algebraic operations in the W -algebra under consideration, in particular, with the Jordan product on its self-adjoint part.
It is important to note that, following the spirit of [8], we introduce a notion of parametric model that does not require the normalization condition usually implemented when dealing with probability distributions and quantum states (i.e., the normalization of the total volume for probability distributions, or the normalization of the trace for quantum states).This choice reflects the idea of considering the normalization condition a kind of ad hoc constraint which should really bear no sensible information (for instance, the choice of 1 instead of 73 as a normalization constant is completely irrelevant).
The work is structured as follows.In section 2 we introduce all the background material we need on W -algebras and their relation with classical and quantum information geometry.In section 3 we introduce the notion of smooth parametric model of normal positive linear functionals on a W -algebra A .This notion allows to treat essentially all the parametric models used in classical and quantum information geometry in the same mathematical framework.In section 4 we further embellish the notion of parametric model in order to deal with geometric aspects related with Riemannian geometry.Essentially, we characterize all those parametric models for which the algebraic structure of A leads to the definition of a Riemannian metric tensor suitably generalizing the Fisher-Rao metric tensor.In section 5 we discuss three meaningful examples.First of all, we discuss the finite-dimensional case in which almost all the technical difficulties disappear and there is a clear link with the Jordan-algebra-analogue of Kirillov's theory of co-adjoint orbits.Then, we discuss the case in which A is Abelian and show that the classical case of parametric models of probability distributions endowed with the Fisher-Rao metric tensor perfectly falls in the setting we introduced.Finally, we discuss the case of rank-one, strongly-continuous unitary models on a complex, separable Hilbert space H.This family of models is broad enough to encompass various models used in the quantum setting (e.g., the case of normal pure states of B(H) endowed with the Fubini-Study metric tensor, and various notions of generalized coherent states).Section 6 closes the work with some thoughts and comment on possible future investigations stemming from the formalism introduced.

W -algebras and normal positive linear functionals
Before giving the precise definition of smooth parametric model of normal positive linear functionals on a W -algebra A , we have to recall some notions and some well-known results about C -algebras and W -algebras.We assume basic knowledge of functional analysis (e.g., Banach spaces, Hilbert spaces, L p -spaces), and we refer to standard introductory texts such as [12,13,65,68] for additional information on the subject.Definition 1.A complex Banach space (A, +, || • ||) on which there is an associative product (a, b) → a•b ≡ ab such that ||ab|| ≤ ||a|| ||b|| is called a complex Banach algebra.A complex Banach algebra (A, +, || • ||, •) on which there is a bounded, anti-linear map a → a † ∈ A such that (a † ) † = a and (ab) † = b † a † is called a complex involutive Banach algebra.A complex involutive Banach algebra (A, +, || • ||, •, †) such that ||a|| 2 = ||a * a|| is called a C -algebra.For the sake of notational simplicity, we simply write A to denote a C -algebra (A, +, || • ||, •, †).A W -algebra is a C -algebra A which is isomorphic to the Banach dual space of a Banach space B(A ) called the predual of A (this predual space is essentially unique [68,).Note that B(A ) is uniquely determined by A .
If A admits an identity I, it is called a unital C -algebra.Note that every W -algebra has an identity [65,Par. 1.7].An element a in the C -algebra A such that a † = a is called self-adjoint, while an element a such that a † = −a is called skew-adjoint.The set of self-adjoint elements in A is denoted by A sa and it is a real Banach subspace of the Banach space underlying A when the latter is considered as a real Banach space.More importantly, every skew adjoint element a ∈ A can be uniquely written as a = ib for some b ∈ A sa , and every element a ∈ A can be uniquely written as a = b + ic for some b, c ∈ A sa .This fact implies that, as a real Banach space, A is the direct sum of A sa with iA sa (the space of skew-adjoint elements), while, as a complex Banach space, A is the complexification of A sa .
The set A sa endowed with the anti-commutator product is also an important example of Banach-Jordan algebra [2], that is, a Banach space which is also a Jordan algebra whose Jordan product is continuous in the norm topology.Quite interestingly, A sa can be turned into a Banach-Lie algebra (a Banach space which is also a Lie algebra whose Lie product is continuous in the norm topology) with the Lie product The Jordan product and Lie product on A sa speak to each other according to the identity for all a, b, c ∈ A sa .Equation (3) can be verified by a direct computation for A sa , and it's at the earth of the definition of abstract Lie-Jordan algebras [26,27].
What is perhaps the prototypical example of a unital C -algebra is the space B(H) of bounded linear operators on the complex Hilbert space H in which the addition and multiplication are those between linear operators, the norm is the operator norm, and the involution is the map sending a linear operator in its adjoint in the Hilbert-space sense.Accordingly, B sa (H) is the space of bounde self-adjoint linear operators on H, while B + (H) is the space of bounded positive (non-negative) linear operators on H.The C -algebra B(H) is non-commutative whenever dim(H) ≥ 2. From standard results in the theory of bounded operators on Hilbert spaces it follows that B(H) is the dual Banach space of the space T (H) of trace-class linear operators on H. Consequently, B(H) is a W -algebra.The W -algebra B(H) is the one most often used to describe quantum systems especially in the context of the so-called standard quantum mechanics [14,48,49], in quantum information theory and quantum computing [56,63], and in quantum information geometry [5,20,21].
On the other hand, there are also examples of Abelian (commutative) C -algebras which are of paramount importance to describe classical systems, especially in the context of statistical mechanics [48,49] and classical information geometry [36,37,38].For instance, take a topological space X and consider the space C(X ) of complex-valued continuous functions on X .This space is clearly an algebra with respect to the usual addition and multiplication between functions, and it admits and algebraic involution given by complex conjugation, i.e., f † (x) = f (x).Without appealing to additional structures on X , we can define a norm-like function on C(X ) given by the supremum, i.e., ||f || ∞ := sup{|f (x)| : x ∈ X }.Clearly, unless X is compact, there are functions are easily seen to be Banach spaces with respect to || • || ∞ , and, more importantly, both •, †) are easily seen to be C -algebras for every topological Hausdorff space X , denoted C b (X ) and C 0 (X ), respectively, with an evident abuse of notation.Quite often, the topological space X is taken to be locally-compact and Hausdorff.We can motivate this choice noting that, when X is not Hausdorff we can always consider its Stone-Cech compactification βX and C b (X ) ∼ = C b (βX ) ≡ C(βX ), and when X is not locallycompact then C 0 (X ) may be "very little" if not empty (e.g., when X is an infinite dimensional Banach space because by Riesz lemma every K has empty interior).
A relevant example of Abelian W -algebra is borrowed from measure theory.Specifically, let X = (X, Σ) be a measurable space and let µ be a measure on it.Let F(X ) denote the space of complex-valued measurable functions on X , define (5) Clearly, we can endow L ∞ (X , µ) with the usual pointwise product among functions, and with the involution † associated with complex conjugation as it is done for continuous functions.

Concerning the norm, if we set ||f ||
However, defining all algebraic operations "by projection", it is an instructive exercise to prove that L ∞ (X , µ) = (L ∞ (X , µ)/N, +, || • || ∞ ) is a complex Banach space and that (L ∞ (X , µ), •, †) is a is a C -algebra which we is denoted by L ∞ (X , µ) with an evident abuse of notation.Then, the standard theory of L p -spaces guarantees that L ∞ (X , µ) is the Banach space dual of L 1 (X , µ) so that L ∞ (X , µ) is indeed an Abelian W -algebra.
Let us now turn our attention from C -algebras and W -algebras to positive linear functionals on them.As the name suggest, they are elements of the Banach dual space A * of the C -algebra (or W -algebra) A , but of a very special kind.Indeed, they basically formalize and generalize the notion of positive measure from measure theory to the framework of C -algebras and Walgebras.From a more practical point of view, a positive linear functional on A is an element of A * sa ⊂ A * , that is, a bounded linear functional which takes real values on self-adjoint elements (these functionals are also called self-adjoint).Moreover, as the name already suggests, they satisfy a particular positivity property.In order to state this property, we first introduce the notion of positive element in A .An element a ∈ A such that a = b † b for some b ∈ A called positive, and it is clear that every positive element is also self-adjoint.The space of positive elements in A is denoted as A + and is a norm-closed convex cone inside A sa .
In the classical cases discussed above, it is immediate to check that positive elements correspond to positive functions (or equivalence classes of positive functions), while, in the quantum case, they correspond to bounded non-negative linear operators, that is, all those bounded linear operators whose spectrum lies on the positive semiline (actually, the notion of spectrum also makes sense for elements in a C -algebra or W -algebra in such a way that positive elements are precisely those elements whose specturm lies on the positive semiline as it happens for bounded linear operators).
Then the set P of positive linear functionals on A is defined as It is almost trivial to check that P is a convex cone in The space of faithful positive linear functionals is often denoted by P + .A positive linear functional with unit norm is called a state, and the space of (faithful) states on A is denoted by S (S + ).Clearly, S is a convex set, and it is weakly*-compact if and only if A has an identity element [13,Thm. 2.3.15].In this case, S is the weak*-closure of the convex envelope of the space of pure states on A .A state ρ is defined to be pure if, given ω ∈ P, it holds that ρ − ω is a positive linear functional if and only if ω = λρ with 0 ≤ λ ≤ 1.
Positive linear functionals on A posses particularly nice properties:
When A is a W -algebra we may single out an interesting family of elements in P called normal.Specifically, we recall that A is isomorphic to the Banach dual space of its predual space B, and thus there is the canonical immersion of B into its double dual B * * ∼ = A * given by x → ξ x with ξ x (a) = a(x) where a is identified with an element of the dual of B. We denote by A * the norm-closed image of the predual space B through the canonical immersion, and all those linear functionals in A * are called normal.From a topological point of view, the set of normal linear functionals A * on A is the set of all norm-continuous linear functionals on A which are continuous also with respect to the weak topology on A determined by the predual space B [65, thm.1.13.2, p. 28].This weak topology on A is referred to as the σ-weak (or ultraweak) topology on A .Quite obviously, the normal positive linear functionals (n.p.l.f.s) are just elements of the set P = P ∩ A * , while normal states are elements of S = S ∩ A * .It turns out that ω ∈ P being normal is equivalent to the validity of the equality for every orthogonal family {e} j∈J of projections in A [68, cor.III.3.11, p. 136].
Going back to the quantum case A = B(H), we recall that A is the dual space of the space T (H) of trace-class linear operators on H.The canonical immersion of T (H) into A * = (B(H)) * is given by ξ → ξ with ξ(a) = Tr H ( ξa) (8) for all a ∈ A = B(H).Therefore, the space P of n.p.l.f.s on A is identified with the space of positive trace-class linear operators ρ on H, and the space S of normal states on A is identified with the convex subset of P singled out by the constraint Tr H (ρ) = 1.
On the other hand, in the classical case A = L ∞ (X , µ), the W -algebra A is the dual space of L 1 (X , µ) so that we may identify a normal linear functional ξ ∈ A * with the complex measure µ ξ given by for every measurable subset A ⊆ A, with f ξ ∈ L 1 (X , µ).Accordingly, the space P of n.p.l.f.s on A = L ∞ (X , µ) is identified with the space of positive f ∈ L 1 (X , µ), and the space S of normal states with the space of probability density functions in L 1 (X , µ).
From the practical point of view, it turns out that most of the parametric models used in classical and quantum information geometry consist of n.p.l.f.s when framed in the context of W -algebras as we do in this work.This instance may be partly attributed to the nice mathematical properties of n.p.l.f.s.Be that as it may, motivated by the above-mentioned empirical usefulness of n.p.l.f.s with respect to classical and quantum information geometry, we decided to limit our analysis to this kind of positive linear functionals and leave a more general analysis for future works.
Given an arbitrary n.p.l.f.ω ∈ P, there exists a unique non-zero projection p ∈ A (called the support projection of ω) such that ω(a) = ω(ap) = ω(pa) = ω(pap) (10) for all a ∈ A , and such that ω is faithful when restricted to A pp = pA p [68, lem.III.3.6, p. 134].Given the support projection p of ω, we have the topological direct sum decomposition where the topology we are referring to is the one induced by the C -norm, where q = I − p, and where we have set A pp = pA p, A pq = pA q, A qp = qA p, and A qq = qA q.We often refer to A pp as the support algebra of ω.Note that A pp and A qq are C -algebras and that the involution † gives a Banach space isomorphism between A qp and A pq .Obviously, if ω is faithful, then p = I so that A pp = A and all other summands in the right hand side of equation ( 11) vanish.
Remark 1.The decomposition in equation (11) allows us to visualize the algebraic operations in A in terms of matrix operation.Specifically, given an arbitrary a ∈ A , if we write it is a matter of direct inspection to check that a + b and ab can be computed using matrix-like operations.It then follows that the so-called Gel'fand ideal N ω of ω defined by can be written as The Gel'fand ideal is instrumental to the construction of the so-called Gel'fand-Naimark-Segal (GNS) representation of A associated with ω [13, Sec.Clearly, every identifiable model is locally identifiable while the converse is not true.Indeed, if (M, j, A ) is locally identifiable, for every point m ∈ M, the best we can do is find an open neighbourhood U of m such that the model is identifiable when restricted to U .
If P were a Banach manifold it would clearly admit a tangent space at each point and the tangent map of the immersion map j associated with a parametric model (M, j, A ) would send a tangent vector at m ∈ M into a tangent vector lying in the tangent space of P at j(m) ∈ P. Since P is not a Banach manifold (not even in the finite-dimensional case), we can no longer use the tangent space of P.However, being P a subset of the Banach space A * sa there is a kind of "close relative" of the tangent space we can exploit: the tangent double tangent at P. Definition 3 (Tangent Double Cone).Let B be a real Banach space, X a subset of B and let ξ ∈ X.Then, η ∈ B is said to be in the tangent double cone T ξ X of X at ξ if there exist > 0 and a That is, if there exist a curve that has η as derivative at the point ξ.
Remark 3. The tangent double cone reduces to the ordinary tangent space whenever the subset X considered in Definition 3 is actually a submanifold of B.
With respect to the notion of parametric model introduced in definition 2, the relevance of the tangent double cone T ω P at every n.p.l.f.ω stems from the following proposition whose proof follows directly from the definition of tangent vector and tangent map at a point.Proposition 1.Let (M, j, A ) be a smooth parametric model of n.p.l.f.s according to definition 2.Then, T m j(T m M ) lies in the tangent double cone T j(m) P for all m ∈ M.
Motivated by proposition 1, we investigate the structure of T ω P. A first interesting fact about T ω P is that it is actually a vector space containing only normal linear functionals.Proposition 2. Let B be a real Banach space, let X ⊆ B let ξ ∈ X, and let T ξ X be the tangent double cone of X at ξ.Then, if X is a convex cone, T ξ X is a vector space.On the other hand, if there is a Banach subspace V of B such that X ⊆ V, then T ξ X ⊆ V.In particular, the tangent double cone T ω P is a vector space for every ω ∈ P sitting inside (A sa ) * for every ω ∈ P.
Proof.Suppose X is a convex cone.Let η, ζ be in T ξ X, and let γ and σ be the smooth curves in X starting at ξ and having η and ζ as tangent vectors at t = 0, respectively.Let I γ = (− γ , γ ) be the domain of γ.Let a ∈ R. If a = 0, then we define I aγ = (− γ , γ ) as the domain of the curve γ a (t) = γ(0) starting at ξ and having aη = 0 as the tangent vector at t = 0.If a > 0, then we define I aγ = (− γ /a, γ /a) as the domain of the curve γ a (t) = γ(at) starting at ξ and having aη as the tangent vector at t = 0.If a < 0, then we define I aγ = ( γ /a, − γ /a) as the domain of the curve γ a (t) = γ(at) starting at ξ and having aη as the tangent vector at t = 0. Similar considerations apply for I σ .Then, take a, b ∈ R with a, b > 0, 0 < α < 1, β = (1 − α), and I µ = (− µ , µ ) with µ = min(α/a, β/b), and define the curve µ : I µ → B setting µ(t) := αγ(at/α) + βσ(bt/β).Being X a convex cone, µ(t) lies in X for every t ∈ I µ , and it is a matter of direct inspection to check that µ is a smooth curve starting at ξ having aη + bζ as tangent vector at t = 0.All other cases in which a and b are either positive, negative or 0 are handled similarly.It then follows that T ξ X is a vector space as claimed.Now, assume there is a Banach subspace V of B such that X ⊆ V. Let η be in T ξ X, this means that there exists a curve γ(t) such that γ(0) = ξ and γ(0) = η, meaning that Since X ⊆ V and V is a vector space, it is clear that γ( ), γ(0), γ( ) − γ(0) are in V Therefore, since the limit in equation ( 16) is taken in the norm topology of B and this topology coincides with the norm topology on V because the latter is a Banach subspace of the former, we conclude that η ∈ V as claimed.Eventually, since P is a convex cone inside (A sa ) * ⊆ A * sa and (A sa ) * is a Banach subspace of A * sa [65, p. 29], we conclude that T ω P sitting inside (A sa ) * for every ω ∈ P. Remark 4. In the Classical case A = L ∞ (X , µ), the tangent double cone at each n.p.l.f.ω is closed and coincides with the space of signed measures absolutely continuous with respect to µ ω [8, thm. 3.1, p. 42].Therefore, the tangent double cone is isomorphic with the Banach space L 1 (X , µ ω ).
Quite interestingly, it turns out there is a suitable notion of absolute continuity in the context of bounded linear functionals on a C -algebra A [57] generalizing the measure-theoretic notion of absolute continuity to the C * -algebraic case, and such that, when A is a W -algebra, the norm-closure of the tangent double cone T ω P coincides precisely with the space of bounded linear functionals which are absolutely continuous with respect to the n.p.l.f.ω in perfect analogy with what happens in the classical case recalled in remark 4.

Definition 4 (Absolute continuity).
Let ω be a positive functional on the C -algebra A , η ∈ (A sa ) * and let also B 1 be the unit ball in A .Then η is said to be absolutely continuous with respect to ω if one of the following equivalent statements is true: • For every sequence {a n } n∈N with a n ∈ A + ∩ B 1 , then lim ω(a n ) = 0 =⇒ lim η(a n ) = 0 .
We will denote the set of all self-adjoint linear functionals that are absolutely continuous with respect to a positive linear functional ω by AC ω .We now list some useful properties of AC ω : 1. for every C -algebra A and every positive linear functional ω on A , the set AC ω is a Banach subspace of (A sa ) * which is the closure of 5. in the Classical case A = L ∞ (X , µ), projections in A are in one-to-one correspondence with characteristic functions on measurable subsets of X with finite µ-measure, so that the result recalled in point 3 above is equivalent to µ ξ being absolutely continuous with respect to µ ω in the measure-theoretic sense.
The set J ω in equation ( 17) turns out to be particularly useful for our investigation, and we write a generic element in J ω as η a with a ∈ A sa to emphasize the intimate connection between elements in J ω and elements in A sa given by η a (b) = ω({a, b}) (18) for all b ∈ A sa .
Proposition 3. The norm-closure T ω P of the tangent double con T ω P at ω ∈ P coincides with AC ω .
Proof.First, we prove that AC ω ⊆ T ω P. Consider then the smooth curve ω t in (A sa ) * defined by Clearly, ω 0 = ω and ω t ∈ P for all t ∈ R so that the tangent vector at t = 0 is in T ω P by definition of the tangent double cone.A quick computation shows that the tanget vector at t = 0 reads ωt (b) with η a ∈ J ω because of equation (18), and it follows that J ω ⊆ T ω P so that AC ω = J ω ⊆ T ω P (recall point 1).
We now prove that T ω P ⊆ AC ω .Indeed, suppose ξ ∈ T ω P and let γ : (− , ) → (A sa ) * be a smooth curve in P starting at ω with ξ as tangent vector at t = 0.Then, for every projection p ∈ A such that ω(p) = 0, the function f (t) = ω t (p) = (γ(t))(p) is a real-valued smooth function which is non-negative and vanishes when t = 0 so that its derivative f (0) at t = 0 vanishes.Direct inspection shows that f (0) = ξ(p) = so that ξ ∈ T ω P implies ξ ∈ AC ω because of point 3, and thus T ω P ⊆ AC ω .Proposition 4. Given a n.p.l.f.ω, the Banach space dual of AC ω = J ω can be identified with where If a ∈ Ann(AC ω ) then 0 = η a (a) = ω(a 2 ), which means that a lies in the Gel'fand ideal of ω (cf.equation ( 13)).Therefore, being ω a n.p.l.f. and being a self-adjoint, equation ( 14) immediately implies that Ann(AC ω ) = (A qq ) sa so that (AC ω ) * ∼ = A sa /(A qq ) sa ∼ = (A pp ⊕ A pq ⊕ A qp ) sa =: A ω sa as claimed.

J-regular parametric models
Now, we want to investigate the possibility of introducing additional mathematical structures on a given smooth parametric model of n.p.l.f.s (normal states).Of course, since the parametric models we refer to are defined in the context of W * -algebras, it is only reasonable to look for the additional structures mentioned before in terms of the algebraic structures of W * -algebras.
In particular, we will exploit the Jordan product among self-adjoint elements associated with the anti-commutator as in equation ( 1) in order to induce a (possibly degenerate) Riemannian metric tensor on suitable parametric models.Consider a n.p.l.f.ω ∈ P and define the map where η a , η b are as in equation (18).Note that G ω is clearly bilinear, and it is symmetric because the Jordan product {, } is symmetric.Moreover, recalling equation ( 18) and equation ( 13), it follows that η a such that implies a is in the Gel'fand ideal of ω.Therefore, being a self-adjoint, equation ( 14) implies that a ∈ A qq which means that η a (b) = ω({a, b}) = 0 for all b ∈ A because of equation (10) and equation (11).We thus conclude that η a = 0 and thus G ω is positive definite on each J ω for every n.p.l.f.ω, and thus J ω endowed with G ω becomes a real pre-Hilbert space.
In general, J ω is not closed with respect neither to the norm topology inherited from (A sa ) * nor from the topology determined by the inner product given by equation (23).According to point 1, the norm closure of J ω is the space AC ω of self-adjoint normal linear functionals which are absolutely continuous with respect to ω.It turns out that the closure of J ω with respect to the topology determined by the inner product given by equation ( 23) (which is a real Hilbert space) can be identified with a subset of AC ω .Definition 5.The closure of J ω with respect to the topology determined by the inner product G ω (cf.equation (23)) is denoted by J ω and it is referred to as the J-Hilbert space of ω.Proposition 5. Given a n.p.l.f.ω and the real Hilbert space J ω of definition 5, it holds J ω ⊆ AC ω as sets.Moreover, the dual space (AC ω ) * ∼ = A ω sa (cf.proposition 4) can be identified with a dense subset of the dual of J ω .
Proof.Given η ∈ J ω , we define a linear function l η : for every a ∈ A sa .Then, exploiting the Cauchy-Schwarz inequality, it follows that which means that l η is a bounded linear map and thus defines an element in A * sa .With an evident abuse of notation, the element in A * sa determined by l η is denoted by η.Next, it holds which means that the norm determined by the inner product on J ω is stronger than the norm on J ω inherited from (A sa ) * .It thus follows that J ω is a subset of (A sa ) * .Then, following the reasoning below equation ( 24), if P is any projection such that ω(P) = 0, then η P = 0 and thus η(P) = l η (P) = 0 which means that η ∈ AC ω because of point 3.
Since the norm determined by the inner product on J ω is stronger than the norm on J ω inherited from (A sa ) * , the identification map i : J ω → AC ω can be extended to a bounded linear map i : J ω → AC ω whose image is clearly dense.Therefore, the dual map i * : 2 ) = 0 , and, in turn, this is equivalent to (a − b) lying in the Gel'fand ideal of ω (cf.equation ( 13)).Therefore, being ω a n.p.l.f. and being (a − b) self-adjoint, equation ( 14) immediately implies that (a − b) ∈ (A qq ) sa so that (a − b) = 0 because a, b ∈ A ω sa , and thus i * is injective.Recalling that J ω is isomorphic to its dual beacause it is a Hilbert space, equation ( 28) also implies that i * has a dense range so that (AC ω ) * ∼ = A ω sa can be identified with a dense subset of the dual of J ω as claimed.
Once we have the inner product G ω on J ω ⊆ AC ω for every n.p.l.f.ω, we are ready to add some spice to the notion of parametric model introduced in definition 2 by requiring additional regularity conditions that allows the definition of a (possibly degenerate) Riemannian metric tensor on the manifold of the model capturing the essense of the inner product G ω for each ω in the model.
with X, Y smooth vector fields on M and where the right-hand-side is the inner product in J j(m) (cf.definition (5)), defines a smooth tensor field on M. Obviously, a J-regular parametric model (M, j, A ) is such that the image T m j(T m M ) of the tangent space at m ∈ M throught the tangent map of j is a subset of the real Hilbert space J j(m) (cf.definition (5)) for all m ∈ M.
We close this section by showing that a monotonicity property holds for G ω under normcontinuous, ultra-weakly continuous, completely-positive, unital maps.Proposition 6.Let Φ : A → B be a norm-continuous, ultra-weakly continuous, completelypositive, unital map.Let ρ ∈ P(A ) be such that ρ = Φ * (ω) with ω ∈ P(B) and Φ * : B * → A * the dual map of Φ.For every η ∈ J ω it holds Φ * (η) ∈ J ρ and Equation ( 30) is referred to as the monotonicity property of G with respect to (normcontinuous and ultra-weakly continuous) completely-positive, unital maps.
Proof.First of all, we note that dual map Φ * sends positive linear functionals into positive linear functionals because Φ is positive.Moreover, since Φ is also ultra-weakly continuous, there is a bounded linear map ϕ : B(B) → B(A ) between the predual of B and the predual of A of which Φ is the dual map.Therefore, Φ * sends normal linear functionals into normal linear functionals.In general, if ρ = Φ * (ω), the map Φ does not send A ρ sa into B ω sa (cf.proposition 4) .However, denoting with P ω the projection onto B ω sa (which exists because B ω sa is complemented in B sa since ω is a n.p.l.f.), it is clear that the map a → Φ ω (a) := P ω (Φ(a)) (31) is a linear map between A ρ sa and B ω sa .What is interesting is that, when looking at A ρ sa as a dense subspace of J * ρ and at B ω sa as a dense subspace of J * ω according to proposition 5, it defines a bounded linear map between J * ρ and J * ω .Indeed, for every a ∈ A ρ sa , it holds Let us denote by Q ω the complement projection of P ω .Obviously, and ω vanishes on B qq (cf.discussion below equation ( 10)), we conclude that ω(ab) = ω(P ω (a)P ω (b)) so that Since Φ is completely positive, it follows that Φ(a)Φ(a) ≤ Φ(a 2 ) [16] and thus Equation (34) implies that Φ ω can be extended to a bounded linear map between J * ρ and J * ω as claimed.Moreover, equation (34) also implies that ||Φ ω || ≤ 1 so that Φ ω is a contraction.Now, consider the dual map Φ * ω : J ω → J ρ .Denoting with (|) • the pairing between J • and its dual, it holds for all a ∈ A ρ sa ⊆ J * ρ .According to proposition 5, every η ∈ J ω is also an element of AC ω , and proposition 4 implies that η(Φ(a)) = η(P ω (Φ(a))) + η(Q ω (Φ(a))) = η(P ω (Φ(a))).Accordingly, it follows that which means that Φ * ω coincides with Φ * on J ω ⊆ AC ω .Eventually, recalling that Φ ω is a contraction and that the norm of Φ * ω coincides with that of Φ ω , it immediately follows that as claimed.
Remark 5. When Φ is an ultra-weakly continuous automorphism of A , then it is also an automorphism of the Jordan product.Therefore, recalling that ρ = Φ * (ω) and that Φ * ω coincides with Φ * on J ω ⊆ AC ω (cf.proposition 6), a direct computation shows that for all a, b ∈ A sa .It then follows that which reflects the invariance under ultra-weakly continuous automorphisms of the inner products G ω with ω ∈ P.

Examples
In this section we will investigate some meaningful examples that we believe clearly show the unifying perspective of our formalism when dealing with classical and quantum information geometry.

The finite-dimensional case
We briefly discuss the peculiarities of the finite-dimensional case, referring to [20,21,22] for the proofs of all the results mentioned.When A is finite-dimensional, there is a sort of "preferred" family of Jordan regular parametric models naturally emerging when we investigate the Jordananalogue of Konstant-Kirillov-Souriau theory of coadjoint orbits of a Lie group [45,46,47,66].Specifically, since A is finite-dimensional, all linear functional are normals so that A * = A * and A * sa = (A sa ) * .Therefore, in complete analogy with the Lie-algebra case, the algebraic structure of Jordan algebra on A sa translates into a geometric structure on A * sa , namely, a (2, 0)-contravariant tensor R whose value at (a, b) The contravariant tensor R is evidently smooth (because linear in ξ) and symmetric because it's defined in terms of the symmetric Jordan product, and it is referred to, quite obviously, as the Jordan tensor on A * sa .The Jordan tensor R allows to define a generalized distribution D = {D ξ } ξ∈A * sa on A * sa according to and, when instead of a generic ξ ∈ A * sa we consider a n.p.l.f.ω, equation ( 41) and equation ( 17) immediately implies that D ω = J ω = AC ω , where the last equality follows from the fact that J ω is closed when A is finite-dimensional.The distribution D = {D ξ } ξ∈A * sa is referred to as the canonical distribution generated by the tensor R.
According to [22], in analogy with the Konstant-Kirillov-Souriau theory of coadjoint orbits of a Lie group, it is possible to find maximal leaves of the canonical distribution on which the tensor R is invertible, and its inverse is a symmetric analogue of the Konstant-Kirillov-Souriau symplectic form on coadjoint orbits.However, the canonical distributions of R does not generate a foliation of A * sa as instead happens in the case of the symplectic foliation associated with the Poisson tensor on the dual of a Lie algebra in the Konstant-Kirillov-Souriau theory.The tangent space at each point of any of the maximal leaves of D = {D ξ } ξ∈A * sa mentioned before is spanned by the tangent vectors as η a in equation (18), and the pointwise inverse of R is as by equation (23).The whole space of P n.p.l.f.s of A is decomposed into the disjoint union of such maximal leaves, and the inverse of R on any such leave is a Riemannian metric tensor which coincides with the Fisher-Rao metric tensor on positive measures when A is commutative, and with the Bures-Helstrom metric tensor on faithful (non-normalized) quantum states when A = B(H) for some finite-dimensional Hilbert space H (a fact also noted in [20,21]).
The family of maximal leaves of D = {D ξ } ξ∈A * sa providing a decomposition of P is precisely the "preferred" family of Jordan regular parametric models alluded to in the beginning of this section.It is also worth mentioning that, by suitably adapting the analysis carried out in [24], it could be possible to show that this preferred family provides a Withney stratification of P.
In the infinite dimensional case, quite unsurprisingly, things are more complicated and this picture breaks down.That is why, in this work, we proposed a change of perspective to tackle the infinite-dimensional case.Specifically, instead of looking to generalize the foliation picture described before, we decided to simply look at the maximal leaves of n.p.l.f.s of the canonical distribution of R in finite dimensions as parametric models in the sense of definition 2 endowed with a Riemannian metric tensor determined, essentially, by the Jordan product in A sa through the inverse of the tensor R.Then, this reformulation of the finite-dimensional case led us to the notion of Jordan regular parametric model given in definition 6, which works also in the infinite-dimensional case.Remark 6. Concerning the infinite-dimensional generalization of the foliation picture, a relevant thing to note is that, by focusing on (A sa ) * , the algebraic structure of Jordan algebra of A sa determines a (2, 0)-contravariant tensor R on (A sa ) * again by equation (40) essentially because T * ξ (A sa ) * ∼ = A sa .Moreover, the map : is such that the image of T * ξ (A sa ) * is actually contained in (A sa ) * ⊆ A * sa ∼ = T * * ξ (A sa ) * .Therefore, R satisfies a condition analogous to the one exploited in [10] to define a Banach Lie-Poisson structure on A sa in terms of the antysimmetric tensor built out of the Lie algebra structure on A sa associated with the (scaled) commutator product in A .However, a rigorous analysis of the geometry of the distribution generated by R would reasonably require particular care for technical details because, among other things, the vector space D ξ is in general not closed.We believe it would be very interesting to try to understand how far we can push the finite-dimensional theory developed in [22] into the infinite-dimensional realm to see how many "preferred" Jordan regular parametric models are obtained.We hope we will be able to investigate this issue in the future, and we also hope our current work will inspire someone else to investigate these matters.

Classical statistical models
The aim of this subsection is to briefly check that the formalism developed in this work is a genuine generalization of the formalism of classical information geometry as rigorously formalized in [7,8,9].Let A = L ∞ (X , µ) be the Abelian W -algebra recalled in section 2. Recall that normal linear functionals are identified with complex measures through equation ( 9), and n.p.l.f.s can be identified with the space of non-negative functions in L 1 (X , µ).
Given a n.p.l.f.ω on A , the tangent double cone T ω P is closed and it can be identified with L 1 (X , µ ω ) (cf. remark 4), that is, every ξ ∈ T ω P acts as with which is nothing but the logarithmic derivative of p at m in the direction v m as defined in [8, defn. 3.6, p. 152].
On the other hand, it is clear that every characteristic function 1 A associated with the measurable subset A ⊆ X with µ ω (A) < ∞ is in A = L ∞ (X , µ) and determines an element Accordingly, the Hilbert space J ω introduced in definition 5 can be identified with the real Hilbert space L 2 R (X , µ ω ).Note that this is consistent with the well-known fact that L q (X , µ ω ) ⊆ L p (X , µ ω ) for all 1 ≤ p ≤ q ≤ ∞.It then follows that, if the parametric model (M, p, A ) is Jordan regular in the sense of definition 6, then (M, X , µ, p) is 2-integrable in the language of [8] and the tensor G defined as in equation 29 becomes which coincides with the Fisher-Rao metric tensor as given in [8, eqn.3.41, p. 136].

Rank-one, strongly-continuous unitary models
Consider the W -algebra A = B(H) of bounded linear operator on the possibly infinitedimensional, separable complex Hilbert space H.As recalled in section 2, A = B(H) is the Banach dual of the space T (H) of trace-class operators on H. Therefore, every non-negative trace-class operator ω defines a n.p.l.f. on A = B(H) through ω(a) = Tr(ωa).
In this section, we will discuss a particular type of parametric models stemming from strongly continuous unitary representations of Banach-Lie groups.This family of parametric models is general enough to encompass most of the parametric models used in the literature when dealing with a quantum system described in terms of a separable Hilbert space [31,40,41].
Let G be a Banach-Lie group and π : G → U (H) a strongly-continuous unitary representation of G on H, that is, a group homomorphism between G and the unitary group U (H) such that the map π ψ : G → H given by g → (π(g))(ψ) is continuous with respect to the norm topology in H for every ψ ∈ H.A vector ϕ ∈ H is called smooth for π if the map g → (π(g))(ϕ) is smooth.In particular, if G is a finite-dimensional Lie group, then smooth vectors always exist and are dense in H [32].

Proposition 7. If ϕ is a smooth vector for the strongly-continuous unitary representation
J-regular parametric model according to definition 6.Moreover, the tensor G defined as in equation ( 29) is invariant with respect to the canonical left action of G on itself.
Proof.We first prove that (G, j, B(H)) is a smooth parametric model according to definition 2. It suffices to note that j can be written as the composition between the smooth map π ϕ : G → H given by π ϕ (g) := (π(g))(ϕ), and the smooth map from H to the space T sa (H) of self-adjoint trace-class operators on H given by ψ → F (ψ) = |ψ ψ|.
Then, it holds for an arbitrary tangent vector v g ∈ T g G, and it holds for an arbitrary tangent vector φ ∈ T ψ H ∼ = H, so that where, with an evident abuse of notation, on the right-hand side, we denoted by T g π ϕ (v g ) simply as v g .Since ϕ g |ϕ g = C ϕ is a constant depending only on the reference vector ϕ and not on g, it also holds v g |ϕ g + ϕ g |v g = 0 (49) for every g ∈ G and every v g ∈ T g G.
Being G a Banach-Lie group, there is a natural left action L of G on itself, and it follows that j with Ad π(h) (a) = π(h)aπ(h) † for every a ∈ A .Consequently, it also follows that where the last equality basically follows from the fact that Ad π(h) is linear.Again because G is a Banach-Lie group, in order to write down G following equation ( 29), we can focus only on left-invariant vector fields.If X is a left-invariant vector field on G, it holds X(g) = T e L g (X(e)) (52) where e is the identity element in G. Accordingly, exploiting equation ( 51) we obtain Now, we note that Ad π(g) can be thought of as the dual of a norm-continuous, ultra-weakly continuous automorphism of A = B(H), and thus, according to remark 5, it holds G j(g) (T g j(X(g)), T g j(Y (g))) = G j(e) (T e j(X(e)), T e j(Y (e))) (54) for all left-invariant vector fields X, Y on G. Equation ( 54) implies that we only need to understand what happens to tangent vectors at the identity element e in order to understand G (cf. equation ( 29)).Let {φ j } j∈N be an orthonormal basis in H such that ϕ = Aφ 0 .According to equation ( 48), for every left-invariant vector field X on G there is a vector v e ∈ H such that Replacing X(e) with Y (e) and a with X e in equation ( 59), end exploiting equation (48) for T e j(Y (e)), it follows that where we also used equation (49).Eventually, we obtain which clearly shows that G is indeed a smooth tensor field on G which is invariant with respect to the canonical left action of G on itself.Eventually, we conclude that (G, j, B(H)) is a J-regular parametric model according to definition 6 as claimed.
Corollary 1.Let (G, j, B(H)) be a J-regular parametric model as in proposition 7. Assume that the isotropy subgroup G ϕ of ϕ ∈ H with respect to π is such that G/G ϕ ∼ = M is a smooth manifold for which there exists a smooth section σ : M → G. Then (M, j•σ, B(H)) is a J-regular parametric model (cf.definition 6) whose tensor field mathcalG is invariant with respect to the canonical action of G on M.
When G = U(H) and π is the identity map, then proposition 7 gifts U(H) a left invariant degenerate Riemannian metric tensor.Moreover, corollary 1 gifts the complex projective space CP(H) ∼ = U(H)/U (1) a Riemannian metric tensor G which is invariant under the canonical action of U(H) on CP(H).This invariance property is strong enough to force G to be a constant multiple of the Fubini-Study metric tensor and we thus obtain the standard Riemannian structure of the space of normal pure states that is well-known in the context of geometric Quantum Mechanics [6,23,25,44].
When G is a finite-dimensional Lie group and π a strongly continuous unitary representation for which both proposition proposition 7 and corollary 1 are valid, then we obtain parametric models of n.p.l.f.sthat are basically the well-known and widely used coherent states [3,60].

Conclusions
The theory of J-regular parametric models of n.p.l.f.s on a W -algebra A introduced here is only at a preliminary stage, and very much is yet there to be investigated.For instance, the formulation of the Cramer-Rao bound in the context of W -algebras is definitely something that should be addressed.Some preliminary results in the finite-dimensional case are discussed in [20], from which it follows that an infinite-dimensional reformulation would necessarily entail the investigation of the so-called theory of quantum measurement in the context of information geometry.
Then, it is reasonable to ask what happens in the W -algebraic framework to other well-known geometrical structures appearing in the classical case, first and foremost, to the Amari-Cencov tensor determining the dualistic structure in the classical case.At this purpose, let us simply note that, when ω is faithful, the Jordan structure of A sa gives rise to a trilinear map T ω on J ω by means of the so-called Jordan triple product Radon-Nikodym derivative of µ ξ with respect to µ ω .If (M, p, A ) is a parametric model according to definition 2, then p(m) = p m µ with p m a positive function in L 1 (X , µ), and this, in turn, means that (M, X , µ, p) is a parametrized measure model dominated by µ in the languange of classical information geometry [8, defn.3.4, p. 150].In particular, every T m p(v m ) ∈ T p(m) P leads to a Radon-Nikodym derivative dTmp(vm) dp(m)

Parametric models of normal positive linear functionals
2.3].Once all the technical tools discussed in section 2 are at our disposal, we are ready to define what a smooth parametric model of n.p.l.f.s actually is.
Definition 2. Given a W -algebra A , a smooth parametric model of n.p.l.f.s is a triple (M, j, A ) where M is a Banach manifold and j : M −→ (A sa ) * is a smooth map such that j(M) ⊂ P. If j(M) ⊂ S ⊂ P, we refer to (M, j, A ) as a smooth parametric model of normal states.A smooth parametric model of n.p.l.f.s (normal states) (M, j, A ) is said to be identifiable if j is injective, and it is said to be locally identifiable if T m j is injective for all m ∈ M. Remark 2. In the following in order to avoid burdensome repetitions of words, we often write "parametric model" when we actually means "smooth parametric model of n.p.l.f.s (normal states)".