Abstract
Background
Hadamard conjugation is part of the standard mathematical armoury in the analysis of molecular phylogenetic methods. For group-based models, the approach provides a one-to-one correspondence between the so-called “edge length” and “sequence” spectrum on a phylogenetic tree. The Hadamard conjugation has been used in diverse phylogenetic applications not only for inference but also as an important conceptual tool for thinking about molecular data leading to generalizations beyond strictly tree-like evolutionary modelling.
Results
For general group-based models of phylogenetic branching processes, we reformulate the problem of constructing a one-one correspondence between pattern probabilities and edge parameters. This takes a classic result previously shown through use of Fourier analysis and presents it in the language of tensors and group representation theory. This derivation makes it clear why the inversion is possible, because, under their usual definition, group-based models are defined for abelian groups only.
Conclusion
We provide an inversion of group-based phylogenetic models that can implemented using matrix multiplication between rectangular matrices indexed by ordered-partitions of varying sizes. Our approach provides additional context for the construction of phylogenetic probability distributions on network structures, and highlights the potential limitations of restricting to group-based models in this setting.
We’re sorry, something doesn't seem to be working properly.
Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.
Background
Fundamental to evolutionary biology is the development and implementation of molecular phylogenetic methods [1]. These methods provide the means to reconstruct the past evolutionary history of biological entities given present-day molecular data, such as DNA. Considering Kimura’s neutral theory of molecular evolution, it is logical to apply a stochastic model at the level of DNA substitutions to construct probabilistic description of what molecular alignments are expected to be observed, given a proposed evolutionary history (tree topology and edge lengths). is commonly implemented assuming an IID (across sites in the alignment) and Markov process for DNA substitution, leading to a model that has a continuous-time Markov chain at its core (see Semple and Steel [2] for an introduction to the mathematics underlying modern phylogenetic methodology).
In a series of papers, Hendy and colleagues introduced the Hadamard conjugation as a novel tool for phylogenetic analyses [3]–[5]. They found an invertible relationship between a phylogenetic tree, as characterized by its edge length spectrum, and the probability distribution of site patterns (referred to as the sequence spectrum). Originally introduced only for the 2-state symmetric model, the Hadamard conjugation was later extended to the K3ST model [6]–[8] and further to any of the so-called “group-based” models [9]. Hadamard conjugation has been used as both a tool for simulation [10] and to look at statistical properties of methods, exploring the inconsistency of parsimony under a molecular clock [5],[11]. For these sorts of applications, following the notation in Felsenstein [1], we can use the Hadamard transform H to start with an edge length spectrum γ and calculate the sequence spectrum s=H −1 log(H γ). The beauty of Hadamard conjugations is that one can also begin with an observed sequence spectrum and perform the inverse of the conjugation to empirically obtain an edge length spectrum . Although it is not expected that the spectrum will precisely match a tree, Hendy [12] proposed using an optimisation criterion to map from to the “closest tree”.
Several authors have commented that it is potentially a useful feature of Hadamard conjugation that data isn’t forced onto a fixed tree. The conflicting information can be retained and interpreted in the form of a “lentoplot” [13] or a splits-graph [14], with both of these methods implemented in Spectronet [15]. Schliep [16] gives some more statistical justification for such an approach by making a link to modern statistical techniques such as the Lasso and Ridge regression.
von Haeseler and Churchill [17] seems to be the first paper that explicitly suggests using Hadamard conjugation to provide a likelihood framework for networks. The principle idea in this work was to start with an edge length spectrum that encodes a set of incompatible splits, use the Hadamard transformation to get site probabilities and use these to determine a likelihood. This idea was further explored by Bryant [18], and Bryant [19] followed this through defining the “n-taxon process” for group-based models. It should be noted that likelihoods calculated via Hadamard are not equivalent to likelihoods calculated by taking a mixture of trees. Indeed, Matsen and Steel [20], Matsen et al. [21] used Hadamard methods in combination with phylogenetic invariants to show that mixtures of trees with the same topology can exactly mimic another tree under the 2-state model. Considering biological applications, thinking in terms of mixtures of trees or partitions where the data can be thought of as arising on a set of trees [22]–[24] seems more reasonable than the Hadamard conjugation. Strimmer and Moulton [25] suggested using split networks as a spring board to likelihood-based analyses on DAGs, but later identified several problems with the approach [26]; most notably, in split-networks internal nodes do not have a biological interpretation as an ancestor.
In Sumner et al. [27], we gave some additional insight into the interpretation of applying the Hadamard conjugation in a network setting. We showed that permutation group structure inherent to the Hadamard transformation – as for any group-based model – restricts the resulting process from being capable of reproducing truly convergent processes. This is a serious limitation, as one of the biological motivations for explicit network models is the ability to model convergent processes. We also presented an alternative algebraic formalism for the general Markov model, analogous to the n-taxon process, but capable of reproducing convergent processes.
From the point of view of group representation theory, the inversion of group-based models relies on the fact that the irreducible representations of an abelian group are one-dimensional, and the model structure essentially reduces to analysing group characters – hence the standard presentation of a Fourier inversion. In this article, we make this connection concrete. For the general Markov model, it is then immediately apparent that an analogous inversion is not possible because the algebraic structure underlying the model is not abelian and hence the irreducible representations are not one-dimensional. In fact, to obtain one-dimensional representations for the general Markov model, it is necessary to apply higher-degree polynomial maps (beyond the degree 1, linear case), and define “Markov invariants” [28]. These invariants present one-dimensional representations but at the cost of the higher degree – degree 5 in the case of the general Markov model with four states on quartet trees [29],[30]. This connection between Hadamard transformation and Markov invariants is an interesting one, but we do not discuss it further here.
In this paper we approach the inversion of group-based phylogenetic models by taking a representation-theoretic perspective and working explicitly with tensor indices. Our approach rests heavily on the formalism of “phylogenetic tensors”, as presented in Bashford et al. [31], for the binary-symmetric and K3ST model, and Sumner et al. [27],[28], for the general Markov model.
Although the main inversion results presented here are not more general than those in in Székely et al. [7], we think it is important to reformulate them using the language of tensors and representation theory. This viewpoint has already led to new approaches for modeling convergent evolution [27] and for studying non-group-based models [28]. However, in none of our previous work was the link to Hadamard conjugation explicitly discussed. By presenting an old technique (Hadamard conjugation) in a new light we hope to introduce other researchers to the viewpoint of tensor analysis and representation theory.
Methods
Group-based models
We consider the continuous-time formulation of Markov processes, and show how to implement the inversion of a group-based phylogenetic model based on any abelian group G. We note that such an inversion requires a map from tensor product space (where elements are indexed by ordered-n-partitions) to phylogenetic splits (where elements are indexed by bipartitions). We achieve this by finding canonical maps from bipartitions to ordered-n-partitions.
For a group G (not necessarily abelian) with order |G|=d, we write G={σ 1,σ 2,…,σ d }, and, when necessary, write ε∈G to specify the identity element of G. We will discuss the “regular representation” of G shortly, but skipping ahead we find that any rate matrix Q occurring in a group-based Markov model can be written in the form
where each , and the K σ are the permutation matrices corresponding to the (non-identity) group elements σ∈G.
For the reader interested in deriving this result, consider the d-dimensional vector space , with scalar multiplication and vector addition defined via
for all and . The regular representation, , is then defined by setting the group action
for all and σ∈G. If we fix {σ 1,σ 2,…,σ d } as an ordered basis for , it is then clear – via Cayley’s theorem – that each group element σ gets mapped to a permutation matrix K σ :=ρ reg(σ), with . Thus K σ has matrix elements
Consider the unit column vectors
and identify each with , so that the group action becomes σ:ξ i ↦K σ ξ i =ξ j where σ j =σ σ i . Thus the matrix elements have i as the column label and j as the row label.
A group-based Markov model is then obtained by taking a continuous-time Markov chain with state space G={σ 1,σ 2,…,σ d } and using the group multiplication in G to assign a rate α σ to all substitutions σ 1↦σ 2 where σ σ 1=σ 2. Following this through (as is done in detail in [32]) we are led to the formula (1) for rate matrices in any group-based model.
The regular representation is one example of the general concept of a representation of G on a vector space V, defined as a homomorphism ρ:G→G L(V) satisfying ρ(g 1g 2)=ρ(g 1)ρ(g 2) for all g 1,g 2∈G. A representation is said to be reducible if there exists a proper subspace U⊂V satisfying ρ(g)U⊂U, i.e. the set of matrices ρ(G) send vectors in U back to U. In this case, U is called an invariant subspace. The representation ρ is then called irreducible if V does not contain any invariant subspaces.
The reader should note that the usual construction of a “group-based” model [2] stipulates that G be abelian. Although the construction just given using the regular representation allows for non-abelian G, we will nonetheless only consider the abelian case in this paper, because, as discussed in the introduction, it is only in the abelian case that a (linear) inversion of phylogenetic models is possible. In this case the irreducible representations of G are all one-dimensional [33], and hence the analysis reduces to computations with group characters, as is exploited in the previous approaches using Fourier analysis [9],[34].
Phylogenetic tensors
We denote [d]:={1,2,…,d} as the state space for a continuous-time Markov chain. Consider an n-taxa phylogenetic tree and a d-state phylogenetic pattern distribution with the interpretation that is the probability that the observed state at the k th leaf on the tree is i k . As is shown in Sumner and Jarvis [35] and in more detail in Sumner et al. [27], such phylogenetic pattern distributions can be represented abstractly as tensors in the n-fold tensor product space , as follows. If we choose {ξ 1,ξ 2,…,ξ d } as an ordered basis for , and ordered basis for the tensor product space, a “phylogenetic tensor” is then defined as
For readers who are unfamiliar with tensor products, it is possible to understand the general concept via the definition of the “Kronecker” product of a n×m matrix A and a n ′×m ′ matrix B as the n n ′×m m ′ matrix given by
We can index the matrix A⊗B with row indicies i 1j 1=11,12,…,n n ′ and column indices j 1j 2=11,12,…,m m ′, i.e. generically and specifically (A⊗B)12,32=A 13B 22. This point of view is useful if one wants to write out specific matrix representations of tensors, however, in the development that follows will focus heavily on the indexing of tensor components in the various cases discussed.
Suppose represents the state distribution of a single taxa, i.e. π i is the probability that a randomly chosen site in the sequence will be in state i. Now suppose a phylogenetic branching event occurs and the sequence is copied. The corresponding phylogenetic tensor representing the joint distribution of the two-taxa just after the branching event then has the property that if i 2=i 1 and is zero otherwise. Thinking in terms of tensor operations, we find that phylogenetic branching events can be generated by a linear operator determined by δ(π)=P and defined in general using our chosen basis as
The remarkable fact for group-based models, central to the present article, is that the permutation matrices “intertwine” particularly simply with the branching operator:
Thus, for any rate matrix Q arising from a group-based model, we have (via the linearity of δ):
We also note that, since Q can be expressed a linear combination of permutation matrices representing elements in a group G, the matrix powers Q 2,Q 3,Q 4… will also be expressible as linear combinations of the same permutation matrices (although precise expressions for the relevant coefficients may or may not be easily computable). Together with (3), this implies that, for any substitution matrix e Qt arising from matrix exponentiation,
This relation shows that mathematically, and hence conceptually, “Markov evolution on a single followed by a branching event” can be replaced with “Branching event on a single taxon followed by (correlated) Markov evolution of two taxa.” This equivalence is illustrated in Figure 1, and should be compared to the equivalent discussion of the “n-taxa process” given in [18] and [19].
In Sumner et al. [27] we showed how to generalise this intertwining action to the case of the general Markov model. Interestingly, for the general Markov model the appropriate intertwining has quite a different structure from what occurs in group-based models, and hence the simplicity of (4) is somewhat misleading in general. We refer the reader to Sumner et al. [27] for more discussion on this point.
Returning to the case of group-based models, for each subset A⊆[n], we define a linear map on as the tensor product where a i =1 if i∈A and 0 otherwise. For example, if n=5, we have
To develop a phylogenetic tensor on a tree, we root the phylogenetic tree at taxon n, and label edges by subsets ∅≠e⊆[n−1], where i∈e if the path from taxon n to taxon i crosses the edge labelled by e. A five taxon tree with this labelling, is presented in Figure 2. To each edge labelled by ∅≠e⊆[n−1], we assign the rate matrix
where each is the rate of substitution for all states σ 1 to σ 2 satisfying , and . Each edge is then assigned substitution matrix , so that the time parameter for each edge is absorbed into the definition of Q e .
Now iterating (4) multiple times, Bashford et al. [27],[31] show that any phylogenetic tensor can be written as
where , and δ n−1π is the d×d×…×d tensor that represents the “zero edge-length star tree” distribution on n taxa. It is this form of phylogenetic tensors that will do a lot of the heavy lifting in the discussion that follows. The reader should note that under this representation, there is no need for the edge parameters to be chosen to be compatible with a particular tree, hence the possibilities for generalising to non-tree-like or network models, as discussed in the introduction.
The stationary distribution for group-based models is uniform (because the rate matrices are doubly stochastic). In this paper we always assume a stationary distribution, so that:
and δ n−1π has tensor components
This concludes our discussion of the tensor presentation of phylogenetic probability distributions under group-based models. It is important to note that everything discussed so far works for any group-based model, with no requirement that the underlying group G be abelian.
In what follows, we discuss the inversion of abelian group-based models. We present the simplest case with ; the case; the case; the general case; and finally we discuss the case of any abelian group.
Results
The binary-symmetric case
We begin with the inversion of the so-called “binary-symmetric” model. Consider with standard basis
As a group-based model, the binary-symmetric model arises by taking the group
with a generic rate matrix given by
where is the permutation matrix representing σ in the standard basis.
Now , with σ↦K, is the regular representation of , and the character table of given in Table 1 is easily recognised to be the Hadamard matrix
As is an abelian group, the irreducible representations are one-dimensional.
The corresponding projection operators can be read off from the columns of the character table. That is, the operators
project ρ reg=i d⊕s g n onto the id and sgn representations of , respectively.
This observation prompts us to work in the alternative basis:
In this basis the permutation matrix is diagonal:
The representation-theoretic perspective on is to observe that i d(σ)=1 and s g n(σ)=−1.
Referring to (5), we know that we can write a generic phylogenetic tensor as
where .
We index matrix and tensor indices by using and allow multiplication × in the ring of integers . The Hadamard matrix then has matrix elements where j is the row index and i is the column index. Observe that in the diagonal basis, the permutation matrix has elements
Thus we have expressions such as
where .
As we are dealing with tensors of arbitrary size, it is convenient to represent a string such as i 1i 2…i n as an ordered-bipartition μ=μ 0: μ 1 of the set [n], where μ 0,μ 1⊆[n] with j∈μ k if and only if i j =k. For example we have the following equivalences:
and inequivalence:
We then have
Defining h (n):=h (n−1)⊗h where h (1):=h, in the diagonal basis and using our notation h (n) has tensor components
The zero edge-length star-tree initial distribution has tensor components
(where, although it seems we have given preference to taxon 1 in this expression, there are many ways that this distribution can be expressed using the δ ij ). In the diagonal basis with , we have components
which is exactly the statement
Since is diagonal in the transformed basis, we can conclude that
Of course many of these tensor components will be zero and we would like to ignore these.
Take u=u 0: u 1 as an ordered bipartition of the reduced set [n−1], so that u≡i 1i 2…i n−1 where j∈u k if and only if i j =k, and define
and interpret u·γ(u) as a string: u·γ(u) = i 1i 2…i n−1γ(u).
If we make the definitions
then we can write the non-zero components as
with inverses
This is the first part of the inversion.
We would like to go further and actually recover the individual edge weights α e . To do this we define the (square) 2n−1×2n−1 matrix F with components
with e a subset and u an ordered-bipartition of [n−1]. As , we see that F provides its own inverse F −1 with components
Defining the column vectors and , we can write the matrix equations
Together with the first part of the inversion (6), these equations give a one-one map between pattern probabilities and edge weights for the binary-symmetric model.
Inversion of the model
Taking confidence from the previous case we now discuss the inversion of the group-based phylogenetic model with . We take
and, by analogy to the case, index tensors with indices i,j=0,1,2 and allow multiplication × by extending to the ring .
In this case a generic rate matrix is given by
where
are the matrices representing the permutations σ≅(123) and σ 2≅(132) under the regular representation, respectively.
We define ω=e 2πi/3, and present the character table of in Table 2. The decomposition of the regular representation is ρ reg=i d⊕ω⊕ω 2, and the columns of the character table give the projection operators onto the (one-dimensional) irreducible subspaces:
Therefore, the matrix
diagonalizes the generic rate matrix for this model:
or, equivalently,
We recall our basic result (5) that for group-based models, a generic phylogenetic tensor can be expressed as
where . We take the stationary distribution as initial distribution, so .
The matrix elements of f can be expressed as , where we extend to include multiplication × from the ring of integers . Similarly,
More generally, tensorial components can be expressed as
We represent a string i 1i 2…i n as an ordered-tripartition, i 1i 2…i n ≡μ=μ 0: μ 1: μ 2, of the set [n], where j∈μ k if and only if i j =k. For example, if we take n=5, we have:
Taking n = 3, we have
and in general:
Taking the uniform distribution as initial distribution, the initial star-tree distribution can be written as
Defining f (n)=f (n−1)⊗f where f (1)=f, we have
and in the transformed basis, where , we have
Indexing by ordered-tripartitions, we conclude that
Now suppose |μ 1|+2|μ 2|=0 (mod 3), then
If |μ 1|+2|μ 2|=1 (mod 3), then
and if |μ 1|+2|μ 2|=2 (mod 3), then
Thus we have found a basis where all the elements of the initial star-tree tensor are zero unless the tripartion μ satisfies |μ 1|+2|μ 2|=0 (mod 3). Crucially, this statement also holds for the phylogenetic tensor because in this basis the rate matrices of this model are diagonal:
We deal with this condition on μ by taking u=u 0: u 1: u 2 as an ordered-tripartion of the reduced set [n−1] and setting μ=u·γ(u) (considered as the concatenation of strings) where
If we make the definitions
we then have the first part of the inversion
As in the case, we would like to use η u to recover the rate parameters α e ,β e for all ∅≠e⊆[n−1] and thus complete the full inversion for this model. Of course, it is little bit more difficult this time.
Recall that μ=μ 0: μ 1: μ 2 with μ i ⊆[n], whereas u=u 0: u 1: u 2 with u i ⊆[n−1], and ∅≠e⊆[n−1]. Considering
it follows that
and similarly
We make the observation that
and
where F 1 and F 2 are 2n−1×3n−1 matrices.
Thus we may write
Defining the column vectors and , we can write
and define two 3n−1×2n−1 matrices G 1 and G 2 as
where
with f f −1=1.
Considering that
for all ordered-triparitions u,w of [n−1], we have the matrix products
Thus the second part of the inversion for this model is
Together with (7), these equations give a one-one map between pattern probabilities and edge weights for the group-based model with .
Inversion of the K3ST model
We now consider the K3ST model [36] which occurs as the group-based model with
In this model a generic rate matrix is given by
where
We already know that the 2×2 Hadamard matrix h diagonalizes K, so we see immediately that H=h⊗h diagonalizes this model:
Of course H is the character table of and the permutation matrices (8), together with K 00:=1, give the regular representation ρ reg≅i d⊗i d⊕i d⊗s g n⊕s g n⊗i d⊕s g n⊗s g n, where we recall the basic result that the tensor product of two irreducible representations of a group G gives an irreducible representation of G×G.
Simplifying notation, for this model we index tensors with indices given as pairs: ; and we express the individual parts using lower case Roman characters. For example, we write i:=a b=01, with a=0 and b=1. This gives matrix elements:
and more complicated tensor products such as
Again we interpret strings such as μ≡a 1a 2…a n and ν≡b 1b 2…b n as ordered-bipartitions μ=μ 0: μ 1 and ν=ν 0: ν 1 of the set [n]. We can then write matrix elements of tensor products as
Taking the stationary distribution as initial distribution, the zero edge-length star-tree distribution is given by
which in the finer index representation is
Recall that elements of the Hadamard matrix can be written as , where and we allow multiplication × by extending to the ring of integers . In the transformed basis, we have
We recall (5), so under this model we can express a generic phylogenetic tensor as
To exclude the vanishing components we define, for all ordered bipartitions u=u 0: u 1 of the reduced set [n−1],
and intepret u·γ(u) as the string u·γ(u)=a 1a 2…a n−1γ(u). Then, for each pair u,v of ordered-bipartitions of [n−1], we define
and
This gives the inversion
Consider the 2n×2n−1 rectangular matrices F 01, F 10 and F 11 with components
where e⊆[n−1] and u=u 0: u 1 and v=v 0: v 1 are ordered-bipartitions of [n−1]. If we define the column vector indexed by pairs of ordered-bipartitions and the column vectors , and indexed by subsets of [n−1], we then have the matrix equation
Writing H (n)=H (n−1)⊗H with H (1)=H, we note that
and define the 2n−1×2n rectangular matrices G 01,G 10 and G 11 as
Noting that
for all u,v,y,z ordered-bipartitions of [n−1], we then have the matrix identities
and
Writing
completes the inversion for the K3ST model.
Inversion of the model
We now consider the group based model for . For this model the generic rate matrix has the form
where and
so that .
Defining ω=e 2πi/r, we have ω r=1 and 1+ω+ω 2+…+ω r−1=0 and where i,j=0,1,2,…,r−1. Of course, f is the character table of and .
Lemma 1.
where μ,ν,μ ′ are ordered-r-partitions of the set [n] defined by the strings i 1i 2…i n , j 1j 2…j n and k 1k 2…k n , respectively.
Proof.
The result is obvious by the definition of tensor product. However, explicitly we have
which clearly equals 1 if i ℓ −k ℓ =0 for all ℓ, and, by repeatedly applying 1+ω+ω 2+…+ω r−1=0, equals 0 otherwise.
The regular representation contains exactly one copy of every irreducible representation and the irreducible representations of are given by the powers of ω:
Thus the change of basis will give diagonal matrices . Additionally,
Lemma 2.
In the diagonal basis, the matrices have matrix elements given by .
Proof.
Consider the matrix elements . Thus
where we have used .
Now
and
Translating this result using the ordered-r-partitions for indices, we have
Lemma 3.
In the diagonal basis, the uniform initial distribution on the star tree has components
where μ=μ 0: μ 1: μ 2:…: μ r−1 is an ordered-r-partition of the set [n].
Again recall that for this model a generic phylogenetic tensor can be written as
where . In the diagonal basis and as a consequence of Lemma 3 will have many vanishing components. To avoid these we take u=u 0: u 1: u 2:…: u r−1 as an ordered-r-partition of [n−1] and set
If we define and
we then have the first part of the inversion for the model:
For each i∈[r−1], we define the column vectors , and, for each ∅≠e⊆[n−1] and u an ordered- (r−1)-partition of [n−1], we define the rectangular r n−1×2n−1 matrices
so we have the vector equation
We claim that
Lemma 4.
Proof.
We recall that , so, for μ=μ 0: μ 1: μ 2:…: μ r−1 an ordered-r-parition of [n], and e a subset of [n−1] we have
so
because e⊆[n−1]. On the other hand , so
where e appears in the s th position.
Define, for i∈[r−1], the rectangular 2n−1×r n−1 matrices
Of course G i F j =δ ij 1, so we now have the second part of the inversion:
Inversion of any abelian group-based model
Lemma 5.
Any (finitely generated) abelian group G is isomorphic to a direct product of cyclic groups of prime-power order, ie. where each where p i is prime and n i is a positive integer.
Lemma 6.
The group-based model arising from the G is defined only up to group isomorphisms of G.
Proof.
A generic rate matrix for the group-based model arsing from G is given by
Under a group isomorphism ϕ:G→G ′, we have ϕ(σ i σ j )=ϕ(σ i )ϕ(σ j ).
Recall (2), so that the matrix elements is set via the action σ i ↦σ σ i =σ j . If we consider the regular representation of G ′ we then have defined by ϕ(σ i )↦ϕ(σ)ϕ(σ i ). Now ϕ(σ)ϕ(σ i )=ϕ(σ σ i )=ϕ(σ j ) and, because ϕ is a group isomorphism, this occurs if and only if σ σ i =σ j . Thus for all i and j.
This means that we can restrict attention to a single representitive in the isomorphism class of G. Of course, for this purpose we choose the representative guaranteed by Lemma 5.
Thus, for any abelian group G, with generators σ 1,σ 2,…,σ q the corresponding group-based model has rate generators given by
for all , where is the permutation matrix representing the generator . The character table f of G is simply the tensor product of the individual character tables of the :
In the diagonal basis we have matrix elements
where ω k is a k th root of unity. Thus
We write phylogenetic tensors for this model in the form
where 0≤i sj ≤r s for all 0≤s≤q. We simplify notation by writing each group of indices as μ (s):=i s1i s2…i sn where μ (s) is an ordered- r s -partition of [n].
Lemma 7.
In the diagonal basis, the uniform initial distribution on the star tree has components
A generic phylogenetic tensor for this model can be expressed as
where π is the unifrom distribution on states, i.e.
In the diagonal basis , and, as a consequence of the previous lemma, P has many vanishing components. To avoid these, for each i∈[q] we take as an ordered- r i -partition of [n−1] and set
We then define
and
so that we have the first part of the inversion
We define the column vectors and where u i is an ordered- r i -partition of [n−1], and we define the (r 1r 2…r q )n−1×2n−1 matrices
where in each term e appears in the position and the equality follows from Lemma 4.
We can then write the vector equation
If we define the 2n−1×(r 1r 2…r q )n−1 matrices
where in each term e appears in the position, we have the orthogonality relations
This gives us the second part of the inversion of any group-based model:
Conclusion
In this article we have given an alternative derivation of the inversion of group-based phylogenetic models. Primarily our method relies on the remarkable intertwining relation between branching events and Markov evolution (4), and the resulting simplified expression of phylogenetic tensors given in (5). From there we took a representation theoretic approach concentrating on the structure of tensor indices.
Authors’ information
Jeremy G. Sumner is a ARC Research Fellow, Peter D. Jarvis is a Alexander von Humboldt Fellow and Barbara R. Holland is a ARC Future Fellow.
References
Felsenstein J: Inferring Phylogenies . 2004, Sinauer Associates, Sunderland
Semple C, Steel M: Phylogenetics . 2003, Oxford University Press, Oxford
Hendy MD, Penny D: A framework for the quantitative study of evolutionary trees . Syst Zool. 1989, 38: 297-309. 10.2307/2992396.
Hendy MD: The relationship between simple evolutionary tree models and observable sequence data . Syst Zool. 1989, 38: 310-321. 10.2307/2992397.
Hendy MD, Penny D: Spectral analysis of phylogenetic data . J Class. 1993, 10: 1-20. 10.1007/BF02638451.
Steel M, Hendy M, Székely L, Erdös P: Spectral analysis and a closest tree method for genetic sequences . Appl Math Lett. 1992, 5 (6): 63-67. 10.1016/0893-9659(92)90016-3.
Székely LA, Erdos P, Steel M, Penny D: A fourier inversion formula for evolutionary trees . Appl Math Lett. 1993, 6 (2): 13-16. 10.1016/0893-9659(93)90004-7.
Hendy MD, Penny D, Steel M: A discrete Fourier analysis for evolutionary trees . Proc Natl Acad Sci. 1994, 91: 3339-3343. 10.1073/pnas.91.8.3339.
Székely LA, Steel MA, Erdős PL: Fourier calculus on evolutionary trees. Adv Appl Math. 1993, 14: 200-216. 10.1006/aama.1993.1001.
Hendy MD, Charleston MA: Hadamard conjugation: a versatile tool for modelling nucleotide sequence evolution . New Zeal J Bot. 1993, 31: 231-237. 10.1080/0028825X.1993.10419500.
Holland BR, Penny D, Hendy MD: Outgroup misplacement and phylogenetic inaccuracy under a molecular clock – a simulation study . Syst Biol. 2003, 52: 229-238. 10.1080/10635150390192771.
Hendy MD: A combinatorial description of the closest tree algorithm for finding evolutionary trees . Discrete Math. 1991, 96: 51-58. 10.1016/0012-365X(91)90469-I.
Lento GM, Hickson RE, Chambers GK, Penny D: Use of spectral analysis to test hypotheses on the origin of pinninpeds . Mol Biol Evol. 1995, 12: 28-52. 10.1093/oxfordjournals.molbev.a040189.
Huber KT, Watson EE, Hendy MD: An algorithm for constructing local regions in a phylogenetic network . Mol Phylogenet Evol. 2001, 19: 1-8. 10.1006/mpev.2000.0891.
Huber KT, Langton M, Penny D, Moulton V, Hendy M: Spectronet: a package for computing spectra and median networks . Appl Bioinform. 2002, 1: 2041-2059.
Schliep KP: Some applications of statistical phylogenetics. Ph.D. thesis. Massey University; 2009.
von Haeseler A, Churchill GA: Network models for sequence evolution . J Mol Evol. 1993, 37: 77-85. 10.1007/BF00170465.
Bryant D: Extending tree models to split networks . Algebraic Statistics and Computational Biology . Edited by: Pachter L, Sturmfels B. 2005, Cambridge University Press, Cambridge, 297-368.
Bryant D: Hadamard phylogenetic methods and the n -taxon process . Bull Math Biol. 2009, 71: 297-309. 10.1007/s11538-008-9364-8.
Matsen FA, Steel M: Phylogenetic mixtures on a single tree can mimic a tree of another topology . Syst Biol. 2007, 56: 767-775. 10.1080/10635150701627304.
Matsen FA, Mossel E, Steel M: Mixed-up trees: the structure of phylogenetic mixtures . Bull Math Biol. 2008, 70: 1115-1139. 10.1007/s11538-007-9293-y.
Griffiths RC, Majoram P: Ancestral inference from samples of DNA sequences with recombination . J Comput Biol. 1996, 3: 479-502. 10.1089/cmb.1996.3.479.
Griffiths RC, Marjoram P: An ancestral recombination graph . Progress in Population Genetics and Human Evolution, Volume 87 of IMA volumes in mathematics and its applications . 1997, Springer Verlag, Berlin, 257-270.
Jin G, Nakhleh L, Snir S, Tuller T: Maximum likelihood of phylogenetic networks . Bioinformatics. 2006, 21: 2604-2611. 10.1093/bioinformatics/btl452.
Strimmer K, Moulton V: Likelihood analysis of phylogenetic networks using directed graphical models . Mol Biol Evol. 2000, 17: 875-881. 10.1093/oxfordjournals.molbev.a026367.
Strimmer K, Wiuf C, Moulton V: Recombination analysis using directed graphical models . Mol Biol Evol. 2001, 18: 97-99. 10.1093/oxfordjournals.molbev.a003725.
Sumner JG, Holland BR, Jarvis PD: The algebra of the general Markov model on trees and networks . Bull Math Biol. 2012, 74 (4): 858-880. 10.1007/s11538-011-9691-z.
Sumner JG, Charleston MA, Jermiin LS, Jarvis PD: Markov invariants, plethysms, and phylogenetics . J Theor Biol. 2008, 253: 601-615. 10.1016/j.jtbi.2008.04.001.
Sumner JG, Jarvis PD: Markov invariants and the isotropy subgroup of a quartet tree . J Theor Biol. 2009, 258: 302-310. 10.1016/j.jtbi.2009.01.021.
Holland BR, Jarvis PD, Sumner JG: Low-parameter phylogenetic inference under the general markov model . Syst Biol. 2013, 62: 78-92. 10.1093/sysbio/sys072.
Bashford JD, Jarvis PD, Sumner JG, Steel MA: U(1)×U(1)×U(1) symmetry of the Kimura 3ST model and phylogenetic branching processes . J Phys A Math Gen. 2004, 37: L1-L9. 10.1088/0305-4470/37/8/L01.
Sumner JG, Fernández-Sánchez J, Jarvis PD: Lie Markov models . J Theor Biol. 2012, 298: 16-31. 10.1016/j.jtbi.2011.12.017.
Sagan BE: The Symmetric Group: Representations, Combinatorial Algorithms, and Symmetric Functions . 2001, Springer, New York
Evans SN, Speed T: Invariants of some probability models used in phylogenetic inference . Ann Stat. 1993, 21: 355-377. 10.1214/aos/1176349030.
Sumner JG, Jarvis PD: Entanglement invariants and phylogenetic branching . J Math Biol. 2005, 51: 18-36. 10.1007/s00285-004-0309-z.
Kimura M: Estimation of evolutionary distances between homologous nucleotide sequences . Proc Natl Acad Sci. 1981, 78: 1454-1458.
Acknowledgements
This research was supported by Australian Research Council Discovery Grants DP0877447 (JGS and PDJ) and FT100100031 (BRH).
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
JGS conducted much of the theoretical work presented, and was primarily responsible for drafting of the paper. PDJ contributed to the theoretical work and assisted in editing of the paper. BRH contributed to the theoretical work and contributed substantially to the writing of the paper. All authors read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.
About this article
Cite this article
Sumner, J.G., Jarvis, P.D. & Holland, B.R. A tensorial approach to the inversion of group-based phylogenetic models. BMC Evol Biol 14, 236 (2014). https://doi.org/10.1186/s12862-014-0236-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12862-014-0236-6