1 Introduction and summary

In chiral effective field theories for low-energy quantum chromodynamics [1] one typically works with special unitary matrices U as the field variable, since this allows for a convenient implementation of (chiral and other) symmetry transformations. The excitations on top of the spontaneously broken groundstate, represented by the unit matrix \(\mathbf {1}\), are pseudoscalar Goldstone bosons (pions, kaons and the \(\eta (548)\)-meson). In the two-flavor case the special unitary \(2\times 2\) matrices that constitute the compact Lie group SU(2) are often given in the exponential form

$$\begin{aligned} U = \exp (i \vec \tau \!\cdot \!\vec \varphi \,) = \cos |\vec \varphi \,|\mathbf{1} + i\vec \tau \!\cdot \! {\hat{\varphi }} \sin |\vec \varphi \,|\,, \end{aligned}$$
(1)

with \(\vec \tau =(\tau _1,\tau _2,\tau _3)\) the Pauli matrices and \(|\vec \varphi \,|\) the magnitude of a three-component real vector \(\vec \varphi =(\varphi _1,\varphi _2,\varphi _3)\) related e.g. to the pion-fields. Since the manifold of SU(2) is identified with a three-sphere \(S^3\), one can provide alternative algebraic or rational parametrizations

$$\begin{aligned}&U = \pm \sqrt{1-\vec \pi ^{\,2}}\,\mathbf{1} + i\vec \tau \!\cdot \! \vec \pi , \quad |\vec \pi \,|\le 1, \nonumber \\&U = {(1-\vec \phi ^{\,2}/4)\mathbf{1}+ i\vec \tau \!\cdot \! \vec \phi \over 1+\vec \phi ^{\,2}/4} , \end{aligned}$$
(2)

which are advantageous in specific calculations or for certain applications. In the three-flavor case, where the matrices U belong to the eight-dimensional compact Lie group SU(3), no such alternatives to parametrize the manifold are known and one has to stay with the exponential form \(U = \exp (i \vec \lambda \!\cdot \! \vec v\,)\) in terms of eight Gell-Mann matrices \(\vec \lambda = (\lambda _1,\dots , \lambda _8)\) and eight real parameters \(\vec v=(v_1,\dots , v_8)\). The aim of the present work is solve the corresponding matrix exponential function. A key ingredient to limit the number of matrix powers to few independent ones is the Cayley-Hamilton relation, which states that any matrix \(\Sigma \) gets nullified when inserted into its characteristic polynomial \(P(z)=\mathrm{det}(z {\mathbf {1}}-\Sigma )\). The coefficients [2] of the latter are given by the traces of increasing matrix powers of \(\Sigma \) and ultimately the determinant of \(\Sigma \). When carrying out this procedure for SU(3), one encounters the problem of determining the roots of a cubic polynomial in the so-called irreducible case. It corresponds to the situation when all three roots are real and the Cardano formula exhibits under the cube-root a square-root with a negative radicand. Then the problem gets effectively solved through a trigonometric ansatz and the trisection of an angle. When continuing the solution of the matrix exponential function to SU(4), the four real roots of a quartic polynomial are determined with the help of a cubic resolvent equation that also belongs to the irreducible case. As an interesting byproduct of this analysis one obtains the shape of the allowed region for certain real-valued invariants \(\eta \) and \(\zeta \). Moreover, be imposing the condition related to a quaternionic structure one can treat as a subgroup of SU(4) the symplectic group Sp(2) with ten real parameters. The situation with more elaborate quasi-analytical formulas for SU(5), SU(6) and Sp(3) matrices in the exponential parametrization is discussed in perspective.

2 Special unitary group SU(3)

One starts with the usual exponential representation of an SU(3) matrix

$$\begin{aligned} U = \exp (i \vec \lambda \!\cdot \! \vec v\,)\,, \end{aligned}$$
(3)

in terms of the eight Gell-Mann matricesFootnote 1\(\vec \lambda = (\lambda _1,\dots , \lambda _8)\) normalized to tr\(( \lambda _a \lambda _b) =2 \delta _{ab}\) and an eight-component real parameter vector \(\vec v = (v_1, \dots ,v_8)\). The aim is to give an analytical expression for U that involves the \(3\!\times \!3\) unit matrix \({\mathbf {1}}\) and the rescaled matrix \(\Sigma = \vec \lambda \!\cdot \!{\hat{v}}\) (with \({\hat{v}} = \vec v/|\vec v\,|\) a point on the seven-sphere \(S^7\)), each multiplied with coefficient functions that depend on the magnitude \(|\vec v\,|=\sqrt{v_1^2+\dots +v_8^2}\) and another invariant. Considering the traces tr\(\,\Sigma =0\) and tr\(\,\Sigma ^2=2\) one finds as a pertinent invariant the (real-valued) determinant \(\eta = \mathrm{det}\,\Sigma \). According to the Cayley-Hamilton relation the traceless hermitian matrix \(\Sigma \) (a Lie algebra element) gets nullified when inserted into its characteristic polynomial, which lead to the cubic relation

$$\begin{aligned} \Sigma ^3 = \Sigma + \eta \, {\mathbf {1}}\,. \end{aligned}$$
(4)

Consequently, any power of \(\Sigma \) can be written as a linear combination of \({\mathbf {1}}, \Sigma \), and \(\Sigma ^2\). Starting at order n with \(\Sigma ^n =\alpha _n \, {\mathbf {1}}+\beta _n\, \Sigma +\gamma _n\, \Sigma ^2 \) and multiplying with \(\Sigma \) one obtains via the relation in Eq. (4) the expansion coefficients at order \(n+1\). The resulting linear recursion relation reads in vector notation

$$\begin{aligned} \left( \! \begin{array} {c} \alpha _{n+1} \\ \beta _{n+1} \\ \gamma _{n+1}\\ \end{array}\!\right) = M_3 \left( \!\begin{array} {c}\alpha _n \\ \beta _n \\ \gamma _n \\ \end{array}\!\right) , \, \text { with the matrix} \quad M_3 = \left( \begin{array} {ccc} 0 &{} 0&{} \eta \\ 1&{} 0&{} 1\\ 0&{}1&{} 0 \\ \end{array}\!\right) , \end{aligned}$$
(5)

and the initial values \(\alpha _0=1, \beta _0=0, \gamma _0=0\). By diagonalization of \(M_3\) the exponential series \(\exp ( i|\vec v\,|M_3) = \sum _{n=0}^\infty ( i|\vec v\,|M_3)^n/n!\) can be solvedFootnote 2 and after vector multiplication with \((1,0,0)^t\) from the right and \(( {\mathbf {1}}, \Sigma ,\Sigma ^2)\) from the left, one ends up with the following analytical formula for an SU(3) matrix

$$\begin{aligned} U= & {} \exp (i \vec \lambda \cdot \vec v\,) \nonumber \\= & {} \sum _{j=1}^3 {\exp (i z_j |\vec v\,|) \over 3z_j^2-1} \Big \{ (z_j^2-1) {\mathbf {1}}+ z_j \, \vec \lambda \cdot {\hat{v}}+ (\vec \lambda \cdot {\hat{v}})^2 \Big \},\nonumber \\ \end{aligned}$$
(6)

where the \(\eta \)-dependent quantities \(z_1,z_2,z_3\) subject to the zero-sum constraint \(z_1+z_2+z_3=0\) and \(z_1^2+z_2^2+z_3^2=2\) are the three real roots of the cubic equation

$$\begin{aligned} P_3(z) = z^3-z-\eta = 0\,. \end{aligned}$$
(7)
Fig. 1
figure 1

Generic behavior of the cubic polynomial \(P_3(z) = z^3-z -\eta \)

Note that at the same time \(z_1,z_2,z_3\) are the eigenvalues of the traceless hermitian matrix \(\Sigma = \vec \lambda \!\cdot \! {\hat{v}}\). One also observes that the trace of U is given by a much simpler formula: tr\(\,U = \sum _{j=1}^3 \exp (i z_j |\vec v\,|)\). Another interesting feature is that the denominator in Eq. (6) is the derivative \(P_3'(z) = 3z^2-1\) evaluated at the root \(z_j\). Before turning to the solution of the cubic equation, one should analyze the generic behavior of the polynomial \(P_3(z)= z^3-z-\eta \) as it is sketched in Fig. 1. When having three zero-crossings, the local maximum at \(z= -1/\sqrt{3}\) must be positive, \(2\sqrt{3}/9-\eta \ge 0\), and the local minimum at \(z= 1/\sqrt{3}\) must be negative, \(-2\sqrt{3}/9-\eta \le 0\). Both conditions together fix the range of the determinant \(\eta \) to the interval

$$\begin{aligned} -{2\sqrt{3} \over 9}\le \eta \le {2\sqrt{3} \over 9}\,. \end{aligned}$$
(8)

In the irreducible case at hand, the cubic equation \(z^3-z-\eta =0\) is treated by the substitution \(z ={2 \over \sqrt{3}} \cos \psi \) which leads (via the relation \(\cos 3\psi = 4\cos ^3\psi -3\cos \psi \)) to the auxiliary equation

$$\begin{aligned} \cos 3\psi = {3\sqrt{3} \eta \over 2}\,, \qquad \psi = {1\over 3} \arccos {3\sqrt{3} \eta \over 2}\,, \end{aligned}$$
(9)

that provides an angle \( \psi \) in the range \(0\le \psi \le \pi /3\). It is an important solvability criterion that the argument in Eq. (9) has indeed magnitude less equal to 1 as a consequence of the determinantal range derived before. The three real roots entering the analytical formula in Eq. (6) are given by

$$\begin{aligned}&z_1 = {2\over \sqrt{3}} \cos \psi >0, \qquad z_2 = -\sin \psi -{\cos \psi \over \sqrt{3}}<0,\nonumber \\&z_3 = \sin \psi -{\cos \psi \over \sqrt{3}}, \end{aligned}$$
(10)

where in this (arbitrary) ordering the sign of \(z_3\) is yet undetermined. At the boundary values \(\eta = \pm 2\sqrt{3}/9\) two roots coincide and the formula in Eq. (6) is evaluated by means of a limiting procedure \(\epsilon \rightarrow 0\)

$$\begin{aligned} \eta= & {} {2 \sqrt{3}\over 9}: \quad z_1={2\over \sqrt{3}},\quad z_{2,3}=-{1\over \sqrt{3}}\mp \epsilon ,\nonumber \\&\qquad 3\Sigma ^2- \sqrt{3} \Sigma = {\mathbf {2}} , \end{aligned}$$
(11)
$$\begin{aligned} \eta= & {} -{2 \sqrt{3}\over 9}: \quad z_{1,3}={1\over \sqrt{3}}\pm \epsilon ,\quad z_2=-{2\over \sqrt{3}},\nonumber \\&\qquad 3\Sigma ^2+ \sqrt{3} \Sigma = {\mathbf {2}} , \end{aligned}$$
(12)

and one recognizes that in these exceptional situations the three matrices \({\mathbf {1}}, \Sigma , \Sigma ^2\) are no more linearly independent. The validity of the quasi-analytical formula has been checked numerically in many cases. A disadvantage for applications is its rather inexplicit dependence on \(\eta \) which cannot be expressed in terms of real-valued algebraic functions, but requires a detour via the real roots of a cubic equation in the irreducible case. For comparison in the case of SU(2) the iteration matrix \(M_2\) is the Pauli matrix \(\tau _1\) and \(\exp ( i |\vec \varphi \,|\tau _1)(1,0)^t = (\cos |\vec \varphi \,|, i\sin |\vec \varphi \,|)^t\).

3 Special unitary group SU(4)

One starts again with the exponential representation of an SU(4) matrix

$$\begin{aligned} U = \exp (i \vec \lambda \!\cdot \! \vec v\,)\,, \end{aligned}$$
(13)

in terms of 15 traceless hermitian generatorsFootnote 3\(\vec \lambda = (\lambda _1,\dots , \lambda _{15})\) normalized to tr\(( \lambda _a \lambda _b) =2 \delta _{ab}\) and a 15-component real parameter vector \(\vec v = (v_1, \dots ,v_{15})\). One works with the \(4\!\times \! 4\) unit matrix \({\mathbf {1}}\) and the rescaled matrix \(\Sigma = \vec \lambda \!\cdot \! {\hat{v}}\) that get multiplied by functions depending on the magnitude \(|\vec v\,|=\sqrt{v_1^2+\dots +v_{15}^2}\) and further invariants. Besides the constant traces tr\(\,\Sigma =0\) and tr\(\,\Sigma ^2=2\) one finds now as the two pertinent (real-valued) invariants the determinant and the trace of the cube

$$\begin{aligned} \eta = \mathrm{det}\,\Sigma \,, \qquad \zeta = {1\over 3} \mathrm{tr}\,\Sigma ^3\,, \end{aligned}$$
(14)

where the factor 1/3 is included for convenience. In the present case the Cayley-Hamilton relation sets up an equation for the fourth power of \(\Sigma \) of the form

$$\begin{aligned} \Sigma ^4 = \Sigma ^2 +\zeta \,\Sigma - \eta \, {\mathbf {1}}\,, \end{aligned}$$
(15)

which allows to write any higher power of \(\Sigma \) as a linear combination of \({\mathbf {1}}, \Sigma , \Sigma ^2\), and \(\Sigma ^3\). Starting at order n with \(\Sigma ^n =\alpha _n \, {\mathbf {1}}+\beta _n\, \Sigma +\gamma _n\, \Sigma ^2 +\delta _n\, \Sigma ^3\) and multiplying with \(\Sigma \) one obtains via the relation in Eq. (15) the expansion coefficients at order \(n+1\). The resulting linear recursion relation reads

$$\begin{aligned} \left( \!\begin{array} {c}\alpha _{n+1} \\ \beta _{n+1} \\ \gamma _{n+1} \\ \delta _{n+1} \\ \end{array}\!\right) = M_4 \left( \!\begin{array} {c}\alpha _n \\ \beta _n \\ \gamma _n \\ \delta _n \\ \end{array}\!\right) , \, \text { with the matrix} \, M_4 = \left( \begin{array} {cccc} 0&{} 0 &{} 0&{} -\eta \\ 1&{} 0&{} 0&{} \zeta \\ 0&{}1&{} 0&{}1\\ 0 &{} 0 &{}1&{}0 \\ \end{array}\!\right) , \end{aligned}$$
(16)

and the initial values \(\alpha _0=1, \beta _0=0, \gamma _0=0, \delta _0=0\). Through diagonalization of \(M_4\) (or an application of MatrixExp[, ]) the exponential series \(\exp ( i|\vec v\,|M_4) = \sum _{n=0}^\infty ( i|\vec v\,|M_4)^n/n!\) can again be solved and by dotting with \((1,0,0,0)^t\) from the right and \(( {\mathbf {1}}, \Sigma ,\Sigma ^2, \Sigma ^3)\) from the left, one ends up with the following quasi-analytical formula for an SU(4) matrix

$$\begin{aligned}&U = \exp (i \vec \lambda \!\cdot \! \vec v\,) = \sum _{j=1}^4 {\exp (i z_j |\vec v\,|) \over 4z_j^3-2z_j-\zeta } \Big \{ (z_j^3-z_j-\zeta ){\mathbf {1}} \nonumber \\&\qquad \qquad \qquad \quad + (z_j^2-1) \vec \lambda \!\cdot \! {\hat{v}}+ z_j (\vec \lambda \!\cdot \! {\hat{v}})^2+ (\vec \lambda \!\cdot \! {\hat{v}})^3 \Big \}, \end{aligned}$$
(17)

where the \((\eta ,\zeta )\)-dependent quantities \(z_1,z_2,z_3,z_4\) subject to the zero-sum constraint \(z_1+z_2+z_3+z_4=0\) and \(z_1^2+z_2^2+z_3^2+z_4^2=2\) are now the four real roots of the quartic equation

$$\begin{aligned} P_4(z) = z^4-z^2- \zeta \,z +\eta = 0\,. \end{aligned}$$
(18)

At the same time \(z_1,z_2,z_3,z_4\) are the eigenvalues of the traceless hermitian \(4\!\times \!4\) matrix \(\Sigma = \vec \lambda \!\cdot \! {\hat{v}}\) and one gets again a simpler formula for the trace: tr\(\,U = \sum _{j=1}^4 \exp (i z_j |\vec v\,|)\). The denominator in Eq. (17) stems from the derivative \(P_4'(z) = 4z^3-2z-\zeta \).

Fig. 2
figure 2

Generic behavior of the cubic resolvent polynomial \(R_3(\theta ) = \theta ^3-2\theta ^2+(1-4\eta )\theta -\zeta ^2\)

Borrowing results from advanced algebra [3], the determination of the four roots \(z_j\) of the quartic equation \(P_4(z)=0\) proceeds via three auxiliary quantities \(\theta _1,\theta _2,\theta _3\) in the following way

$$\begin{aligned} z_1= & {} {1\over 2}\Big (\sqrt{\theta _1}+ \sqrt{\theta _2}+ \sqrt{\theta _3} \Big ), \nonumber \\ z_2= & {} {1\over 2}\Big ( \sqrt{\theta _1}- \sqrt{\theta _2}- \sqrt{\theta _3}\Big ), \end{aligned}$$
(19)
$$\begin{aligned} z_3= & {} {1\over 2}\Big (\sqrt{\theta _2}- \sqrt{\theta _1}- \sqrt{\theta _3} \Big ),\nonumber \\ z_4= & {} {1\over 2}\Big (\sqrt{\theta _3}- \sqrt{\theta _1}- \sqrt{\theta _2}\Big ), \end{aligned}$$
(20)

where it has to be noted that there is only a twofold sign ambiguity in taking square roots, since the sign of the product is fixed by the condition \(\sqrt{\theta _1} \sqrt{\theta _2} \sqrt{\theta _3} = \zeta \). The four choices of signs \(+\,+, +\,-, -\,+, -\,-\) correspond merely to a relabeling of the four roots. The \(\theta \)-values derive from the roots through the inverse relations

$$\begin{aligned}&\theta _1 = -(z_1+z_2)(z_3+z_4), \qquad \theta _2 = -(z_1+z_3)(z_2+z_4), \nonumber \\&\theta _3 = -(z_1+z_4)(z_2+z_3), \end{aligned}$$
(21)

and these are all positive, since each is a square in view of the zero-sum \(z_1+z_2+z_3+z_4=0\). As a matter of fact [3] the three \(\theta \)-values are the roots of the cubic resolvent equation [3]

$$\begin{aligned} R_3(\theta ) = \theta ^3 -2 \theta ^2+(1-4\eta )\theta - \zeta ^2 =0\,. \end{aligned}$$
(22)

The generic behavior of \( R_3(\theta )\) as shown in Fig. 2 implies several restrictions on the invariants \(\eta \) and \(\zeta \). In the presence of three zero-crossing on the positive \(\theta \)-axis and \(R_3(\theta<0)<-\zeta ^2\), the local maximum and local minimum must lie in between at positions \(\theta _\text {max,min} = {1\over 3} (2\mp \sqrt{1+12 \eta }) \ge 0\). This implies first \(\eta \ge -1/12\) and secondly \(\eta \le 1/4\), leading to the (narrow) range \(-1/12\le \eta \le 1/4\) for the determinant. The conditions \(R_3(\theta _\text {max}) \ge 0\) and \(R_3(\theta _\text {min}) \le 0\) multiplied together yield after some manipulation the inequality

$$\begin{aligned} \Big ({27\over 2}\zeta ^2+36 \eta -1\Big )^2 \le (1+12\eta )^3 \le 64\,. \end{aligned}$$
(23)

The resulting allowed range for the invariants \(\eta \) and \(\zeta \) is the bounded region shown in Fig. 3 from which one deduces also the extremal values \(\zeta _\pm = \pm 2 \sqrt{6}/9\).

Fig. 3
figure 3

The allowed values of the invariants \(\eta \) and \(\zeta \) lie inside the bounded region. The enclosed area in the \(\eta \zeta \)-plane amounts to \(8\sqrt{2}/105\), and the circumference of the tricorn measures 2.3847

The cubic resolvent equation in Eq. (22) belongs again to the irreducible case such that its solutions are conveniently obtained via the substitution \(\theta = {2\over 3}(1+\sqrt{1+12\eta } \cos \psi )\). The auxiliary angle \(\psi \) lying within the interval \([0,\pi /3]\) is determined from the equation

$$\begin{aligned} \cos 3\psi = {{27\over 2}\zeta ^2+36 \eta -1\over (1+12\eta )^{3/2}}\,, \, \psi = {1\over 3} \arccos {{27\over 2}\zeta ^2+36 \eta -1\over (1+12\eta )^{3/2}}\,, \end{aligned}$$
(24)

where solvability is guaranteed by the inequality derived previously in Eq. (23). The three positive \(\theta \)-values read (up to permutation of the indices)

$$\begin{aligned}&\theta _1 = {2\over 3}\Big (1+\sqrt{1+12\eta } \cos \psi \Big ), \nonumber \\&\theta _2 = {2\over 3}\Big [1-\sqrt{1+12\eta } \sin \Big (\psi +{\pi \over 6}\Big )\Big ] , \nonumber \\&\theta _3 = {2\over 3}\Big [1+\sqrt{1+12\eta } \sin \Big (\psi -{\pi \over 6}\Big )\Big ] , \end{aligned}$$
(25)

and after taking square roots and forming appropriate sums and differences as prescribed in Eqs. (19, 20) one obtains the four real roots \(z_j\) entering the quasi-analytical formula Eq. (17) for an SU(4) matrix.

One can continue the procedure to SU(5) with 24 traceless hermitian generatorsFootnote 4\(\vec \lambda = (\lambda _1,\dots , \lambda _{24})\) normalized to tr\(( \lambda _a \lambda _b) =2 \delta _{ab}\) and a 24-component real parameter vector \(\vec v = (v_1, \dots ,v_{24})\). In terms of the invariants \(\eta =\mathrm{det}\,\Sigma , ~\zeta =\mathrm{tr}\,\Sigma ^3/3\), and a properly chosen new one, namely \(\xi = \mathrm{tr}\,\Sigma ^4/4-1/2\), the Cayley-Hamilton relation for the fifth power of \(\Sigma = \vec \lambda \!\cdot \! {\hat{v}}\) reads now

$$\begin{aligned} \Sigma ^5= \Sigma ^3 +\zeta \, \Sigma ^2+\xi \,\Sigma +\eta \, {\mathbf {1}}\,. \end{aligned}$$
(26)

By setting up the linear recursion for five expansion coefficients and solving the matrix exponential function \(\exp ( i|\vec v\,|M_5)\) one arrives at the following quasi-analytical formula for an SU(5) matrix

$$\begin{aligned}&U = \exp (i \vec \lambda \!\cdot \! \vec v\,) = \!\sum _{j=1}^5 {\exp (i z_j |\vec v\,|) \over 5z_j^4\!-\!3z_j^2\!-\!2\zeta z_j\!-\!\xi } \Big \{ (z_j^4\!-\!z_j^2\!-\!\zeta z_j\!-\!\xi ){\mathbf {1}}\nonumber \\&\qquad +(z_j^3\!-\!z_j\!-\!\zeta )\vec \lambda \!\cdot \! {\hat{v}}+ (z_j^2\!-\!1) (\vec \lambda \!\cdot \! {\hat{v}})^2+ z_j (\vec \lambda \!\cdot \! {\hat{v}})^3+ (\vec \lambda \!\cdot \! {\hat{v}})^4 \Big \}.\nonumber \\ \end{aligned}$$
(27)

Here \(z_1,z_2,z_3,z_4,z_5\) are the five real roots (with zero sum and \(\sum _{j=1}^5z_j^2=2\)) of the quintic polynomial equation

$$\begin{aligned} P_5(z) = z^5 -z^3-\zeta z^2-\xi z-\eta = 0\,, \end{aligned}$$
(28)

whose determination with their detailed \((\eta ,\zeta ,\xi )\)-dependence is a formidable task. As the denominator in Eq. (27) one recognizes again the derivative \(P_5'(z) = 5z^4 -3z^2-2\zeta z-\xi \) evaluated at the root \(z_j\).

In perspective one can consider SU(6) with 35 traceless hermitian generators, \(\vec \lambda = (\lambda _1, \dots ,\lambda _{35})\) normalized to tr\(( \lambda _a \lambda _b) =2 \delta _{ab}\) (the last one reads \(\lambda _{35} =\mathrm{diag}(1,1,1,1,1,-5)/\sqrt{15}\)) and a 35-component real parameter vector \(\vec v = (v_1, \dots ,v_{35})\). The characteristic polynomial that nullifies the rescaled matrix \(\Sigma = \vec \lambda \!\cdot \! {\hat{v}}\) is of degree six

$$\begin{aligned} P_6(z) = z^6-z^4-\zeta z^3-\xi z^2+(\zeta - \chi )z+\eta \,, \end{aligned}$$
(29)

with a new invariant \(\chi =\mathrm{tr}\,\Sigma ^5/5\). The quasi-analytical formula for SU(6) matrices is analogous to Eq. (27) and it involves a sum over the six real roots defined by \(P_6(z_j)=0\). The denominator in the formula is \(P_6'(z_j) = 6z_j^5-4z_j^3-3\zeta z_j^2-2\xi z_j+\zeta - \chi \) and the coefficients of the expansion with respect to \(\mathbf {1}\) and increasing powers of \( \vec \lambda \cdot {\hat{v}}\) (up to the fifth power) are \(z^5-z^3-\zeta z^2-\xi z +\zeta -\chi ,\) \(z^4-z^2-\zeta z-\xi \,,\) \(z^3-z-\zeta \,,\) \(z^2-1\,,\) \(z\,,\) \(1\,,\) respectively, each evaluated at the real root \(z_j\).

4 Symplectic group Sp(2)

The 15-dimensional special unitary group SU(4) contains a particular 10-dimensional subgroup, the so called (compact) symplectic group Sp(2) defined by imposing the condition (of respecting a quaternionic structure [4])

$$\begin{aligned} U^t J \,U = J\,, \qquad J = \left( \begin{array} {cc} \mathbf {0} &{} -\mathbf {1} \\ \mathbf {1}&{} \mathbf {0}\\ \end{array}\!\right) ,\qquad J^2 = -\left( \begin{array} {cc} \mathbf {1} &{} \mathbf {0} \\ \mathbf {0}&{} \mathbf {1}\\ \end{array}\!\right) , \end{aligned}$$
(30)

on \(U\in SU(4)\), where \(^t\) stands for transposition, and \(\mathbf {1}\) and \(\mathbf {0}\) denote momentarily the \(2\!\times \!2\) unit and zero matrix. For the Lie algebra elements \(\vec \lambda \!\cdot \!\vec v\) that depend linearly on 15 parameters \(v_1,\dots ,v_{15}\) this implies the constraint

$$\begin{aligned} (\vec \lambda \!\cdot \!\vec v\,)^t = J (\vec \lambda \!\cdot \!\vec v\,) J\,, \end{aligned}$$
(31)

which as a result eliminates five of the 15 real parameters through the linear relations

$$\begin{aligned}&v_9 = v_6, \qquad v_{10}=v_7,\qquad v_{13}=-v_1, \qquad v_{14}=v_2,\nonumber \\&v_{15}={1 \over \sqrt{2}}\big (v_8-\sqrt{3}v_3\big ). \end{aligned}$$
(32)

The squared magnitude of the yet 15-component parameter vector \(\vec v\) becomes a sum of ten squares

$$\begin{aligned} |\vec v\,|^2= & {} 2v_1^2+2v_2^2+3\tilde{v}_3^2+v_4^2+v_5^2+2v_6^2+2v_7^2\nonumber \\&+{\tilde{v}}_8^2+v_{11}^2+v_{12}^2, \end{aligned}$$
(33)

after introducing the linear combinations \(\tilde{v}_3=(\sqrt{3}v_3-v_8)/2\) and \({\tilde{v}}_8=(v_3+\sqrt{3}v_8)/2\). Moreover, one finds that the trace of the cube vanishes, tr\((\vec \lambda \!\cdot \!\vec v\,)^3=0\), just as a consequence of the five linear relations in Eq. (32) or the underlying constraint in Eq. (31). Thus one is dealing for the subgroup Sp(2) of SU(4) with the special case \(\zeta =0\) and the quartic polynomial \(P_4(z) =z^4-z^2+\eta \) becomes biquadratic, and is effectively equivalent to \({\tilde{P}}_2(x) = x^2-x+\eta \). The four real roots are then given by

$$\begin{aligned}&z_{1,3} = \pm \sqrt{x_1}, \qquad x_1= {1\over 2}\big (1-\sqrt{1-4\eta }\,\big )\ge 0,\nonumber \\&z_{2,4} = \pm \sqrt{x_2}, \qquad x_2= {1\over 2}\big (1+\sqrt{1-4\eta }\,\big ) \ge 0, \end{aligned}$$
(34)

where the determinant \(\eta \) must be confined to the interval \(0\le \eta \le 1/4\). Since for Sp(2) the roots of \(P_4(z)\) occur as pairs of opposite sign, the sum in Eq. (17) can be simplified to

$$\begin{aligned}&U = \exp (i\vec \lambda \!\cdot \!\vec v\,) = \sum _{j=1}^2 {(x_j-1){\mathbf {1}}+(\vec \lambda \!\cdot \!{\hat{v}})^2 \over 2x_j-1}\nonumber \\&\qquad \times \Big \{\cos \left( \sqrt{x_j} |\vec v\,|\right) {\mathbf {1}} +{i\over \sqrt{x_j}} \sin \left( \sqrt{x_j} |\vec v|\right) \, \vec \lambda \!\cdot \!{\hat{v}}\Big \} , \end{aligned}$$
(35)

while the trace of such symplectic matrices is always real-valued: tr\(\,U = 2 \sum _{j=1}^2 \cos ( \sqrt{x_j} |\vec v\,|)\).

In perspective one can consider the 21-dimensional symplectic group Sp(3) by imposing on SU(6) matrices the condition \(U^t\,J \, U = J\), with J constructed from \(3\!\times \!3\) matrices \(\mathbf {1}\) and \(\mathbf {0}\). The condition \((\vec \lambda \!\cdot \!\vec v\,)^t = J (\vec \lambda \!\cdot \!\vec v\,) J\) for the Lie algebra elements eliminates 14 of the original 35 parameters through linear relations, where the more interesting ones associated to the diagonal generators readFootnote 5

$$\begin{aligned}&v_{24}= {1\over \sqrt{5}} \big (\sqrt{3} v_{15} - 2 \sqrt{2} v_3\big )\,,\nonumber \\&v_{35}= {1\over 2\sqrt{5}} \big (2 \sqrt{2} v_{15} - 5 v_8 - \sqrt{3} v_3\big ). \end{aligned}$$
(36)
Fig. 4
figure 4

The allowed values of the invariants \(\xi \) and \(\eta \) for Sp(3) lie between the two curves starting at the point \(\xi =-1/3, \eta =-1/27\). The enclosed area in the \(\xi \eta \)-plane amounts to 1/480, and the circumference of the tricorn measures 0.67763

As a consequence of the imposed condition the traces of odd powers of \(\vec \lambda \!\cdot \!{\hat{v}}\) vanish, and therefore one is dealing for the subgroup Sp(3) of SU(6) with the special case \(\zeta =0,\) \(\chi =0\). The six real roots come as pairs with opposite sign, \(\pm \sqrt{x_j}\), where \(x_j\) are the three positive roots of the cubic polynomial \({\tilde{P}}_3(x) = x^3-x^2-\xi x+\eta \). The further analysis based on a behavior of \({\tilde{P}}_3(x)\) similar to that shown in Fig. 2 leads to the following inequalities

$$\begin{aligned} -{1\over 3}\le \xi \le 0\,,\quad \eta \le 0\,, \qquad (2-27\eta +9\xi )^2\le 4 (1+3\xi )^3\,. \end{aligned}$$
(37)

The resulting allowed range of the invariants \(\xi \) and \(\eta \) is shown in Fig. 4 and one recognizes as the minimal value \(\eta _\mathrm{min} = -1/27\). Making the substitution \(x={1\over 3}(1+2\sqrt{1+3\xi }\cos \psi )\) one obtains for the three positive roots the expressions

$$\begin{aligned}&x_1={1\over 3}\Big (1+2\sqrt{1+3\xi }\cos \psi \Big ), \nonumber \\&x_2={1\over 3}\Big [1-2\sqrt{1+3\xi }\sin \Big (\psi +{\pi \over 6}\Big )\Big ] , \nonumber \\&x_3={1\over 3}\Big [1+2\sqrt{1+3\xi }\sin \Big (\psi -{\pi \over 6}\Big )\Big ], \end{aligned}$$
(38)

with the angle \(\psi \in [0,\pi /3]\) given by

$$\begin{aligned} \psi = {1\over 3}\arccos {2-27\eta +9\xi \over 2(1+3\xi )^{3/2}}\,. \end{aligned}$$
(39)

In the end the semi-analytical formula for Sp(3) matrices reads

$$\begin{aligned}&U = \exp (i\vec \lambda \!\cdot \!\vec v\,) =\sum _{j=1}^3 {1 \over 3x_j^2-2x_j-\xi } \nonumber \\&\qquad \times \Big \{\cos ( \sqrt{x_j} |\vec v\,|) {\mathbf {1}} +{i\over \sqrt{x_j}} \sin (\sqrt{x_j} |\vec v\,|)\,\vec \lambda \!\cdot \!{\hat{v}}\Big \}\nonumber \\&\qquad \times \Big \{ (x_j^2\!-\!x_j\!-\!\xi ) {\mathbf {1}} +(x_j-1)(\vec \lambda \!\cdot \!{\hat{v}})^2+ (\vec \lambda \!\cdot \!{\hat{v}})^4\Big \} , \end{aligned}$$
(40)

where the trace tr\(\,U = 2\sum _{j=1}^3 \cos ( \sqrt{x_j} |\vec v\,|) \) is again real-valued.

In passing one reminds that in low dimensions the spin groups Spin(n), defined as the two-sheeted simply-connected coverings of the special orthogonal goups SO(n), obey the following isomorphisms

$$\begin{aligned}&Spin(3)= SU(2), \qquad Spin(4) = SU(2) \times SU(2), \nonumber \\&Spin(5) =Sp(2), \qquad \, Spin(6) = SU(4), \end{aligned}$$
(41)

together with \(Sp(1)= SU(2)\). For all these compact Lie groups the analytical evaluation of the matrix exponential function has been studied in this work. Actually, the obtained formula can be evaluated most straightforwardly for the symplectic group Sp(2), whereas in the other cases one has to make a (somewhat) cumbersome detour via the three real roots of a cubic polynomial equation.