Abstract
We present a number of new contributions to the topic of constructing efficient higherorder splitting methods for the numerical integration of evolution equations. Particular schemes are constructed via setup and solution of polynomial systems for the splitting coefficients. To this end we use and modify a recent approach for generating these systems for a large class of splittings. In particular, various types of pairs of schemes intended for use in adaptive integrators are constructed.
Introduction
Operator splitting techniques for the efficient numerical integration of evolution equations
have become increasingly popular in recent years. Splitting the righthand side F(u) into two or more components in an appropriate way enables efficient and accurate approximations. In particular, a number of higherorder schemes with real or complex coefficients have been constructed and analyzed. Relevant contributions to this fields can, e.g., be found in [7, 8, 10, 12, 13, 15, 19–21, 23]. Furthermore, application to particular problem classes have been studied in the literature where the vector field F has special properties, such that splitting methods can be tuned for such cases. In [9] and [18], for instance, perturbations of integrable systems have been considered, say \( F(u) = A(u) + \varepsilon \,B(u) \) where \( \varepsilon \) is s small perturbation parameter. Exploiting this perturbation structure allows the construction of more efficient (de facto) higherorder schemes compared to generic ones.
Overview
We present some new contributions to the topic of splitting methods; here we will concentrate on the generic case, i.e., no special properties of the vector field F are assumed. At first we review the approach from [1] for the automatic setup of order conditions represented by polynomial equations in the coefficients to be determined. Special cases involving symmetries or composition methods based on lowerorder schemes can be treated as well. Splitting of the righthand side of (1.1) into two or three components is considered.
The goal is to identify good schemes of a desired order p . ‘Good’ refers to a compromise between efficiency (minimizing effort) as well as accuracy (minimizing a measure for the expected behavior of the local error). In particular, we focus on the constructions of pairs of schemes of orders \( (p,p+1) \), where a scheme of order p acts as a ‘worker’, while a related scheme of order \( p+1 \) plays the role of a ’controller’ for the purpose of practical local error estimation. The idea of using pairs of embedded schemes (an idea related to Runge–Kutta pairs) is due to [17]. Via more flexible embeddings, optimized variants can be constructed. Here, ‘optimization’ means searching for schemes where a reasonable measure for the behavior of the local error becomes minimal among a set of comparable schemes. It is wellknown that this is a very relevant point, because such local error measures may vary over several orders of magnitude.We also consider alternative ways of choosing \( (p,p+1) \)pairs, e.g., adjoint pairs.
Concerning the search for optimal solutions for a given set of order conditions (see Sect. 4), different techniques were applied, depending on the particular case at hand, including exact, symbolic solution representations using^{Footnote 1} Maple (for lowerorder schemes), or numerical searches using optimization tools or straightforward MonteCarlo techniques.
The ultimate purpose is adaptive integration of evolution equations based on a reliable local error control. This topic has been studied in detail, in particular in the context of Schrödinger equations, in [2, 4–6]. In these papers, an alternative method for local error estimation has been constructed and analyzed. It is based on a computable high order approximation of an integral representation of the local error in terms of the defect of the numerical solution. While this approach is rather universal and useful in several cases, the alternative of using optimized pairs of schemes, if applicable, will usually be more efficient.
In Part II of this work we will present a detailed study of adaptive integration, using both approaches for local error estimation, for different types of linear and nonlinear evolution equations.
Remark 1
Recently we became aware of the paper [9], where a method of deriving order conditions has been proposed which is similar to our approach. Both approaches are based on the notion of a Lyndon basis (also called Lyndon–Shirshov basis) in a free Lie algebra. In view of the similarities between our work and [9], we stress that we have implemented a fully automatic computational procedure for deriving order conditions which requires no extra analytical hand work. This is a versatile implementation, and it can easily be adapted to cover special cases like palindromic schemes, flexible embeddings, and also splitting into more than two operators (see Sects. 2, 3).
The procedure for setting up higher order conditions involves the generation of long weighted sums of power products of noncommuting variables representing the components of the split vector fields. These sums can easily be distributed in order to obtain a significant speedup in a parallel environment, and we have realized such a version.
Problem setting and notation
For an evolution equation (1.1) where the righthand side is split into two components,
a single step of a multiplicative splitting scheme, starting from u and over a step of length h , is given by^{Footnote 2}
with
with appropriate coefficients \( a_j,b_j \). More general schemes based on splitting into three operators are also considered, see Sect. 2.4, and a special case of additive splitting is also included, see Sect. 2.2.
The local error of a splitting step is denoted by
Contents
In Sects. 2 and 3 we describe our approach for setting up the order conditions for different types of [pairs of] schemes. Some technical details concerning implementation of this setup procedure are given in Sect. 4. By solving the resulting polynomial systems we have constructed a number of new variants, and we have compiled a collection of practically relevant (old and new) schemes and pairs of schemes up to order \( p=6 \). This collection can be found at
and is also expected to be extended in the future, depending on further investigations on the topic at hand. We will refer to this webpage throughout as reference [3] to avoid listing coefficients in the present paper for the sake of brevity. Some remarks on the schemes collected in [3] are given in Sect. 5; for more detailed information about the properties of the various schemes we also refer to [3]. In Sect. 7 we present a numerical example.
Order conditions
Many authors have contributed to the topic of finding good methods. For an overview on the topic see [7, 20]. Here we do not attempt to describe the relevant approaches and results in detail but mainly refer to work related to our present activity. For the relevant mathematical background we refer to [7, 15, 20].
Among many others, [8, 10, 12, 13] are devoted to the construction of optimal higherorder methods with real or complex coefficients, either via composition or by solving a set of order conditions generated in different ways. Order conditions take the form of a polynomial system in the unknown coefficients or composition weights \( \omega _\mu \), see Sect. 2.2. In the following we recapitulate and illustrate by examples how order conditions can be set up according to [1]; as mentioned before, this is similar to one of the approaches taken in [9]. Later on we will also present optimized schemes and pairs of schemes obtained on the basis of this approach, where ‘optimized’ means that a measure for the local error is chosen as small as possible.
Setup of order conditions
There are different ways to generate a polynomial system representing the conditions on the splitting coefficients for a desired order p . An essential theoretical basis is the wellknown Baker–Campbell–Hausdorff (BCH) formula, see for example [15].
The approach proposed in [1], which we follow here, also relies on the BCH formula, but order conditions are set up in a completely automatic way. Most of the schemes and pairs of schemes specified in [3] have been obtained on the basis of the algorithm from [1]. In the following we explain and illustrate this approach by means of examples. For the purpose of generating order conditions it is sufficient to consider the case of a linear operator split into two parts A and B . We denote
For the linear case the local error (1.4) is of the form \( \mathscr {L}(h)\,u \) with a linear operator \( \mathscr {L}(h) \).
Consider the Taylor expansion of the local error^{Footnote 3} of a onestep method starting at u ,
The method is of order p iff \( \mathscr {L}(h) = {\mathscr {O}}(h^{p+1}) \); thus the conditions for order p are given by
For the case of a splitting method we have (with \( {\varvec{k}}=(k_1,\ldots ,k_s) \in \mathbb N_0^s \))
If the conditions (2.2) are satisfied up to a given order p , then the leading term of the local error is given by \( \frac{h^{p+1}}{(p+1)!}\,\tfrac{\mathrm{d}^{p+1}}{\mathrm{d}h^{p+1}}\,\mathscr {L}(0) \). This leading error term is a linear combination of higherorder commutators of the operators A and B . As explained in [1], a nonredundant set of order conditions can be built in a recursive way by generating the symbolic expressions (2.3) for \( q=1,2,3,\ldots \) in terms of formally linear but noncommuting operators A, B , and identifying coefficients associated with power products of A  and B factors which uniquely identify commutators out of an appropriate basis of Lieelements. For this purpose we use the socalled Lyndon basis, also called Lyndon–Shirshov basis, of the free Lie algebra generated by A and B . The elements of this basis are represented by the (associative) Lyndon words over the alphabet \( \{\mathtt {A,B}\} \), see Table 1.
Let us first illustrate the procedure by means of a simple example.
Example 1
For \( s=2 \) we have
The basic consistency condition for order \( p=1 \) is \( \tfrac{\mathrm{d}^{}}{\mathrm{d}h^{}} \mathscr {L}(0)=0 \) which is equivalent to \( a_1+a_2=1 \) and \( b_1+b_2=1 \). Assuming these firstorder conditions are satisfied, the second derivative \( \tfrac{\mathrm{d}^{2}}{\mathrm{d}h^{2}}\,\mathscr {L}(0) \), which now represents the leading error term, simplifies to the commutator expression
giving the additional condition \( 2\,a_2\,b_1 = 1\) for order \( p=2 \). Assuming now that the conditions for \( p=2 \) are satisfied, the third derivative \( \tfrac{\mathrm{d}^{3}}{\mathrm{d}h^{3}}\,\mathscr {L}(0) \), which will now represent the leading error term, is a linear combination of the commutators [A, [A, B]] and [[A, B], B] , namely
This computation can be automatized:

Generate the representation (2.4a) of \( \tfrac{\mathrm{d}^{}}{\mathrm{d}h^{}} \mathscr {L}(0) \) and extract coefficients of the Lyndon words \( \mathtt {A} \) and \( \mathtt {B} \). This gives the firstorder conditions \( a_1+a_2=1 \) and \( b_1 + b_2= 1 \).

Generate the representation (2.4b) of \( \tfrac{\mathrm{d}^{2}}{\mathrm{d}h^{2}} \mathscr {L}(0) \). For a solution of the equations for order 1, the leading local error will have the form \( \frac{h^2}{2}\,\tfrac{\mathrm{d}^{2}}{\mathrm{d}h^{2}}\,\mathscr {L}(0) \) with \( \tfrac{\mathrm{d}^{2}}{\mathrm{d}h^{2}}\,\mathscr {L}(0) \) from (2.5). The coefficient of [A, B] in (2.5) is determined by extracting the coefficient of the Lyndon word \( \mathtt {AB} \) in (2.4b). This gives the equation \( 2\,a_2\,b_1 = 1 \) which, together with the firstorder conditions, represents a set of conditions for order \( p=2 \).

Generate the representation of \( \tfrac{\mathrm{d}^{3}}{\mathrm{d}h^{3}} \mathscr {L}(0) \) (we do not display it here). For a solution of the equations for order 2, the leading local error will have the form \( \frac{h^3}{6}\,\tfrac{\mathrm{d}^{3}}{\mathrm{d}h^{3}}\,\mathscr {L}(0) \) with \( \tfrac{\mathrm{d}^{3}}{\mathrm{d}h^{3}}\,\mathscr {L}(0) \) from (2.6). The coefficients of [A, [A, B]] and [[A, B], B] in (2.6) are determined by extracting the coefficients of the Lyndon words \( \mathtt {AAB} \) and \( \mathtt {ABB} \) in the expression for \( \tfrac{\mathrm{d}^{3}}{\mathrm{d}h^{3}} \mathscr {L}(0) \).
In the simple case considered here, there is a onedimensional manifold of solutions for order \( p=2 \), and for each solution \( \{a_1,a_2,b_1,b_2\} \) the size of the coefficients in (2.6) is a quality measure.
If a scheme of order 3 is desired, the system of equations is augmented by the further equations \( 3\,a_2^2\,b_1 = 1 \) and \( 3\,a_2\,b_1^2 = 1 \). (For the case \( s=2 \) displayed here, the resulting system of equations has no solution; we need \( s \ge 3 \).)
In general, for arbitrary s and p , this procedure is continued up to the desired order, by ‘implicit recursive elimination’ as described in [1], automatically producing a generically nonredundant set of order conditions for a desired order p . This process is based on a special bijection between (associative) Lyndon words and bracketed, nonassociative versions of these words which, in our context, are identified with higherorder commutators representing basis elements for the free Lie algebra generated by A and B . The expanded version of such a commutator is a Lie polynomial in terms of the noncommutative variables A and B . The essential point is that its leading monomial, with respect to (alphabetically increasing) lexicographical order, is precisely the monomial represented by the corresponding Lyndon word; see [11].
In the following, the relation ‘\(\mathtt{<}\)’ refers to lexicographical order of words over the alphabet \( \{\mathtt {A,B}\} \).
Example 2
Consider a scheme of order \( p=4 \), i.e., assume that the conditions up to order \( p=4 \) are satisfied. Then, \( \tfrac{\mathrm{d}^{5}}{\mathrm{d}h^{5}}\,\mathscr {L}(0) \) is a linear combination of commutators, or nonassociative words, listed below and represented by the six Lyndon words of length 5 (see Table 1),
The commutators are bracketed, nonassociative versions of these words,^{Footnote 4}
As mentioned above, the leading (lowest) monomials in the expanded commutators, in the sense of lexicographical order, correspond to the Lyndon words. Note that \(\underline{\underline{\mathrm{some\; of\; these\; monomials}}}\) also occur in lower commutators (‘lower’ again in the sense of lexicographical ordering). Let us now denote these six commutators by \( K_k,\, k=1 \ldots 6 \). We a priori know that \( \tfrac{\mathrm{d}^{5}}{\mathrm{d}h^{5}}\,\mathscr {L}(0) \) is of the form, with \( \ell _5=6 \),
where the scalars \( \kappa _k \) are multivariate polynomials of degree 5 in the coefficients \( a_j,b_j \) of the underlying scheme of order \( p=4 \). Therefore the additional conditions for order \( p=5 \) are given by
Extracting these coefficients \( \kappa _k \) from the expression (2.3) for \( \tfrac{\mathrm{d}^{5}}{\mathrm{d}h^{5}}\,\mathscr {L}(0) \) is a combinatorial challenge, but we can do better: We simply extract the coefficients of the Lyndon monomials—let us denote them by \( \lambda _k \)—which is a standard operation in computer algebra. Now, instead of (2.7a) we require
In our example, for \( {\varvec{\kappa }} = (\kappa _1,\ldots ,\kappa _6)^T \) and \( {\varvec{\lambda }} = (\lambda _1,\ldots ,\lambda _6)^T \) we have
where the lower diagonal entries correspond to the additional occurrence of the \( \lambda _k \) in nonleading positions. Therefore the systems (2.7a) and (2.7b) are equivalent.
The situation displayed in this example occurs also in the general case. For any order p , the vectors \( {\varvec{\kappa }} \) and \( {\varvec{\lambda }} \) consisting of polynomials of degree \( p+1 \) satisfy \( {\varvec{\lambda }} = M\,{\varvec{\kappa }} \) where M is a lower triangular matrix with unit diagonal. In particular, a Lyndon monomial \( \lambda _k \) never occurs in an expanded commutator \( K_j \) for \( j>k \) because this would contradict the leading position [11] of the Lyndon monomial \( \lambda _j>\lambda _k \) in \( K_j \).
Special cases: symmetries
In the sequel,
denotes the adjoint scheme associated with \( \mathscr {S}\).
The order conditions generated by the algorithm indicated in Sect. 2.1 are generically nonredundant. However, there exist special cases:

Symmetric (or: ‘timesymmetric’) onestep schemes are characterized by the property
$$\begin{aligned} \mathscr {S}(h,\mathscr {S}(h,u)) = u, \quad \text {i.e.,} \quad \mathscr {S}(h,u) = \mathscr {S}^*(h,u). \end{aligned}$$(2.8)For symmetric splitting schemes we have either \( a_1=0 \) or \( b_s=0 \), and the remaining coefficient tupels \( (a_j) \) and \( (b_j) \) are both palindromic. Since symmetric schemes have an even order p (cf. [15, Chapter 3]), only oddorder conditions for an appropriately reduced number of free coefficients need to be imposed. The general algorithm described in Sect. 2.1 can easily be adapted to this case.

The following type of schemes seems not to have been considered earlier in the literature:^{Footnote 5} Palindromic schemes, or ‘reflected schemes’ in the terminology of [1], are characterized by \( b_j = a_{s+1j},\, j=1 \ldots s \), i.e.,
$$\begin{aligned}&(a_1,b_1,a_2,b_2,\ldots ,a_{s1},b_{s1},a_s,b_s) \nonumber \\&\quad = (a_1,b_1,a_2,b_2,\ldots ,b_2,~~~\,a_2,~~~\,b_1,a_1). \end{aligned}$$(2.9)Assume a scheme of order p is given, and consider a splitting step of the form (1.3). Interchanging the roles of A and B , i.e., replacing (1.3) by
$$\begin{aligned} \check{\mathscr {S}}(h,u) = \check{\mathscr {S}}_s(h,\check{\mathscr {S}}_{s1}(h,\ldots ,\check{\mathscr {S}}_1(h,u))), \end{aligned}$$(2.10a)with
$$\begin{aligned} \check{\mathscr {S}}_j(h,v) = \phi _A(b_j\,h,\phi _B(a_j\,h,v)), \end{aligned}$$(2.10b)also results in a scheme of order p . If \( \mathscr {S}\) is palindromic then
$$\begin{aligned} \mathscr {S}(h,\check{\mathscr {S}}(h,u)) = u, \quad \text {i.e.,} \quad \check{\mathscr {S}}(h,u) = \mathscr {S}^*(h,u). \end{aligned}$$(2.11)Thus we infer from [15, Theorem II.3.2] that in the palindromic case the local errors \( \mathscr {L}(h,u) = \mathscr {S}(h,u)  \phi _F(h,u) \) and \( \check{\mathscr {L}}(h,u) = \check{\mathscr {S}}(h,u)  \phi _F(h,u) \) are related via
$$\begin{aligned} \mathscr {L}(h,u)&=C(u)\,h^{p+1} + {\mathscr {O}}(h^{p+2}), \end{aligned}$$(2.12a)$$\begin{aligned} \check{\mathscr {L}}(h,u)&= (1)^p\,C(u)\,h^{p+1} + {\mathscr {O}}(h^{p+2}), \end{aligned}$$(2.12b)with \( C(u) = \tfrac{1}{(p+1)!}\,\tfrac{\mathrm{d}^{p+1}}{\mathrm{d}h^{p+1}}\,\mathscr {L}(0,u) \). For an ansatz with palindromic coefficients, exchanging the roles of A and B in the algorithm from Sect. 2.1 will lead to the identical set of order conditions. Therefore the order conditions associated with ‘Lyndon twins’ are pairwise identical. Here, we call a pair of Lyndon words a twin if one of them is obtained by exchanging the role of A and B and reading it from right to left, see Table 1. For instance, the 6 words of odd length 5 consist of three twins; the 9 words of even length 6 consist of three twins, the selfie \( \mathtt {AAABBB} \), and two solitary words. Due to this redundancy the number of order conditions is appropriately reduced.

Higher order onestep schemes can be generated by m fold composition of lowerorder schemes with appropriately chosen substeps \( h_\mu = \omega _\mu h \) satisfying \( \omega _1 + \cdots + \omega _m = 1 \) plus additional conditions guaranteeing that a certain order is obtained.^{Footnote 6}
A popular class of composition methods are symmetric Strang compositions. Schemes of this type of orders 4, 6 and higher were first devised in [23]. Some of the composition coefficients have to be chosen negative, and the local error measures of these composition schemes are rather large. On the other hand, for higher orders, composition beats the generic lower limits on the number s of stages such that a given order p can be expected. For instance, the sevenfold 6th order symmetric Strang composition [3, ‘Y 86’] recombines into an 8stage scheme, whereas the generic number of order conditions for a symmetric scheme of order \( p=6 \) is 10, which would require \( s=10 \) stages involving 11 free coefficients.
Evidently, (symmetric) compositions are an attractive option for constructing higherorder schemes. Therefore we have included this class into our considerations concerning the search for optimal variants (see Sect. 4).
Complex coefficients
Our considerations are not restricted to schemes with real coefficients \( a_j,b_j \). Complex schemes, with coefficients having positive real parts, are appropriate for the application of splitting methods to parabolic problems, since real schemes with positive coefficients do not exist for order \( p \ge 3 \), see [7]. For this class of methods, in particular based on complex compositions, we refer to [8, 13].
Splitting into more than two operators
We also consider evolution equations where the righthand side splits into three parts,
and according multiplicative splitting schemes,
with
The methodology from [1] can be directly generalized to the case of splitting into more than two operators. For the practically relevant case of splitting into three operators A, B, C , as in (2.14), the representation (2.3) generalizes as follows, with \(A_j = a_j\,A,\, B_j=b_j\,B,\, C_j=c_j\,C \), and \( {\varvec{k}}=(k_1,\ldots ,k_s) \in \mathbb N_0^s \), \( {\varvec{l}} = (l_A,l_B,l_C) \in \mathbb N_0^3 \):
On the basis of these identities, the algorithm from Sect. 2.1 generalizes in a straightforward way. The Lyndon basis representing independent commutators now corresponds to Lyndon words over the alphabet \( \{\mathtt {A,B,C}\} \), see Table 2.
Concerning symmetries, similar considerations as in Sect. 2.2 apply.
For a general convergence theory of ABCsplitting for the linear case and some applications we refer to [6]. For example, splitting into three operators can be used to handle evolution equations where the righthand side splits up into two nonautonomous parts. Introducing the independent variable t as an unknown variable satisfying \( t'=1 \), such a problem can be formally considered as an autonomous system split into three parts. In this case, splitting means that the variable t is frozen over several subintervals comprising an integration step. Since the ODE \( t'=1 \) is trivial, a large number of higherorder commutators vanishes in this case, and therefore the number of necessary order conditions is significantly reduced, a situation to be considered in further work.
Pairs of splitting schemes
For the purpose of efficient local error estimation as a basis for adaptive stepsize selection, using pairs of related schemes is a wellestablished idea. One of the schemes, of order p , acts as the worker, and the other, of order \( p+1 \), is the controller responsible for local error estimation.^{Footnote 7} Criteria for the selection of pairs of schemes are accuracy and computational efficiency.
Order conditions for pairs of schemes of the types listed below can be generated with minor modifications of the approach described in Sect. 2.

Embedded pairs. In [17], pairs of splitting schemes of orders p and \( p+1 \) are specified. The idea is to select a controller \( \bar{\mathscr {S}}\) of order \( p+1 \) and to construct a worker \( \mathscr {S}\) of order p for which a maximal number of stages \( \mathscr {S}_j \) coincides with those of the controller. Let \( a_j,b_j \) and \( \bar{a}_j,\bar{b}_j \) denote the coefficients of the worker and controller, respectively. The approach adopted in [17] may be called static, finding \( \mathscr {S}\) and \( \bar{\mathscr {S}}\) such that \( a_j={\bar{a}}_j \) and \( b_j={\bar{b}}_j \) for as many \( j=1,2,\ldots \) as possible. In this sense the schemes are related to each other but, in general, the total number of order conditions, and thus the total number of necessary evaluations, is the same as for an arbitrary unrelated \( (p,p+1) \) pair.
Here we develop the idea of embedding further: again we fix a ‘good’ controller of order \( p+1 \) and wish to adjoin to it a ‘good’ worker of order p . Since the number of stages \( \bar{s} \) of \( \bar{\mathscr {S}}\) will be higher than the number of stages s of \( \mathscr {S}\), we can select an optimal embedded worker \( \mathscr {S}\) from a set of candidates obtained by flexible embedding, where the number of coinciding coefficients is not a priori fixed.
Example 3
In [17], an embedded (3, 4) pair was constructed, where the controller is an optimized symmetric scheme of order \( p=4 \) with \( s=7 \) stages due to [10], with local error measure \(\mathrm{LEM}=0.01\) (‘LEM’ in the sense of (4.2b) below). The worker specified in [17] is a scheme of order \( p=3 \) with \( s=6 \) stages, where the coefficients \( a_1,a_2,a_3,a_4 \) and \( b_1,b_2,b_3 \) coincide with those of the controller. This amounts to 7 additional evaluations for the worker, and its local error measure is \(\mathrm{LEM}=0.2\).
For flexible embedding, in contrast, we consider all possible embedded workers, and we find that a scheme of order \( p=3 \) with \( s=4 \) stages is to be preferred, see [3, Emb 4/3 BM PRK/A], where \( a_1,a_2 \) and \( b_1 \) coincide with those of the controller. This amounts to five additional evaluations for the worker, and it has \(\mathrm{LEM}=0.1\).

Milne pairs. In the context of multistep methods for ODEs, the socalled Milne device is a wellestablished technique for constructing pairs of schemes. In our context, one may aim for finding a pair \( (\mathscr {S},\tilde{\mathscr {S}}) \) of schemes of the same type, with equal s and p , such that their local errors \( \mathscr {L},\tilde{\mathscr {L}}\) are related according to
$$\begin{aligned} \mathscr {L}(h,u)&= C(u)\,h^{p+1} + {\mathscr {O}}(h^{p+2}), \end{aligned}$$(3.1a)$$\begin{aligned} \tilde{\mathscr {L}}(h,u)&= \gamma \,C(u)\,h^{p+1} + {\mathscr {O}}(h^{p+2}), \end{aligned}$$(3.1b)with \( \gamma \not = 1 \). Then, the additive scheme
$$\begin{aligned} {\bar{\mathscr {S}}}(h,u) = \frac{\gamma }{1\gamma }\,\mathscr {S}(h,u) + \frac{1}{1\gamma }\,\tilde{\mathscr {S}}(h,u) \end{aligned}$$is a method of order \( p+1 \), and
$$\begin{aligned} \mathscr {S}(h,u)  {\bar{\mathscr {S}}}(h,u) = \frac{1}{1\gamma }\,\big ( \mathscr {S}(h,u)  \tilde{\mathscr {S}}(h,u) \big ) \end{aligned}$$provides an asymptotically correct local error estimate for \( \mathscr {S}(h,u) \).

Adjoint pairs. Let \( \mathscr {S}\) be a scheme of odd order p and and \( \mathscr {S}^*\) its adjoint, see Sect. 2.2. Due to [15, Theorem II.3.2] the leading error terms of \( \mathscr {S}\) and its adjoint \( \mathscr {S}^*\) are identical up to the factor \( 1 \). Therefore, the averaged additive scheme
$$\begin{aligned} {\bar{\mathscr {S}}}(h,u) = \frac{1}{2}\big ( \mathscr {S}(h,u)+\mathscr {S}^*(h,u) \big ) \end{aligned}$$(3.2)is a method of order \( p+1 \), and
$$\begin{aligned} \mathscr {S}(h,u)  {\bar{\mathscr {S}}}(h,u) = \frac{1}{2}\big ( \mathscr {S}(h,u)  \mathscr {S}^*(h,u) \big ) \end{aligned}$$provides an asymptotically correct local error estimate for \( \mathscr {S}(h,u) \). In this case the additional effort for computing the local error estimate is identical with the effort for the worker \( \mathscr {S}\) but not higher as is the case for embedded pairs. An example are palindromic pairs, where \( \mathscr {S}\) is palindromic (of odd order p ), such that \( \mathscr {S}^*= \check{\mathscr {S}}\), see Sect. 2.2.
For detailed comments on a number of new pairs listed in [3], see Sect. 5.
Implementation aspects: constructing schemes and minimizing local error terms
Our approach for setting up order conditions described in Sect. 2.1 has been implemented in Maple 18. We use the Physics package for the manipulation of noncommuting symbols, and tables of Lyndon words generated using an algorithm devised in [14]. Since the number of terms in (2.3) resp. (2.15) rapidly increases with q we have implemented a parallel version relying on Maple’s Grid package. In particular, the job of generating all the terms in the long sums (2.3) and (2.15) can be (equi)distributed over several parallel threads.
The resulting set of order conditions is a multivariate polynomial system which, for higher orders, requires numerical solution techniques. Once a scheme of order p has been found, its leading local error term is of the form (see Sect. 2)
with \( \ell _{p+1} \) commutators \( K_{p+1,k} \) associated with Lyndon words of length \( p+1 \). To compare schemes of equal order p one may consider
as a reasonable measure for the accuracy of a scheme. However, we use the quantity
instead. Using (4.2b) has the advantage that the coefficients \( \lambda _k = \lambda _{p+1,k} \) are exactly those which are generated in the course of the setup of the conditions for order \( p+1 \), see Sect. 2.1, while the coefficients from (4.2a) are more difficult to compute (cf. the discussion in Sect. 2.1). Since different particular solutions to the order conditions typically result in leading local error terms varying over several orders of magnitudes, we consider (4.2b) equally reasonable as (4.2a).
For finding and evaluating solutions and pairs of solutions we follow two different strategies.

For the case where the number of equations equals the number of free coefficients we expect a set of isolated solutions. In this case we use the fsolve function in Maple combined with a MonteCarlo strategy for generating different initial intervals. Higher precision is used to generate solutions with double precision accuracy. For each detected solution the LEM (4.2b) is computed.

Especially for the case where the number of equations is smaller than the number of free coefficients, the problem is to be considered as a constrained minimization problem: minimize the LEM representing the objective function, with the order conditions imposed as nonlinear equality constraints. To this end we employ stateoftheart techniques which have also been applied for the construction of special classes of Runge–Kutta methods, see for instance [16]. In particular we have used the MATLAB^{Footnote 8} optimizer fmincon. Again a large number of initial guesses are generated randomly, since this optimization problem is nonconvex in general. The results cannot be guaranteed globally optimal, but results from an exhaustive search usually suggest that this is indeed the case. A postprocessing, i.e., refining the solutions to full double precision, is again performed in Maple using higher precision sfloat arithmetic.
We have also rechecked a number of known methods, refined their coefficients to full double precision, and computed their LEMs.
Schemes from the collection [3]
This collection is not intended to be exhaustive. It includes some known and quite a number of new schemes, in particular pairs of schemes, up to order \( p=6 \), with their essential properties. Some methods are included mainly for the sake of completeness or their historical significance.
In the following we comment on some of these methods; for complete information, consult [3]. ‘Best’ or ‘optimal’ means that it has minimal LEM (4.2b) among a certain class of methods with comparable effort for a given order p . In some simple cases such optimality properties can be established theoretically; for higher orders we have resorted to more or less exhaustive numerical search.
Methods whose label contains the letter ‘A’ are new, or taken again into consideration in the context of constructing pairs, or their LEM has been computed for the first time.^{Footnote 9} The list also includes some pairs of embedded schemes (‘Emb ...’), pairs of Milne type (‘Milne ...’), and palindromic pairs (‘PP ...’), see Sect. 3.
More detailed information about all these methods can be found on the webpage [3].
Splitting into two operators (‘AB schemes’)
Real coefficients.

The best schemes up to order \( p=5 \) we have found are palindromic:

‘best 2stage 2nd order’ (\( s=p=2 \)).

‘Emb 3/2 AKS’ (palindromic controller with \( s=p=3 \)).

‘Emb 4/3 AKS p’ (palindromic controller with \( s=5 \), \( p=4 \)). In particular, this scheme has essentially the same LEM as the fourth order scheme from [10] which has been used in [17], but it has only 5 stages instead of 7.

‘Emb 5/4 A’ (palindromic controller with \( s=8 \), \( p=5 \)), see also ‘PP 5/6 A’.


‘Emb 5/4 AK (ii)’ is an optimized embedded pair. The controller is a new scheme with \( s=7 \), \( p=5 \), and the worker of order \( p=4 \) is chosen out of several dozens of candidates of order 4 which share the same computational effort but have LEMs varying over several orders of magnitudes.

Palindromic pairs: ‘PP 3/4 A’, ‘PP 5/6 A’.
Complex coefficients (with positive real parts).

Since for order \( p=3 \) we need 5 conditions, the question is whether there exists a thirdorder scheme with \( s=3 \) and 5 evaluations. It turns out that the only scheme of this type, ‘A 33 c’, has complex coefficients.

‘A 44 c’ (\(s=4 \), \( p=4 \)) is the best complex symmetric Strang composition method of order 4 ; see also [12, 13].

‘Emb 3/2 A c’ and ‘Emb 4/3 A c’ are embedded pairs with palindromic controller and optimized worker. We note that the controller in ‘Emb 4/3 A c’ (\( s=5 \), \(p=4\)) has a significantly smaller LEM than ‘A 44 c’ (factor \( \approx 20 \)).

‘C 86 c’ (\( s=8 \), \( p=6 \)) is the best symmetric complex Strang composition method of order 6 ; see also [12, 13].

Palindromic pairs: ‘PP 3/4 A c’, ‘PP 5/6 A c’.
Splitting into three operators (‘ABC schemes’)
Due to the rapidly increasing number of generic order conditions, finding general higher order schemes would be a very challenging task for this case. For \( p=6 \), for instance, the generic number of order conditions is 196 for the general case and 59 for the symmetric case. For \( p=6 \) we therefore only consider real or complex Strang compositions which are easier to construct and lead to more compact schemes. Generating the expression for the leading error term \( \tfrac{\mathrm{d}^{7}}{\mathrm{d}h^{7}} \mathscr {L}(0) \) for the purpose of computing the LEM for \( p=6 \), involving 312 coefficients (see Table 2), is computationally expensive, but it can be done at reasonable effort, for the purpose of computing the LEM of a given composition and comparing different variants.
Real coefficients.

‘AK 52’ (\( s=5 \), \( p=2 \), 9 evaluations) appears to be a possible rival of the Strang scheme (\( s=3 \), \( p=2 \), 5 evaluations), with a LEM which is smaller by a factor \( \approx 7 \).

‘PP 3/4 A 3’ is a palindromic pair based on the best palindromic scheme found for \( s=6 \), \( p=3 \).

‘Y 74’ (\( s=7, p=4 \), 13 evaluations) is the best symmetric Strang composition of order \( p=4 \). It is the analog of the AB composition ‘Y 44’, with the same composition weights.

‘AK 114’ (\( s=11 \), \( p=4 \), 21 evaluations) has been found on the basis of 11 conditions for a symmetric ABC scheme of order 4. Its LEM is smaller by a factor \( \approx 13 \) compared to ‘Y 74’.

‘AY 156’ (\( s=15, p=6 \)) is the best symmetric Strang composition of order \( p=6 \). It is the analog of the AB composition ‘Y 86’, with the same composition weights.
Complex coefficients (with positive real parts).

‘AK 74 c’ (\( s=7, p=4 \)) is the best symmetric Strang composition of order \( p=4 \). It is the analog of the AB composition ‘A 44c’, with the same composition weights.

‘AK 156 c’ (\( s=15, p=6 \)) is the best symmetric Strang composition of order \( p=6 \). It is the analog of the AB composition ‘C 86c’, with the same composition weights.
Palindromic schemes: discussion and open questions
As indicated in Sect. 2.2, one motivation for considering palindromic schemes is the fact that they are easier to construct. Moreover, as already mentioned in Sect. 5, small error constants are usually observed in this case. Apparently, palindromic schemes tend to have minimal LEMs among a set of competitors, for instance the thirdorder scheme in the pair ‘PP 3/4 A’ (a theoretical explanation for this observation is missing). This is the reason why we have included some adjoint pairs of (optimized) palindromic type of orders \( (p,p+1) \), p odd, in our collection [3].
Palindromic schemes have a certain type of symmetry, but they are not timesymmetric. Investigating special properties of such schemes appears to be of interest in the context of geometric integration, for instance when they are applied to partitioned systems of the form
with the natural splitting of the vector field (f, g) into (f, 0) and (0, g) . In this case, splitting methods are equivalent to a subclass of Partitioned Runge–Kutta (PRK) methods, characterized by a pair \( (\varvec{A},\varvec{\alpha }),\,(\varvec{B},\varvec{\beta }) \) of Runge–Kutta schemes. For each AB splitting scheme, the associated PRK coefficients satisfy
As a consequence, all splitting schemes preserve quadratic invariants when applied to a partitioned system (6.1), and they are symplectic when applied to a Hamiltonian system with a separable Hamiltonian \( H(p,q) = T(p) + U(q) \), see [15, Theorem IV.2.4 and Theorem VI.4.6].
Extending these considerations to PRK schemes applied to general partitioned systems
again from [15] we know that under the additional condition \( \varvec{\alpha }_i = \varvec{\beta }_i,~ i=1 \ldots s \), the invariance properties mentioned above remain valid for (6.2). The latter condition is satisfied if the scheme is a composition of steps of symplectic Euler type, i.e., for \( a_i=b_i \). In the palindromic case we have \( \varvec{\beta }_i = \varvec{\alpha }_{s+1i} \), and this does not appear to be a useful property in view of invariance questions. (Cf., for instance, the proof of the general assertion of Theorem IV.2.4 in [15], which does not carry over.)
Summarizing, we may say that palindromic schemes by now have not been completely understood, and this may deserve further investigations.
Numerical example
For a numerical illustration, in particular concerning the expected performance of palindromic schemes, we consider an example of a system of coupled nonlinear evolution equations of Schrödinger type (see [22]),
with initial condition chosen such that the exact solution is a pair of solitons,
which is exponentially decreasing with x . We start at \( t=0 \), the parameters are chosen as \( \delta =0.5 \), \( \alpha =1.0 \), \( v=1.1 \), and \( e=0.8 \).
We impose periodic boundary conditions on the interval \( x_{min},x_{max}=[50,70] \) using an equidistant grid of size 2048. For splitting we choose the time step h and separately integrate

the kinetic part (‘A’) involving the derivatives w.r.t. x , using a Fourier spectral discretization,

and the nonlinear ‘ODE part’ (‘B’), which can be exactly propagated: at each grid point x , the respective solution \( (\psi _{1,B},\psi _{2,B}) = (\psi _{1,B}(x,t),\psi _{2,B}(x,t)) \) of the ODE system
$$\begin{aligned} \mathrm {i}\,\frac{\mathrm {d} \psi _{1,B}}{\mathrm {d}\,t} + \big ( \psi _{1,B}^2 + e\,\psi _{2,B}^2 \big ) \psi _{1,B}&= 0, \\ \mathrm {i}\,\frac{\mathrm {d} \psi _{2,B}}{\mathrm {d}\,t} + \big ( e\,\psi _{1,B}^2 + \psi _{2,B}^2 \big ) \psi _{2,B}&= 0, \end{aligned}$$starting at \( t_0 \) is given by
$$\begin{aligned} \psi _{1,B}(x,t)&= \mathrm {e}^{\,\mathrm {i}\,(tt_0)\,\left( \psi _{1,B}(x,t_0)^2 + e\,\psi _{2,B}(x,t_0)^2\right) } \psi _{1,B}(x,t_0), \\ \psi _{2,B}(x,t)&= \mathrm {e}^{\,\mathrm {i}\,(tt_0)\,\left( e\,\psi _{1,B}(x,t_0)^2 + \psi _{2,B}(x,t_0)^2\right) } \psi _{2,B}(x,t_0). \end{aligned}$$
All computations were performed in standard double precision arithmetic. In Tables 3 and 4, ‘err’ refers to a canonically scaled discrete \( L_2 \)  norm, and ‘ord’ refers to the order observed.
Notes
Maple is a product of \( \text {Maplesoft}^{\text {TM}} \).
\( \phi _F \) denotes the flow associated with the given evolution equation.
By construction, \( \mathscr {L}(0)=0 \) for any consistent scheme.
The bracketing can be computed using the SageMath function StandardBracketedLyndonWords, see http://www.sagemath.org.
The Lie–Trotter scheme, with \( s=p=1,\; a_1=b_1=1 \), is a trivial special case.
We note that the idea of composition is of a general nature and not restricted to the class of splitting methods.
Of course, a scheme acting as a controller can also be used as an integrator in a normal way.
MATLAB is a trademark of The Math Works, Inc.
Of course, ‘new’ may not be considered as a rigorous statement in each case since the literature on the subject is rather large by now.
References
Auzinger, W., Herfort, W.: Local error structures and order conditions in terms of Lie elements for exponential splitting schemes. Opusc. Math. 34, 243–255 (2014)
Auzinger, W., Hofstätter, H., Koch, O., Thalhammer, M.: Defectbased local error estimators for splitting methods, with application to Schrödinger equations. Part III: the nonlinear case. J. Comput. Appl. Math. 273, 182–204 (2014)
Auzinger, W., Koch. O.: Coefficients of various splitting methods. http://www.asc.tuwien.ac.at/~winfried/splitting/
Auzinger, W., Koch, O., Thalhammer, M.: Defectbased local error estimators for splitting methods, with application to Schrödinger equations. Part I: the linear case. J. Comput. Appl. Math. 236, 2643–2659 (2012)
Auzinger, W., Koch, O., Thalhammer, M.: Defectbased local error estimators for splitting methods, with application to Schrödinger equations. Part II: higherorder methods for linear problems. J. Comput. Appl. Math. 255, 384–403 (2013)
Auzinger, W., Koch, O., Thalhammer, M.: Defectbased local error estimators for highorder splitting methods involving three linear operators. Numer. Algorithms 70, 61–91 (2015)
Blanes, S., Casas, F., Murua, A.: Splitting and composition methods in the numerical integration of differential equations. Bol. Soc. Esp. Mat. Apl. 45, 87–143 (2008)
Blanes, S., Casas, F., Chartier, P., Murua, A.: Optimized highorder splitting methods for some classes of parabolic equations. Math. Comput. 82, 1559–1576 (2013)
Blanes, S., Casas, F., Farrés, A., Laskar, J., Makazaga, J., Murua, A.: New families of symplectic splitting methods for numerical integration in dynamical astronomy. Appl. Numer. Math. 68, 58–72 (2013)
Blanes, S., Moan, P.C.: Practical symplectic partitioned Runge–Kutta and Runge–Kutta–Nyström methods. J. Comput. Appl. Math. 142, 313–330 (2002)
Bokut, L., Sbitneva, L., Shestakov. I.: Lyndon–Shirshov words, Gröbner–Shirshov bases, and free Lie algebras. In: Sabinin, L., Sbitneva, L., Shestakov, I. (eds.) NonAssociative Algebra and Its Applications, chapter 3. Chapman & Hall / CRC, Boca Raton (2006)
Castella, F., Chartier, P., Descombes, S., Vilmart, G.: Splitting methods with complex times for parabolic equations. BIT Numer. Math. 49, 487–508 (2009)
Chambers, J.: Symplectic integrators with complex time steps. AJ 126, 1119–1126 (2003)
Duval, J.P.: Géneration d’une section des classes de conjugaison et arbre des mots de Lyndon de longueur bornée. Theor. Comput. Sci. 60, 255–283 (1988)
Hairer, E., Lubich, C., Wanner, G.: Geometric Numerical Integration, 2nd edn. SpringerVerlag, Berlin (2006)
Ketcheson, D.I., MacDonald, C.B., Ruuth, S.J.: Spatially partitioned embedded Runge–Kutta methods. SIAM J. Numer. Anal. 51, 2887–2910 (2013)
Koch, O., Neuhauser, C., Thalhammer, M.: Embedded splitstep formulae for the time integration of nonlinear evolution equations. Appl. Numer. Math. 63, 14–24 (2013)
McLachlan, R.I.: Composition methods in the presence of small parameters. BIT Numer. Math. 35, 258–268 (1995)
McLachlan, R.I.: On the numerical integration of ordinary differential equations by symmetric composition methods. SIAM J. Sci. Comput. 16, 151–168 (1995)
McLachlan, R.I., Reinout, G., Quispel, W.: Splitting methods. Acta Numer. 11, 341–434 (2002)
Omelyan, I.P., Mryglod, I.M., Folk, R.: Construction of highorder forcegradient algorithms for integration of motion in classical and quantum systems. Phys. Rev. E 66, 026701 (2002)
Wadati, M., Izuka, T., Hisakado, M.: A coupled nonlinear Schrödinger equation and optical solitons. J. Phys. Soc. Jpn. 61, 2241–2245 (1992)
Yoshida, H.: Construction of higher order symplectic integrators. Phys. Lett. A 150, 262–268 (1990)
Acknowledgments
Open access funding provided by Technische Universität Wien. This work was supported by the Austrian Science Fund (FWF) under Grant P24157N13, and by the Vienna Science and Technology Fund (WWTF) under Grant MA14002. The computational results presented have been achieved in part using the Vienna Scientific Cluster (VSC).
Author information
Affiliations
Corresponding author
Additional information
Communicated by Christian Lubich.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Auzinger, W., Hofstätter, H., Ketcheson, D. et al. Practical splitting methods for the adaptive integration of nonlinear evolution equations. Part I: Construction of optimized schemes and pairs of schemes. Bit Numer Math 57, 55–74 (2017). https://doi.org/10.1007/s1054301606269
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s1054301606269
Keywords
 Evolution equations
 Splitting methods
 Free Lie algebra
 Order conditions
 Local error
 Embedded methods
Mathematics Subject Classification
 65J08
 65M15
 68R15
 68W30