## 1 Introduction

Operator splitting techniques for the efficient numerical integration of evolution equations

\begin{aligned} \partial _t u(t) = F(u(t)), \quad t \ge 0, \quad u(0)~\;\text {given,} \end{aligned}
(1.1)

have become increasingly popular in recent years. Splitting the right-hand side F(u) into two or more components in an appropriate way enables efficient and accurate approximations. In particular, a number of higher-order schemes with real or complex coefficients have been constructed and analyzed. Relevant contributions to this fields can, e.g., be found in [7, 8, 10, 12, 13, 15, 1921, 23]. Furthermore, application to particular problem classes have been studied in the literature where the vector field F has special properties, such that splitting methods can be tuned for such cases. In [9] and [18], for instance, perturbations of integrable systems have been considered, say $$F(u) = A(u) + \varepsilon \,B(u)$$ where $$\varepsilon$$ is s small perturbation parameter. Exploiting this perturbation structure allows the construction of more efficient (de facto) higher-order schemes compared to generic ones.

### 1.1 Overview

We present some new contributions to the topic of splitting methods; here we will concentrate on the generic case, i.e., no special properties of the vector field F are assumed. At first we review the approach from [1] for the automatic setup of order conditions represented by polynomial equations in the coefficients to be determined. Special cases involving symmetries or composition methods based on lower-order schemes can be treated as well. Splitting of the right-hand side of (1.1) into two or three components is considered.

The goal is to identify good schemes of a desired order p . ‘Good’ refers to a compromise between efficiency (minimizing effort) as well as accuracy (minimizing a measure for the expected behavior of the local error). In particular, we focus on the constructions of pairs of schemes of orders $$(p,p+1)$$, where a scheme of order  p acts as a ‘worker’, while a related scheme of order $$p+1$$ plays the role of a ’controller’ for the purpose of practical local error estimation. The idea of using pairs of embedded schemes (an idea related to Runge–Kutta pairs) is due to [17]. Via more flexible embeddings, optimized variants can be constructed. Here, ‘optimization’ means searching for schemes where a reasonable measure for the behavior of the local error becomes minimal among a set of comparable schemes. It is well-known that this is a very relevant point, because such local error measures may vary over several orders of magnitude.We also consider alternative ways of choosing $$(p,p+1)$$-pairs, e.g., adjoint pairs.

Concerning the search for optimal solutions for a given set of order conditions (see Sect. 4), different techniques were applied, depending on the particular case at hand, including exact, symbolic solution representations usingFootnote 1 Maple (for lower-order schemes), or numerical searches using optimization tools or straightforward Monte-Carlo techniques.

The ultimate purpose is adaptive integration of evolution equations based on a reliable local error control. This topic has been studied in detail, in particular in the context of Schrödinger equations, in [2, 46]. In these papers, an alternative method for local error estimation has been constructed and analyzed. It is based on a computable high order approximation of an integral representation of the local error in terms of the defect of the numerical solution. While this approach is rather universal and useful in several cases, the alternative of using optimized pairs of schemes, if applicable, will usually be more efficient.

In Part II of this work we will present a detailed study of adaptive integration, using both approaches for local error estimation, for different types of linear and nonlinear evolution equations.

### Remark 1

Recently we became aware of the paper [9], where a method of deriving order conditions has been proposed which is similar to our approach. Both approaches are based on the notion of a Lyndon basis (also called Lyndon–Shirshov basis) in a free Lie algebra. In view of the similarities between our work and [9], we stress that we have implemented a fully automatic computational procedure for deriving order conditions which requires no extra analytical hand work. This is a versatile implementation, and it can easily be adapted to cover special cases like palindromic schemes, flexible embeddings, and also splitting into more than two operators (see Sects. 2, 3).

The procedure for setting up higher order conditions involves the generation of long weighted sums of power products of noncommuting variables representing the components of the split vector fields. These sums can easily be distributed in order to obtain a significant speed-up in a parallel environment, and we have realized such a version.

### 1.2 Problem setting and notation

For an evolution equation (1.1) where the right-hand side is split into two components,

\begin{aligned} \partial _t u(t) = F(u(t)) = A(u(t)) + B(u(t)), \quad t \ge 0, \end{aligned}
(1.2)

a single step of a multiplicative splitting scheme, starting from u and over a step of length h , is given byFootnote 2

\begin{aligned} \mathscr {S}(h,u) = \mathscr {S}_s(h,\mathscr {S}_{s-1}(h,\ldots ,\mathscr {S}_1(h,u))) \approx \phi _F(h,u), \end{aligned}
(1.3a)

with

\begin{aligned} \mathscr {S}_j(h,v) = \phi _B(b_j\,h,\phi _A(a_j\,h,v)), \end{aligned}
(1.3b)

with appropriate coefficients $$a_j,b_j$$. More general schemes based on splitting into three operators are also considered, see Sect. 2.4, and a special case of additive splitting is also included, see Sect. 2.2.

The local error of a splitting step is denoted by

\begin{aligned} \mathscr {S}(h,u) - \phi _F(h,u) =: \mathscr {L}(h,u). \end{aligned}
(1.4)

### 1.3 Contents

In Sects. 2 and 3 we describe our approach for setting up the order conditions for different types of [pairs of] schemes. Some technical details concerning implementation of this setup procedure are given in Sect. 4. By solving the resulting polynomial systems we have constructed a number of new variants, and we have compiled a collection of practically relevant (old and new) schemes and pairs of schemes up to order $$p=6$$. This collection can be found at

and is also expected to be extended in the future, depending on further investigations on the topic at hand. We will refer to this webpage throughout as reference [3] to avoid listing coefficients in the present paper for the sake of brevity. Some remarks on the schemes collected in [3] are given in Sect. 5; for more detailed information about the properties of the various schemes we also refer to [3]. In Sect. 7 we present a numerical example.

## 2 Order conditions

Many authors have contributed to the topic of finding good methods. For an overview on the topic see [7, 20]. Here we do not attempt to describe the relevant approaches and results in detail but mainly refer to work related to our present activity. For the relevant mathematical background we refer to [7, 15, 20].

Among many others, [8, 10, 12, 13] are devoted to the construction of optimal higher-order methods with real or complex coefficients, either via composition or by solving a set of order conditions generated in different ways. Order conditions take the form of a polynomial system in the unknown coefficients or composition weights $$\omega _\mu$$, see Sect. 2.2. In the following we recapitulate and illustrate by examples how order conditions can be set up according to [1]; as mentioned before, this is similar to one of the approaches taken in [9]. Later on we will also present optimized schemes and pairs of schemes obtained on the basis of this approach, where ‘optimized’ means that a measure for the local error is chosen as small as possible.

### 2.1 Setup of order conditions

There are different ways to generate a polynomial system representing the conditions on the splitting coefficients for a desired order p . An essential theoretical basis is the well-known Baker–Campbell–Hausdorff (BCH) formula, see for example [15].

The approach proposed in [1], which we follow here, also relies on the BCH formula, but order conditions are set up in a completely automatic way. Most of the schemes and pairs of schemes specified in [3] have been obtained on the basis of the algorithm from [1]. In the following we explain and illustrate this approach by means of examples. For the purpose of generating order conditions it is sufficient to consider the case of a linear operator split into two parts A and B . We denote

\begin{aligned} A_j = a_j\,A,\quad B_j = b_j\,B, \quad j=1 \ldots s. \end{aligned}

For the linear case the local error (1.4) is of the form $$\mathscr {L}(h)\,u$$ with a linear operator $$\mathscr {L}(h)$$.

Consider the Taylor expansion of the local errorFootnote 3 of a one-step method starting at  u ,

\begin{aligned} \mathscr {L}(h)\,u = \sum _{q=1}^{p} \frac{h^q}{q!}\,\frac{\mathrm{d}^{q}}{\mathrm{d}h^{q}}\,\mathscr {L}(0)\,u + \frac{h^{p+1}}{(p+1)!}\,\frac{\mathrm{d}^{p+1}}{\mathrm{d}h^{p+1}}\,\mathscr {L}(0)\,u + {\mathscr {O}}(h^{p+2}). \end{aligned}
(2.1)

The method is of order p iff $$\mathscr {L}(h) = {\mathscr {O}}(h^{p+1})$$; thus the conditions for order p are given by

\begin{aligned} \frac{\mathrm{d}}{{\mathrm{d}h}}\,\mathscr {L}(0) = \cdots = \frac{\mathrm{d}^{p}}{\mathrm{d}h^{p}}\,\mathscr {L}(0) = 0. \end{aligned}
(2.2)

For the case of a splitting method we have (with $${\varvec{k}}=(k_1,\ldots ,k_s) \in \mathbb N_0^s$$)

\begin{aligned} \frac{\mathrm{d}^{p}}{\mathrm{d}h^{q}}\,\mathscr {L}(0) = \sum _{|{\varvec{k}}|=q} {q\atopwithdelims (){\varvec{k}}}\cdot \prod \limits _{j=1}^s \sum _{l=0}^{k_{j}} {k_{j}\atopwithdelims ()l}\,B_{j}^{l}\,A_{j}^{k_j-l} - (A+B)^{q}. \end{aligned}
(2.3)

If the conditions (2.2) are satisfied up to a given order p , then the leading term of the local error is given by $$\frac{h^{p+1}}{(p+1)!}\,\tfrac{\mathrm{d}^{p+1}}{\mathrm{d}h^{p+1}}\,\mathscr {L}(0)$$. This leading error term is a linear combination of higher-order commutators of the operators A and B . As explained in [1], a non-redundant set of order conditions can be built in a recursive way by generating the symbolic expressions (2.3) for $$q=1,2,3,\ldots$$ in terms of formally linear but non-commuting operators AB , and identifying coefficients associated with power products of A - and B -factors which uniquely identify commutators out of an appropriate basis of Lie-elements. For this purpose we use the so-called Lyndon basis, also called Lyndon–Shirshov basis, of the free Lie algebra generated by A and B . The elements of this basis are represented by the (associative) Lyndon words over the alphabet $$\{\mathtt {A,B}\}$$, see Table 1.

Let us first illustrate the procedure by means of a simple example.

### Example 1

For $$s=2$$ we have

\begin{aligned} \frac{\mathrm{d}}{\mathrm{d}h} \mathscr {L}(0)&= \underline{(a_1+a_2-1)}\,A + \underline{(b_1+b_2-1)}\,B, \end{aligned}
(2.4a)
\begin{aligned} \frac{\mathrm{d}^{2}}{\mathrm{d}h^{2}}\,\mathscr {L}(0)&= ((a_1+a_2)^2-1)\,A^2 \nonumber \\&\quad + \underline{(2\,a_2\,b_1-1)}\,AB + (2\,a_1\,b_1 + 2\,a_1\,b_2 + 2\,a_2\,b_2 - 1)\,BA \nonumber \\&\quad + ((b_1+b_2)^2-1)\,B^2. \end{aligned}
(2.4b)

The basic consistency condition for order $$p=1$$ is $$\tfrac{\mathrm{d}^{}}{\mathrm{d}h^{}} \mathscr {L}(0)=0$$ which is equivalent to $$a_1+a_2=1$$ and $$b_1+b_2=1$$. Assuming these first-order conditions are satisfied, the second derivative $$\tfrac{\mathrm{d}^{2}}{\mathrm{d}h^{2}}\,\mathscr {L}(0)$$, which now represents the leading error term, simplifies to the commutator expression

\begin{aligned} \frac{\mathrm{d}^{2}}{\mathrm{d}h^{2}}\,\mathscr {L}(0) = \underline{(2\,a_2\,b_1 - 1)}\,[A,B], \end{aligned}
(2.5)

giving the additional condition $$2\,a_2\,b_1 = 1$$ for order $$p=2$$. Assuming now that the conditions for $$p=2$$ are satisfied, the third derivative $$\tfrac{\mathrm{d}^{3}}{\mathrm{d}h^{3}}\,\mathscr {L}(0)$$, which will now represent the leading error term, is a linear combination of the commutators [A, [AB]] and [[AB], B] , namely

\begin{aligned} \frac{\mathrm{d}^{3}}{\mathrm{d}h^{3}}\,\mathscr {L}(0) = \underline{(3\,a_2^2\,b_1 - 1)}\,[A,[A,B]] + \underline{(3\,a_2\,b_1^2 - 1)}\,[[A,B],B]. \end{aligned}
(2.6)

This computation can be automatized:

• Generate the representation (2.4a) of $$\tfrac{\mathrm{d}^{}}{\mathrm{d}h^{}} \mathscr {L}(0)$$ and extract coefficients of the Lyndon words $$\mathtt {A}$$ and $$\mathtt {B}$$. This gives the first-order conditions $$a_1+a_2=1$$ and $$b_1 + b_2= 1$$.

• Generate the representation (2.4b) of $$\tfrac{\mathrm{d}^{2}}{\mathrm{d}h^{2}} \mathscr {L}(0)$$. For a solution of the equations for order 1, the leading local error will have the form $$\frac{h^2}{2}\,\tfrac{\mathrm{d}^{2}}{\mathrm{d}h^{2}}\,\mathscr {L}(0)$$ with $$\tfrac{\mathrm{d}^{2}}{\mathrm{d}h^{2}}\,\mathscr {L}(0)$$ from (2.5). The coefficient of [AB] in (2.5) is determined by extracting the coefficient of the Lyndon word $$\mathtt {AB}$$ in (2.4b). This gives the equation $$2\,a_2\,b_1 = 1$$ which, together with the first-order conditions, represents a set of conditions for order $$p=2$$.

• Generate the representation of $$\tfrac{\mathrm{d}^{3}}{\mathrm{d}h^{3}} \mathscr {L}(0)$$ (we do not display it here). For a solution of the equations for order 2, the leading local error will have the form $$\frac{h^3}{6}\,\tfrac{\mathrm{d}^{3}}{\mathrm{d}h^{3}}\,\mathscr {L}(0)$$ with $$\tfrac{\mathrm{d}^{3}}{\mathrm{d}h^{3}}\,\mathscr {L}(0)$$ from (2.6). The coefficients of [A, [AB]] and [[AB], B] in (2.6) are determined by extracting the coefficients of the Lyndon words $$\mathtt {AAB}$$ and $$\mathtt {ABB}$$ in the expression for $$\tfrac{\mathrm{d}^{3}}{\mathrm{d}h^{3}} \mathscr {L}(0)$$.

In the simple case considered here, there is a one-dimensional manifold of solutions for order $$p=2$$, and for each solution $$\{a_1,a_2,b_1,b_2\}$$ the size of the coefficients in (2.6) is a quality measure.

If a scheme of order 3 is desired, the system of equations is augmented by the further equations $$3\,a_2^2\,b_1 = 1$$ and $$3\,a_2\,b_1^2 = 1$$. (For the case $$s=2$$ displayed here, the resulting system of equations has no solution; we need $$s \ge 3$$.)

In general, for arbitrary s and p , this procedure is continued up to the desired order, by ‘implicit recursive elimination’ as described in [1], automatically producing a generically non-redundant set of order conditions for a desired order  p . This process is based on a special bijection between (associative) Lyndon words and bracketed, non-associative versions of these words which, in our context, are identified with higher-order commutators representing basis elements for the free Lie algebra generated by A and B . The expanded version of such a commutator is a Lie polynomial in terms of the non-commutative variables A and B . The essential point is that its leading monomial, with respect to (alphabetically increasing) lexicographical order, is precisely the monomial represented by the corresponding Lyndon word; see [11].

In the following, the relation ‘$$\mathtt{<}$$’ refers to lexicographical order of words over the alphabet $$\{\mathtt {A,B}\}$$.

### Example 2

Consider a scheme of order $$p=4$$, i.e., assume that the conditions up to order $$p=4$$ are satisfied. Then, $$\tfrac{\mathrm{d}^{5}}{\mathrm{d}h^{5}}\,\mathscr {L}(0)$$ is a linear combination of commutators, or non-associative words, listed below and represented by the six Lyndon words of length 5 (see Table 1),

\begin{aligned} {\mathtt {AAAAB< AAABB< AABAB< AABBB< ABABB < ABBBB}}. \end{aligned}

The commutators are bracketed, non-associative versions of these words,Footnote 4

\begin{aligned}{}[A,[A,[A,[A,B]]]]&= \underline{A^4\,B} - 4\,A^3\,B\,A + 6\,A^2\,B\,A^2 - 4\,A\,B\,A^3 + B\,A^4, \\ [A,[A,[[A,B],B]]]&= \underline{A^3\,B^2} - 2\,\underline{\underline{A^2\,B\,A\,B}} + 4\,A\,B\,A\,B\,A - A\,B^2\,A^2 - A^2\,B^2\,A\\&\quad - 2\,B\,A\,B\,A^2 + B^2\,A^3, \\ [[A,[A,B]],[A,B]]&= \underline{A^2\,B\,A\,B} - A^2\,B^2\,A -3\,A\,B\,A^2\,B + 4\,A\,B\,A\,B\,A \\&\quad {} + 2\,B\,A^3\,B - 3\,B\,A^2\,B\,A - A\,B^2\,A^2 + B\,A\,B\,A^2, \\ [A,[[[A,B],B],B]]&= \underline{A^2\,B^3} - 3\,\underline{\underline{A\,B\,A\,B^2}} + 3\,A\,B^2\,A\,B - 2\,A\,B^3\,A\\&\quad + 3\,B\,A\,B^2\,A - 3\,B^2\,A\,B\,A + B^3\,A^2, \\ [[A,B],[[A,B],B]]&= \underline{A\,B\,A\,B^2} - 3\,A\,B^2\,A\,B + 2\,A\,B^3\,A - B\,A^2\,B^2 \\&\quad {} + 4\,B\,A\,B\,A\,B -3\,B\,A\,B^2\,A - B^2\,A^2\,B + B^2\,A\,B\,A, \\ [[[[A,B],B],B],B]&= \underline{A\,B^4} - 4\,B\,A\,B^3 + 6\,B^2\,A\,B^2 - 4\,B^3\,A\,B + B^4\,A. \end{aligned}

As mentioned above, the leading (lowest) monomials in the expanded commutators, in the sense of lexicographical order, correspond to the Lyndon words. Note that $$\underline{\underline{\mathrm{some\; of\; these\; monomials}}}$$ also occur in lower commutators (‘lower’ again in the sense of lexicographical ordering). Let us now denote these six commutators by $$K_k,\, k=1 \ldots 6$$. We a priori know that $$\tfrac{\mathrm{d}^{5}}{\mathrm{d}h^{5}}\,\mathscr {L}(0)$$ is of the form, with $$\ell _5=6$$,

\begin{aligned} \frac{\mathrm{d}^{5}}{\mathrm{d}h^{5}}\mathscr {L}(0) = \sum _{k=1}^{\ell _5}\,\kappa _k\,K_k \end{aligned}

where the scalars $$\kappa _k$$ are multivariate polynomials of degree 5 in the coefficients $$a_j,b_j$$ of the underlying scheme of order $$p=4$$. Therefore the additional conditions for order $$p=5$$ are given by

\begin{aligned} \kappa _k = 0, \quad k=1 \ldots \ell _5. \end{aligned}
(2.7a)

Extracting these coefficients $$\kappa _k$$ from the expression (2.3) for $$\tfrac{\mathrm{d}^{5}}{\mathrm{d}h^{5}}\,\mathscr {L}(0)$$ is a combinatorial challenge, but we can do better: We simply extract the coefficients of the Lyndon monomials—let us denote them by $$\lambda _k$$—which is a standard operation in computer algebra. Now, instead of (2.7a) we require

\begin{aligned} \lambda _k = 0, \quad k=1 \ldots \ell _5. \end{aligned}
(2.7b)

In our example, for $${\varvec{\kappa }} = (\kappa _1,\ldots ,\kappa _6)^T$$ and $${\varvec{\lambda }} = (\lambda _1,\ldots ,\lambda _6)^T$$ we have

\begin{aligned} {\varvec{\lambda }} = M\,{\varvec{\kappa }}, \quad \text {with} \quad M = \left\lgroup \begin{array}{rrrrrr} 1 &{} &{} &{} &{} &{} \\ &{} 1 &{} &{} &{} &{} \\ &{} -2 &{} 1 &{} &{} &{} \\ &{} &{} &{} 1 &{} &{} \\ &{} &{} &{} -3 &{} 1 &{} \\ &{} &{} &{} &{} &{} 1 \end{array}\right\rgroup , \end{aligned}
(2.7c)

where the lower diagonal entries correspond to the additional occurrence of the $$\lambda _k$$ in non-leading positions. Therefore the systems (2.7a) and (2.7b) are equivalent.

The situation displayed in this example occurs also in the general case. For any order p , the vectors $${\varvec{\kappa }}$$ and $${\varvec{\lambda }}$$ consisting of polynomials of degree $$p+1$$ satisfy $${\varvec{\lambda }} = M\,{\varvec{\kappa }}$$ where M is a lower triangular matrix with unit diagonal. In particular, a Lyndon monomial $$\lambda _k$$ never occurs in an expanded commutator $$K_j$$ for $$j>k$$ because this would contradict the leading position [11] of the Lyndon monomial $$\lambda _j>\lambda _k$$ in $$K_j$$.

### 2.2 Special cases: symmetries

In the sequel,

\begin{aligned} \mathscr {S}^*(h,u) = \mathscr {S}^{-1}(-h,u) \end{aligned}

denotes the adjoint scheme associated with $$\mathscr {S}$$.

The order conditions generated by the algorithm indicated in Sect. 2.1 are generically non-redundant. However, there exist special cases:

• Symmetric  (or: ‘time-symmetric’) one-step schemes are characterized by the property

\begin{aligned} \mathscr {S}(-h,\mathscr {S}(h,u)) = u, \quad \text {i.e.,} \quad \mathscr {S}(h,u) = \mathscr {S}^*(h,u). \end{aligned}
(2.8)

For symmetric splitting schemes we have either $$a_1=0$$ or $$b_s=0$$, and the remaining coefficient tupels $$(a_j)$$ and $$(b_j)$$ are both palindromic. Since symmetric schemes have an even order p (cf. [15, Chapter 3]), only odd-order conditions for an appropriately reduced number of free coefficients need to be imposed. The general algorithm described in Sect. 2.1 can easily be adapted to this case.

• The following type of schemes seems not to have been considered earlier in the literature:Footnote 5 Palindromic  schemes, or ‘reflected schemes’ in the terminology of [1], are characterized by $$b_j = a_{s+1-j},\, j=1 \ldots s$$, i.e.,

\begin{aligned}&(a_1,b_1,a_2,b_2,\ldots ,a_{s-1},b_{s-1},a_s,b_s) \nonumber \\&\quad = (a_1,b_1,a_2,b_2,\ldots ,b_2,~~~\,a_2,~~~\,b_1,a_1). \end{aligned}
(2.9)

Assume a scheme of order p is given, and consider a splitting step of the form (1.3). Interchanging the roles of A and B , i.e., replacing (1.3) by

\begin{aligned} \check{\mathscr {S}}(h,u) = \check{\mathscr {S}}_s(h,\check{\mathscr {S}}_{s-1}(h,\ldots ,\check{\mathscr {S}}_1(h,u))), \end{aligned}
(2.10a)

with

\begin{aligned} \check{\mathscr {S}}_j(h,v) = \phi _A(b_j\,h,\phi _B(a_j\,h,v)), \end{aligned}
(2.10b)

also results in a scheme of order p . If $$\mathscr {S}$$ is palindromic then

\begin{aligned} \mathscr {S}(-h,\check{\mathscr {S}}(h,u)) = u, \quad \text {i.e.,} \quad \check{\mathscr {S}}(h,u) = \mathscr {S}^*(h,u). \end{aligned}
(2.11)

Thus we infer from [15, Theorem II.3.2] that in the palindromic case the local errors $$\mathscr {L}(h,u) = \mathscr {S}(h,u) - \phi _F(h,u)$$ and $$\check{\mathscr {L}}(h,u) = \check{\mathscr {S}}(h,u) - \phi _F(h,u)$$ are related via

\begin{aligned} \mathscr {L}(h,u)&=C(u)\,h^{p+1} + {\mathscr {O}}(h^{p+2}), \end{aligned}
(2.12a)
\begin{aligned} \check{\mathscr {L}}(h,u)&= (-1)^p\,C(u)\,h^{p+1} + {\mathscr {O}}(h^{p+2}), \end{aligned}
(2.12b)

with $$C(u) = \tfrac{1}{(p+1)!}\,\tfrac{\mathrm{d}^{p+1}}{\mathrm{d}h^{p+1}}\,\mathscr {L}(0,u)$$. For an ansatz with palindromic coefficients, exchanging the roles of A and B in the algorithm from Sect. 2.1 will lead to the identical set of order conditions. Therefore the order conditions associated with ‘Lyndon twins’ are pairwise identical. Here, we call a pair of Lyndon words a twin if one of them is obtained by exchanging the role of A and B and reading it from right to left, see Table 1. For instance, the 6 words of odd length 5 consist of three twins; the 9 words of even length 6 consist of three twins, the selfie $$\mathtt {AAABBB}$$, and two solitary words. Due to this redundancy the number of order conditions is appropriately reduced.

• Higher order one-step schemes can be generated by m -fold composition  of lower-order schemes with appropriately chosen sub-steps $$h_\mu = \omega _\mu h$$ satisfying $$\omega _1 + \cdots + \omega _m = 1$$ plus additional conditions guaranteeing that a certain order is obtained.Footnote 6

A popular class of composition methods are symmetric Strang compositions. Schemes of this type of orders 4, 6 and higher were first devised in [23]. Some of the composition coefficients have to be chosen negative, and the local error measures of these composition schemes are rather large. On the other hand, for higher orders, composition beats the generic lower limits on the number s of stages such that a given order p can be expected. For instance, the sevenfold 6-th order symmetric Strang composition [3, ‘Y 8-6’] recombines into an 8-stage scheme, whereas the generic number of order conditions for a symmetric scheme of order $$p=6$$ is 10, which would require $$s=10$$ stages involving 11 free coefficients.

Evidently, (symmetric) compositions are an attractive option for constructing higher-order schemes. Therefore we have included this class into our considerations concerning the search for optimal variants (see Sect. 4).

### 2.3 Complex coefficients

Our considerations are not restricted to schemes with real coefficients $$a_j,b_j$$. Complex schemes, with coefficients having positive real parts, are appropriate for the application of splitting methods to parabolic problems, since real schemes with positive coefficients do not exist for order $$p \ge 3$$, see [7]. For this class of methods, in particular based on complex compositions, we refer to [8, 13].

### 2.4 Splitting into more than two operators

We also consider evolution equations where the right-hand side splits into three parts,

\begin{aligned} \partial _t u(t) = F(u(t)) = A(u(t)) + B(u(t)) + C(u(t)), \quad t \ge 0, \quad u(0)~\text {given,}\qquad \end{aligned}
(2.13)

and according multiplicative splitting schemes,

\begin{aligned} \mathscr {S}(h,u) = \mathscr {S}_s(h,\mathscr {S}_{s-1}(h,\ldots ,\mathscr {S}_1(h,u))) \approx \phi _F(h,u), \end{aligned}
(2.14a)

with

\begin{aligned} \mathscr {S}_j(h,v) = \phi _C(c_j\,h,\phi _B(b_j\,h,\phi _A(a_j\,h,v))). \end{aligned}
(2.14b)

The methodology from [1] can be directly generalized to the case of splitting into more than two operators. For the practically relevant case of splitting into three operators ABC , as in (2.14), the representation (2.3) generalizes as follows, with $$A_j = a_j\,A,\, B_j=b_j\,B,\, C_j=c_j\,C$$, and $${\varvec{k}}=(k_1,\ldots ,k_s) \in \mathbb N_0^s$$$${\varvec{l}} = (l_A,l_B,l_C) \in \mathbb N_0^3$$:

\begin{aligned} \frac{\mathrm{d}^{q}}{\mathrm{d}h^{q}}\,\mathscr {L}(0) = \sum _{|{\varvec{k}}|=q} {q\atopwithdelims (){\varvec{k}}} \prod \limits _{j=1}^{s} \sum _{|{\varvec{l}}|=k_j} {k_j\atopwithdelims (){\varvec{l}}}\, C_j^{l_C}\,B_j^{l_B}\,A_j^{l_A} - (A+B+C)^q.\qquad \end{aligned}
(2.15)

On the basis of these identities, the algorithm from Sect. 2.1 generalizes in a straightforward way. The Lyndon basis representing independent commutators now corresponds to Lyndon words over the alphabet $$\{\mathtt {A,B,C}\}$$, see Table 2.

Concerning symmetries, similar considerations as in Sect. 2.2 apply.

For a general convergence theory of ABC-splitting for the linear case and some applications we refer to [6]. For example, splitting into three operators can be used to handle evolution equations where the right-hand side splits up into two non-autonomous parts. Introducing the independent variable  t as an unknown variable satisfying $$t'=1$$, such a problem can be formally considered as an autonomous system split into three parts. In this case, splitting means that the variable t is frozen over several subintervals comprising an integration step. Since the ODE $$t'=1$$ is trivial, a large number of higher-order commutators vanishes in this case, and therefore the number of necessary order conditions is significantly reduced, a situation to be considered in further work.

## 3 Pairs of splitting schemes

For the purpose of efficient local error estimation as a basis for adaptive stepsize selection, using pairs of related schemes is a well-established idea. One of the schemes, of order p , acts as the worker, and the other, of order $$p+1$$, is the controller responsible for local error estimation.Footnote 7 Criteria for the selection of pairs of schemes are accuracy and computational efficiency.

Order conditions for pairs of schemes of the types listed below can be generated with minor modifications of the approach described in Sect. 2.

• Embedded pairs. In [17], pairs of splitting schemes of orders p and $$p+1$$ are specified. The idea is to select a controller $$\bar{\mathscr {S}}$$ of order $$p+1$$ and to construct a worker $$\mathscr {S}$$ of order  p for which a maximal number of stages $$\mathscr {S}_j$$ coincides with those of the controller. Let $$a_j,b_j$$ and $$\bar{a}_j,\bar{b}_j$$ denote the coefficients of the worker and controller, respectively. The approach adopted in [17] may be called static, finding $$\mathscr {S}$$ and $$\bar{\mathscr {S}}$$ such that $$a_j={\bar{a}}_j$$ and $$b_j={\bar{b}}_j$$ for as many $$j=1,2,\ldots$$ as possible. In this sense the schemes are related to each other but, in general, the total number of order conditions, and thus the total number of necessary evaluations, is the same as for an arbitrary unrelated $$(p,p+1)$$ pair.

Here we develop the idea of embedding further: again we fix a ‘good’ controller of order $$p+1$$ and wish to adjoin to it a ‘good’ worker of order p . Since the number of stages $$\bar{s}$$ of $$\bar{\mathscr {S}}$$ will be higher than the number of stages s of $$\mathscr {S}$$, we can select an optimal embedded worker $$\mathscr {S}$$ from a set of candidates obtained by flexible embedding, where the number of coinciding coefficients is not a priori fixed.

### Example 3

In [17], an embedded (3, 4) -pair was constructed, where the controller is an optimized symmetric scheme of order $$p=4$$ with $$s=7$$ stages due to [10], with local error measure $$\mathrm{LEM}=0.01$$ (‘LEM’ in the sense of (4.2b) below). The worker specified in [17] is a scheme of order $$p=3$$ with $$s=6$$ stages, where the coefficients $$a_1,a_2,a_3,a_4$$ and $$b_1,b_2,b_3$$ coincide with those of the controller. This amounts to 7 additional evaluations for the worker, and its local error measure is $$\mathrm{LEM}=0.2$$.

For flexible embedding, in contrast, we consider all possible embedded workers, and we find that a scheme of order $$p=3$$ with $$s=4$$ stages is to be preferred, see [3, Emb 4/3 BM PRK/A], where $$a_1,a_2$$ and $$b_1$$ coincide with those of the controller. This amounts to five additional evaluations for the worker, and it has $$\mathrm{LEM}=0.1$$.

• Milne pairs. In the context of multistep methods for ODEs, the so-called Milne device is a well-established technique for constructing pairs of schemes. In our context, one may aim for finding a pair $$(\mathscr {S},\tilde{\mathscr {S}})$$ of schemes of the same type, with equal s and p , such that their local errors $$\mathscr {L},\tilde{\mathscr {L}}$$ are related according to

\begin{aligned} \mathscr {L}(h,u)&= C(u)\,h^{p+1} + {\mathscr {O}}(h^{p+2}), \end{aligned}
(3.1a)
\begin{aligned} \tilde{\mathscr {L}}(h,u)&= \gamma \,C(u)\,h^{p+1} + {\mathscr {O}}(h^{p+2}), \end{aligned}
(3.1b)

with $$\gamma \not = 1$$. Then, the additive scheme

\begin{aligned} {\bar{\mathscr {S}}}(h,u) = -\frac{\gamma }{1-\gamma }\,\mathscr {S}(h,u) + \frac{1}{1-\gamma }\,\tilde{\mathscr {S}}(h,u) \end{aligned}

is a method of order $$p+1$$, and

\begin{aligned} \mathscr {S}(h,u) - {\bar{\mathscr {S}}}(h,u) = \frac{1}{1-\gamma }\,\big ( \mathscr {S}(h,u) - \tilde{\mathscr {S}}(h,u) \big ) \end{aligned}

provides an asymptotically correct local error estimate for $$\mathscr {S}(h,u)$$.

• Adjoint pairs. Let $$\mathscr {S}$$ be a scheme of odd order p and and $$\mathscr {S}^*$$ its adjoint, see Sect. 2.2. Due to [15, Theorem II.3.2] the leading error terms of $$\mathscr {S}$$ and its adjoint $$\mathscr {S}^*$$ are identical up to the factor $$-1$$. Therefore, the averaged additive scheme

\begin{aligned} {\bar{\mathscr {S}}}(h,u) = \frac{1}{2}\big ( \mathscr {S}(h,u)+\mathscr {S}^*(h,u) \big ) \end{aligned}
(3.2)

is a method of order $$p+1$$, and

\begin{aligned} \mathscr {S}(h,u) - {\bar{\mathscr {S}}}(h,u) = \frac{1}{2}\big ( \mathscr {S}(h,u) - \mathscr {S}^*(h,u) \big ) \end{aligned}

provides an asymptotically correct local error estimate for $$\mathscr {S}(h,u)$$. In this case the additional effort for computing the local error estimate is identical with the effort for the worker $$\mathscr {S}$$ but not higher as is the case for embedded pairs. An example are palindromic pairs, where $$\mathscr {S}$$ is palindromic (of odd order p ), such that $$\mathscr {S}^*= \check{\mathscr {S}}$$, see Sect. 2.2.

For detailed comments on a number of new pairs listed in [3], see Sect. 5.

## 4 Implementation aspects: constructing schemes and minimizing local error terms

Our approach for setting up order conditions described in Sect. 2.1 has been implemented in Maple 18. We use the Physics package for the manipulation of noncommuting symbols, and tables of Lyndon words generated using an algorithm devised in [14]. Since the number of terms in (2.3) resp. (2.15) rapidly increases with q we have implemented a parallel version relying on Maple’s Grid package. In particular, the job of generating all the terms in the long sums (2.3) and (2.15) can be (equi-)distributed over several parallel threads.

The resulting set of order conditions is a multivariate polynomial system which, for higher orders, requires numerical solution techniques. Once a scheme of order p has been found, its leading local error term is of the form (see Sect. 2)

\begin{aligned} \frac{h^{p+1}}{(p+1)!}\,\frac{\mathrm{d}^{p+1}}{\mathrm{d}h^{p+1}} \mathscr {L}(0) = \sum _{k=1}^{\ell _{p+1}} \kappa _{p+1,k}\,K_{p+1,k}, \end{aligned}
(4.1)

with $$\ell _{p+1}$$ commutators $$K_{p+1,k}$$ associated with Lyndon words of length $$p+1$$. To compare schemes of equal order p one may consider

\begin{aligned} \left( \,\sum _{k=1}^{\ell _{p+1}} |\kappa _{p+1,k}|^2 \right) ^{1/2} \end{aligned}
(4.2a)

as a reasonable measure for the accuracy of a scheme. However, we use the quantity

\begin{aligned} \text {LEM} := \left( \,\sum _{k=1}^{\ell _{p+1}} |\lambda _{p+1,k}|^2 \right) ^{1/2} \end{aligned}
(4.2b)

instead. Using (4.2b) has the advantage that the coefficients $$\lambda _k = \lambda _{p+1,k}$$ are exactly those which are generated in the course of the setup of the conditions for order $$p+1$$, see Sect. 2.1, while the coefficients from (4.2a) are more difficult to compute (cf. the discussion in Sect. 2.1). Since different particular solutions to the order conditions typically result in leading local error terms varying over several orders of magnitudes, we consider (4.2b) equally reasonable as (4.2a).

For finding and evaluating solutions and pairs of solutions we follow two different strategies.

• For the case where the number of equations equals the number of free coefficients we expect a set of isolated solutions. In this case we use the fsolve function in Maple combined with a Monte-Carlo strategy for generating different initial intervals. Higher precision is used to generate solutions with double precision accuracy. For each detected solution the LEM (4.2b) is computed.

• Especially for the case where the number of equations is smaller than the number of free coefficients, the problem is to be considered as a constrained minimization problem: minimize the LEM representing the objective function, with the order conditions imposed as nonlinear equality constraints. To this end we employ state-of-the-art techniques which have also been applied for the construction of special classes of Runge–Kutta methods, see for instance [16]. In particular we have used the MATLABFootnote 8 optimizer fmincon. Again a large number of initial guesses are generated randomly, since this optimization problem is nonconvex in general. The results cannot be guaranteed globally optimal, but results from an exhaustive search usually suggest that this is indeed the case. A post-processing, i.e., refining the solutions to full double precision, is again performed in Maple using higher precision sfloat arithmetic.

We have also re-checked a number of known methods, refined their coefficients to full double precision, and computed their LEMs.

## 5 Schemes from the collection [3]

This collection is not intended to be exhaustive. It includes some known and quite a number of new schemes, in particular pairs of schemes, up to order $$p=6$$, with their essential properties. Some methods are included mainly for the sake of completeness or their historical significance.

In the following we comment on some of these methods; for complete information, consult [3]. ‘Best’ or ‘optimal’ means that it has minimal LEM (4.2b) among a certain class of methods with comparable effort for a given order p . In some simple cases such optimality properties can be established theoretically; for higher orders we have resorted to more or less exhaustive numerical search.

Methods whose label contains the letter ‘A’ are new, or taken again into consideration in the context of constructing pairs, or their LEM has been computed for the first time.Footnote 9 The list also includes some pairs of embedded schemes (‘Emb ...’), pairs of Milne type (‘Milne ...’), and palindromic pairs (‘PP ...’), see Sect. 3.

More detailed information about all these methods can be found on the webpage [3].

### 5.1 Splitting into two operators (‘AB schemes’)

Real coefficients.

• The best schemes up to order $$p=5$$ we have found are palindromic:

• best 2-stage 2nd order’ ($$s=p=2$$).

• Emb 3/2 AKS’ (palindromic controller with $$s=p=3$$).

• Emb 4/3 AKS p’ (palindromic controller with $$s=5$$, $$p=4$$). In particular, this scheme has essentially the same LEM as the fourth order scheme from [10] which has been used in [17], but it has only 5 stages instead of 7.

• Emb 5/4 A’ (palindromic controller with $$s=8$$, $$p=5$$), see also ‘PP 5/6 A’.

• Emb 5/4 AK (ii)’ is an optimized embedded pair. The controller is a new scheme with $$s=7$$, $$p=5$$, and the worker of order $$p=4$$ is chosen out of several dozens of candidates of order 4 which share the same computational effort but have LEMs varying over several orders of magnitudes.

• Palindromic pairs: ‘PP 3/4 A’, ‘PP 5/6 A’.

Complex coefficients (with positive real parts).

• Since for order $$p=3$$ we need 5 conditions, the question is whether there exists a third-order scheme with $$s=3$$ and 5 evaluations. It turns out that the only scheme of this type, ‘A 3-3 c’, has complex coefficients.

• A 4-4 c’ ($$s=4$$, $$p=4$$) is the best complex symmetric Strang composition method of order  4 ; see also [12, 13].

• Emb 3/2 A c’ and ‘Emb 4/3 A c’ are embedded pairs with palindromic controller and optimized worker. We note that the controller in ‘Emb 4/3 A c’ ($$s=5$$, $$p=4$$) has a significantly smaller LEM than ‘A 4-4 c’ (factor $$\approx 20$$).

• C 8-6 c’ ($$s=8$$, $$p=6$$) is the best symmetric complex Strang composition method of order  6 ; see also [12, 13].

• Palindromic pairs: ‘PP 3/4 A c’, ‘PP 5/6 A c’.

### 5.2 Splitting into three operators (‘ABC schemes’)

Due to the rapidly increasing number of generic order conditions, finding general higher order schemes would be a very challenging task for this case. For $$p=6$$, for instance, the generic number of order conditions is 196 for the general case and 59 for the symmetric case. For $$p=6$$ we therefore only consider real or complex Strang compositions which are easier to construct and lead to more compact schemes. Generating the expression for the leading error term $$\tfrac{\mathrm{d}^{7}}{\mathrm{d}h^{7}} \mathscr {L}(0)$$ for the purpose of computing the LEM for $$p=6$$, involving 312 coefficients (see Table 2), is computationally expensive, but it can be done at reasonable effort, for the purpose of computing the LEM of a given composition and comparing different variants.

Real coefficients.

• AK 5-2’ ($$s=5$$, $$p=2$$, 9 evaluations) appears to be a possible rival of the Strang scheme ($$s=3$$, $$p=2$$, 5 evaluations), with a LEM which is smaller by a factor $$\approx 7$$.

• PP 3/4 A 3’ is a palindromic pair based on the best palindromic scheme found for $$s=6$$, $$p=3$$.

• Y 7-4’ ($$s=7, p=4$$, 13 evaluations) is the best symmetric Strang composition of order $$p=4$$. It is the analog of the AB composition ‘Y 4-4’, with the same composition weights.

• AK 11-4’ ($$s=11$$, $$p=4$$, 21 evaluations) has been found on the basis of 11 conditions for a symmetric ABC scheme of order 4. Its LEM is smaller by a factor $$\approx 13$$ compared to ‘Y 7-4’.

• AY 15-6’ ($$s=15, p=6$$) is the best symmetric Strang composition of order $$p=6$$. It is the analog of the AB composition ‘Y 8-6’, with the same composition weights.

Complex coefficients (with positive real parts).

• AK 7-4 c’ ($$s=7, p=4$$) is the best symmetric Strang composition of order $$p=4$$. It is the analog of the AB composition ‘A 4-4-c’, with the same composition weights.

• AK 15-6 c’ ($$s=15, p=6$$) is the best symmetric Strang composition of order $$p=6$$. It is the analog of the AB composition ‘C 8-6-c’, with the same composition weights.

## 6 Palindromic schemes: discussion and open questions

As indicated in Sect. 2.2, one motivation for considering palindromic schemes is the fact that they are easier to construct. Moreover, as already mentioned in Sect. 5, small error constants are usually observed in this case. Apparently, palindromic schemes tend to have minimal LEMs among a set of competitors, for instance the third-order scheme in the pair ‘PP 3/4 A’ (a theoretical explanation for this observation is missing). This is the reason why we have included some adjoint pairs of (optimized) palindromic type of orders $$(p,p+1)$$, p odd, in our collection [3].

Palindromic schemes have a certain type of symmetry, but they are not time-symmetric. Investigating special properties of such schemes appears to be of interest in the context of geometric integration, for instance when they are applied to partitioned systems of the form

\begin{aligned} \partial _t u&= f(v), \nonumber \\ \partial _t v&= g(u), \end{aligned}
(6.1)

with the natural splitting of the vector field (fg) into (f, 0) and (0, g) . In this case, splitting methods are equivalent to a subclass of Partitioned Runge–Kutta (PRK) methods, characterized by a pair $$(\varvec{A},\varvec{\alpha }),\,(\varvec{B},\varvec{\beta })$$ of Runge–Kutta schemes. For each AB splitting scheme, the associated PRK coefficients satisfy

\begin{aligned} \varvec{\alpha }_i\,\varvec{B}_{ij} + \varvec{\beta }_j\,\varvec{A}_{ji} = \varvec{\alpha }_i\,\varvec{\beta }_j, \quad i,j=1 \ldots s. \end{aligned}

As a consequence, all splitting schemes preserve quadratic invariants when applied to a partitioned system (6.1), and they are symplectic when applied to a Hamiltonian system with a separable Hamiltonian $$H(p,q) = T(p) + U(q)$$, see [15, Theorem IV.2.4 and Theorem VI.4.6].

Extending these considerations to PRK schemes applied to general partitioned systems

\begin{aligned} \partial _t u&= f(u,v), \nonumber \\ \partial _t v&= g(u,v), \end{aligned}
(6.2)

again from [15] we know that under the additional condition $$\varvec{\alpha }_i = \varvec{\beta }_i,~ i=1 \ldots s$$, the invariance properties mentioned above remain valid for (6.2). The latter condition is satisfied if the scheme is a composition of steps of symplectic Euler type, i.e., for $$a_i=b_i$$. In the palindromic case we have $$\varvec{\beta }_i = \varvec{\alpha }_{s+1-i}$$, and this does not appear to be a useful property in view of invariance questions. (Cf., for instance, the proof of the general assertion of Theorem IV.2.4 in [15], which does not carry over.)

Summarizing, we may say that palindromic schemes by now have not been completely understood, and this may deserve further investigations.

## 7 Numerical example

For a numerical illustration, in particular concerning the expected performance of palindromic schemes, we consider an example of a system of coupled nonlinear evolution equations of Schrödinger type (see [22]),

\begin{aligned} \begin{aligned} \mathrm {i}\,\left( \frac{\partial \psi _1}{\partial t} + \delta \frac{\partial \psi _1}{\partial x}\right) + \frac{1}{2}\,\frac{\partial ^2 \psi _1}{\partial x^2} + \big ( |\psi _1|^2 + e\,|\psi _2|^2 \big ) \psi _1&= 0, \\ \mathrm {i}\,\left( \frac{\partial \psi _2}{\partial t} - \delta \frac{\partial \psi _2}{\partial x}\right) + \frac{1}{2}\,\frac{\partial ^2 \psi _2}{\partial x^2} + \big ( e\,|\psi _1|^2 + |\psi _2|^2 \big ) \psi _2&= 0, \end{aligned} \end{aligned}
(7.1)

with initial condition chosen such that the exact solution is a pair of solitons,

\begin{aligned} \psi _{1(x,t)}= & {} \frac{\sqrt{2\,\alpha }}{1+e}\,\text {sech}\big ( \sqrt{2\,\alpha }\,(x-v\,t) \big )\, \mathrm {e}^{{\mathrm {i}}\left( (v-\delta )x + (\alpha -({v}^{2}-\delta ^2)/2)\,t\right) }, \\ \psi _{2(x,t)}= & {} \frac{\sqrt{2\,\alpha }}{1+e}\,\text {sech}\big ( \sqrt{2\,\alpha }\,(x-v\,t) \big )\, \mathrm {e}^{{\mathrm {i}}\left( (v+\delta )x + (\alpha -(v^{2}-\delta ^{2})/2)\,t\right) }, \end{aligned}

which is exponentially decreasing with |x| . We start at $$t=0$$, the parameters are chosen as $$\delta =0.5$$, $$\alpha =1.0$$, $$v=1.1$$, and $$e=0.8$$.

We impose periodic boundary conditions on the interval $$x_{min},x_{max}=[-50,70]$$ using an equidistant grid of size 2048. For splitting we choose the time step h and separately integrate

• the kinetic part (‘A’) involving the derivatives w.r.t. x , using a Fourier spectral discretization,

• and the nonlinear ‘ODE part’ (‘B’), which can be exactly propagated: at each grid point  x , the respective solution $$(\psi _{1,B},\psi _{2,B}) = (\psi _{1,B}(x,t),\psi _{2,B}(x,t))$$ of the ODE system

\begin{aligned} \mathrm {i}\,\frac{\mathrm {d} \psi _{1,B}}{\mathrm {d}\,t} + \big ( |\psi _{1,B}|^2 + e\,|\psi _{2,B}|^2 \big ) \psi _{1,B}&= 0, \\ \mathrm {i}\,\frac{\mathrm {d} \psi _{2,B}}{\mathrm {d}\,t} + \big ( e\,|\psi _{1,B}|^2 + |\psi _{2,B}|^2 \big ) \psi _{2,B}&= 0, \end{aligned}

starting at $$t_0$$ is given by

\begin{aligned} \psi _{1,B}(x,t)&= \mathrm {e}^{\,\mathrm {i}\,(t-t_0)\,\left( |\psi _{1,B}(x,t_0)|^2 + e\,|\psi _{2,B}(x,t_0)|^2\right) } \psi _{1,B}(x,t_0), \\ \psi _{2,B}(x,t)&= \mathrm {e}^{\,\mathrm {i}\,(t-t_0)\,\left( e\,|\psi _{1,B}(x,t_0)|^2 + |\psi _{2,B}(x,t_0)|^2\right) } \psi _{2,B}(x,t_0). \end{aligned}

All computations were performed in standard double precision arithmetic. In Tables 3 and 4, ‘err’ refers to a canonically scaled discrete $$L_2$$ - norm, and ‘ord’ refers to the order observed.