1 Introduction

The results in this paper show how Lie polynomials [1] and the combinatorics of words [2] are basic to the study of tree-level scattering amplitudes in field theory and string theory. We give self-contained proofs of the identities that underpin coloured amplitudes and the double copy at tree-level using simple properties of Lie polynomials. We will see that the cleanest description is in terms of the Berends–Giele multiparticle fields of biadjoint scalar theory, with values in Lie polynomials.

The free Lie algebra, \({\mathscr {L}}\), is the space of linear combinations of ‘Lie monomials’, which are nested commutators of ‘letters’; our letters will be taken to be the natural numbers (\(1,2,3,\ldots \)). \({\mathscr {L}}\) is a subspace of the space of linear combinations of ‘words’ formed from the natural numbers. The connection to gauge theory arises because there is a natural map from Lie monomials \(\Gamma \in {\mathscr {L}}\) to the colour structures that appear in gauge theories, for any choice of gauge Lie algebra, and for any (single trace) gauge theory Lagrangian. Moreover, there is also a correspondence between Lie monomials \(\Gamma \in {\mathscr {L}}\) and labelled binary trees with a given root.

The double copy starts by expressing Yang-Mills tree amplitudes in the form [3]

$$\begin{aligned} A= \sum _\Gamma \frac{N_\Gamma c_\Gamma }{s_\Gamma }\,. \end{aligned}$$
(1.1)

Here \(\Gamma \) denotes trivalent graphs, \(s_\Gamma \) denotes the product of denominator propagator factors associated to the graph, and \(c_\Gamma \) denotes the corresponding colour factor.Footnote 1 The numerators \(N_\Gamma \) are functions of momenta and gluon polarization data. These are said to be ‘BCJ numerators’ if they satisfy colour-kinematics duality, which means that they satisfy

$$\begin{aligned} N_{\Gamma }+N_{\Gamma '}+N_{\Gamma ''}=0, \end{aligned}$$

whenever \(\Gamma +\Gamma '+\Gamma ''= 0\). In other words, the \(N_\Gamma \) are ‘BCJ numerators’ if \(\Gamma \mapsto N_\Gamma \) is a homomorphism from \({\mathscr {L}}\) to the space of functions of the kinematic data.Footnote 2 Such numerators exist for Yang-Mills, and the key example of the double copy is that replacing \(c_\Gamma \) in (1.1) by another copy of \(N_\Gamma \) yields gravity amplitudes [4]. BCJ numerators are known for many coloured theories and can be used to obtain the tree amplitudes of any theory known to participate in the double copy. This includes gauge and gravity theories and their relatives, such as brane theories, with and without supersymmetry, as well as tree-level string amplitudes; see [5] for an up-to-date review of progress and references to the literature. The most basic example is to replace \(N_\Gamma \) in (1.1) by \(c_\Gamma \). This yields the amplitudes of the biadjoint scalar theory, which is the backbone of the double copy.

Lie polynomials are ubiquitous in the auxiliary structures that are used to study amplitudes, and with hindsight can be seen in the multiparticle vertex operators in conventional string theory [6], in the geometry of the space of Mandelstam variables [7], and in the CHY formulae and ambitwistor strings [8, 9], where a prominent role is played by \(\mathcal {M}_{0,n}\), the moduli space of n-points on the Riemann sphere [10, 11]. However, our aim here is to prove basic results directly using only the Lie polynomial structure.

The following sections summarize our results.

1.1 Berends–Giele recursion and planar binary trees

In Sect. 2, we review the properties of the space of Lie polynomials, \({\mathscr {L}}\), and its dual \({\mathscr {L}}^*\). Elements of the dual, \({\mathscr {L}}^*\), can be expressed as ‘words modulo shuffles’. For a Lie monomial \(\Gamma \in {\mathscr {L}}\) and a word P, the duality pairing is denoted \((P,\Gamma )\). There is a correspondence between Lie monomials \(\Gamma \in {\mathscr {L}}\) (up to sign) and binary trees, i.e., trivalent rooted tree graphs [12]. If P is a word such that \((P,\Gamma )=1\), then P defines a planar embedding of the tree associated to \(\Gamma \).

Section 3 reviews the Berends–Giele recursion relations for biadjoint scalar theory. For \(P\in {\mathscr {L}}^*\), we reduce this problem to studying the recursion (as in [13])

$$\begin{aligned} b(P)=\frac{1}{s_P} \sum _{XY=P} [b(X),b(Y)]\,, \qquad b(i) = i \end{aligned}$$
(1.2)

where \(s_P\) is the Mandelstam variable associated to the word P. The b(P) are valued in the space of Lie polynomials, \({\mathscr {L}}\). Moreover, the relation (1.2) is solved by

$$\begin{aligned} b(P)=\sum _\Gamma \frac{(P,\Gamma ) \Gamma }{s_\Gamma }, \end{aligned}$$
(1.3)

where the sum is over Lie monomials \(\Gamma \), defined up to sign.

The b(P) naturally define what we call Lie polynomial amplitudes. These are obtained by removing the off-shell external propagator to obtain colour ordered partial amplitudes m(Pn) valued in Lie polynomials:

$$\begin{aligned} m(Pn):=\lim _{s_P\rightarrow 0} s_P b(P). \end{aligned}$$
(1.4)

The pairing of (1.4) with an ordering gives the double colour ordered partial amplitudes of the biadjoint theory \(m(Pn,Qn):=(Q,m(Pn))\) [14].

For a given gauge theory, we interpret BCJ numerators \(N_\Gamma \) as given by homomorphism, N, from the free Lie algebra to functions of their kinematic data, as in [10]. The existence of such a homomorphism is special to those gauge theories that participate in the double copy, and we write down examples for NLSM and SYM theories in § 7. The amplitudes are obtained by acting on m(Pn) with N. They satisfy the Kleiss Kuijf (KK) relations and the Bern Carrasco Johansson (BCJ) relations because of the basic results that we prove about the \({\mathscr {L}}\)-valued b(P).

1.2 BCJ amplitude relations from a new Lie bracket

It was argued in [6,7,8,9,10,11,12,13,14,15] that BCJ amplitude relations could be expressed using the map defined in [6] that was there called the ‘S-map’. We will show that this map defines a Lie bracket in the dual space of Lie polynomials. We will call this Lie bracket the S-bracket and denote it with braces: \(\{\,,\}\).

We prove that the BCJ amplitude relations of [6,7,8,9,10,11,12,13,14,15] follow from the basic identity

$$\begin{aligned} b(\{P,Q\})= \left[ b(P),b(Q)\right] , \end{aligned}$$
(1.5)

which generalizes the off-shell BCJ relations of [16]. Thus b maps the \(\{,\}\)-bracket to the standard Lie bracket. We also show that \(b: P\mapsto b(P)\) is an invertible map, and that \(\{,\}\) is the pullback of [, ] under this map. In particular \(\{,\}\) is a Lie bracket.

The BCJ relations for amplitudes are a consequence of (1.5), which implies that \(b(\{P,Q\})\) does not have pole in \(1/s_{PQ}\), and hence that

$$\begin{aligned} m(\{P,Q\},n)=0, \end{aligned}$$

in the limit as \(s_{PQ}\rightarrow 0\).

1.3 The KLT inner product and its generalized matrix

The Kawai–Lewellen-Tye (KLT) matrix [17,18,19] relates Yang-Mills partial amplitudes to gravity amplitudes. It arises in a natural way from the S-bracket. If \(\Gamma \) is a Lie monomial, let \(\{\Gamma \}\) be obtained by replacing every pair of brackets [, ] with an S-bracket \(\{,\}\). This is well defined because the S-bracket is Lie. We use this to define a KLT map:

$$\begin{aligned} S: \Gamma \mapsto \{\Gamma \}, \end{aligned}$$

which is valued in \({\mathscr {L}}^*\). Using the duality pairing between \({\mathscr {L}}\) and \({\mathscr {L}}^*\), the KLT map defines a symmetric bilinear form on \({\mathscr {L}}\): \(S(\Gamma _1,\Gamma _2):=(\{\Gamma _1\}, \Gamma _2)\). The conventional KLT matrix is recovered by evaluating the matrix elements of this map in a basis. In particular, the ‘generalized KLT matrix’ of [13] is given by

$$\begin{aligned} S(P,Q)= (\{\ell [P]\},\ell [Q])\,, \end{aligned}$$
(1.6)

where \(\ell \) denotes the complete left bracketings:

$$\begin{aligned} \ell [123\ldots n]:=[\ldots [1,2]\ldots n]\,. \end{aligned}$$
(1.7)

Cachazo, He and Yuan [8] showed that biadjoint scalar amplitudes are in some sense the inverse to the KLT matrix, see [20,21,22]. Using the Berends–Giele formula for biadjoint amplitudes, [14], this statement precisely follows from the basic statement that the maps b and S are inverses of each other, which is our main result.

1.4 Numerators and cobrackets

In Sect. 8 we show that the “contact term map” defined in [23] is the Lie co-bracket dual to \(\{~,~\}\); it gives rise to a Lie co-algebra structure on \({\mathscr {L}}^*\). In the context of pure spinor superstrings, [23] the contact-term map encodes the BRST variations of local multiparticle superfield numerators satisfying generalized Jacobi identities [6, 24, 25]. These BRST variations play a central role in the recent developments in the explicit calculation of superstring amplitudes, from tree-level to 3-loops.

Moreover, in Sect. 7 we study BCJ numerators. Our approach shows that BCJ-like numerators always exist, and are given by \(N_\Gamma = B(\{\Gamma \})\), if B(P) are the Berends–Giele currents of the theory. However, there is no guarantee that these \(N_\Gamma \) are local, except in special cases. We review known numerators including for the non-linear sigma model (NLSM), where we give a conjecture that has since been proved elsewhere [26]. We also study the numerators for super–Yang–Mills (sYM), Z-theory and the open superstring.

2 Review of Lie Polynomials, Combinatorics on Words and Colour Factors

Let W be the vector space of linear combinations of words over the natural numbers. The free Lie algebra \({\mathscr {L}}\) is the subspace of W linearly spanned by Lie monomials, \(\Gamma \). A Lie monomial is a complete bracketing of a word, such as such as

$$\begin{aligned} \Gamma = 123 - 132 - 231 + 321 = [1,[2,3]]. \end{aligned}$$
(2.1)

The left- and right-bracketings are surjective maps from W onto \({\mathscr {L}}\) given by

$$\begin{aligned} \begin{aligned} \ell [123\ldots n]&:= [[[1,2],3],\ldots ,n],\\ r[123\ldots n]&:= [1,[2,[3,\ldots ,[n-1,n]\ldots ]]]. \end{aligned} \end{aligned}$$
(2.2)

For a letter i and a word P,

$$\begin{aligned} \ell [Pi] = [\ell [P],i] \end{aligned}$$
(2.3)

(and similarly \(r[iP] = [i,r[P]]\)). This inductively implies Baker’s identity [1]

$$\begin{aligned} \ell [P\ell [Q]] = [\ell (P),\ell [Q]]. \end{aligned}$$
(2.4)

Write |P| for the length of a word P. If \(\overline{P}\) denotes the reverse of P, then the so-called ‘antipode’ can be defined as

$$\begin{aligned} \alpha (P):= (-1)^{|P|} \overline{P}. \end{aligned}$$
(2.5)

The antipode relates \(\ell [P]\) and r[P] by

$$\begin{aligned} \ell [P] = - r[\alpha (P)]. \end{aligned}$$
(2.6)

2.1 \({\mathscr {L}}\) and its dual \({\mathscr {L}}^*\)

This section recalls the duality pairing between \({\mathscr {L}}\) and its dual, \({\mathscr {L}}^*\), which is central to the results of the paper. For words \(P,Q \in W\), write (PQ) for the trivial inner product on W:

$$\begin{aligned} (P,Q):= {\left\{ \begin{array}{ll} 1 &{} \text { if } P=Q;\\ 0 &{} \text { otherwise}. \end{array}\right. } \end{aligned}$$
(2.7)

The shuffle product on W, , is inductively defined by

(2.8)

for letters ij, and words PQ. The base case is . The expression is sometimes referred to as the sum over ordered permutations of P and Q, preserving the ordering of the letters of P and of Q. Ree’s theorem characterizes \({\mathscr {L}}\) in terms of the shuffle product:

Theorem (Ree) [27]. \(\Gamma \in W\) is a Lie polynomial iff for all nonempty \(P,Q\in W\).

Write \(Sh \subset W\) for the subspace spanned by all proper shuffles, , with PQ nonempty. Ree’s theorem implies that \({\mathscr {L}}\) is the orthogonal subspace of Sh, with respect to ( ,  ). Thus the dual vector space to \({\mathscr {L}}\) (with respect to ( ,  )) is given by the vector space quotient,

$$\begin{aligned} {\mathscr {L}}^* = W / Sh. \end{aligned}$$
(2.9)

Elements of \({\mathscr {L}}^*\) are equivalence classes, \(P+Sh\), for some \(P\in W\). If two expressions in W, P and Q, belong to the same equivalence class, write \(P\sim Q\). If \(P\sim Q\), then there exist some words \(A_i,B_i\) and coefficients \(c_{i}\) so that

(2.10)

On account of the ambiguity in how to represent elements of \({\mathscr {L}}^*\) and \({\mathscr {L}}\), it is useful to find bases.

A word P is Lyndon if it is smaller in the dictionary ordering than any of its suffixes: P is Lyndon if \(P=QR\) for nonempty Q and R, then \(P<R\) in the dictionary ordering. The Lyndon words, P, give a basis of \({\mathscr {L}}^*\) [1]. Dually, the set of Lie monomials, \(\ell [P]\), for Lyndon words P, is a basis of \({\mathscr {L}}\). These two bases are dual because, for two Lyndon words P and Q, the smallest letter must come first in both words. But, for any letter i,

$$\begin{aligned} (iP,\ell [iQ]) = (iP,iQ) = (P,Q), \end{aligned}$$
(2.11)

because the only term in the word expansion of \(\ell [iQ]\) that has i at the beginning is iQ.

2.2 Appearance in gauge theory

The algebra recalled above is ubiquitous in gauge theory because of the colour factors. In this context, it will be helpful to write \(W_n \subset W\) for the words in W that have no repeated letters, and only involve letters \(1,2,\ldots ,n\). Likewise write \({\mathscr {L}}_n = {\mathscr {L}}\cap W_n\) for the Lie monomials restricted to letters \(1,2,\ldots ,n\). Its dual is \({\mathscr {L}}_n^* = W_n / (Sh \cap W_n)\).

Fix n elements of a Lie algebra: \(t^a_i \in {\mathfrak {g}}\), for \(i=1,\ldots ,n\). For any Lie monomial \(\Gamma \in {\mathscr {L}}_{n-1}\), let \(\textbf{t}(\Gamma )\) be obtained by writing \(\Gamma \) as a nested bracketing of \(1,\ldots ,n-1\), and replacing i with \(t_i\) and [ ,  ] with the Lie bracket of \({\mathfrak {g}}\). This defines a linear map

$$\begin{aligned} \textbf{t}: {\mathscr {L}}_{n-1} \rightarrow {\mathfrak {g}}. \end{aligned}$$
(2.12)

If \(\textrm{tr}\) is the invariant inner product on \({\mathfrak {g}}\), then for every Lie monomial \(\Gamma \in {\mathscr {L}}_{n-1}\), the associated colour factor is

$$\begin{aligned} c_\Gamma = \textrm{tr}(\textbf{t}(\Gamma )t_n). \end{aligned}$$
(2.13)

The replacement \(\Gamma \mapsto c_\Gamma \) also defines a homomorphism out of \({\mathscr {L}}_{n-1}\).

Fig. 1
figure 1

The product of propagators \({1\over s_\Gamma }\) and the planar binary tree associated to the Lie monomial [[1, 2], [3, 4]] according to the definition (2.17)

The colour factor \(c_\Gamma \) arises in cubic scalar theory as the colour factor for a specific cubic Feynman graph, that we write as \(T_\Gamma \). Regard \(T_\Gamma \) as a rooted binary tree, with root at n. Then \(T_\Gamma \) is defined inductively as follows. \(T_{[1,2]}\) is the tree with two external legs, 1 and 2, connected by one vertex to the root. If \(\Gamma = [\Gamma ',\Gamma '']\), then \(T_\Gamma \) is the tree formed by connecting (or ‘grafting’) the roots of \(T_{\Gamma '}\) and \(T_{\Gamma ''}\) to make a new vertex. Every pair of brackets in \(\Gamma \) corresponds to a vertex in \(T_\Gamma \). An example is shown in Fig. 1. The trees \(T_\Gamma \) and \(T_{-\Gamma }\) are the same. So there is a 1:1 correspondence between Lie monomials up to sign, \(\pm \Gamma \), and binary trees, T.

In massless theories, the contribution of a graph \(T_\Gamma \) to the amplitude is a function of external momenta, \(k_i^\mu \), \(i=1,\ldots ,n\), with \(k_i\cdot k_i = 0\). Write \(s_{ij}\) for the Mandelstam variable

$$\begin{aligned} s_{ij}:= 2k_i\cdot k_j. \end{aligned}$$
(2.14)

For every subset \(I\subset {\mathbb {N}}\) with at least two elements, write

$$\begin{aligned} k_I^\mu := \sum _{i\in I} k_i^\mu , \end{aligned}$$
(2.15)

and

$$\begin{aligned} s_I:= k_I^2 = \sum _{\{i,j\}\subset I} s_{ij}. \end{aligned}$$
(2.16)

Take \(\Gamma \) and \(T_\Gamma \) as above. When written as a nested bracket expression, each pair of brackets in \(\Gamma \) defines a subset of \(\{1,\ldots ,n-1\}\). If I is a subset that appears like this, write \(I\in \Gamma \), and define

$$\begin{aligned} s_\Gamma := \prod _{I\in \Gamma } s_I. \end{aligned}$$
(2.17)

The inverse, \(\frac{1}{s_\Gamma }\), is then the product of propagators of the tree graph, \(T_\Gamma \), associated to \(\Gamma \), including a propagator for the root of \(T_\Gamma \).

Given this, it will be useful to write \({\mathcal {M}}\) for the Laurent ring in the variables \(s_I\), subject to the relation (2.16). Clearly the propogator factors \(1/s_\Gamma \) belong to \({\mathcal {M}}\).

2.3 Identities for amplitudes

The duality between \({\mathscr {L}}_n\) and \({\mathscr {L}}_n^*\) leads to helpful identities that we collect in this section. A Lyndon basis for \({\mathscr {L}}_n^*\) is given by the words of the form 1P, that begin with 1. This basis is dual to the basis of \({\mathscr {L}}_n\) given by the corresponding Lie monomials \(\ell [1P]\). Given that these are dual bases, we have the following basis expansion in \({\mathscr {L}}_n^*\),

$$\begin{aligned} P \sim \sum _Q (P, \ell [1Q]) 1Q \end{aligned}$$
(2.18)

where the sum is over all permutations, Q, of \(23\ldots n\). For \(iP\in W_n\) and some letter i, it can be checked that

(2.19)

This follows from (2.3). Substituting this formula for \(\ell [iP]\) into (2.18) implies that,

(2.20)

for \(XiY\in W_n\) is a permutation of \(12\ldots n\), and some distinguished letter i.Footnote 3 We will later see that this implies the Kleiss-Kuijf (KK) relations among partial amplitudes. Setting Y to be empty in (2.20) gives

$$\begin{aligned} P\sim -(-1)^{|P|}{\bar{P}}, \end{aligned}$$
(2.22)

for any word P.

We also have the dual basis expansion in \({\mathscr {L}}_n\). For \(\Gamma \in {\mathscr {L}}_n\),

$$\begin{aligned} \Gamma = \sum _Q ( 1Q,\Gamma )\,\ell [1Q], \end{aligned}$$
(2.23)

where the sum is over permutations of \(23\ldots n-1\). For example, this gives

$$\begin{aligned} \left[ [1,2],[3,4]\right] = \ell [1234] - \ell [1243], \end{aligned}$$
(2.24)

which follows also from the Jacobi identity.

Finally, we will need to use the adjoints of \(\ell \) and r, which we write as \(\ell ^*\) and \(r^*\):

$$\begin{aligned} (\ell ^*(P),Q) = (P,\ell (Q)),\qquad (r^*(P),Q) = (P,r(Q)). \end{aligned}$$
(2.25)

It follows from (2.3) that the adjoints can be computed recursively as

$$\begin{aligned} \begin{aligned} \ell ^*(123 \ldots n)&= \ell ^*(123 \ldots n-1)n - \ell ^*(23 \ldots n)1,\qquad \ell ^*(i):= i,\\ r^*(123 \ldots n)&= 1r^*(23 \ldots n) - n r^*(123 \ldots n{-}1),\qquad r^*(i):= i,\\\end{aligned} \end{aligned}$$
(2.26)

Likewise, (2.19) implies that

(2.27)

Note that \(\ell ^*\) and \(r^*\) are well-defined on \({\mathscr {L}}^*\) as for nonempty PQ. This follows from , which vanishes by Ree’s theorem.

3 Berends–Giele Recursion and Lie Polynomials

Berends–Giele (BG) is a recursive method to compute tree-level scattering amplitudes. [31] This section formulas Berends–Giele recursion in terms of fields with values in Lie polynomials. This is similar to the ‘perturbiner’ method of [15, 28, 32, 33], and will allow us to make full use of the properties of \({\mathscr {L}}_n\) and \({\mathscr {L}}_n^*\) reviewed in Sect. 2.Footnote 4

3.1 Berends–Giele recursion for biadjoint scalar theories

Consider a biadjoint scalar field \(\Phi \) with values in the tensor product of two Lie algebras \({\mathfrak {g}}\otimes {\tilde{{\mathfrak {g}}}} \). Let these have structure constants \(f^{abc}\) and \({\tilde{f}} ^{{\tilde{a}} {\tilde{b}} {\tilde{c}}}\) and invariant inner products for which we take an orthonormal basis. Then the Lagrangian is

$$\begin{aligned} {{\mathcal {L}}}_{BS} = \frac{1}{2} \nabla _\mu \Phi _{a{\tilde{a}}} \nabla ^\mu \Phi ^{a{\tilde{a}}} + \frac{1}{3!} f_{abc}{\tilde{f}}_{{\tilde{a}} {\tilde{b}} {\tilde{c}}} \Phi ^{a{\tilde{a}}}\Phi ^{b{\tilde{b}}}\Phi ^{c\tilde{c}}\,, \end{aligned}$$
(3.1)

where \(\mu =1,\ldots ,d\) is a space-time index and ab \({\tilde{a}}, {\tilde{b}}\) are Lie algebra indices. The field equations are

$$\begin{aligned} \Box \Phi _{a{\tilde{a}}}= \frac{1}{2} f_{abc}{\tilde{f}}_{{\tilde{a}} {\tilde{b}} {\tilde{c}}} \Phi ^{b{\tilde{b}}}\Phi ^{c{\tilde{c}}}\,. \end{aligned}$$
(3.2)

Our aim is to solve this field equation perturbatively. We will do this by solving a closely related problem. Let \(\phi (x)\) be a field with values in \({\mathscr {L}}_n\otimes {\mathscr {L}}_n\) and subject to the field equaiton

$$\begin{aligned} \Box \phi = \frac{1}{2} [\![ \phi ,\phi ]\!] \,, \end{aligned}$$
(3.3)

where \([\![~,~]\!]\) is the symmetric bracket:

$$\begin{aligned}{}[\![\Gamma _1\otimes {{\tilde{\Gamma }}}_1,\Gamma _2\otimes {{\tilde{\Gamma }}}_2]\!]:=\left[ \Gamma _1,\Gamma _2\right] \otimes \left[ {{\tilde{\Gamma }}}_1,{{\tilde{\Gamma }}}_2\right] \,. \end{aligned}$$
(3.4)

For some null momenta \(k_1,\ldots ,k_n\), the field

$$\begin{aligned} \phi _1=\sum _{j=1}^n e^{ik_j\cdot x}\, j\otimes j\, \in {\mathscr {L}}\otimes {\mathscr {L}}\,, \end{aligned}$$
(3.5)

gives a homogenous solution to (3.3). We use this to seed a recursive solution of (3.3).

Write \({\mathscr {L}}_{\le k}\) for the subspace of \({\mathscr {L}}_n\) spanned by Lie monomials \(\Gamma \) with length \(|\Gamma |\le k\). Given a solution to (3.3) with values in \({\mathscr {L}}_{\le k}\), consider the field

$$\begin{aligned} \phi _{k}= \phi _1+ \textrm{proj}_{\le k}\Box ^{-1} \frac{1}{2} [\![ \phi _{k-1},\phi _{k-1}]\!] \,, \end{aligned}$$
(3.6)

where \(\textrm{proj}_{\le k}\) denotes the projection onto \({\mathscr {L}}_{\le k}\). Let us iteratively applying (3.6), starting with \(\phi _1\). At each step in this recursion, the coefficient of a word P in \(\Phi _k\) has x dependence \(e^{ik_P\cdot x}\). So the inverse wave operator, \(\Box ^{-1}\), acts on such a term to give \(1/k_P^2=1/s_P\). When \(k=n\), we have

$$\begin{aligned} \phi _{n}= \phi _1+ \Box ^{-1} \frac{1}{2} [\![ \phi _{n-1},\phi _{n-1}]\!] \,, \end{aligned}$$
(3.7)

which is a solution to (3.3), with values in \({\mathscr {L}}_n\otimes {\mathscr {L}}_n\). By construction, \(\phi _n\) is symmetric in its two Lie polynomials factors. It follows that, for a word \(P\in W_n\), pairing with the left or right factor of \(\phi _n\) gives the same result: \((P,\phi _n) = (\phi _n,P)\). So write

$$\begin{aligned} \phi (P):= (P,\phi _n) = (\phi _n,P). \end{aligned}$$
(3.8)

The x dependence is given by \(e^{ik_P\cdot x}\). So the Fourier transform of \(\phi (P)\) to momentum space is just

$$\begin{aligned} b(P)= e^{-i k_P\cdot x} \Phi (P), \end{aligned}$$
(3.9)

which plays the same role in our context as a Berends–Giele ‘current’.

Proposition [13]. The Berends–Giele ‘currents’ satisfy

$$\begin{aligned} b(P) = {1\over s_P}\sum _{XY=P}[b(X),b(Y)], \qquad b(i) = i\,, \end{aligned}$$
(3.10)

where the sum is over all deconcatenations, \(P=XY\), of P.

Proof

Consider the identity, for \(\Gamma _1,\Gamma _2\in {\mathscr {L}}_n\),

$$\begin{aligned}{} & {} (P, \left[ \Gamma _{1},\Gamma _{2}\right] ) \left[ \Gamma _{1,}\Gamma _{2}\right] \nonumber \\{} & {} =\sum _{XY=P} \left[ (X,\Gamma _{1})\Gamma _{1}, (Y,\Gamma _{2})\Gamma _{2}\right] + \left[ (X,\Gamma _{2})\Gamma _{2}, (Y,\Gamma _{1})\Gamma _{1}\right] . \end{aligned}$$
(3.11)

This implies that

$$\begin{aligned} \frac{1}{2} [\![ \phi ,\phi ]\!](P) = \sum _{XY=P}[ \phi (X),\phi (Y)], \end{aligned}$$
(3.12)

and the result then follows by pairing (3.6) with P. \(\square \)

Moreover, since b(P) takes values in \({\mathscr {L}}_n\), Ree’s theorem implies that

(3.13)

for nonempty RS. Thus \(b: P \mapsto b(P)\) defines a homomorphism

$$\begin{aligned} b: {\mathscr {L}}_n^* \rightarrow {\mathscr {L}}_n \otimes {\mathcal {M}}, \end{aligned}$$
(3.14)

where \({\mathcal {M}}\) is the Laurent ring of Mandelstam variables.

The field theory \(n+1\)-particle amplitude can be obtained from b(P) by removing the last propagator, \(1/s_{12\ldots n}\). Imposing momentum conservation, \(s_{12\ldots n}\rightarrow 0\). So define the ‘\({\mathscr {L}}_n\)-valued amplitude’ by

$$\begin{aligned} m(P,n+1)=\lim _{s_{P}\rightarrow 0} s_{P}\, b(P), \end{aligned}$$
(3.15)

for a word \(P\in {\mathcal {L}}_{n-1}^*\).

3.2 The tree diagram expansion of b(P)

The recursion relation (3.10) can be solved explicitly, to recover the usual Feynman diagram expansion. Write \(s_\Gamma \) for the product of variables \(s_I\) defined in (2.17). Then we claim that

$$\begin{aligned} b(P) = \sum _\Gamma \frac{(P,\Gamma )\Gamma }{s_\Gamma }, \end{aligned}$$
(3.16)

where the sum is over \(\Gamma \in {\mathscr {L}}_n\).

Lemma

The formula in (3.16) satisfies (3.10).

Proof

For any Lie monomial \(\Gamma \) there are \(\Gamma _1\) and \(\Gamma _2\) so that \(\Gamma =[\Gamma _1,\Gamma _2]\), and these monomials are unique up to sign. So, for a fixed Lie monomial \(\Gamma \),

$$\begin{aligned} \frac{(P,\Gamma )\Gamma }{s_\Gamma }= \frac{(X,\Gamma _1)(Y,\Gamma _2) \left[ \Gamma _1,\Gamma _2\right] }{s_P s_{\Gamma _1}s_{\Gamma _2}} - \frac{(Y,\Gamma _1)(X,\Gamma _2) \left[ \Gamma _1,\Gamma _2\right] }{s_P s_{\Gamma _1}s_{\Gamma _2}}, \end{aligned}$$
(3.17)

where \(P=XY\) and \(|X|=|\Gamma _1|\), \(|Y|=|\Gamma _2|\). Summing over all Lie monomials (up to sign), \(\Gamma _1\) and \(\Gamma _2\), that have length strictly smaller than |P| gives (3.10). \(\square \)

The first few examples of b(P) are:

$$\begin{aligned} \begin{aligned} b(12)&= {[1,2]\over s_{12}},\\ b(123)&= {[[1,2],3]\over s_{12}s_{123}} + {[1,[2,3]]\over s_{23}s_{123}},\\ b(1234)&= {[ [ [ 1, 2 ], 3 ], 4 ] \over s_{12} s_{123} s_{1234}} + {[ [ 1, [ 2, 3 ] ], 4 ] \over s_{123} s_{1234} s_{23}}\\ {}&+ {[ [ 1, 2 ], [ 3, 4 ] ] \over s_{12} s_{1234} s_{34}} + {[ 1, [ [ 2, 3 ], 4 ] ] \over s_{1234} s_{23} s_{234}} + {[ 1, [ 2, [ 3, 4 ] ] ] \over s_{1234} s_{234} s_{34}}. \end{aligned} \end{aligned}$$
(3.18)
Fig. 2
figure 2

The Catalan expansion b(1234) from (3.10). Viewed as cubic graphs and removing the overall propagator \(1/s_{1234}\), they correspond to the expansion of a color-ordered five-point tree amplitude A(12345) [13]. Note the leg 5 does not enter in the Lie elements in the numerators and that the root is unlabelled. By labelling the root and assigning leg 5 to the Catalan expansion of b(5) one recovers the free Lie algebra correspondence (3.21) for the case \(n=5\)

See Fig. 2 for this last example. Since b(P) is valued in \({\mathscr {L}}_n\), it can be expanded in a basis using (2.23), to get

$$\begin{aligned} b(P) = \sum _R b(P|iR)\ell [iR], \end{aligned}$$
(3.19)

where i is the smallest letter in P, and where

$$\begin{aligned} b(P|Q) = (b(P),\ell [Q]). \end{aligned}$$
(3.20)

The usual biadjoint scalar partial tree amplitude is then given by [14]

$$\begin{aligned} m(Pn,Qn)=: \lim _{s_{P}\rightarrow 0}s_{P} b(P|Q) = (Q,m(Pn)). \end{aligned}$$
(3.21)

We will see in Sect. 7 how b(P) can be dressed with BCJ numerators to give multiparticle fields and amplitudes for other gauge theories.

4 A New Lie Bracket for Tree-Level Scattering Amplitude Relations

This section introduces the S-bracket, \(\{~,~\}\), and shows that it is a Lie bracket. The S-bracket is then used to define the ‘generalized KLT matrix’ and to prove the identities conjectured in [13].

4.1 The S-bracket

A bilinear map was introduced in [6,7,8,9,10,11,12,13,14,15] (where it was called the ‘S map’) to express the BCJ relations for super-Yang–Mills amplitudes from its action on Berends–Giele currents \(M_P\) from [34]. It was abstracted to a map acting on words in [13] and the off-shell BCJ relations for b(P) was conjectured, but no general proof was given. Here we will see that this map defines a Lie bracket on \({\mathscr {L}}^*\), the ‘S bracket’, and this will lead to a new formulation of the fundamental BCJ relations.

Definition

(S bracket). Define a multilinear pairing \(\{~,~\}: {\mathcal {L}}^*\otimes {\mathcal {L}}^* \rightarrow {\mathcal {L}}^*\) by [13]

$$\begin{aligned} \{P,Q\}:= r^*(P)\star \ell ^*(Q), \end{aligned}$$
(4.1)

where \(r^*\) and \(\ell ^*\) are defined in (2.26) and

$$\begin{aligned} Ai\star jB:= s_{ij}\, AijB \end{aligned}$$
(4.2)

for words AB and letters ij.

This definition implies that the S-bracket can be recursively computed using

$$\begin{aligned} \begin{aligned} \{iAj,B\}&=i\{Aj,B\}-j\{iA,B\},\\\{B,iAj\}&=\{B,iA\}j-\{B,Aj\}i,\\\{i,j\}&= s_{ij}\,ij \end{aligned} \end{aligned}$$
(4.3)

which follows from (2.27). Altneratively, for \(P,Q\in {\mathscr {L}}^*\), equation (2.27) also gives an explicit closed formula: [6]

(4.4)

For example,

$$\begin{aligned} \begin{aligned} \{1,2\}&=s_{12}12, \\\{1,23\}&= s_{12} 123 - s_{13} 132,\\\{12,34\}&= s_{23}1234 - s_{24}1243 - s_{13} 2134 + s_{14}2143. \end{aligned} \end{aligned}$$
(4.5)

Given that the adjoints \(r^*\) and \(\ell ^*\) annihilate proper shuffles, the definition (4.1) manifestly satisfies .

The S-bracket is antisymmetric. Indeed, by (4.4),

(4.6)

because, in \({\mathscr {L}}^*\), \(X\sim -(-1)^X{\overline{X}}\). It follows that,

$$\begin{aligned} \{P,Q\}\sim -\{Q,P\}. \end{aligned}$$
(4.7)

Moreover, Sect. 5.1, below, shows that \(\{~,~\}\) is a Lie bracket on \({\mathscr {L}}_{\mathcal {S}}^*\), and so it also satisfies the Jacobi identity,

$$\begin{aligned} \{P,\{Q,R\}\}+\{Q,\{R,P\}\}+\{R,\{P,Q\}\} \sim 0. \end{aligned}$$
(4.8)

4.2 The BCJ amplitude relations

This section proves the main property of the S bracket, \(\{~,~\}\), generalising the off-shell fundamental BCJ relation [16]. The on-shell identities follow directly from the off-shell relations, as explained at the end of this section.

Proposition

For \(P,Q\in {\mathscr {L}}^*\), the S bracket \(\{~,~\}\) satisfies

$$\begin{aligned} b(\{P,Q\}) = [b(P),b(Q)], \end{aligned}$$
(4.9)

i.e., b maps the S-bracket to the Lie bracket.

The proposition is proved in appendix B. It is interesting to observe that the property (4.9) mimics the identity obeyed by the Poisson bracket \(\{,\}\) of Hamiltonian vector fields \(X_f\): \(X_{\{f,g\}} = [X_f,X_g]\) for functions f and g.

Corollary

(Off-shell BCJ relations [16]) Taking \(P=i\), a single letter, we obtain

(4.10)

where \(\alpha (X)=(-1)^{|X|}{\bar{X}}\). Equivalently, we also have

$$\begin{aligned} \sum _{Q=XY} s_{i,Y} b(XiY) = \left[ i,b(Q)\right] . \end{aligned}$$
(4.11)

Proof

(4.10) is immediate. (4.11) follows by showing that

$$\begin{aligned} \{i,Q\} \sim \sum _{Q=XY} s_{i,Y} XiY. \end{aligned}$$
(4.12)

To prove this identity, use (2.18) to write the RHS (4.12) in the basis of words beginning with the letter i,

(4.13)

Further manipulations give

(4.14)

where we use the property of the shuffle product, (2.8). \(\square \)

Corollary

(BCJ relations) The tree-level partial amplitudes (3.15) satisfy

$$\begin{aligned} m(\{P,Q\},n) = 0, \end{aligned}$$
(4.15)

where P and Q are words that partition \(1,2,\ldots ,n-1\) into two parts.

Proof

By the definition (3.15),

$$\begin{aligned} m(\{P,Q\},n) =\lim _{s_{PQ}\rightarrow 0} s_{PQ} b(\{P,Q\})= \lim _{s_{PQ}\rightarrow 0}s_{PQ}[b(P),b(Q)] = 0. \end{aligned}$$
(4.16)

The last term vanishes because neither b(P) nor b(Q) has a \(1/s_{PQ}\) pole. \(\square \)

The original so called ‘fundamental BCJ relations’ are [35, 36]

$$\begin{aligned} m(\{i,Q\}n) = \sum _{Q=RS} (k_i\cdot k_S)m(RiSn) = 0, \end{aligned}$$
(4.17)

which follows in this form from (4.11). The first few examples are, [3]

$$\begin{aligned} \begin{aligned} 0&= m(\{1,23\}4) = s_{12} m(1234) - s_{13} m(1324),\\0&=m(\{1,234\},5) = s_{12} m(12345) - s_{13} m(13245) - s_{13} m(13425) + s_{14} m(14325). \end{aligned} \end{aligned}$$
(4.18)

As observed in [6,7,8,9,10,11,12,13,14,15], (4.15) also implies other BCJ relations, such as

$$\begin{aligned} 0= & {} m(\{12,34\},5) = s_{23} m(1234,5) - s_{13} m(2134,5)\nonumber \\ {}{} & {} - s_{24} m(1243,5) + s_{14} m(2143,5), \end{aligned}$$
(4.19)

while similar formulas using the shuffle product appear in [37,38,39]. The BCJ relations for Yang–Mills theory were first proven from the field-theory limit of string theory in [37, 38]. By now these relations have been proven for a variety of theories at tree-level. See the recent review [5] and references therein.

5 The KLT Matrix as Nested Brackets

This section introduces a canonical KLT map S. We find that S is the inverse of b, and that this implies that \(\{~,~\}\) is a Lie bracket. When bases are chosen for \({\mathscr {L}}_n\) and \({\mathscr {L}}_n^*\), the matrix elements of S give the ‘generalized KLT matrix’ S(P|Q) proposed in [13]. We prove the conjectured properties of S(P|Q) as well as additional ones.

5.1 The KLT map

Let \(\Gamma \) be a Lie monomial, and write it as a nested bracketing. Let \(\{\Gamma \} \in {\mathscr {L}}^*\) be obtained by replacing every commutator [ ,  ] in the bracketed expression of \(\Gamma \) with a \(\{~,~\}\). Since \(\{~,~\}\) is antisymmetric, this is well defined. By nested applications of (4.9), it follows from the proposition in the previous section that

Proposition

For any Lie monomial \(\Gamma \),

$$\begin{aligned} \Gamma = b(\{\Gamma \}). \end{aligned}$$
(5.1)

These are surprising identities. For example, one can verify that

$$\begin{aligned} \left[ [1,2],[3,4]\right] = s_{12}s_{34}\bigl (s_{23}b(1234) - s_{24}b(1243) - s_{13} b(2134) + s_{14}b(2143)\bigr ).\nonumber \\ \end{aligned}$$
(5.2)

Define the KLT map \(S:{\mathscr {L}}_n \rightarrow {\mathscr {L}}_n^*\otimes {\mathcal {M}}\) by

$$\begin{aligned} S:\Gamma \mapsto \{\Gamma \}, \end{aligned}$$
(5.3)

for Lie monomials \(\Gamma \). It is not obvious that S is well defined as a map from \({\mathscr {L}}_n\) to \({\mathscr {L}}_n^*\otimes {\mathcal {M}}\). The map S would be well defined provided that \(\{~,~\}\) was a Lie bracket. And if S is well defined, then b is clearly an inverse, since clearly \(b(S(\Gamma )) = \Gamma \). This is verified in the following in the proof of the following:

Proposition

The maps \(b:{\mathscr {L}}^*\rightarrow {\mathscr {L}}\) and \(S:{\mathscr {L}}\rightarrow {\mathscr {L}}^*\) are inverses. In particular, b is invertible.

Proof

Take dual Lyndon bases of \({\mathscr {L}}^*_n\) and \({\mathscr {L}}_n\), as in Sect. 2. Then define a map, \(S'\):

$$\begin{aligned} S': \ell (P) \mapsto \{\ell (P)\}, \end{aligned}$$
(5.4)

for monomials \(\ell (P)\) in the given basis of \({\mathscr {L}}_n\). We show that (i) \(S'\) and b are inverse, and (ii) that the \(S'\) in (5.4) is the map, S, in (5.3). This proves that (5.3) is well-defined.

  1. (i)

    By (5.1),

    $$\begin{aligned} b(S'(\Gamma )) = \sum _{P\in Basis} (\Gamma ,P) b(\{\ell (P)\}) = \sum _{P\in Basis} (\Gamma ,P) \ell (P) = \Gamma . \end{aligned}$$
    (5.5)

    Conversely,

    $$\begin{aligned} S'(b(P)) = \sum _{Q\in Basis} (b(P),Q) \{\ell (Q)\}. \end{aligned}$$
    (5.6)

    Expanding \(\{\ell (Q)\} \in {\mathscr {L}}^*_n\otimes {\mathcal {M}}\) in the given basis gives (by (2.18)),

    $$\begin{aligned} \{\ell (Q)\} = \sum _{R\in Basis} (\{\ell (Q)\},\ell (R)) R. \end{aligned}$$
    (5.7)

    But notice that

    $$\begin{aligned} (\{\Gamma \},\Gamma ') = (\{\Gamma \},b(\{\Gamma '\})) =(b(\{\Gamma \}),\{\Gamma '\}) = (\Gamma ,\{\Gamma '\}), \end{aligned}$$
    (5.8)

    by the self-adjointness of b. Combining (5.7) and (5.8) gives that

    $$\begin{aligned} S'(b(P)) = \sum _{Q,R \in Basis} b(P,Q) (\ell \{Q\},\ell (R)) R = \sum _{R\in Basis} (P,\ell (R)) R = P. \end{aligned}$$
    (5.9)

    So b is invertible, with inverse \(S'\).

  2. (ii)

    But \(b(\{\Gamma \}) = \Gamma \), by (5.1). So

    $$\begin{aligned} S'(\Gamma ) = S'(b(\{\Gamma \})) = \{\Gamma \}, \end{aligned}$$
    (5.10)

    which shows that \(S'\) is the map \(S: \Gamma \mapsto \{\Gamma \}\).

\(\square \)

The self-adjointness of b, (5.8), implies that the KLT map, S, is itself self adjoint:

$$\begin{aligned} (\Gamma ,S(\Gamma ')) = (S(\Gamma ),\Gamma '). \end{aligned}$$
(5.11)

Corollary

\(\{~,~\}\) is a Lie bracket on \({\mathscr {L}}^*\otimes {\mathcal {M}}\).

Proof

Given that S and b are inverse, (4.9) shows that \(\{~,~\}\) is the pull back of [ ,  ] from \({\mathscr {L}}\) to \({\mathscr {L}}^*\) by \(b^{-1}\). So \(\{,\}\) is skew and satisfies the Jacobi relation. \(\square \)

5.2 The generalized and standard KLT matrix

This section finds explicit formulas for the KLT map, S, in terms of the matrix elements of S. Let us again take dual Lyndon bases: \(P\in {\mathscr {L}}_n^*\) for every Lyndon word P, and \(\ell [P] \in {\mathscr {L}}_n\) for every Lyndon word P. Define the generalized KLT matrix (introduced in [13]) to be the matrix elements in this basisFootnote 5

$$\begin{aligned} S^\ell (P|Q):= (S(\ell [P]),\ell [Q]) = (\{\ell [P]\},\ell [Q]). \end{aligned}$$
(5.12)

Some example entries of the generalized KLT matrix are

$$\begin{aligned} \begin{aligned} S^\ell (12|12)&= s_{12}, S^\ell (12|21) = - s_{12},\\ S^\ell (123|123)&= s_{12}(s_{13}+s_{23}), S^\ell (312|123) = - s_{12}s_{13}. \end{aligned} \end{aligned}$$
(5.13)

It follows from (5.8) that the generalized KLT matrix is symmetric:

$$\begin{aligned} S^\ell (P|Q) = S^\ell (Q|P). \end{aligned}$$
(5.14)

Moreover, (5.5) implies that the generalized KLT matrix satisfies

$$\begin{aligned} \ell [P] = \sum _{Q\in Basis} S^\ell (P|Q)\, b(Q). \end{aligned}$$
(5.15)

Or, equivalently,

$$\begin{aligned} (R,\ell [P]) = \sum _{Q\in Basis} S^\ell (P|Q)\, b(Q|R), \end{aligned}$$
(5.16)

and so, also:

$$\begin{aligned} b(P|Q) = \sum _{X,Y \in Basis} \, b(P|X)\, S^\ell (X|Y) \, b(Y|Q). \end{aligned}$$
(5.17)

This is what we might call an ‘off shell KLT relation’. We study it further in the next subsection.

The standard KLT matrix arises, as in [13], as a restriction of the generalized one to a fixed basis, and is conventionally written

$$\begin{aligned} S(P|Q)_i:= S^\ell (iP|iQ), \end{aligned}$$
(5.18)

for some fixed letter i, called the ‘fixed leg’. The identity, (5.16), together with the definition (5.18), amount to a purely algebraic proof that the standard KLT matrix is the inverse to the biadjoint scalar BG double current:

$$\begin{aligned} \sum _R S^\ell (P|R)_i b(iR|iQ) = \delta _{P,Q}. \end{aligned}$$
(5.19)

This is in accord with the discussions of [8, 14]. Moreover, we can easily recover explicit formulas for \(S(P|Q)_i\). Using the definition of \(\{~,~\}\), and (4.12), we get

$$\begin{aligned} S^\ell (12\ldots n| 1A)= \prod _{i=2}^n \left( \sum _{C_i=A_iiB_i} s_{i,A_i}\right) , \end{aligned}$$
(5.20)

in agreement with [17,18,19]. We derive (5.20) in greater detail in appendix C. Finally, the matrix elements can also be efficiently computed using the following recursion relation, originally conjectured in [42].

Lemma

The standard KLT matrix can be recursively computed using

$$\begin{aligned} S(Ai,BiC)_m = k_i\cdot k_{mB}\, S(A,BC)_m, \end{aligned}$$
(5.21)

where m is the fixed leg. The recursion is seeded by \(S(\emptyset |\emptyset )_m:= 1\).

Proof

By definition, \(\ell \{Ai\} = \{\ell \{A\},i\}\). Using the formula for \(\{P,i\}\), (4.12),

$$\begin{aligned} \ell \{Ai\} = \sum _{X,Y} s_{i,X}\,(XY,\ell \{A\}) XiY. \end{aligned}$$
(5.22)

Since \(S(Ai,BiC)_m = (\ell \{mAi\},\ell (mBiC))\), it follows from (5.22) that

$$\begin{aligned} S(Ai,BiC)_m = \sum _{X,Y} s_{i,X} (XY,\ell \{mA\}) (XiY,\ell (mBiC)). \end{aligned}$$
(5.23)

The generalized KLT matrix is the matrix of coefficients that arises in the basis expansion of \(\ell \{P\}\):

$$\begin{aligned} \ell \{P\} \sim \sum _{Q\in Basis} (\ell \{P\}, \ell [Q]) Q=\sum _{Q\in Basis} S^\ell (P|Q)Q, \end{aligned}$$
(5.24)

So expanding \(\ell \{mA\}\) in a basis,

$$\begin{aligned} \ell \{mA\} = \sum _P S(A,P)_m\, mP. \end{aligned}$$
(5.25)

Equation (5.23) becomes

$$\begin{aligned} S(Ai,BiC)_m = \sum _{X,Y,P} s_{i,X} S(A,P)_m (XY,mP) (XiY,\ell (mBiC)). \end{aligned}$$
(5.26)

The only contributions in the sum come from words, \(X=mX'\), that begin with the letter m. Expanding \(\ell (mBiC)\), one sees that \((mX'iY,\ell (mBiC)) = \delta _{X',B}\delta _{Y,C}\). \(\square \)

5.3 The momentum kernel and the KLT gravity formula

The \((n-2)!\) version of the KLT relation, given by [36, 43], is

$$\begin{aligned} M_n^\textrm{gravity} = \lim _{s_{1P}\rightarrow 0} \sum _{P,Q} {1\over s_{1P}}{{\mathcal {A}}}(1Pn)S(P|Q)_1 {\tilde{{\mathcal {A}}}}(1Qn), \end{aligned}$$
(5.27)

where the double sum is over permutations of \(23\ldots n-1\). In this formula, the gauge theory partial amplitudes \(\tilde{{\mathcal {A}}}(1Qn)\) and \({{\mathcal {A}}}(1Pn)\) are understood to have independent sets of polarization vectors, but the same external momenta. It is not immediately obvious that the limit on the RHS of (5.27) is well defined. As we will see, the cancellation of the \(1/s_{1P}\) pole on the RHS is best understood using the derivation of \(S(P|Q)_1\) in Sect. 5.1, above.

A Lie polynomial version of (5.27) follows immediately from our results in Sect. 5.1. Indeed, (5.17) shows that the off-shell KLT matrix satisfies a KLT-like relation:

$$\begin{aligned} b(P|Q) = \sum _{R,U} b(P|iR)S(R|U)_i b(iU|Q). \end{aligned}$$
(5.28)

To obtain a relation of the form (5.27), write

$$\begin{aligned} A(1P,n):= s_{1P} b(1P) = s_{1P}\sum _{R\in S_{n-2}} b(1P|1R)\ell (1R), \end{aligned}$$
(5.29)

for the on-shell ‘Lie polynomial partial amplitude’. We also write

$$\begin{aligned} M_n = \sum _{P,Q\in S_{n-2}} s_{1P} \,\ell (1P)b(1P|1Q)\ell (1Q) \end{aligned}$$
(5.30)

for the Lie polynomial valued gravity-like amplitude. Notice that, by (3.19), \(M_n\) is equivalently written as

$$\begin{aligned} M_n = s_{12\ldots n-1} \sum _{|\Gamma | = n-1} \frac{\Gamma \otimes \Gamma }{s_\Gamma }. \end{aligned}$$
(5.31)

Then (5.28) implies the KLT relation

$$\begin{aligned} M_n = \lim _{s_{1P}=0}\sum _{P,Q\in S_{n-2}} {1\over s_{1P}}\, S(P|Q)_1\, A(1Pn)\, \otimes \, A(1Qn). \end{aligned}$$
(5.32)

All the double poles from the right-hand side of (5.27) are manifestly absent in its free Lie algebra version (5.32) due to the generalized KLT matrix property (5.28). The only poles in (5.28) are those that appear in b(1P|1Q).

Another manifestation of the cancellation of the \(1/s_{1P}\) pole follows from (5.19), above. This implies that

$$\begin{aligned} \sum _Q S^{\ell }(1P|1Q)m(1Q,n|R,n) = 0, \end{aligned}$$
(5.33)

where \(m(1Q,n|R,n) = \lim _{s_{iP}\rightarrow 0} s_{iP} b(1Q|R)\). The expression vanishes because there is no \(1/s_{iP}\) pole in (5.9). This is a remarkable identity, because the components \(S(P,Q)_i\) do not have \(s_{iP}\) as a factor. On the other hand, b(iPiQ) certainly does have a \(1/s_{iP}\) pole. This cancellation is the key reason why the KLT relation, (5.27), is well defined. As discussed below, these results will also make it clear why the existence of a ‘BCJ form’ for gravity (i.e. of the form of \(M_n\)) is equivalent to existence of the KLT relation of this kind.

To illustrate these surprising cancellations, we end this section with an example of the four-point gravity-like amplitude and its KLT relation. On the one hand, \(M_4\) can be written as

$$\begin{aligned} M_4 ={[[1,2],3]\otimes [[1,2],3] \over s_{12}} + {[[1,3],2]\otimes {[}[1,3],2]\over s_{13}} + {[1,[2,3]]\otimes [1,[2,3]]\over s_{23}}.\nonumber \\ \end{aligned}$$
(5.34)

On the other hand, we can present is as a KLT sum using (5.32) as:

$$\begin{aligned} \begin{aligned} M_4&= \lim _{s_{123}\rightarrow 0} \frac{1}{s_{123}} \big [s_{12}s_{23}A(123,4)\otimes A(123,4) - s_{12}s_{13} A(123,4)\otimes A(132,4) \\&\qquad {}-s_{12}s_{13} A(132,4)\otimes A(123,4) + s_{13}s_{23} A(132,4)\otimes A(132,4)\big ],\\\end{aligned} \end{aligned}$$

where, for example,

$$\begin{aligned} A(123,4) = \frac{[[1,2],3]}{s_{12}} + \frac{[1,[2,3]]}{s_{23}}. \end{aligned}$$
(5.36)

The poles and numerators in this sum organise to cancel the \(s_{123}\) pole and give (5.34) in the limit.

5.4 Discussion: generalized Jacobi identities

The main property of the S-bracket \(\{~,~\}\) is that \(b(\{P,Q\}) = [b(P),b(Q)]\). However, since b and S are inverse, this also implies the surprising identity,

$$\begin{aligned} \{P,Q\} = S([b(P),b(Q)]). \end{aligned}$$
(5.37)

The bracket \(\{P,Q\}\) is polynomial in the Mandelstam variables, whereas S([b(P), b(Q)]) is naively a rational function of the Mandelstam variables. As observed in Sect. 5.1, the S-bracket is a Lie bracket, and so satisfies

$$\begin{aligned} \{1,2\}\sim - \{2,1\}\,\qquad \{\{1,2\},3\}+\{\{2,3\},1\}+\{\{3,1\},2\} \sim 0, \end{aligned}$$
(5.38)

as identities in \({\mathscr {L}}^*\otimes {{\mathcal {M}}}\). It follows that \(\{~,~\}\) also satisfies

$$\begin{aligned} \ell \{PiQ\}\sim -\ell \{i\ell [P]Q\} \end{aligned}$$
(5.39)

which can be deduced from two applications of (2.4). This makes it clear that the generalized KLT matrix, \(S^\ell (P,Q)\), given in (5.12), satisfies

$$\begin{aligned} S(XiY|Q) = - S(i\ell [X]Y|Q), \end{aligned}$$
(5.40)

which is called the ‘generalized Jacobi identities’ in [13]. This property of the generalized KLT matrix has no analog for the standard KLT matrix, \(S(XiY|Q)_j\), because of the fixed leg j.Footnote 6

6 The Contact Term Map as a Lie Co-bracket

A series of studies of string theory correlators and BCJ numerators experimentally discovered the so-called contact term map, which appears in the action of the BRST operator and in studies of Yang-Mills BCJ numerators [23] in the context of Berends Giele recursion. This section identifies the contact term map as the Lie co-bracket dual to \(\{~,~\}\). The main properties of the contact map follow directly.

Definition

(Contact term map). The contact term map, C, is the dual of \(\{~,~\}\). For a Lie monomial, \(\Gamma \),

$$\begin{aligned} (P\otimes Q, C(\Gamma )):= (\{P,Q\}, \Gamma ). \end{aligned}$$
(6.1)

An explicit formula for \(C(\Gamma )\) follows from (6.1) by choosing a basis, for example:

$$\begin{aligned} C(\Gamma ) = \sum _{P,Q} (\{P,Q\},\Gamma )\; \ell [P]\otimes \ell [Q], \end{aligned}$$
(6.2)

where the sum is over a Lyndon basis of \({\mathscr {L}}^*\).

It is convenient to write

$$\begin{aligned} P\wedge Q:= P\otimes Q - Q\otimes P. \end{aligned}$$
(6.3)

Then the first few examples of the map C are

$$\begin{aligned} \begin{aligned} ~&C([1,2]) = (k_1\cdot k_2) (1\wedge 2 ),\\~&C([1,[2,3]]) = (k_2\cdot k_3)\left( [1,2]\wedge 3 + 2\wedge [1,3]\right) + (k_{1}\cdot k_{23}) \left( 1 \wedge [2,3]\right) . \end{aligned} \end{aligned}$$
(6.4)

C satisfies the dual Jacobi identity,

$$\begin{aligned} (C\otimes Id)\circ C - (Id\otimes C)\circ C - (Id\otimes A) \circ (C\otimes 1) \circ C = 0, \end{aligned}$$
(6.5)

where A is the swap map: \(X\otimes Y\mapsto Y\otimes X\). This dual Jacobi identity follows immediately from the Jacobi identity satisfied by the S-bracket. Moreover, recall that

$$\begin{aligned} b(\{P,Q\}) = [b(P),b(Q)]. \end{aligned}$$
(6.6)

This can be used to show:

Lemma

C satisfies [23]

$$\begin{aligned} C(b(P)) = \sum _{P=XY} b(X)\wedge b(Y). \end{aligned}$$
(6.7)

Proof

(6.6) implies that

$$\begin{aligned} (P\otimes Q, C(b(R))) = ([b(P),b(Q)], R). \end{aligned}$$
(6.8)

The RHS can be expanded by deconcatenation (as in the derivation of (3.10)):

$$\begin{aligned} RHS = \sum _{R=XY} (b(P),X)(b(Q),Y) - (X\leftrightarrow Y). \end{aligned}$$
(6.9)

But b is self-adjoint, and so

$$\begin{aligned} (P\otimes Q,C(b(R))) = \sum _{R=XY} (P,b(X))(Q,b(Y)) - (X\leftrightarrow Y), \end{aligned}$$
(6.10)

and this is equivalent to (6.7). (6.7) can also be checked using the formula, (6.2). \(\square \)

We finish this section by deriving a recursive formula for C. First define the standard extension of the adjoint representation of \({\mathscr {L}}\) to \({\mathscr {L}}\otimes {\mathscr {L}}\):

$$\begin{aligned} \begin{aligned} \left[ P,X\otimes Y\right]&:= \left[ P,X\right] \otimes Y + X\otimes [P,Y],\\\left[ X\otimes Y,Q\right]&:= [X,Q]\otimes Y + X\otimes [Y,Q]. \end{aligned} \end{aligned}$$

This makes \({\mathscr {L}}\otimes {\mathscr {L}}\) into an adjoint representation of \({\mathscr {L}}\).

Lemma

(Recursion). For \(\Gamma _1,\Gamma _2\in {\mathscr {L}}\), the action of C on \(\left[ \Gamma _1,\Gamma _2\right] \) is given by

$$\begin{aligned} C([\Gamma _1,\Gamma _2]):= k_{1}\cdot k_{2}\, \Gamma _1\wedge \Gamma _2+[C(\Gamma _1), \Gamma _2] + \left[ \Gamma _1,C(\Gamma _2)\right] , \end{aligned}$$
(6.11)

where \(k_1\) and \(k_2\) are the momenta associated to \(\Gamma _1\) and \(\Gamma _2\). With \(C(i):=0\), (6.11) can be taken as a definition of C, as in [23].

Proof

This is a consequence of the identity, (B.5). We again use deconcatenation, as in (6.8), to write

$$\begin{aligned} (\{P,Q\},[\Gamma _1,\Gamma _2]) = (\delta \{P,Q\}, \Gamma _1\otimes \Gamma _2) - (1\leftrightarrow 2). \end{aligned}$$
(6.12)

Then (B.5) gives

$$\begin{aligned} \begin{aligned} (P\otimes Q,C(\left[ \Gamma _1,\Gamma _2\right] ))&= k_{P}\cdot k_{Q} (P,\Gamma _1)(Q,\Gamma _2) \\&+ \sum _{P=XY} (X,\Gamma _1)(\{Y,Q\},\Gamma _2)\\ {}&- \sum _{Q=XY} (\{P,X\},\Gamma _1)(Y,\Gamma _2) - (1\leftrightarrow 2). \end{aligned} \end{aligned}$$
(6.13)

But \((\{Y,Q\},\Gamma _2) = (Y\otimes Q,C(\Gamma _2))\) and \((\{P,X\},\Gamma _1) = (P\otimes X, C(\Gamma _1))\), and so (6.13) is equivalent to (6.11). \(\square \)

The recursive relation, (6.11), can be solved to find an explicit formula for \(C(\Gamma )\). To see this, write

$$\begin{aligned} D(\Gamma ):= k_1\cdot k_2\, \Gamma _1\wedge \Gamma _2, \end{aligned}$$
(6.14)

for Lie monomials \(\Gamma = \left[ \Gamma _1,\Gamma _2\right] \). Nesting (6.11) then leads to a sum over the edges in the tree \(T_\Gamma \). For \(I \in \Gamma \) an edge in the tree \(T_\Gamma \), let \(\Gamma _I\) be the associated Lie monomial. For example, if \(\Gamma =[[1,2],[3,4]]\), then \(\Gamma _{12} = [1,2]\). Then the solution to the recursion, (6.11), is

$$\begin{aligned} C(\Gamma ) = \sum _{I\in \Gamma } \Gamma /\Gamma _{I}[D(\Gamma _I)], \end{aligned}$$
(6.15)

where \(\Gamma /\Gamma _{I}[D(\Gamma _I)]\) denotes the replacement, in \(\Gamma \), of \(\Gamma _I\) by \(D(\Gamma _I)\). For example, if \(\Gamma =[[1,2],[3,4]]\), then \(C(\Gamma )\) is

$$\begin{aligned} C(\Gamma ) = D(\Gamma ) + \Gamma /\Gamma _{12}[D(\Gamma _{12})] + \Gamma /\Gamma _{34}[D(\Gamma _{34})], \end{aligned}$$
(6.16)

where

$$\begin{aligned} \Gamma /\Gamma _{34}[D(\Gamma _{34})] = s_{34} [[1,2],3\otimes 4] = s_{34} \left( [[1,2],3]\otimes 4 + 3\otimes [[1,2],4] \right) , \end{aligned}$$
(6.17)

and

$$\begin{aligned} D(\Gamma ) = k_{12}\cdot k_{34} \, [1,2]\wedge [3,4]. \end{aligned}$$
(6.18)

We will see applications of these structures in the discussion of numerators in Sect. 7.5.

7 BCJ Numerators

In the previous sections we used Berends–Giele recursion and the properties of Lie polynomials to relations for our Lie polynomial version of biadjoint scalar theory. When dressed with BCJ numerators these results apply to other gauge theories. In this section, we first argue that any gauge theory that admits the Berends–Giele framework can be understood in this way. There are clear distinctions between the behaviour of numerators for on-shell amplitudes, versus those for the partially off-shell Berends–Giele multiparticle fields; for a given Berends–Giele description we will see that numerators are unique, but ambiguities arise due to gauge transformations and field redefinitions. We then use this to study examples of numerators \(N^\textrm{theory}_\Gamma \) for different theories.

7.1 BCJ numerators off-shell

Many gauge theories have perturbation expansions that can be expanded in colour factors of the form (2.13); this is the case for any gauge theory whose Lagrangian is second order and single trace in the Lie algebra. The tree amplitudes for such gauge theory can be solved using Berends–Giele recursion for \({\mathscr {L}}\)-valued fields. The recursion produces colour-ordered Berends–Giele currents, \(B^\textrm{theory}(P)\), that give the partial amplitudes of the theory as

$$\begin{aligned} \mathcal {A}^\textrm{theory}(Pn)=\lim _{s_P\rightarrow 0}s_P B^\textrm{theory}(P)\cdot \epsilon _n, \end{aligned}$$
(7.1)

where \(\epsilon _n\) is the polarization of the nth particle. These partial amplitudes satisfy shuffle relations as a consequence of the B(P) being defined as functions of \(P\in {\mathscr {L}}^*\). The amplitudes \(\mathcal {A}(Pn)\) obtained this way are necessarily invariant under both field redefinitions and gauge transformations. However, the Berends–Giele currents, \(B^\textrm{theory}(P)\), are not invariant. Different choices of gauge fixing and field redefinition lead to different Berends–Giele recursions and different associated currents.

We now study the properties of these \(B^\textrm{theory}(P)\) for a general gauge theory, and in the next section derive necessary conditions for this theory to have KLT relations. Dropping the superscript ‘theory’, fix some gauge theory, and let B(P) be the BG currents of the theory obtained using some choices of gauge fixing and field redefinitions. We derive B(P) using the \({\mathscr {L}}\)-valued recursion method given for biadjoint scalar in Sect. 3 and for Yang-Mills in appendix A. This means that the B(P) are functions of \({\mathscr {L}}^*\) and so satisfy

(7.2)

for \(R,S\ne \emptyset \). This allows us to define

$$\begin{aligned} {\tilde{N}}_\Gamma := B(\{\Gamma \})\,, \end{aligned}$$
(7.3)

where we recall that \(\{\Gamma \}\) is only well-defined as an element of \({\mathscr {L}}^*\). We call the \({\tilde{N}}_\Gamma \) ‘off-shell BCJ numerators’ for the currents B(P). The map, \({\tilde{N}}: {\mathscr {L}}\rightarrow {\mathcal {K}}\), given by

$$\begin{aligned} {\tilde{N}}: \Gamma \mapsto {\tilde{N}}_\Gamma \end{aligned}$$
(7.4)

is a homomorphism, since the S-bracket is a Lie bracket. Moreover, the \({\tilde{N}}_\Gamma \) defined by (7.3) are unique. Note that the numerators \({\tilde{N}}_\Gamma \) may have free indices that we do not write (e.g. for YM, \({\tilde{N}}_\Gamma \) has a free gluon polarization index). It follows from the results of Sect. 5 that \({\tilde{N}}\) relates B(P) to the biadjoint scalar b(P) by

$$\begin{aligned} B(P)={\tilde{N}}(b(P)). \end{aligned}$$
(7.5)

To be more explicit, we write (7.5) and (7.3) in a basis. Using (5.24), the \({\tilde{N}}_\Gamma \) for a basis of \({\mathscr {L}}_n\) is given by

$$\begin{aligned} {\tilde{N}}_{\ell [P]}= B(\ell \{P\})=\sum _{Q\in Basis}S^\ell (P|Q) B(Q)\,. \end{aligned}$$
(7.6)

If we multiply this by \(s_P\), contract free indices with the nth particle polarization, and take the limit \(s_P\rightarrow 0\), (7.6) becomes the formula for on-shell amplitude BCJ numerators given in [44] (see also [42]). Also, in this basis, the relation (7.5) reads

$$\begin{aligned} \sum _{Q\in Basis} {\tilde{N}}_{\ell [Q]} \, b(P|Q)= \sum _{Q,R\in Basis} S^\ell (Q|R) \, B(R)\, b(P|Q)\,. \end{aligned}$$
(7.7)

The off-shell BCJ numerators defined by \({\tilde{N}}\) in turn define ‘on-shell BCJ numerators’ for the amplitude, given by

$$\begin{aligned} N_\Gamma = \lim _{s_P\rightarrow 0} {\tilde{N}}_\Gamma \cdot \epsilon _n. \end{aligned}$$
(7.8)

This again defines a homomorphism \(N: \Gamma \mapsto N_\Gamma \) out of \({\mathscr {L}}_n\). Moreover, by (7.1) and (7.5), the partial amplitudes of the theory are given by

$$\begin{aligned} {{\mathcal {A}}}(Pn) = N(m(Pn)), \end{aligned}$$
(7.9)

where m(Pn) are the \({\mathscr {L}}\)-valued biadjoint scalar amplitudes (\(m(Pn) = s_P b(P)\)). Unlike the \({\tilde{N}}_\Gamma \), the numerators \(N_\Gamma \) are subject to a new gauge freedom, given by replacements

$$\begin{aligned} N_\Gamma \mapsto N_\Gamma + N^\textrm{gauge}_\Gamma , \end{aligned}$$
(7.10)

where the gauge freedom is spanned by ‘trivial’ numerators of the form

$$\begin{aligned} N^\textrm{gauge}_\Gamma = \sum _{R,S} C_{R,S} (\{R,S\},\Gamma ), \end{aligned}$$
(7.11)

for some arbitrary kinematic functions \(C_{R,S} \in \mathcal {K}\). These trivial numerators give

$$\begin{aligned} N_\Gamma ^\textrm{gauge} ( b(P))= \sum _{R,S} C_{R,S} b(P|\{R,S\})= \sum _{R,S} C_{R,S} (P, [b(R),b(S)])\,. \end{aligned}$$
(7.12)

This contribution vanishes on-shell, because the RHS (7.12) has no \(1/s_P\) pole, and so vanishes when multiplied by \(s_P\), in the \(s_P\rightarrow 0\) limit. This gauge freedom, in a different guise, led to the original discovery of the BCJ relations in [3], where it was argued that there are \((n-2)!-(n-3)!\) independent pure gauge numerators of this form. However, (7.12) shows that these no longer vanish off-shell. The off-shell numerators are therefore not subject to the freedom, (7.11), and are unique once a choice of Berends Giele formulation has been made.

7.2 BCJ numerators and KLT

Two gauge theories with BG currents B(P) and \(B'(P)\) satisfy an off-shell KLT relation if we can write

$$\begin{aligned} M=\sum _{P,Q\in Basis} S^\ell (P|Q)B(P) B'(Q)\,, \end{aligned}$$
(7.13)

for M the current for some gravity-like theory. If M arises from a local field theory, the current M must have the appropriate kinematic poles for factorization. This places constraints on the RHS of (7.13), which must have the same poles. Note that

$$\begin{aligned} \sum _P S^\ell (P|Q) B(P) = {\tilde{N}} (\ell [Q]). \end{aligned}$$
(7.14)

So the only poles on the RHS of (7.13) come from \(B'(Q)=N'(b(Q))\) and \({\tilde{N}}\). To guarantee the correct poles, we demand that the off-shell numerators \({\tilde{N}}\) and \({\tilde{N}}'\) have no kinematic poles. We will say that \({\tilde{N}}\) is local if it has no poles in the Mandelstam variables.

For a generic choice of gauge theory and currents B(P), there is no reason to expect that the off-shell numerators obtained from (7.3) will be local. For example, an obstruction to locality for Yang-Mills Berends Giele currents in Lorenz gauge is identified in [15, 23, 28].Footnote 7 However, if the off-shell numerators \({\tilde{N}}\) are local, they restrict to give local on-shell numerators for the amplitudes of the theory, using (7.8).Footnote 8 Local BCJ numerators are known for both the nonlinear sigma model (NLSM), [42,43,44] and for (super-)Yang-Mills [11, 15, 23, 44,45,46,47]. See [5] for a review.

If the off-shell numerators, \({\tilde{N}}\), of a gauge theory are local, this implies that the theory satisfies the on-shell BCJ relations. Indeed,

$$\begin{aligned} B(\{P,Q\})={\tilde{N}}(b(\{P,Q\})). \end{aligned}$$
(7.15)

If \({\tilde{N}}\) is local, \(B(\{P,Q\})\) will have no \(1/s_{PQ}\) pole, since \(b(\{P,Q\})\) has no \(1/s_{PQ}\) pole. It follows that the partial amplitudes associated to B(P) satisfy

$$\begin{aligned} A^\textrm{theory}(\{P,Q\},n)= \lim _{s_{PQ}\rightarrow 0} s_{PQ} B^\textrm{theory} (\{P,Q\})\cdot \epsilon _n=0, \end{aligned}$$
(7.16)

which are the fundamental BCJ relations. Given this, we propose:

Conjecture: If a theory’s partial amplitudes satisfy the BCJ relations, then there exists a field redefinition and gauge fixing of its Berends–Giele recursion so that it has local off-shell numerators.

The following subsections review what is known in the cases of the biadjoint scalar, NLSM, Yang-Mills, and Z theory.

7.3 Biadjoint scalar

The Berends–Giele recursion for the biadjoint scalar theory was studied in Sect. 3, where we obtained ‘currents’,

$$\begin{aligned} b(P|Q) = (b(P),Q), \end{aligned}$$
(7.17)

where recall that b(P) takes values in \({\mathscr {L}}\), and ( ,  ) is the pairing between \({\mathscr {L}}\) and \({\mathscr {L}}^*\). This can be written as

$$\begin{aligned} b(P|Q) = N(Q) (b(P)), \end{aligned}$$
(7.18)

where N(Q) is the homomorphism \(\Gamma \mapsto (Q,\Gamma )\). In other words, the off-shell numerators for these currents is

$$\begin{aligned} {\tilde{N}}_\Gamma (Q) = (Q,\Gamma ). \end{aligned}$$
(7.19)

The on-shell numerators are then

$$\begin{aligned} N_\Gamma (Q) = (Q,\Gamma ). \end{aligned}$$
(7.20)

With these numerators, the biadjoint amplitudes m(Pn|Qn) are then given by,

$$\begin{aligned} m(Pn|Qn) = \lim _{s_P\rightarrow 0}s_P(P, b(Q))) = \lim _{s_P\rightarrow 0} s_P \sum _\Gamma \frac{(P,\Gamma ) N_\Gamma (Q)}{s_\Gamma }, \end{aligned}$$
(7.21)

As observed in Sect. 4.2, the Kleiss-Kuijf relations follow from Ree’s theorem as because b(Q) is a Lie polynomial, while the BCJ relations follow from the \(\{,\}\)-bracket.

Examples of biadjoint amplitudes with the numerators N(Q) are obtained from b(123) and b(1234):

$$\begin{aligned} \begin{aligned} m(1234|1234)&= {\langle 123,[[1,2],3]\rangle \over s_{12}} + {\langle 123,[1,[2,3]]\rangle \over s_{23}} ={1\over s_{12}}+{1\over s_{23}}, \\m(12345|14235)&= {\langle 1234, [ [ [ 1, 4 ], 2 ], 3 ] \rangle \over s_{14} s_{124}} + {\langle 1234, [ [ 1, [ 4, 2 ] ], 3 ] \rangle \over s_{124} s_{24}}\\&+ {\langle 1234, [ [ 1, 4 ], [ 2, 3 ] ] \rangle \over s_{14} s_{23}}\\&+ {\langle 1234, [ 1, [ [ 4, 2 ], 3 ] ] \rangle \over s_{24} s_{234}} + {\langle 1234, [ 1, [ 4, [ 2, 3 ] ] ] \rangle \over s_{234} s_{23}}= - {1\over s_{23}s_{234}}, \end{aligned} \end{aligned}$$
(7.22)

where the expansion of b(123) and b(1234) can be found in (3.18).

Remark. This perspective on biadjoint scalar theory was developed in a series of papers. [14] showed that these amplitudes could be derived from solving the biadjoint scalar field equations to get Berends–Giele ‘currents’ b(P|Q). In [13], these ‘currents’ were rewritten in terms of b(P), with values in planar binary trees. Following [8], it was also pointed out in [14] that this Berends–Giele multiparticle field b(iP|iQ) gives rise to an efficient algorithm to compute the inverse of the KLT matrix \(S(P|Q)_i\), but no proof was given for this statement. The statement that the KLT matrix is the inverse to the “biadjoint amplitudes” had already been argued on general grounds in [20,21,22]. The direct proofs of this statement and of the recursion for \(S(P|Q)_i\) conjectured in [14], are given above in Sect. 5.

7.4 NLSM

NLSM amplitudes [42] can be studied by BG recursion, as above for biadjoint scalar theory. Several authors have suggested the following formula the (off-shell) BCJ numerators for NLSM:

$$\begin{aligned} {\tilde{N}}^\textrm{NLSM}_\Gamma := \sum _P (P,\Gamma )S^\ell (P|P), \quad \hbox {for } {|\Gamma |}=n \end{aligned}$$
(7.23)

where the sum is over a basis. It is clear that \({\tilde{N}}^\textrm{NLSM}\) is a homomorphism out of \({\mathscr {L}}_n\). We refer to [26] for a proof of (7.23) using methods adapted to \(\mathcal{M}_{0,n}\); alternatively, this result also follows from the Berends–Giele currents produced by the Lagrangian for NLSM introduced in [48]. The NLSM amplitudes are then given by

$$\begin{aligned} A^\textrm{NLSM}(Pn) = \lim _{s_P\rightarrow 0}s_P {\tilde{N}}^\textrm{NLSM}\big (b(P)\big ). \end{aligned}$$
(7.24)

The numerators at \(n=4\) points are given by

$$\begin{aligned} \begin{aligned} N^\textrm{NLSM}_{[[1,2],3]}&= s_{12}(s_{23}+s_{13}),\\N^\textrm{NLSM}_{[1,[2,3]]}&= s_{12}(s_{23}+s_{13}) - s_{13}(s_{23}+s_{12}) = s_{23}(s_{12}-s_{13}). \end{aligned} \end{aligned}$$
(7.25)

Substituting this into (7.24) then gives the 4-point amplitude:

$$\begin{aligned} A^\textrm{NLSM}(1234) = s_{123}{{\tilde{N}}}^\textrm{NLSM}( b(123)) = s_{12}+s_{23}. \end{aligned}$$
(7.26)

From the existence of the map \({\tilde{N}}^\textrm{NLSM}\), and the results in Sect. 4, it follows that the KK and BCJ relations are automatically satisfied by the NLSM amplitudes, as was first proved using amplitudes methods in [49]. In [42], master BCJ numerators with fixed legs 1 and n of the NLSM amplitudes were conjectured to be \(N_{1|P|n}=(-1)^{n/2}S(P|P)_1\) for even n. This follows from (7.23).

7.5 Super-Yang–Mills

String theory OPEs (or supersymmetric BG recursion) can be used to recursively compute local SYM multiparticle superfields \(\{A^\Gamma _\alpha ,A^\mu _\Gamma ,W^\alpha _\Gamma ,F^{\mu \nu }_\Gamma \}\), \(\mu ,\nu =1,\ldots ,10\) in the BCJ gauge which are labelled by Lie monomials \(\Gamma \in {\mathscr {L}}\) [6, 23, 28]. As demonstrated in [23, 28], the words labelling these superfields satisfy ‘generalized Jacobi identities’ (as in Sect. 5.4). For example,

$$\begin{aligned} A^\mu _{[[1,2],[3,4]]} = A^\mu _{[[[1,2],3],4]} -A^\mu _{[[[1,2],4],3]}. \end{aligned}$$
(7.27)

This leads to a proposal for local off-shell BCJ numerators \(\tilde{N}_\mu ^\textrm{SYM}\) from which SYM tree amplitudes arise from

$$\begin{aligned} \mathcal {A}^\textrm{SYM}(Pn) =A^\mu _n\lim _{s_P\rightarrow 0} s_P {\tilde{N}}_\mu ^\textrm{SYM} \big ( b(P)\big ), \end{aligned}$$
(7.28)

where \(A^\mu _n\) is the polarization vector of the nth particle while the action of \({{\tilde{N}}}_\mu ^\textrm{SYM}\) on the Lie polynomials \(\Gamma \) in (3.10) is given by

$$\begin{aligned} A^\mu _n{{\tilde{N}}}_{\mu \Gamma }^\textrm{SYM}:= A_{n\mu } A^\mu _\Gamma \,, \end{aligned}$$
(7.29)

in terms of the \(\theta =0\) component of the superfield \(A^m_\Gamma \). This representation manifestly satisfies the BCJ identities. For example, the five-point color-ordered amplitudes in the Kleiss–Kuijf basis following from the maps (7.28) and (7.29) are given by

$$\begin{aligned} \begin{aligned} \mathcal {A}(12345) = \Big ({A^\mu _{[ [ [ 1, 2 ], 3 ], 4 ]} \over s_{12} s_{45}} + {A^\mu _{[ 1, [ [ 2, 3 ], 4 ] ]} \over s_{23} s_{51}} + {A^\mu _{[ [ 1, 2 ], [ 3, 4 ] ]} \over s_{12} s_{34}} + {A^\mu _{[ [ 1, [ 2, 3 ] ], 4 ]} \over s_{45} s_{23}} + {A^\mu _{[ 1, [ 2, [ 3, 4 ] ] ]} \over s_{51} s_{34}}\Big ) A_{5\mu } \end{aligned}\nonumber \\ \end{aligned}$$
(7.30)

together with the other 5 permutations of 2, 3, 4. The BCJ numerator identities are manifestly satisfied. For example, comparing the above parametrization with the one in [3] leads to

$$\begin{aligned} n_3 = A^m_{[ [ 1, 2 ], [ 3, 4 ] ]}A^m_5,\qquad n_5 = A^m_{[ 1, [ 2, [ 3, 4 ] ] ]}A^m_5,\qquad n_8 = A^m_{[ [ 1, [ 4, 3 ] ], 2 ]}A^m_5,\qquad \end{aligned}$$
(7.31)

from which the identity \(n_3-n_5+n_8=0\) can easily be verified.

In section 6.1.2 of [23] two alternatives were presented for the map \({{\tilde{N}}}_\mu ^\textrm{SYM}\) writing for \(\Gamma =[\Gamma _1,\Gamma _2]\)

$$\begin{aligned} A^\mu _n{{\tilde{N}}}_{\mu \Gamma }^\textrm{SYM}:= \langle V_{\Gamma _1}V_{\Gamma _2}V_n\rangle = H'_{\Gamma _1,\Gamma _2,n} \end{aligned}$$
(7.32)

where the superfields \(V_\Gamma \) are defined in [6, 23] and are related to the unintegrated vertex operator in the pure spinor formalism and \(\langle \cdot \rangle \) represents the pure spinor bracket from [24]. The superfields \(H'_{P,Q,R}\) are computed to all orders in [23].

The contact term map C plays a key role in the definition of the \(V_\Gamma \) [6, 23, 28]. Associated to the \(V_\Gamma \) are SYM Berends–Giele currents, \(M_P = V(b(P))\), where \(V(\Gamma ):= V_\Gamma \). For example,

$$\begin{aligned} M_1 = V_1, \quad M_{12} = {V_{[1,2]}\over s_{12}},\quad M_{123} = {V_{[[1,2],3]}\over s_{12}s_{123}} + {V_{[1,[2,3]]}\over s_{23}s_{123}}. \end{aligned}$$
(7.33)

The equation of motion of \(M_{P}\) under the action of the pure spinor BRST charge Q is computed in examples and is conjecturally

$$\begin{aligned} QM_P = \sum _{XY=P}M_XM_Y. \end{aligned}$$
(7.34)

Whereas the equation of motion for \(V_\Gamma \) is conjecturally

$$\begin{aligned} QV_\Gamma = (V\otimes V, C(\Gamma )) = \sum _{P,Q} V_{\ell (P)}V_{\ell (Q)} (P\otimes Q,C(\Gamma )), \end{aligned}$$
(7.35)

where the sum is over a basis. For example,

$$\begin{aligned} \begin{aligned} QV_{[1,2]}&=(k_1\cdot k_2) V_1V_2,\\QV_{[[1,2],3]}&= (k_1\cdot k_2)(V_{[1,3]}V_2 + V_1V_{[2,3]}) + (k_{12}\cdot k_3)V_{[1,2]}V_3,\\QV_{[1,[2,3]]}&= (k_2\cdot k_3)\big (V_{[1,2]}V_3 + V_2V_{[1,3]}\big ) + (k_1\cdot k_{23})V_1V_{[2,3]}, \end{aligned} \end{aligned}$$
(7.36)

Equation (6.7) above, allows one to prove (7.34) as an immediate consequence of equation (7.35), as explained in [23]. This was previously only known to be true examples. It remains to prove (7.35) itself, using string theory methods.

7.6 Z-theory and the open superstring

In order to upgrade the discussion in the previous subsection to the open superstring with \({\alpha ^{\prime }}\) corrections we will exploit non-abelian Z-theory to evaluate \({\alpha ^{\prime }}\) expansions of open string disk integrals. Z-theory disk integrals can be computed via the Berends–Giele method as

$$\begin{aligned} Z(P,n|Q,n) = \lim _{s_P\rightarrow 0}s_P b^{\alpha ^{\prime }}(P|Q) \end{aligned}$$
(7.37)

where the Berends–Giele currents \(b^{\alpha ^{\prime }}(P|Q)\) are computed using the equations of motion of the non-abelian Z-theory [50]. In principle, these can be obtained from ‘currents’ \(b^{\alpha ^{\prime }}(P)\) with values in \({\mathscr {L}}_n\) by computing \({\alpha ^{\prime }}\) corrections to the Catalan expansion (3.10) and defining

$$\begin{aligned} b^{\alpha ^{\prime }}(P|Q) = (b^{\alpha ^{\prime }}(P), Q) \end{aligned}$$
(7.38)

which together with (7.37) implies that Z-theory admits a free Lie algebra representation. The \(b^{\alpha ^{\prime }}(P|Q)\) are known up to \({\alpha ^{\prime }}^7\) order following [30]. Using this result, we can compute \(b^{\alpha ^{\prime }}(P)\) to the same order. The first few orders are given by

$$\begin{aligned} \begin{aligned} s_P b^{\alpha ^{\prime }}(P)&= \sum _{XY=P}[b^{\alpha ^{\prime }}(X),b^{\alpha ^{\prime }}(Y)] \\&\quad + {\alpha ^{\prime }}^2\zeta _2\sum _{XYZ=P}k_X\cdot k_Y[b^{\alpha ^{\prime }}(X),[b^{\alpha ^{\prime }}(Z),b^{\alpha ^{\prime }}(Y)]]\\&\quad - {\alpha ^{\prime }}^2\zeta _2\sum _{XYZ=P}k_Y\cdot k_Z[[b^{\alpha ^{\prime }}(X),b^{\alpha ^{\prime }}(Y)],b^{\alpha ^{\prime }}(Z)]\\&\quad + {\alpha ^{\prime }}^2\zeta _2\sum _{XYZW=P}[[b^{\alpha ^{\prime }}(X),b^{\alpha ^{\prime }}(Y)],[b^{\alpha ^{\prime }}(W),b^{\alpha ^{\prime }}(Z)]]\\&\quad - {\alpha ^{\prime }}^2\zeta _2\sum _{XYZW=P}[[b^{\alpha ^{\prime }}(X),b^{\alpha ^{\prime }}(Z)],[b^{\alpha ^{\prime }}(W),b^{\alpha ^{\prime }}(Y)]] +{{\mathcal {O}}}({\alpha ^{\prime }}^3) \end{aligned} \end{aligned}$$
(7.39)

and so on. It remains to discover a simple way to extend this calculation to all orders in \({\alpha ^{\prime }}\).

Note that \(b^{\alpha ^{\prime }}(P|Q) \ne b^{\alpha ^{\prime }}(Q|P)\). Indeed, \(b^{\alpha ^{\prime }}(P)\) is not a function of \(P\in {\mathscr {L}}_n^*\), but rather satisfies shuffle-like relations that are twisted by the monodromies of the Z-theory disk integrals, as explained in [50]. Whereas we take the second factor in \(b^{\alpha ^{\prime }}(P|Q)\) to belong to \(Q\in {\mathscr {L}}_n^*\). It follows that , because \(b^{\alpha ^{\prime }}(P)\) is valued in \({\mathscr {L}}_n\).

Finally, we conjecture that the full open superstring disk amplitudes including \({\alpha ^{\prime }}\) corrections are given by using the super-Yang-Mills numerators \({{\tilde{N}}}^\textrm{SYM}\) from the previous subsection:

$$\begin{aligned} \begin{aligned} \mathcal {A}^\textrm{string}(P,n)&= A_n^\mu \lim _{s_P\rightarrow 0} s_P {\tilde{N}}_\mu ^\textrm{SYM}( b^{\alpha ^{\prime }}(P) ). \end{aligned} \end{aligned}$$

This proposal has been verified for the bosonic components of the amplitudes at low n.

8 Conclusions

We have seen that many nontrivial properties of gauge theory tree amplitudes follow from the properties of \({\mathscr {L}}\) and its dual. Moreover, these results are particularly important for the study of gauge theories that have KLT relations. This lead us to our main conjecture:

Conjecture: If a theory’s partial amplitudes satisfy the BCJ relations, then there exists a field redefinition and gauge fixing of its Berends–Giele recursion so that it has local off-shell numerators.

If a theory has local off-shell BCJ numerators, we showed above that this immediately implies that it satisfies the KLT relations to give the amplitudes of some gravity-like theory. We showed that, for a given gauge theory, every formulation of Berends–Giele recursion of that theory produce a unique set of off-shell numerators \({\tilde{N}}\). It is a well-posed question to ask whether there exists a field redefinition and gauge fixing of BG recursion such that the \(\tilde{N}\) are local, and we hope that this question can be answered systematically.

We conclude by explaining how our results bear on three outstanding problems in this area: the existence of so-called ‘kinematic algebras’, the existence of BCJ numerators for string theory, and the extension of KLT relations to all orders in perturbation theory.

Kinematic algebras. An ongoing topic of research is the identification of the kinematic algebra for a given theory, see the review [5]. Our results in 7 suggests a possible route to defining the kinematic algebra systematically. For a given gauge theory, BG recursion leads to some currents, B(P), and their associated off-shell numerators \({\tilde{N}}_\Gamma \). These \({\tilde{N}}_\Gamma \) satisfy

$$\begin{aligned} {\tilde{N}}_{[\Gamma _1,\Gamma _2]} = B(\{[\Gamma _1,\Gamma _2]\}). \end{aligned}$$
(8.1)

The \({\tilde{N}}_\Gamma \) take values in some (off-shell) kinematic space \(\mathcal {K}\). Motivated by (8.1), we can define a bracket

$$\begin{aligned} \{{\tilde{N}}_{\Gamma _1},{\tilde{N}}_{\Gamma _2}\}_\mathcal {K}:= B(\{[\Gamma _1,\Gamma _2]\}), \end{aligned}$$
(8.2)

where \(\{~,~\}_\mathcal {K}\) is defined the image of \({\tilde{N}}\) in \(\mathcal {K}\). This \(\{~,~\}_\mathcal {K}\) is a Lie bracket because \(\{~,~\}\) is Lie and \({\tilde{N}}\) is a homomorphism by construction. Imposing that \(\{~,~\}_\mathcal {K}\) is linear in Mandelstam variables, this definition implies a BCJ-like relation,

$$\begin{aligned} B(\{P,Q\}) = \{B(P),B(Q)\}_\mathcal {K}, \end{aligned}$$
(8.3)

where, by linearity,

$$\begin{aligned} \{B(P),B(Q)\}_\mathcal {K}:= \sum \frac{(P,\Gamma _1)}{s_{\Gamma _1}} \frac{(Q,\Gamma _2)}{s_{\Gamma _2}} \{{\tilde{N}}_{\Gamma _1},\tilde{N}_{\Gamma _2}\}_\mathcal {K}. \end{aligned}$$
(8.4)

If the off-shell numerators \({\tilde{N}}\) are local, then The definition (8.2), together with (8.1), implies that the numerators in a basis are given by complete bracketings of \(\{~,~\}_\mathcal {K}\):

$$\begin{aligned} {\tilde{N}}_{\Gamma } = \{\Gamma \}_\mathcal {K}. \end{aligned}$$
(8.5)

Moreover, if the \({\tilde{N}}\) are local, then the numerators of the amplitude are given by

$$\begin{aligned} N_{\Gamma } = \lim _{s_P\rightarrow 0} \{\Gamma \}_\mathcal {K}\cdot \epsilon _n. \end{aligned}$$
(8.6)

In this case, it would therefore be reasonable to call \(\{~,~\}_\mathcal {K}\) the “kinematic algebra” of the theory. This suggests that, for a theory satisfying BCJ relations, the existence of a kinematic algebra is another consequence of our main conjecture proposed above. (Although see [51] for an off-shell BCJ relation for NLSM that is not of the form (8.3).)

Strings and \({\alpha ^{\prime }}\) expansions. The duality between \({\mathscr {L}}_n\) and \({\mathscr {L}}_n^*\) exploited in this paper is closely related to the geometry of \({{\mathcal {M}}}_{0,n}\), the moduli space of n points on the Riemann sphere. This is because the top-dimensional homology of \({{\mathcal {M}}}_{0,n}\) is generated by cycles naturally labelled by Lie polynomials; conversely, the top-dimensional cohomology is generated by cocycles naturally labelled by the elements of \({\mathscr {L}}_{n-1}^*\). This helps to explain the success of using \({{\mathcal {M}}}_{0,n}\) integrals, which arise in CHY formulas and in ambitwitor strings, to study tree level BCJ numerators for amplitudes. See for example [10, 52, 53], and details of the connection to \({\mathscr {L}}_n\) in [26]. This connection also strongly suggests an extension of our methods to string theory at tree level.

A perturbative method for computing the \({\alpha ^{\prime }}\) expansions of tree level string amplitudes using the ‘Drinfeld associator’ was given in [54]. The calculations reviewed in Sect. 7.6 can be related to this method by the equation of motion for \(QV_P\) [55]. This equation of motion involves the contact term map, C, which is the dual of the S-bracket studied in this paper. This suggests that our results about the S-bracket will be useful for advancing the efforts to compute \({\alpha ^{\prime }}\) corrections in [50]. Moreover, the method in [54] is based on the Drinfeld associator, which is itself is a Lie series. This suggests that the duality between \({\mathscr {L}}\) and \({\mathscr {L}}^*\) will be central to advancing the use of this method to all orders in \({\alpha ^{\prime }}\).

Beyond tree level. It is an open question to discover whether KLT-like relations hold at higher loop order in the gauge theory perturbation series. As we have seen, tree level colour factors are labelled by Lie monomials, and partial tree amplitudes are labelled by permutations modulo shuffle relations. This is the leading order avatar of the more general story, at arbitrary orders in the perturbation series, in which colour factors are associated to ribbon graphs, and partial amplitudes are labelled by marked surfaces with boundary (possibly with genus \(g>0\)). The results in the present paper are essentially all derived from the Jacobi identity satisfied by Lie monomials is. Colour factors labelled by ribbon graphs at higher order satisfy analogous identities; as studied in [26]. This raises the possibility that the biadjoint scalar amplitudes at higher orders in perturbation theory play a role similar to role played by the \({\mathscr {L}}\)-valued b(P) at tree level, in giving rise to KLT-relations.