Gröbner–Shirshov bases and their calculation

Bokut, L. A.; Chen, Yuqun

doi:10.1007/s13373-014-0054-6

Gröbner–Shirshov bases and their calculation

Article
Open access
Published: 09 September 2014

Volume 4, pages 325–395, (2014)
Cite this article

Download PDF

You have full access to this open access article

Bulletin of Mathematical Sciences

Gröbner–Shirshov bases and their calculation

Download PDF

L. A. Bokut^1,2 &
Yuqun Chen²

10k Accesses
72 Citations
1 Altmetric
Explore all metrics

Abstract

In this survey we give an exposition of the theory of Gröbner–Shirshov bases for associative algebras, Lie algebras, groups, semigroups, $Ω$ -algebras, operads, etc. We mention some new Composition-Diamond lemmas and applications.

Division Algebras, Clifford Algebras, Periodicity

Article 05 February 2018

Schur inequality for Murray–von Neumann algebras and its applications

Article 18 April 2024

On the generalized n-strong Drazin inverses and block matrices in Banach algebras

Article 21 April 2024

1 Introduction

In this survey we review the method of Gröbner–Shirshov^{Footnote 1} (GS for short) bases for different classes of linear universal algebras, together with an overview of calculation of these bases in a variety of specific cases.

Shirshov (also spelled Širšov) in his pioneering work ([207], 1962) posed the following fundamental question:

How to find a linear basis of a Lie algebra defined by generators and relations?

He gave an infinite algorithm to solve this problem using a new notion of the composition (later the ‘ $s$ -polynomial’ in Buchberger’s terminology [65, 66]) of two Lie polynomials and a new notion of completion of a set of Lie polynomials (adding nontrivial compositions; the critical pair/completion (cpc-) algorithm in the later terminology of Knuth and Bendix [138] and Buchberger [67, 68]).

Shirshov’s algorithm goes as follows. Consider a set $S \subset L i e (X)$ of Lie polynomials in the free algebra $k ⟨ X ⟩$ on $X$ over a field $k$ (the algebra of non-commutative polynomials on $X$ over $k$ ). Denote by $S^{'}$ the superset of $S$ obtained by adding all non-trivial Lie compositions (‘Lie $s$ -polynomials’) of the elements of $S$ . The problem of triviality of a Lie polynomial modulo a finite (or recursive) set $S$ can be solved algorithmically using Shirshov’s Lie reduction algorithm from his previous paper [203], 1958. In general, an infinite sequence

\begin{matrix} S \subseteq S^{'} \subseteq S^{''} \subseteq \dots \subseteq S^{(n)} \subseteq \dots \end{matrix}

of Lie multi-compositions arises. The union $S^{c}$ of this sequence has the property that every Lie composition of elements of $S^{c}$ is trivial modulo $S^{c}$ . This is what is now called a Lie GS basis.

Then a new ‘Composition-Diamond lemma^{Footnote 2} for Lie algebras’ (Lemma 3 in [207]) implies that the set $I r r (S^{c})$ of all $S^{c}$ -irreducible (or $S^{c}$ -reduced) basic Lie monomials $[u]$ in $X$ is a linear basis of the Lie algebra $L i e (X | S)$ generated by $X$ with defining relations $S$ . Here a basic Lie monomial means a Lie monomial in a special linear basis of the free Lie algebra $L i e (X) \subset k ⟨ X ⟩$ , known as the Lyndon–Shirshov (LS for short) basis (Shirshov [207] and Chen–Fox–Lyndon [72], see below). An LS monomial $[u]$ is called $S^{c}$ -irreducible (or $S^{c}$ -reduced) whenever $u$ , the associative support of $[u]$ , avoids the word $\bar{s}$ for all $s \in S$ , where $\bar{s}$ is the maximal word of $s$ as an associative polynomial (in the deg-lex ordering). To be more precise, Shirshov used his reduction algorithm at each step $S$ , $S^{'}$ , $S^{''}, \dots$ . Then we have a direct system $S \to S^{'} \to S^{''} \to \dots$ and $S^{c} = \underset{\to}{l i m} S^{(n)}$ is what is now called a minimal GS basis (a minimal GS basis is not unique, but a reduced GS basis is, see below). As a result, Shirshov’s algorithm gives a solution to the above problem for Lie algebras.

Shirshov’s algorithm, dealing with the word problem, is an infinite algorithm like the Knuth–Bendix algorithm [138], 1970 dealing with the identity problem for every variety of universal algebras.^{Footnote 3} The initial data for the Knuth–Bendix algorithm is the defining identities of a variety. The output of the algorithm, if any, is a ‘Knuth–Bendix basis’ of identities of the variety in the class of all universal algebras of a given signature (not a GS basis of defining relations, say, of a Lie algebra).

Shirshov’s algorithm gives linear bases and algorithmic decidability of the word problem for one-relation Lie algebras [207], (recursive) linear bases for Lie algebras with (finite) homogeneous defining relations [207], and linear bases for free products of Lie algebras with known linear bases [208]. He also proved the Freiheitssatz (freeness theorem) for Lie algebras [207] (for every one-relation Lie algebra $L i e (X | f)$ , the subalgebra $⟨ X \ {x_{i_{0}}} ⟩$ , where $x_{i_{0}}$ appears in $f$ , is a free Lie algebra). The Shirshov problem [207] of the decidability of the word problem for Lie algebras was solved negatively in [21]. More generally, it was proved [21] that some recursively presented Lie algebras with undecidable word problem can be embedded into finitely presented Lie algebras (with undecidable word problem). It is a weak analogue of the Higman embedding theorem for groups [115]. The problem [21] whether an analogue of the Higman embedding theorem is valid for Lie algebras is still open. For associative algebras a similar problem [21] was solved positively by Belyaev [10]. A simple example of a Lie algebra with undecidable word problem was given by Kukin [142].

Actually, a similar algorithm for associative algebras is implicit in Shirshov’s paper [207]. The reason is that he treats $L i e (X)$ as the subspace of Lie polynomials in the free associative algebra $k ⟨ X ⟩$ . Then to define a Lie composition ${⟨ f, g ⟩}_{w}$ of two Lie polynomials relative to an associative word $w = lcm (\bar{f}, \bar{g})$ , he defines firstly the associative composition (non-commutative ‘ $s$ -polynomial’) ${(f, g)}_{w} = f b - a g$ , with $a, b \in X^{*}$ . Then he inserts some brackets ${⟨ f, g ⟩}_{w} = {[f b]}_{\bar{f}} - {[a g]}_{\bar{g}}$ by applying his special bracketing lemma of [203]. We can obtain $S^{c}$ for every $S \subset k ⟨ X ⟩$ in the same way as for Lie polynomials and in the same way as for Lie algebras (‘CD-lemma for associative algebras’) to infer that $I r r (S^{c})$ is a linear basis of the associative algebra $k ⟨ X | S ⟩$ generated by $X$ with defining relations $S$ . All proofs are similar to those in [207] but much easier.

Moreover, the cases of semigroups and groups presented by generators and defining relations are just special cases of associative algebras via semigroup and group algebras. To summarize, Shirshov’s algorithm gives linear bases and normal forms of elements of every Lie algebra, associative algebra, semigroup or group presented by generators and defining relations! The algorithm works in many cases (see below).

The theory of Gröbner bases and Buchberger’s algorithm were initiated by Buchberger (Thesis [65] 1965, paper [66] 1970) for commutative associative algebras. Buchberger’s algorithm is a finite algorithm for finitely generated commutative algebras. It is one of the most useful and famous algorithms in modern computer science.

Shirshov’s paper [207] was in the spirit of the program of Kurosh (1908–1972) to study non-associative (relatively) free algebras and free products of algebras, initiated in Kurosh’s paper [143], 1947. In that paper he proved non-associative analogs of the Nielsen–Schreier and Kurosh theorems for groups. It took quite a few years to clarify the situation for Lie algebras in Shirshov’s papers [200], 1953 and [207], 1962 closely related to his theory of GS bases. It is important to note that Kurosh’s program quite unexpectedly led to Shirshov’s theory of GS bases for Lie and associative algebras [207].

A step in Kurosh’s program was made by his student Zhukov in his Ph.D. Thesis [226], 1950. He algorithmically solved the word problem for non-associative algebras. In a sense, it was the beginning of the theory of GS bases for non-associative algebras. The main difference with the future approach of Shirshov is that Zhukov did not use a linear ordering of non-associative monomials. Instead he chose an arbitrary monomial of maximal degree as the ‘leading’ monomial of a polynomial. Also, for non-associative algebras there is no ‘composition of intersection’ (‘ $s$ -polynomial’). In this sense it cannot be a model for Lie and associative algebras.^{Footnote 4}

Shirshov, also a student of Kurosh’s, defended his Candidate of Sciences Thesis [199] at Moscow State University in 1953. His thesis together with the paper that followed [203], 1958 may be viewed as a background for his later method of GS bases. In the thesis, he proved the free subalgebra theorem for free Lie algebras (now known as Shirshov–Witt theorem, see also Witt [218], 1956) using the elimination process rediscovered by Lazard [149], 1960. He used the elimination process later [203], 1958 as a general method to prove the properties of regular (LS) words, including an algorithm of (special) bracketing of an LS word (with a fixed LS subword). The latter algorithm is of some importance in his theory of GS bases for Lie algebras (particularly in the definition of the composition of two Lie polynomials). Shirshov also proved the free subalgebra theorem for (anti-) commutative non-associative algebras [202], 1954. He used that later in [206], 1962 for the theory of GS bases of (commutative, anti-commutative) non-associative algebras. Shirshov (Thesis [199], 1953) found the (‘Hall–Shirshov’) series of bases of a free Lie algebra (see also [205] 1962, the first issue of Malcev’s Algebra and Logic).^{Footnote 5}

The LS basis is a particular case of the Shirshov or Hall–Shirshov series of bases (cf. Reutenauer [190], where this series is called the ‘Hall series’). In the definition of his series, Shirshov used Hall’s inductive procedure (see Ph. Hall [114], 1933, Hall [113], 1950): a non-associative monomial $w = ((u) (v))$ is a basic monomial whenever

(1)
$(u), (v)$ are basic;
(2)
$(u) > (v)$ ;
(3)
if $(u) = ((u_{1}) (u_{2}))$ then $(u_{2}) \leq (v)$ .

However, instead of ordering by the degree function (Hall words), he used an arbitrary linear ordering of non-associative monomials satisfying

\begin{matrix} ((u) (v)) > (v) . \end{matrix}

For example, in his Thesis [199], 1953 he used the ordering by the content of monomials (the content of, say, the monomial $(u) = ((x_{2} x_{1}) ((x_{2} x_{1}) x_{1}))$ is the vector $(x_{2}, x_{2}, x_{1}, x_{1}, x_{1})$ ). Actually, the content $\hat{u}$ of $(u)$ may be viewed as a commutative associative word that equals $u$ in the free commutative semigroup. Two contents are compared lexicographically (a proper prefix of a content is greater than the content).

If we use the lexicographic ordering, $(u) ≻ (v)$ if $u ≻ v$ lexicographically (with the condition $u ≻ u v, v \neq 1$ ), then we obtain the LS basis.^{Footnote 6} For example, for the alphabet $x_{1}$ , $x_{2}$ with $x_{2} ≻ x_{1}$ we obtain basic Lyndon–Shirshov monomials by induction:

\begin{matrix} x_{2}, x_{1}, [x_{2} x_{1}], [x_{2} [x_{2} x_{1}]] = [x_{2} x_{2} x_{1}], [[x_{2} x_{1}] x_{1}] = [x_{2} x_{1} x_{1}], \\ [x_{2} [x_{2} x_{2} x_{1}]] = [x_{2} x_{2} x_{2} x_{1}], [x_{2} [x_{2} x_{1} x_{1}]] = [x_{2} x_{2} x_{1} x_{1}], \\ [[x_{2} x_{1} x_{1}] x_{1}] = [x_{2} x_{1} x_{1} x_{1}], [[x_{2} x_{1}] [x_{2} x_{1} x_{1}]] = [x_{2} x_{1} x_{2} x_{1} x_{1}], \end{matrix}

and so on. They are exactly all Shirshov regular (LS) Lie monomials and their associative supports are exactly all Shirshov regular words with a one-to-one correspondence between two sets given by the Shirshov elimination (bracketing) algorithm for (associative) words.

Let us recall that an elementary step of Shirshov’s elimination algorithm is to join the minimal letter of a word to previous ones by bracketing and to continue this process with the lexicographic ordering of the new alphabet. For example, suppose that $x_{2} ≻ x_{1}$ . Then we have the succession of bracketings

\begin{matrix} x_{2} x_{1} x_{2} x_{1} x_{1} x_{2} x_{1} x_{1} x_{1} x_{1} x_{2} x_{1} x_{1}, \\ [x_{2} x_{1}] [x_{2} x_{1} x_{1}] [x_{2} x_{1} x_{1} x_{1}] [x_{2} x_{1} x_{1}], \\ [x_{2} x_{1}] [[x_{2} x_{1} x_{1}] [x_{2} x_{1} x_{1} x_{1}]] [x_{2} x_{1} x_{1}], \\ [[x_{2} x_{1}] [[x_{2} x_{1} x_{1}] [x_{2} x_{1} x_{1} x_{1}]]] [x_{2} x_{1} x_{1}], \\ [[[x_{2} x_{1}] [[x_{2} x_{1} x_{1}] [x_{2} x_{1} x_{1} x_{1}]]] [x_{2} x_{1} x_{1}]]; \\ x_{2} x_{1} x_{1} x_{1} x_{2} x_{1} x_{1} x_{2} x_{1} x_{2} x_{2} x_{1}, \\ [x_{2} x_{1} x_{1} x_{1}] [x_{2} x_{1} x_{1}] [x_{2} x_{1}] x_{2} [x_{2} x_{1}], \\ [x_{2} x_{1} x_{1} x_{1}] [x_{2} x_{1} x_{1}] [x_{2} x_{1}] [x_{2} [x_{2} x_{1}]]; \\ x_{2} x_{1} x_{1} x_{1} ≺ x_{2} x_{1} x_{1} ≺ x_{2} x_{1} ≺ x_{2} x_{2} x_{1} . \end{matrix}

By the way, the second series of partial bracketings illustrates Shirshov’s factorization theorem [203] of 1958 that every word is a non-decreasing product of LS words (it is often mistakenly called Lyndon’s theorem, see [12]).

The Shirshov special bracketing [203] goes as follows. Let us give as an example the special bracketing of the LS word $w = x_{2} x_{2} x_{1} x_{1} x_{2} x_{1} x_{1} x_{1}$ with the LS subword $u = x_{2} x_{2} x_{1}$ . The Shirshov standard bracketing is

\begin{matrix} [w] = [x_{2} [[[x_{2} x_{1}] x_{1}] [x_{2} x_{1} x_{1} x_{1}]]] . \end{matrix}

The Shirshov special bracketing is

\begin{matrix} {[w]}_{u} = [[[u] x_{1}] [x_{2} x_{1} x_{1} x_{1}]] . \end{matrix}

In general, if $w = a u b$ then the Shirshov standard bracketing gives $[w] = [a [u c] d]$ , where $b = c d$ . Now, $c = c_{1} \dots c_{t}$ , each $c_{i}$ is an LS-word, and $c_{1} ⪯ \dots ⪯ c_{t}$ in the lex ordering (Shirshov’s factorization theorem). Then we must change the bracketing of $[u c]$ :

\begin{matrix} {[w]}_{u} = [a [\dots [[u] [c_{1}]] \dots [c_{t}]] d] \end{matrix}

The main property of ${[w]}_{u}$ is that ${[w]}_{u}$ is a monic associative polynomial with the maximal monomial $w$ ; hence, $\bar{{[w]}_{u}} = w$ .

Actually, Shirshov [207], 1962 needed a ‘double’ relative bracketing of a regular word with two disjoint LS subwords. Then he implicitly used the following property: every LS subword of $c = c_{1} \dots c_{t}$ as above is a subword of some $c_{i}$ for $1 \leq i \leq t$ .

Shirshov defined regular (LS) monomials [203], 1958, as follows: $(w) = ((u) (v))$ is a regular monomial iff:

(1)
$w$ is a regular word;
(2)
$(u)$ and $(v)$ are regular monomials (then automatically $u ≻ v$ in the lex ordering);
(3)
if $(u) = ((u_{1}) (u_{2}))$ then $u_{2} ⪯ v$ .

Once again, if we formally omit all Lie brackets in Shirshov’s paper [207] then essentially the same algorithm and essentially the same CD-lemma (with the same but much simpler proof) yield a linear basis for associative algebra presented by generators and defining relations. The differences are the following:

no need to use LS monomials and LS words, since the set $X^{*}$ is a linear basis of the free associative algebra $k ⟨ X ⟩$ ;
the definition of associative composition for monic polynomials $f$ and $g$ ,
$\begin{matrix} {(f, g)}_{w} = f b - a g, w = \bar{f} b = a \bar{g}, d e g (w) < d e g (\bar{f}) + d e g (\bar{g}), \end{matrix}$
or
$\begin{matrix} {(f, g)}_{w} = f - a g b, w = \bar{f} = a \bar{g} b, w, a, b \in X^{*}, \end{matrix}$
are much simpler than the definition of Lie composition for monic Lie polynomials $f$ and $g$ ,
$\begin{matrix} {⟨ f, g ⟩}_{w} = {[f b]}_{\bar{f}} - {[a g]}_{\bar{g}}, w = \bar{f} b = a \bar{g}, d e g (w) < d e g (\bar{f}) + d e g (\bar{g}), \end{matrix}$
or
$\begin{matrix} {⟨ f, g ⟩}_{w} = f - {[a g b]}_{\bar{g}}, w = \bar{f} = a \bar{g} b, w, a, b, \bar{f}, \bar{g} \in X^{*}, \end{matrix}$
where ${[f b]}_{\bar{f}}$ , ${[a g]}_{\bar{g}}$ , and ${[a g b]}_{\bar{g}}$ are the Shirshov special bracketings of the LS words $w$ with fixed LS subwords $\bar{f}$ and $\bar{g}$ respectively.
The definition of elimination of the leading word $\bar{s}$ of an associative monic polynomial $s$ is straightforward: $a \bar{s} b \to a (r_{s}) b$ whenever $s = \bar{s} - r_{s}$ and $a, b \in X^{*}$ . However, for Lie polynomials, it is much more involved and uses the Shirshov special bracketing: $f \to f - {[a g b]}_{\bar{g}}$ whenever $\bar{f} = a \bar{g} b$ .

We can formulate the main idea of Shirshov’s proof as follows. Consider a complete set $S$ of monic Lie polynomials (all compositions are trivial). If $w = a_{1} \bar{s_{1}} b_{1} = a_{2} \bar{s_{2}} b_{2}$ , where $w$ , $a_{i}$ , $b_{i} \in X^{*}$ and $w$ is an LS word, while $s_{1}, s_{2} \in S$ , then the Lie monomials ${[a_{1} s_{1} b_{1}]}_{\bar{s_{1}}}$ and ${[a_{2} s_{2} b_{2}]}_{\bar{s_{2}}}$ are equal modulo the smaller Lie monomials in $I d (S)$ :

\begin{matrix} {[a_{1} s_{1} b_{1}]}_{\bar{s_{1}}} = {[a_{2} s_{2} b_{2}]}_{\bar{s_{2}}} + \sum_{i > 2} α_{i} {[a_{i} s_{i} b_{i}]}_{\bar{s_{i}}}, \end{matrix}

where $α_{i} \in k, s_{i} \in S$ and $\bar{{[a_{i} s_{i} b_{i}]}_{\bar{s_{i}}}} = a_{i} \bar{s_{i}} b_{i} < w$ . Actually, Shirshov proved a more general result: if $\bar{(a_{1} s_{1} b_{1})} = a_{1} \bar{s_{1}} b_{1}$ and $\bar{(a_{2} s_{2} b_{2})} = a_{2} \bar{s_{2}} b_{2}$ with $w = a_{1} \bar{s_{1}} b_{1} = a_{2} \bar{s_{2}} b_{2}$ then

\begin{matrix} (a_{1} s_{1} b_{1}) = (a_{2} s_{2} b_{2}) + \sum_{i > 2} α_{i} (a_{i} s_{i} b_{i}), \end{matrix}

where $α_{i} \in k, s_{i} \in S$ and $\bar{(a_{i} s_{i} b_{i})} = a_{i} \bar{s_{i}} b_{i} < w$ . Below we call a Lie polynomial $(a s b)$ a Lie normal $S$ -word provided that $\bar{(a s b)} = a \bar{s} b$ .

This is precisely where he used the notion of composition and other notions and properties mentioned above.

It is much easier to prove an analogue of this property for associative algebras (as well as commutative associative algebras): given a complete monic set $S$ in $k ⟨ X ⟩$ ( $k [X]$ ), for $w = a_{1} \bar{s_{1}} b_{1} = a_{2} \bar{s_{2}} b_{2}$ with $a_{i}, b_{i} \in X^{*}$ and $s_{1}, s_{2} \in S$ we have

\begin{matrix} a_{1} s_{1} b_{1} = a_{2} s_{2} b_{2} + \sum_{i > 2} α_{i} a_{i} s_{i} b_{i}, \end{matrix}

where $α_{i} \in k, s_{i} \in S$ and $a_{i} \bar{s_{i}} b_{i} < w$ .

Summarizing, we can say with confidence that the work (Shirshov [207]) implicitly contains the CD-lemma for associative algebras as a simple exercise that requires no new ideas. The first author, Bokut, can confirm that Shirshov clearly understood this and told him that “the case of associative algebras is the same”. The lemma was formulated explicitly in Bokut [22], 1976 (with a reference to Shirshov’s paper [207]), Bergman [11], 1978, and Mora [171], 1986.

Let us emphasize once again that the CD-Lemma for associative algebras applies to every semigroup $P = sgp ⟨ X | S ⟩$ , and in particular to every group, by way of the semigroup algebra $k P$ over a field $k$ . The latter algebra has the same generators and defining relations as $P$ , or $k P = k ⟨ X | S ⟩$ . Every composition of the binomials $u_{1} - v_{1}$ and $u_{2} - v_{2}$ is a binomial $u - v$ . As a result, applying Shirshov’s algorithm to a set of semigroup relations $S$ gives rise to a complete set of semigroup relations $S^{c}$ . The $S^{c}$ -irreducible words in $X$ constitute the set of normal forms of the elements of $P$ .

Before we go any further, let us give some well-known examples of algebra, group, and semigroup presentations by generators and defining relations together with linear bases, normal forms, and GS bases for them (if known). Consider a field $k$ and a commutative ring or commutative $k$ -algebra $K$ .

The Grassman algebra over $K$ is
$\begin{matrix} K ⟨ X | x_{i}^{2} = 0, x_{i} x_{j} + x_{j} x_{i} = 0, i > j ⟩ . \end{matrix}$
The set of defining relations is a GS basis with respect to the deg-lex ordering. A $K$ -basis is
$\begin{matrix} {x_{i_{1}} \dots x_{i_{n}} | x_{i_{j}} \in X, j = 1, \dots, n, i_{1} < \dots < i_{n}, n \geq 0} . \end{matrix}$
The Clifford algebra over $K$ is
$\begin{matrix} K ⟨ X | x_{i} x_{j} + x_{j} x_{i} = a_{i j}, 1 \leq i, j \leq n ⟩, \end{matrix}$
where $(a_{i j})$ is an $n \times n$ symmetric matrix over $K$ . The set of defining relations is a GS basis with respect to the deg-lex ordering. A $K$ -basis is
$\begin{matrix} {x_{i_{1}} \dots x_{i_{n}} | x_{i_{j}} \in X, j = 1, \dots, n, n \geq 0, i_{1} < \dots < i_{n}} . \end{matrix}$
The universal enveloping algebra of a Lie algebra $L$ is
$\begin{matrix} U_{K} (L) = K 〈X | x_{i} x_{j} - x_{j} x_{i} = \sum α_{i j}^{k} x_{k}, i > j〉 . \end{matrix}$
If $L$ is a free $K$ -module with a well-ordered $K$ -basis
$\begin{matrix} X = {x_{i} | i \in I}, [x_{i} x_{j}] = \sum α_{i j}^{k} x_{k}, i > j, i, j \in I, \end{matrix}$
then the set of defining relations is a GS basis of $U_{K} (L)$ . The PBW theorem follows: $U_{K} (L)$ is a free $K$ -module with a $K$ -basis,
$\begin{matrix} {x_{i_{1}} \dots x_{i_{n}} | i_{1} \leq \dots \leq i_{n}, i_{t} \in I, t = 1, \dots, n, n \geq 0} . \end{matrix}$
Kandri-Rody and Weispfenning [122] invented an important class of (noncommutative polynomial) ‘algebras of solvable type’, which includes universal enveloping algebras. An algebra of solvable type is
$\begin{matrix} R = k ⟨ X | s_{i j} = x_{i} x_{j} - x_{j} x_{i} - p_{i j}, i > j, p_{i j} < x_{i} x_{j} ⟩, \end{matrix}$
and the compositions ${(s_{i j}, s_{j k})}_{w} = 0$ modulo $(S, w)$ , where $w = x_{i} x_{j} x_{k}$ with $i > j > k$ . Here $p_{i j}$ is a noncommutative polynomial with all terms less than $x_{i} x_{j}$ . They created a theory of GS bases for every algebra of this class; thus, they found a linear basis of every quotient of $R$ .
A general presentation $U_{k} (L) = k ⟨ X | S^{(-)} ⟩$ of a universal enveloping algebra over a field $k$ , where $L = L i e (X | S)$ with $S \subset L i e (X) \subset k ⟨ X ⟩$ and $S^{(-)}$ is $S$ as a set of associative polynomials. PBW theorem in a Shirshov’s form. The following conditions are equivalent:
1. (i)
  the set $S$ is a Lie GS basis;
2. (ii)
  the set $S^{(-)}$ is a GS basis for $k ⟨ X ⟩$ ;
3. (iii)
  a linear basis for $U_{k} (L)$ consists of words $u_{1} u_{2} \dots u_{n}$ , where $u_{i}$ are $S$ -irreducible LS words with $u_{1} ⪯ u_{2} ⪯ \dots ⪯ u_{n}$ (in the lex-ordering), see [56, 57];
4. (iv)
  a linear basis for $L$ consists of the $S$ -irreducible LS Lie monomials $[u]$ in $X$ ;
5. (v)
  a linear basis for $U_{k} (L)$ consists of the polynomials $u = [u_{1}] \dots [u_{n}]$ , where $u_{1} ⪯ \dots ⪯ u_{n}$ in the lex ordering, $n \geq 0$ , and each $[u_{i}]$ is an $S$ -irreducible non-associative LS word in $X$ .
Free Lie algebras $L i e_{K} (X)$ over $K$ . Hall, Shirshov, and Lyndon provided different linear $K$ -bases for a free Lie algebra (the Hall–Shirshov series of bases, in particular, the Hall basis, the Lyndon–Shirshov basis, the basis compatible with the free solvable (polynilpotent) Lie algebra) [194], see also [15]. Two anticommutative GS bases of $L i e_{K} (X)$ were found in [34, 37], which yields the Hall and Lyndon–Shirshov linear bases respectively.
The Lie $k$ -algebras presented by Chevalley generators and defining relations of types $A_{n}$ , $B_{n}$ , $C_{n}$ , $D_{n}$ , $G_{2}$ , $F_{4}$ , $E_{6}$ , $E_{7}$ , and $E_{8}$ . Serre’s theorem provides linear bases and multiplication tables for these algebras (they are finite dimensional simple Lie algebras over $k$ ). Lie GS bases for these algebras are found in [49–51].
The Coxeter group
$\begin{matrix} W = sgp ⟨ S | s_{i}^{2} = 1, m_{i j} (s_{i}, s_{j}) = m_{j i} (s_{j}, s_{i}) ⟩ \end{matrix}$
for a given Coxeter matrix $M = (m_{i j})$ . Tits [210] (see also [14]) algorithmically solved the word problem for Coxeter groups. Finite Coxeter groups are presented by ‘finite’ Coxeter matrices $A_{n}$ , $B_{n}$ , $D_{n}$ , $G_{2}$ , $F_{4}$ , $E_{6}$ , $E_{7}$ , $E_{8}$ , $H_{3}$ , and $H_{4}$ . Coxeter’s theorem provides normal forms and Cayley tables (these are finite groups generated by reflections). GS bases for finite Coxeter groups are found in [58].
The Iwahory–Hecke (Hecke) algebras $H$ over $K$ differ from the group algebras $K (W)$ of Coxeter groups in that instead of $s_{i}^{2} = 1$ there are relations $(s_{i} - q_{i}^{1 / 2}) (s_{i} + q_{i}^{1 / 2}) = 0$ or $(s_{i} - q_{i}) (s_{i} + 1) = 0$ , where $q_{i}$ are units of $K$ . Two $K$ -bases for $H$ are known; one is natural, and the other is the Kazhdan–Lusztig canonical basis [155]. The GS bases for the Iwahory–Hecke algebras are known for the finite Coxeter matrices. A deep connection of the Iwahory–Hecke algebras of type $A_{n}$ and braid groups (as well as link invariants) was found by Jones [116].
Affine Kac–Moody algebras [117]. The Kac–Gabber theorem provides linear bases for these algebras under the symmetrizability condition on the Cartan matrix. Using this result, Poroshenko found the GS bases of these algebras [178–180].
Borcherds–Kac–Moody algebras [61–63, 117]. GS bases are not known.
Quantum enveloping algebras (Drinfeld, Jimbo). Lusztig’s theorem [154] provides linear canonical bases of these algebras. Different approaches were developed by Ringel [191, 192], Green [110], and Kharchenko [131–135]. GS bases of quantum enveloping algebras are unknown except for the case $A_{n}$ , see [55, 86, 195, 220].
Koszul algebras. The quadratic algebras with a basis of standard monomials, called PBW-algebras, are always Koszul (Priddy [184]), but not conversely. In different terminology, PBW-algebras are algebras with quadratic GS bases. See [177].
Elliptic algebras (Feigin, Odesskii) These are associative algebras presented by $n$ generators and $n (n - 1) / 2$ homogeneous quadratic relations for which the dimensions of the graded components are the same as for the polynomial algebra in $n$ variables. The first example of this type was Sklyanin algebra (1982) generated by $x_{1}$ , $x_{2}$ , and $x_{3}$ with the defining relations $[x_{3}, x_{2}] = x_{1}^{2}$ , $[x_{2}, x_{1}] = x_{3}^{2}$ , and $[x_{1}, x_{3}] = x_{2}^{2}$ . See [175]. GS bases are not known.
Leavitt path algebras. GS bases for these algebras are found in Alahmedi et al. [2] and applied by the same authors to determine the structure of the Leavitt path algebras of polynomial growth in [3].
Artin braid group $B r_{n}$ . The Markov–Artin theorem provides the normal form and semi-direct structure of the group in the Burau generators. Other normal forms of $B r_{n}$ were obtained by Garside, Birman–Ko–Lee, and Adjan–Thurston. GS bases for $B r_{n}$ in the Artin–Burau, Artin–Garside, Birman–Ko–Lee, and Adjan–Thurston generators were found in [23–25, 89] respectively.
Artin–Tits groups. They differ from Coxeter groups in the absence of the relations $s_{i}^{2} = 1$ . Normal forms are known in the spherical case, see Brieskorn, Saito [64]. GS bases are not known except for braid groups (the Artin–Tits groups of type $A_{n}$ ).
The groups of Novikov–Boon type (Novikov [173], Boon [60], Collins [97], Kalorkoti [118–121]) with unsolvable word or conjugacy problem. They are groups with standard bases (standard normal forms or standard GS bases), see [16–18, 77].
Adjan’s [1] and Rabin’s [187] constructions of groups with unsolvable isomorphism problem and Markov properties. A GS basis is known for Adjan’s construction [26].
Markov’s [161] and Post’s [183] semigroups with unsolvable word problem. The GS basis of Post’s semigroup is found in [223].
Markov’s construction of semigroups with unsolvable isomorphism problem and Markov properties. The GS basis for the construction is not known.
Plactic monoids. A theorem due to Richardson, Schensted, and Knuth provides a normal form of the elements of these monoids (see Lothaire [151]). New approaches to plactic monoids via GS bases in the alphabets of row and column generators are found in [29].
The groups of quotients of the multiplicative semigroups of power series rings with topological quadratic relations of the type $k ⟨ ⟨ x, y, z, t | x y = z t ⟩ ⟩$ embeddable (without the zero element) into groups but in general not embeddable into division algebras (settling a problem of Malcev). The relative standard normal forms of these groups found in [19, 20] are the reduced words for what was later called a relative GS basis [59].

To date, the method of GS bases has been adapted, in particular, to the following classes of linear universal algebras, as well as for operads, categories, and semirings. Unless stated otherwise, we consider all linear algebras over a field $k$ . Following the terminology of Higgins and Kurosh, we mean by a ((differential) associative) $Ω$ -algebra a linear space ((differential) associative algebra) with a set of multi-linear operations $Ω$ :

Associative algebras, Shirshov [207], Bokut [22], Bergman [11];
Associative algebras over a commutative algebra, Mikhalev and Zolotykh [170];
Associative $Γ$ -algebras, where $Γ$ is a group, Bokut and Shum [59];
Lie algebras, Shirshov [207];
Lie algebras over a commutative algebra, Bokut et al. [31];
Lie p-algebras over $k$ with $char k = p$ , Mikhalev [166];
Lie superalgebras, Mikhalev [165, 167];
Metabelian Lie algebras, Chen and Chen [75];
Quiver (path) algebras, Farkas et al. [101];
Tensor products of associative algebras, Bokut et al. [30];
Associative differential algebras, Chen et al. [76];
Associative $(n -)$ conformal algebras over $k$ with $char k = 0$ , Bokut et al. [45], Bokut et al. [43];
Dialgebras, Bokut et al. [38];
Pre-Lie (Vinberg–Koszul–Gerstenhaber, right (left) symmetric) algebras, Bokut et al. [35],
Associative Rota–Baxter algebras over $k$ with $char k = 0$ , Bokut et al. [32];
$L$ -algebras, Bokut et al. [33];
Associative $Ω$ -algebras, Bokut et al. [41];
Associative differential $Ω$ -algebras, Qiu and Chen [185];
$Ω$ -algebras, Bokut et al. [33];
Differential Rota–Baxter commutative associative algebras, Guo et al. [111];
Semirings, Bokut et al. [40];
Modules over an associative algebra, Golod [108], Green [109], Kang and Lee [123, 124], Chibrikov [90];
Small categories, Bokut et al. [36];
Non-associative algebras, Shirshov [206];
Non-associative algebras over a commutative algebra, Chen et al. [81];
Commutative non-associative algebras, Shirshov [206];
Anti-commutative non-associative algebras, Shirshov [206];
Symmetric operads, Dotsenko and Khoroshkin [98].

At the heart of the GS method for a class of linear algebras lies a CD-lemma for a free object of the class. For the cases above, the free objects are the free associative algebra $k ⟨ X ⟩$ , the doubly free associative $k [Y]$ -algebra $k [Y] ⟨ X ⟩$ , the free Lie algebra $L i e (X)$ , and the doubly free Lie $k [Y]$ -algebra $L i e_{k [Y]} (X)$ . For the tensor product of two associative algebras we need to use the tensor product of two free algebras, $k ⟨ X ⟩ \otimes k ⟨ Y ⟩$ . We can view every semiring as a double semigroup with two associative products $\cdot$ and $\circ$ . So, the CD-lemma for semirings is the CD-lemma for the semiring algebra of the free semiring $R i g (X)$ . The CD-lemma for modules is the CD-lemma for the doubly free module ${Mod}_{k ⟨ Y ⟩} (X)$ , a free module over a free associative algebra. The CD-lemma for small categories is the CD-lemma for the ‘free partial $k$ -algebra’ $k C ⟨ X ⟩$ generated by an oriented graph $X$ (a sequence $z_{1} z_{2} \dots z_{n}$ , where $z_{i} \in X$ , is a partial word in $X$ iff it is a path; a partial polynomial is a linear combination of partial words with the same source and target).

All CD-lemmas have essentially the same statement. Consider a class $V$ of linear universal algebras, a free algebra $V (X)$ in $V$ , and a well-ordered $k$ -basis of terms $N (X)$ of $V (X)$ . A subset $S \subset V (X)$ is called a GS basis if every composition of the elements of $S$ is trivial (vanishes upon the elimination of the leading terms $\bar{s}$ for $s \in S$ ). Then the following conditions are equivalent:

(i)
$S$ is a GS basis.
(ii)
If $f \in I d (S)$ then the leading term $\bar{f}$ contains the subterm $\bar{s}$ for some $s \in S$ .
(iii)
The set of $S$ -irreducible terms is a linear basis for the $V$ -algebra $V ⟨ X | S ⟩$ generated by $X$ with defining relations $S$ .

In some cases ( $(n -)$ conformal algebras, dialgebras), conditions (i) and (ii) are not equivalent. To be more precise, in those cases we have $(i) \Rightarrow (i i) \Leftrightarrow (i i i)$ .

Typical compositions are compositions of intersection and inclusion. Shirshov [206, 207] avoided inclusion composition. He suggested instead that a GS basis must be minimal (the leading words do not contain each other as subwords). In some cases, new compositions must be defined, for example, the composition of left (right) multiplication. Also, sometimes we need to combine all these compositions. We present here a new approach to the definition of a composition, based on the concept of the least common multiple $lcm (u, v)$ of two terms $u$ and $v$ .

In some cases (Lie algebras, ( $n$ -) conformal algebras) the ‘leading’ term $\bar{f}$ of a polynomial $f \in V (X)$ lies outside $V (X)$ . For Lie algebras, we have $\bar{f} \in k ⟨ X ⟩$ , for ( $n$ -) conformal algebras $\bar{f}$ belongs to an ‘ $Ω$ -semigroup’.

Almost all CD-lemmas require the new notion of a ‘normal $S$ -term’. A term $(a s b)$ in ${X, Ω}$ , where $s \in S$ , with only one occurrence of $s$ is called a normal $S$ -term whenever $\bar{(a s b)} = (a (\bar{s}) b)$ . Given $S \subset k ⟨ X ⟩$ , every $S$ -word (that is, an $S$ -term) is a normal $S$ -word. Given $S \subset L i e (X)$ , every Lie $S$ -monomial (Lie $S$ -term) is a linear combination of normal Lie $S$ -terms (Shirshov [207]).

One of the two key lemmas asserts that if $S$ is complete under compositions of multiplication then every element of the ideal generated by $S$ is a linear combination of normal $S$ -terms. Another key lemma says that if $S$ is a GS basis and the leading words of two normal $S$ -terms are the same then these terms are the same modulo lower normal $S$ -terms. As we mentioned above, Shirshov proved these results [207] for $L i e (X)$ (there are no compositions of multiplication for Lie and associative algebras).

This survey continues our surveys with Kolesnikov, Fong, Ke, and Shum [27, 28, 42, 46, 52, 53], Ufnarovski’s survey [213], and the book of the first named author and Kukin [54].

The paper is organized as follows. Section 2 is for associative algebras, Sect. 3 is for semigroups and groups, Sect. 4 is for Lie algebras, and the short Sect. 5 is for $Ω$ -algebras and operads.^{Footnote 7}

To conclude this introduction, we give some information about the work of Shirshov; for more on this, see the book [209]. Shirshov (1921–1981) was a famous Russian mathematician. His name is associated with notions and results on the Gröbner–Shirshov bases, the Composition-Diamond lemma, the Shirshov–Witt theorem, the Lazard–Shirshov elimination, the Shirshov height theorem, Lyndon–Shirshov words, Lyndon–Shirshov basis (in a free Lie algebra), the Hall–Shirshov series of bases, the Cohn–Shirshov theorem for Jordan algebras, Shirshov’s theorem on the Kurosh problem, and the Shirshov factorization theorem. Shirshov’s ideas were used by his students Efim Zelmanov to solve the restricted Burnside problem and Aleksander Kemer to solve the Specht problem.

1.1 Digression on the history of Lyndon–Shirshov bases and Lyndon–Shirshov words

Lyndon [156], 1954, defined standard words, which are the same as Shirshov’s regular words [203], 1958. Unfortunately, the papers (Lyndon [156]) and (Chen et al. [72], 1958) were practically unknown before 1983. As a result, at that time almost all authors (except four who used the names Shirshov and Chen–Fox–Lyndon, see below) refer to the basis and words as Shirshov regular basis and words, cf. for instance [8, 9, 96, 188, 212, 224]. To the best of our knowledge, none of the authors mentioned Lyndon’s paper [156] as a source of ‘Lyndon words’ before 1983(!).

In the following papers the authors mentioned both (Chen et al. [72]) and (Shirshov [203]) as a source of ‘Lyndon–Shirshov basis’ and ‘Lyndon–Shirshov words’:

Schützenberger and Sherman [196], 1963;
Schützenberger [197], 1965;
Viennot [217], 1978;
Michel [163], 1975; [164], 1976.

The authors of [196] thank Cohn for pointing out Shirshov’s paper [203]. They also formulate Shirshov’s factorization theorem [203]. They mention [72, 203] as a source of ‘LS words’. Schützenberger also mentions [197] Shirshov’s factorization theorem, but in this case he attributes it to both Chen et al. [72] and Shirshov [203]. Actually, he cites [72] by mistake, as that result is absent from the paper, see Berstel and Perrin [12].^{Footnote 8}

Starting with the book of Lothaire, Combinatorics on words ([151], 1983), some authors called the words and basis ‘Lyndon words’ and ‘Lyndon basis’; for instance, see Reutenauer, Free Lie algebras ([190], 1993).

2 Gröbner–Shirshov bases for associative algebras

In this section we give a proof of Shirshov’s CD-lemma for associative algebras and Buchberger’s theorem for commutative algebras. Also, we give the Eisenbud–Peeva–Sturmfels lifting theorem, the CD-lemmas for modules (following Kang and Lee [124] and Chibrikov [90]), the PBW theorem and the PBW theorem in Shirshov’s form, the CD-lemma for categories, the CD-lemma for associative algebras over commutative algebras and the Rosso–Yamane theorem for $U_{q} (A_{n})$ .

2.1 Composition-Diamond lemma for associative algebras

Let $k$ be a field, $k ⟨ X ⟩$ be the free associative algebra over $k$ generated by $X$ and $X^{*}$ be the free monoid generated by $X$ , where the empty word is the identity, denoted by 1. Suppose that $X^{*}$ is a well-ordered set. Take $f \in k ⟨ X ⟩$ with the leading word $\bar{f}$ and $f = α \bar{f} - r_{f}$ , where $0 \neq α \in k$ and $\bar{r_{f}} < \bar{f}$ . We call $f$ monic if $α = 1$ .

A well-ordering $>$ on $X^{*}$ is called a monomial ordering whenever it is compatible with the multiplication of words, that is, for all $u, v \in X^{*}$ we have

\begin{matrix} u > v \Rightarrow w_{1} u w_{2} > w_{1} v w_{2}, for all w_{1}, w_{2} \in X^{*} . \end{matrix}

A standard example of monomial ordering on $X^{*}$ is the deg-lex ordering, in which two words are compared first by the degree and then lexicographically, where $X$ is a well-ordered set.

Fix a monomial ordering $<$ on $X^{*}$ and take two monic polynomials $f$ and $g$ in $k ⟨ X ⟩$ . There are two kinds of compositions:

(i)
If $w$ is a word such that $w = \bar{f} b = a \bar{g}$ for some $a, b \in X^{*}$ with $| \bar{f} | + | \bar{g} | > | w |$ then the polynomial ${(f, g)}_{w} = f b - a g$ is called the intersection composition of $f$ and $g$ with respect to $w$ .
(ii)
If $w = \bar{f} = a \bar{g} b$ for some $a, b \in X^{*}$ then the polynomial ${(f, g)}_{w} = f - a g b$ is called the inclusion composition of $f$ and $g$ with respect to $w$ .

Then $\bar{{(f, g)}_{w}} < w$ and ${(f, g)}_{w}$ lies in the ideal $I d {f, g}$ of $k ⟨ X ⟩$ generated by $f$ and $g$ .

In the composition ${(f, g)}_{w}$ , we call $w$ an ambiguity (or the least common multiple $lcm (\bar{f}, \bar{g})$ , see below).

Consider $S \subset k ⟨ X ⟩$ such that very $s \in S$ is monic. Take $h \in k ⟨ X ⟩$ and $w \in X^{*}$ . Then $h$ is called trivial modulo $(S, w)$ , denoted by

\begin{matrix} h \equiv 0 mod (S, w), \end{matrix}

if $h = \sum α_{i} a_{i} s_{i} b_{i}$ , where $α_{i} \in k$ , $a_{i}, b_{i} \in X^{*}$ , and $s_{i} \in S$ with $a_{i} \bar{s_{i}} b_{i} < w$ .

The elements $a s b$ , $a, b \in X^{*}$ , and $s \in S$ are called $S$ -words.

A monic set $S \subset k ⟨ X ⟩$ is called a GS basis in $k ⟨ X ⟩$ with respect to the monomial ordering $<$ if every composition of polynomials in $S$ is trivial modulo $S$ and the corresponding $w$ .

A set $S$ is called a minimal GS basis in $k ⟨ X ⟩$ if $S$ is a GS basis in $k ⟨ X ⟩$ avoiding inclusion compositions; that is, given $f, g \in S$ with $f \neq g$ , we have $\bar{f} \neq a \bar{g} b$ for all $a, b \in X^{*}$ .

Put

\begin{matrix} I r r (S) = {u \in X^{*} | u \neq a \bar{s} b, s \in S, a, b \in X^{*}} . \end{matrix}

The elements of $I r r (S)$ are called $S$ -irreducible or $S$ -reduced.

A GS basis $S$ in $k ⟨ X ⟩$ is reduced provided that $supp (s) \subseteq I r r (S \ {s})$ for every $s \in S$ , where $supp (s) = {u_{1}, u_{2}, \dots, u_{n}}$ whenever $s = \sum_{i = 1}^{n} α_{i} u_{i}$ with $0 \neq α_{i} \in k$ and $u_{i} \in X^{*}$ . In other words, each $u_{i}$ is an $S \ {s}$ -irreducible word.

The following lemma is key for proving the CD-lemma for associative algebras.

Lemma 1

If $S$ is a GS basis in $k ⟨ X ⟩$ and $w = a_{1} \bar{s_{1}} b_{1} = a_{2} \bar{s_{2}} b_{2}$ , where $a_{1}, b_{1}, a_{2}, b_{2} \in X^{*}$ and $s_{1}, s_{2} \in S$ , then $a_{1} s_{1} b_{1} \equiv a_{2} s_{2} b_{2} mod (S, w)$ .

Proof

There are three cases to consider.

Case 1 Assume that the subwords ${\bar{s}}_{1}$ and ${\bar{s}}_{2}$ of $w$ are disjoint, say, $| a_{2} | \geq | a_{1} | + | {\bar{s}}_{1} |$ . Then, $a_{2} = a_{1} {\bar{s}}_{1} c$ and $b_{1} = c {\bar{s}}_{2} b_{2}$ for some $c \in X^{*}$ , and so $w_{1} = a_{1} {\bar{s}}_{1} c {\bar{s}}_{2} b_{2}$ . Now,

\begin{matrix} a_{1} s_{1} b_{1} - a_{2} s_{2} b_{2} & = a_{1} s_{1} c {\bar{s}}_{2} b_{2} - a_{1} {\bar{s}}_{1} c s_{2} b_{2} \\ = a_{1} s_{1} c ({\bar{s}}_{2} - s_{2}) b_{2} + a_{1} (s_{1} - {\bar{s}}_{1}) c s_{2} b_{2} . \end{matrix}

Since $\bar{\bar{s_{2}} - s_{2}} < {\bar{s}}_{2}$ and $\bar{s_{1} - \bar{s_{1}}} < {\bar{s}}_{1}$ , we conclude that

\begin{matrix} a_{1} s_{1} b_{1} - a_{2} s_{2} b_{2} = \sum_{i} α_{i} u_{i} s_{1} v_{i} + \sum_{j} β_{j} u_{j} s_{2} v_{j} \end{matrix}

with $α_{i}, β_{j} \in k$ and $S$ -words $u_{i} s_{1} v_{i}$ and $u_{j} s_{2} v_{j}$ satisfying $u_{i} {\bar{s}}_{1} v_{i}, u_{j} {\bar{s}}_{2} v_{j} < w .$

Case 2 Assume that the subword ${\bar{s}}_{1}$ of $w$ contains ${\bar{s}}_{2}$ as a subword. Then ${\bar{s}}_{1} = a {\bar{s}}_{2} b$ with $a_{2} = a_{1} a$ and $b_{2} = b b_{1}$ , that is, $w = a_{1} a {\bar{s}}_{2} b b_{1}$ for some $S$ -word $a s_{2} b$ . We have

\begin{matrix} a_{1} s_{1} b_{1} - a_{2} s_{2} b_{2} = a_{1} s_{1} b_{1} - a_{1} a s_{2} b b_{1} = a_{1} (s_{1} - a s_{2} b) b_{1} = a_{1} {(s_{1}, s_{2})}_{\bar{s_{1}}} b_{1} . \end{matrix}

The triviality of compositions implies that $a_{1} s_{1} b_{1} \equiv a_{2} s_{2} b_{2} mod (S, w) .$

Case 3 Assume that the subwords ${\bar{s}}_{1}$ and ${\bar{s}}_{2}$ of $w$ have a nonempty intersection. We may assume that $a_{2} = a_{1} a$ and $b_{1} = b b_{2}$ with $w = {\bar{s}}_{1} b = a {\bar{s}}_{2}$ and $| w | < | {\bar{s}}_{1} | + | {\bar{s}}_{2} |$ . Then, as in Case 2, we have $a_{1} s_{1} b_{1} \equiv a_{2} s_{2} b_{2} mod (S, w) .$ $□$

Lemma 2

Consider a set $S \subset k ⟨ X ⟩$ of monic polynomials. For every $f \in k ⟨ X ⟩$ we have

\begin{matrix} f = \sum_{u_{i} \leq \bar{f}} α_{i} u_{i} + \sum_{a_{j} \bar{s_{j}} b_{j} \leq \bar{f}} β_{j} a_{j} s_{j} b_{j} \end{matrix}

where $α_{i}, β_{j} \in k$ , $u_{i} \in I r r (S)$ , and $a_{j} s_{j} b_{j}$ are $S$ -words. So, $I r r (S)$ is a set of linear generators of the algebra $k ⟨ X | S ⟩$ .

Proof

Induct on $\bar{f}$ . $□$

Theorem 1

(The CD-lemma for associative algebras) Choose a monomial ordering $<$ on $X^{*}$ . Consider a monic set $S \subset k ⟨ X ⟩$ and the ideal $I d (S)$ of $k ⟨ X ⟩$ generated by $S$ . The following statements are equivalent:

(i)
$S$ is a Gröbner–Shirshov basis in $k ⟨ X ⟩$ .
(ii)
$f \in I d (S) \Rightarrow \bar{f} = a \bar{s} b$ for some $s \in S$ and $a, b \in X^{*}$ .
(iii)
$I r r (S) = {u \in X^{*} | u \neq a \bar{s} b, s \in S, a, b \in X^{*}}$ is a linear basis of the algebra $k ⟨ X | S ⟩$ .

Proof

(i) $\Rightarrow$ (ii). Assume that $S$ is a GS basis and take $0 \neq f \in I d (S)$ . Then, we have $f = \sum_{i = 1}^{n} α_{i} a_{i} s_{i} b_{i}$ where $α_{i} \in k$ , $a_{i}, b_{i} \in X^{*}$ , and $s_{i} \in S$ . Suppose that $w_{i} = a_{i} \bar{s_{i}} b_{i}$ satisfy

\begin{matrix} w_{1} = w_{2} = \dots = w_{l} > w_{l + 1} \geq \dots . \end{matrix}

Induct on $w_{1}$ and $l$ to show that $\bar{f} = a \bar{s} b$ for some $s \in S and a, b \in X^{*}$ . To be more precise, induct on $(w_{1}, l)$ with the lex ordering of the pairs.

If $l = 1$ then $\bar{f} = \bar{a_{1} s_{1} b_{1}} = a_{1} \bar{s_{1}} b_{1}$ and hence the claim holds. Assume that $l \geq 2$ . Then $w_{1} = a_{1} \bar{s_{1}} b_{1} = a_{2} \bar{s_{2}} b_{2}$ . Lemma 1 implies that $a_{1} s_{1} b_{1} \equiv a_{2} s_{2} b_{2} mod (S, w_{1}) .$ If $α_{1} + α_{2} \neq 0$ or $l > 2$ then the claim follows by induction on $l$ . For the case $α_{1} + α_{2} = 0$ and $l = 2$ , induct on $w_{1}$ . Thus, (ii) holds.

(ii) $\Rightarrow$ (iii). By Lemma 2, $I r r (S)$ generates $k ⟨ X | S ⟩$ as a linear space. Suppose that $\sum_{i} α_{i} u_{i} = 0$ in $k ⟨ X | S ⟩$ , where $0 \neq α_{i} \in k$ and $u_{i} \in I r r (S)$ . It means that $\sum_{i} α_{i} u_{i} \in I d (S)$ in $k ⟨ X ⟩$ . Then $\bar{\sum_{i} α_{i} u_{i}} = u_{j} \in I r r (S)$ for some $j$ , which contradicts (ii).

(iii) $\Rightarrow$ (i). Given $f, g \in S$ , Lemma 2 and (iii) yield ${(f, g)}_{w} \equiv 0 mod (S, w) .$ Therefore, $S$ is a GS basis. $□$

A new exposition of the proof of Theorem 1 (CD-lemma for associative algebras).

Let us start with the concepts of non-unique common multiple and least common multiple of two words $u, v \in X^{*}$ . A common multiple $cm (u, v)$ means that $cm (u, v) = a_{1} u b_{1} = a_{2} v b_{2}$ for some $a_{i}, b_{i} \in X^{*}$ . Then $lcm (u, v)$ means that some $cm (u, v)$ contains some $lcm (u, v)$ as a subword: $cm (u, v) = c \cdot lcm (u, v) \cdot d$ with $c, d \in X^{*}$ , where $u$ and $v$ are the same subwords in both sides. To be precise,

\begin{matrix} lcm (u, v) \in {u c v, c \in X^{*} (a trivial lcm (u, v)); \\ u = a v b, a, b \in X^{*} (an inclusion lcm (u, v)); \\ u b = a v, a, b \in X^{*}, | u b | < | u | + | v | (an intersection lcm (u, v))} . \end{matrix}

Define the general composition ${(f, g)}_{lcm (\bar{f}, \bar{g})}$ of monic polynomials $f, g \in k ⟨ X ⟩$ as

\begin{matrix} {(f, g)}_{lcm (\bar{f}, \bar{g})} = lcm (\bar{f}, \bar{g}) {|_{\bar{f} \mapsto f} - lcm (\bar{f}, \bar{g}) |}_{\bar{g} \mapsto g} . \end{matrix}

The only difference with the previous definition of composition is that we include the case of trivial $lcm (\bar{f}, \bar{g})$ . However, in this case the composition is trivial,

\begin{matrix} {(f, g)}_{\bar{f} c \bar{g}} \equiv 0 mod ({f, g}, \bar{f} c \bar{g}) . \end{matrix}

It is clear that if $a_{1} \bar{f} b_{1} = a_{2} \bar{g} b_{2}$ then, up to the ordering of $f$ and $g$ ,

\begin{matrix} a_{1} f b_{1} - a_{2} g b_{2} = c \cdot {(f, g)}_{lcm (\bar{f}, \bar{g})} \cdot d . \end{matrix}

This implies Lemma 1. The main claim (i) $\Rightarrow$ (ii) of Theorem 1 follows from Lemma 1.

Shirshov algorithm. If a monic subset $S \subset k ⟨ X ⟩$ is not a GS basis then we can add to $S$ all nontrivial compositions, making them monic. Iterating this process, we eventually obtain a GS basis $S^{c}$ that contains $S$ and generates the same ideal, $I d (S^{c}) = I d (S)$ . This $S^{c}$ is called the GS completion of $S$ . Using the reduction algorithm (elimination of the leading words of polynomials), we may obtain a minimal GS basis $S^{c}$ or a reduced GS basis.

The following theorem gives a linear basis for the ideal $I d (S)$ provided that $S \subset k ⟨ X ⟩$ is a GS basis.

Theorem 2

If $S \subset k ⟨ X ⟩$ is a Gröbner–Shirshov basis then, given $u \in X^{*} \ I r r (S)$ , by Lemma 2 there exists $\hat{u} \in k I r r (S)$ with $\bar{\hat{u}} < u$ (if $\hat{u} \neq 0$ ) such that $u - \hat{u} \in I d (S)$ and the set ${u - \hat{u} | u \in X^{*} \ I r r (S)}$ is a linear basis for the ideal $I d (S)$ of $k ⟨ X ⟩$ .

Proof

Take $0 \neq f \in I d (S)$ . Then by the CD-lemma for associative algebras, $\bar{f} = a_{1} \bar{s_{1}} b_{1} = u_{1}$ for some $s_{1} \in S$ and $a_{1}, b_{1} \in X^{*}$ , which implies that $\bar{f} = u_{1} \in X^{*} \ I r r (S)$ . Put $f_{1} = f - α_{1} (u_{1} - \hat{u_{1}})$ , where $α_{1}$ is the coefficient of the leading term of $f$ and $\bar{\hat{u_{1}}} < u_{1}$ or $\hat{u_{1}} = 0$ . Then $f_{1} \in I d (S)$ and $\bar{f_{1}} < \bar{f}$ . By induction on $\bar{f}$ , the set ${u - \hat{u} | u \in X^{*} \ I r r (S)}$ generates $I d (S)$ as a linear space. It is clear that ${u - \hat{u} | u \in X^{*} \ I r r (S)}$ is a linearly independent set. $□$

Theorem 3

Choose a monomial ordering $>$ on $X^{*}$ . For every ideal $I$ of $k ⟨ X ⟩$ there exists a unique reduced Gröbner–Shirshov basis $S$ for $I$ .

Proof

Clearly, a Gröbner–Shirshov basis $S \subset k ⟨ X ⟩$ for the ideal $I = I d (S)$ exists; for example, we may take $S = I$ . By Theorem 1, we may assume that the leading terms of the elements of $S$ are distinct. Given $g \in S$ , put

\begin{matrix} Δ_{g} = {f \in S | f \neq g and \bar{f} = a \bar{g} b for some a, b \in X^{*}} \end{matrix}

and $S_{1} = S \ \cup_{g \in S} Δ_{g}$ .

For every $f \in I d (S)$ we show that there exists an $s_{1} \in S_{1}$ such that $\bar{f} = a \bar{s_{1}} b for some a, b \in X^{*}$ .

In fact, Theorem 1 implies that $\bar{f} = a^{'} \bar{h} b^{'}$ for some $a^{'}, b^{'} \in X^{*}$ and $h \in S$ . Suppose that $h \in S \ S_{1}$ . Then we have $h \in \cup_{g \in S} Δ_{g}$ , say, $h \in Δ_{g}$ . Therefore, $h \neq g$ and $\bar{h} = a \bar{g} b$ for some $a, b \in X^{*}$ . We claim that $\bar{h} > \bar{g}$ . Otherwise, $\bar{h} < \bar{g}$ . It follows that $\bar{h} = a \bar{g} b > a \bar{h} b$ and so we have the infinite descending chain

\begin{matrix} \bar{h} > a \bar{h} b > a^{2} \bar{h} b^{2} > a^{3} \bar{h} b^{3} > \dots, \end{matrix}

which contradicts the assumption that $>$ is a well ordering.

Suppose that $g \notin S_{1}$ . Then, by the argument above, there exists $g_{1} \in S$ such that $g \in Δ_{g_{1}}$ and $\bar{g} > \bar{g_{1}}$ . Since $>$ is a well ordering, there must exist $s_{1} \in S_{1}$ such that $\bar{f} = a_{1} \bar{s_{1}} b_{1} for some a_{1}, b_{1} \in X^{*}$ .

Put $f_{1} = f - α_{1} a_{1} s_{1} b_{1}$ , where $α_{1}$ is the coefficient of the leading term of $f$ . Then $f_{1} \in I d (S)$ and $\bar{f} > \bar{f_{1}}$ .

By induction on $\bar{f}$ , we know that $f \in I d (S_{1})$ , and hence $I = I d (S_{1})$ . Moreover, Theorem 1 implies that $S_{1}$ is clearly a minimal GS basis for the ideal $I d (S)$ .

Assume that $S$ is a minimal GS basis for $I$ .

For every $s \in S$ we have $s = s^{'} + s^{''}$ , where $supp (s^{'}) \subseteq I r r (S \ {s})$ and $s^{''} \in I d (S \ {s})$ . Since $S$ is a minimal GS basis, it follows that $\bar{s} = \bar{s^{'}}$ for every $s \in S$ .

We claim that $S_{2} = {s^{'} | s \in S}$ is a reduced GS basis for $I$ . In fact, it is clear that $S_{2} \subseteq I d (S) = I$ . By Theorem 1, for every $f \in I d (S)$ we have $\bar{f} = a_{1} \bar{s_{1}} b_{1} = a_{1} \bar{s_{1}^{'}} b_{1}$ for some $a_{1}, b_{1} \in X^{*}$ .

Take two reduced GS bases $S$ and $R$ for the ideal $I$ . By Theorem 1, for every $s \in S$ ,

\begin{matrix} \bar{s} = a \bar{r} b, \bar{r} = c \bar{s_{1}} d \end{matrix}

for some $a, b, c, d \in X^{*}$ , $r \in R$ , and $s_{1} \in S$ , and hence $\bar{s} = a c \bar{s_{1}} d b$ . Since $\bar{s} \in supp (s) \subseteq I r r (S \ {s})$ , we have $s = s_{1}$ . It follows that $a = b = c = d = 1$ , and so $\bar{s} = \bar{r}$ .

If $s \neq r$ then $0 \neq s - r \in I = I d (S) = I d (R)$ . By Theorem 1, $\bar{s - r} = a_{1} \bar{r_{1}} b_{1} = c_{1} \bar{s_{2}} d_{1}$ for some $a_{1}, b_{1}, c_{1}, d_{1} \in X^{*}$ with $\bar{r_{1}}, \bar{s_{2}} < \bar{s} = \bar{r}$ . This means that $s_{2} \in S \ {s}$ and $r_{1} \in R \ {r}$ . Noting that $\bar{s - r} \in supp (s) \cup supp (r)$ , we have either $\bar{s - r} \in supp (s)$ or $\bar{s - r} \in supp (r)$ . If $\bar{s - r} \in supp (s)$ then $\bar{s - r} \in I r r (S \ {s})$ , which contradicts $\bar{s - r} = c_{1} \bar{s_{2}} d_{1}$ ; if $\bar{s - r} \in supp (r)$ then $\bar{s - r} \in I r r (R \ {r})$ , which contradicts $\bar{s - r} = a_{1} \bar{r_{1}} b_{1}$ . This shows that $s = r$ , and then $S \subseteq R$ . Similarly, $R \subseteq S$ . $□$

Remark 1

In fact, a reduced GS basis is unique (up to the ordering) in all possible cases below.

Remark 2

Both associative and Lie CD-lemmas are valid when we replace the base field $k$ by an arbitrary commutative ring $K$ with identity because we assume that all GS bases consist of monic polynomials. For example, consider a Lie algebra $L$ over $K$ which is a free $K$ -module with a well-ordered $K$ -basis ${a_{i} | i \in I}$ . With the deg-lex ordering on ${a_{i} | i \in I}^{*}$ , the universal enveloping associative algebra $U_{K} (L)$ has a (monic) GS basis

\begin{matrix} \{a_{i} a_{j} - a_{j} a_{i} = \sum α_{i j}^{t} a_{t} | i > j, i, j \in I\}, \end{matrix}

where $α_{i j}^{t} \in K$ and $[a_{i}, a_{j}] = \sum α_{i j}^{t} a_{t}$ in $L$ , and the CD-lemma for associative algebras over $K$ implies that $L \subset U_{K} (L)$ and

\begin{matrix} {a_{i_{1}} \dots a_{i_{n}} | i_{1} \leq \dots \leq i_{n}, n \geq 0, i_{1}, \dots, i_{n} \in I} \end{matrix}

is a $K$ -basis for $U_{K} (L)$ .

In fact, for the same reason, all CD-lemmas in this survey are valid if we replace the base field $k$ by an arbitrary commutative ring $K$ with identity. If this is the case then claim (iii) in the CD-lemma should read: $K (X | S)$ is a free $K$ -module with a $K$ -basis $I r r (S)$ . But in the general case, Shirshov’s algorithm fails: if $S$ is a monic set then $S^{'}$ , the set obtained by adding to $S$ all non-trivial compositions, is not a monic set in general, and the algorithm may stop with no result.

2.2 Gröbner bases for commutative algebras and their lifting to Gröbner–Shirshov bases

Consider the free commutative associative algebra $k [X]$ . Given a well ordering $<$ on $X = {x_{i} | i \in I}$ ,

\begin{matrix} [X] = {x_{i_{1}} \dots x_{i_{t}} | i_{1} \leq \dots \leq i_{t}, i_{1}, \dots, i_{t} \in I, t \geq 0} \end{matrix}

is a linear basis for $k [X]$ .

Choose a monomial ordering $<$ on $[X]$ . Take two monic polynomials $f$ and $g$ in $k [X]$ such that $w = lcm (\bar{f}, \bar{g}) = \bar{f} a = \bar{g} b$ for some $a, b \in [X]$ with $| \bar{f} | + | \bar{g} | > | w |$ (so, $\bar{f}$ and $\bar{g}$ are not coprime in $[X]$ ). Then ${(f, g)}_{w} = f a - g b$ is called the $s$ -polynomial of $f$ and $g$ .

A monic subset $S \subseteq k [X]$ is called a Gröbner basis with respect to the monomial ordering $<$ whenever all $s$ -polynomials of two arbitrary polynomials in $S$ are trivial modulo $S$ and corresponding $w$ .

An argument similar to the proof of the CD-lemma for associative algebras justifies the following theorem due to Buchberger.

Theorem 4

(Buchberger Theorem) Choose a monomial ordering $<$ on $[X]$ . Consider a monic set $S \subset k [X]$ and the ideal $I d (S)$ of $k [X]$ generated by $S$ . The following statements are equivalent:

(i)
$S$ is a Gröbner basis in $k [X]$ .
(ii)
$f \in I d (S) \Rightarrow \bar{f} = \bar{s} a$ for some $s \in S$ and $a \in [X]$ .
(iii)
$I r r (S) = {u \in [X] | u \neq \bar{s} a, s \in S, a \in [X]}$ is a linear basis for the algebra $k [X | S] = k [X] / I d (S)$ .

Proof

Denote by $lcm (u, v)$ be the usual (unique) least common multiple of two commutative words $u, v \in [X]$ :

\begin{matrix} lcm (u, v) \in {u v (the trivial lcm (u, v)); \\ a u = b v, a, b \in [X], | a u | < | u | + | v | (the nontrivial lcm (u, v))} . \end{matrix}

If $cm (u, v) = a_{1} u = a_{2} v$ is a common multiple of $u$ and $v$ then $cm (u, v) = b \cdot lcm (u, v)$ .

The $s$ -polynomial of two monic polynomials $f$ and $g$ is

\begin{matrix} {(f, g)}_{_{lcm (\bar{f}, \bar{g})}} = lcm (\bar{f}, \bar{g}) {|_{\bar{f} \mapsto f} - lcm (\bar{f}, \bar{g}) |}_{\bar{g} \mapsto g} . \end{matrix}

An analogue of Lemma 1 is valid for $k [X]$ because if $a_{1} {\bar{s}}_{1} = a_{2} {\bar{s}}_{2}$ for two monic polynomials $s_{1}$ and $s_{2}$ then

\begin{matrix} a_{1} s_{1} - a_{2} s_{2} = b \cdot {(s_{1}, s_{2})}_{lcm ({\bar{s}}_{1}, {\bar{s}}_{2})} . \end{matrix}

Lemma 1 implies the main claim (i) $\Rightarrow$ (ii) of Buchberger’s theorem. $□$

Theorem 5

Given an ideal $I$ of $k [X]$ and a monomial ordering $<$ on $[X]$ , there exists a unique reduced Gröbner basis $S$ for $I$ . Moreover, if $X$ is finite then so is $S$ .

Eisenbud et al. [99] constructed a GS basis in $k ⟨ X ⟩$ by lifting a commutative Gröbner basis for $k [X]$ and adding all commutators. Write $X = {x_{1}, x_{2}, \dots, x_{n}}$ and put

\begin{matrix} S_{1} = {h_{i j} = x_{i} x_{j} - x_{j} x_{i} | i > j} \subset k ⟨ X ⟩ . \end{matrix}

Consider the natural map $γ : k ⟨ X ⟩ \to k [X]$ carrying $x_{i}$ to $x_{i}$ and the lexicographic splitting of $γ$ , which is defined as the $k$ -linear map

\begin{matrix} δ : k [X] \to k ⟨ X ⟩, x_{i_{1}} x_{i_{2}} \dots x_{i_{r}} \mapsto x_{i_{1}} x_{i_{2}} \dots x_{i_{r}} if i_{1} \leq i_{2} \dots \leq i_{r} . \end{matrix}

Given $u \in [X]$ , we express it as $u = x_{1}^{l_{1}} x_{2}^{l_{2}} \dots x_{n}^{l_{n}}$ , where $l_{i} \geq 0$ , using an arbitrary monomial ordering on $[X]$ .

Following [99], define an ordering on $X^{*}$ using the ordering $x_{1} < x_{2} < \dots < x_{n}$ as follows: given $u, v \in X^{*}$ , put

\begin{matrix} u > v if γ (u) > γ (v) in [X] or (γ (u) = γ (v) and u >_{l e x} v) . \end{matrix}

It is easy to check that this is a monomial ordering on $X^{*}$ and $\bar{δ (s)} = δ (\bar{s})$ for every $s \in k [X]$ . Moreover, $v \geq δ (u)$ for every $v \in γ^{- 1} (u)$ .

Consider an arbitrary ideal $L$ of $k [X]$ generated by monomials. Given $m = x_{i_{1}} x_{i_{2}} \dots x_{i_{r}} \in L, i_{1} \leq i_{2} \dots \leq i_{r}$ , denote by $U_{L} (m)$ the set of all monomials $u \in [x_{i_{1} + 1}, \dots, x_{i_{r} - 1}]$ such that neither $u x_{i_{2}} \dots x_{i_{r}}$ nor $u x_{i_{1}} \dots x_{i_{r - 1}}$ lie in $L$ .

Theorem 6

([99]) Consider the orderings on $[X]$ and $X^{*}$ defined above. If $S$ is a minimal Gröbner basis in $k [X]$ then $S^{'} = {δ (u s) | s \in S, u \in U_{L} (\bar{s})} \cup S_{1}$ is a minimal Gröbner–Shirshov basis in $k ⟨ X ⟩$ , where $L$ is the monomial ideal of $k [X]$ generated by $\bar{S}$ .

Jointly with Yongshan Chen [30], we generalized this result to lifting a GS basis $S \subset k [Y] \otimes k ⟨ X ⟩$ , see Mikhalev and Zolotykh [170], to a GS basis of $I d (S, [y_{i}, y_{j}] for all (i, j)$ ) of $k ⟨ Y ⟩ \otimes k ⟨ X ⟩$ .

Recall that for a prime number $p$ the Gauss ordering on the natural numbers is described as $s \leq_{p} t$ whenever $(\binom{t}{s}) ≢ 0 mod p$ . Let $\leq_{0} = \leq$ be the usual ordering on the natural numbers. A monomial ideal $L$ of $k [X]$ is called $p$ -Borel-fixed whenever it satisfies the following condition: for each monomial generator $m$ of $L$ , if $m$ is divisible by $x_{j}^{t}$ but no higher power of $x_{j}$ then ${(x_{i} / x_{j})}^{s} m \in L$ for all $i < j$ and $s \leq_{p} t$ .

Thus, we have the following Eisenbud–Peeva–Sturmfels lifting theorem.

Theorem 7

([99]) Given an ideal $I$ of $k [X]$ , take $L = I d (\bar{f}, f \in I)$ and $J = γ^{- 1} (I) \subset k ⟨ X ⟩$ .

(i)
If $L$ is $0$ -Borel-fixed then a minimal Gröbner–Shirshov basis of $J$ is obtained by applying $δ$ to a minimal Gröbner basis of $I$ and adding commutators.
(ii)
If $L$ is $p$ -Borel-fixed for some $p$ then $J$ has a finite Gröbner–Shirshov basis.

Proof

Assume that $L$ is $p$ -Borel-fixed for some $p$ . Take a generator $m = x_{i_{1}} x_{i_{2}} \dots x_{i_{r}}$ of $L$ , where $x_{i_{1}} \leq x_{i_{2}} \leq \dots \leq x_{i_{r}}$ , and suppose that $x_{i_{r}}^{t}$ is the highest power of $x_{i_{r}}$ dividing $m$ . Since $t \leq_{p} t$ , it follows that $x_{l}^{t} m / x_{i_{r}}^{t} \in L$ for $l < i_{r}$ . This implies that $x_{l}^{t} m / x_{i_{r}} \in L$ for $l < i_{r}$ , and hence, every monomial in $U_{L} (m)$ satisfies $d e g_{x_{l}} (u) < t$ for $i_{1} < l < i_{r}$ . Thus, $U_{L} (m)$ is a finite set, and the result follows from Theorem 6. In particular, if $p = 0$ then $U_{L} (m) = 1$ . $□$

In characteristic $p \geq 0$ observe that if the field $k$ is infinite then after a generic change of variables $L$ is $p$ -Borel-fixed. Then Theorems 6 and 7 imply

Corollary 1

([99]) Consider an infinite field $k$ and an ideal $I \subset k [X]$ . After a general linear change of variables, the ideal $γ^{- 1} (I)$ in $k ⟨ X ⟩$ has a finite Gröbner–Shirshov basis.

2.3 Composition-Diamond lemma for modules

Consider $S$ , $T \subset k ⟨ X ⟩$ and $f$ , $g \in k ⟨ X ⟩$ . Kang and Lee define [123] the composition of $f$ and $g$ as follows.

Definition 1

([123, 127])

(a)
If there exist $a, b \in X^{*}$ such that $w = \bar{f} a = b \bar{g}$ with $| w | < | \bar{f} | + | \bar{g} |$ then the intersection composition is defined as ${(f, g)}_{w} = f a - b g$ .
(b)
If there exist $a$ , $b \in X^{*}$ such that $w = a \bar{f} b = \bar{g}$ then the inclusion composition is defined as ${(f, g)}_{w} = a f b - g$ .
(c)
The composition ${(f, g)}_{w}$ is called right-justified whenever $w = \bar{f} = a \bar{g}$ for some $a \in X^{*}$ .

If $f - g = \sum α_{i} a_{i} s_{i} b_{i} + \sum β_{j} c_{j} t_{j}$ , where $α_{i}, β_{j} \in k$ , $a_{i}, b_{i}, c_{j} \in X^{*}$ , $s_{i} \in S$ , and $t_{j} \in T$ with $a_{i} {\bar{s}}_{i} b_{i} < w$ and $c_{j} {\bar{t}}_{j} < w$ for all $i$ and $j$ , then we call $f - g$ trivial with respect to $S$ and $T$ and write $f \equiv g mod (S, T; w)$ .

Definition 2

([123, 124]) A pair $(S, T)$ of monic subsets of $k ⟨ X ⟩$ is called a GS pair if $S$ is closed under composition, $T$ is closed under right-justified composition with respect to $S$ , and given $f \in S$ , $g \in T$ , and $w \in X^{*}$ such that if ${(f, g)}_{w}$ is defined, we have ${(f, g)}_{w} \equiv 0 mod (S, T; w)$ . In this case, say that $(S, T)$ is a GS pair for the $A$ -module $_{A} M =_{A} k ⟨ X ⟩ / (k ⟨ X ⟩ T + I d (S))$ , where $A = k ⟨ X | S ⟩$ .

Theorem 8

(Kang and Lee [123, 124], the CD-lemma for cyclic modules) Consider a pair $(S, T)$ of monic subsets of $k ⟨ X ⟩$ , the associative algebra $A = k ⟨ X | S ⟩$ defined by $S$ , and the left cyclic module $_{A} M =_{A} k ⟨ X ⟩ / (k ⟨ X ⟩ T + I d (S))$ defined by $(S, T)$ . Suppose that $(S, T)$ is a Gröbner–Shirshov pair for the $A$ -module $_{A} M$ and $p \in k ⟨ X ⟩ T + I d (S)$ . Then $\bar{p} = a \bar{s} b$ or $\bar{p} = c \bar{t}$ , where $a, b, c \in X^{*}$ , $s \in S$ , and $t \in T$ .

Applications of Theorem 8 appeared in [125–127].

Take two sets $X$ and $Y$ and consider the free left $k ⟨ X ⟩$ -module ${Mod}_{k ⟨ X ⟩} ⟨ Y ⟩$ with $k ⟨ X ⟩$ -basis $Y$ . Then ${Mod}_{k ⟨ X ⟩} ⟨ Y ⟩ = \oplus_{y \in Y} k ⟨ X ⟩ y$ is called a double-free module. We now define the GS basis in ${Mod}_{k ⟨ X ⟩} ⟨ Y ⟩$ . Choose a monomial ordering $<$ on $X^{*}$ , and a well-ordering $<$ on $Y$ . Put $X^{*} Y = {u y | u \in X^{*}, y \in Y}$ and define an ordering $<$ on $X^{*} Y$ as follows: for any $w_{1} = u_{1} y_{1}$ , $w_{2} = u_{2} y_{2} \in X^{*} Y$ ,

\begin{matrix} w_{1} < w_{2} \Leftrightarrow u_{1} < u_{2} or u_{1} = u_{2}, y_{1} < y_{2} \end{matrix}

Given $S \subset {Mod}_{k ⟨ X ⟩} ⟨ Y ⟩$ with all $s \in S$ monic, define composition in $S$ to be only inclusion composition, which means that $\bar{f} = a \bar{g}$ for some $a \in X^{*}$ , where $f, g \in S$ . If ${(f, g)}_{\bar{f}} = f - a g = \sum α_{i} a_{i} s_{i}$ , where $α_{i} \in k$ , $a_{i} \in X^{*}$ , $s_{i} \in S$ , and $a_{i} {\bar{s}}_{i} < \bar{f}$ , then this composition is called trivial modulo $(S, \bar{f})$ .

Theorem 9

(Chibrikov [90], see also [78], the CD-lemma for modules) Consider a non-empty set $S \subset m o d_{k ⟨ X ⟩} ⟨ Y ⟩$ with all $s \in S$ monic and choose an ordering $<$ on $X^{*} Y$ as before. The following statements are equivalent:

(i)
$S$ is a Gröbner–Shirshov basis in ${Mod}_{k ⟨ X ⟩} ⟨ Y ⟩$ .
(ii)
If $0 \neq f \in k ⟨ X ⟩ S$ then $\bar{f} = a \bar{s}$ for some $a \in X^{*}$ and $s \in S$ .
(iii)
$I r r (S) = {w \in X^{*} Y | w \neq a \bar{s}, a \in X^{*}, s \in S}$ is a linear basis for the quotient ${Mod}_{k ⟨ X ⟩} ⟨ Y | S ⟩ = {Mod}_{k ⟨ X ⟩} ⟨ Y ⟩ / k ⟨ X ⟩ S$ .

Outline of the proof. Take $u \in X^{*} Y$ and express it as $u = u^{X} y_{u}$ with $u^{X} \in X^{*}$ and $y_{u} \in Y$ . Put

\begin{matrix} cm (u, v) = a^{X} u = b^{X} v, lcm (u, v) = u = d^{X} v, \end{matrix}

where $y_{u} = y_{v}$ . Up to the order of $u$ and $v$ , we have $c m (u, v) = c \cdot lcm (u, v)$ .

The composition of two monic elements $f, g \in {Mod}_{k ⟨ X ⟩} (Y)$ is

\begin{matrix} {(f, g) |}_{lcm (\bar{f}, \bar{g})} = lcm (\bar{f}, \bar{g}) {|_{\bar{f} \mapsto f} - lcm (\bar{f}, \bar{g}) |}_{\bar{g} \mapsto g} . \end{matrix}

If $a_{1} {\bar{s}}_{1} = a_{2} {\bar{s}}_{2}$ for monic $s_{1}$ and $s_{2}$ then $a_{1} s_{1} - a_{2} s_{2} = c \cdot {(s_{1}, s_{2})}_{lcm ({\bar{s}}_{1}, {\bar{s}}_{2})}$ . This gives an analogue of Lemma 1 for modules and the implication (i) $\Rightarrow$ (ii) of Theorem 9.

Given $S \subset k ⟨ X ⟩$ , put $A = k ⟨ X | S ⟩$ . We can regard every left $A$ -module $_{A} M$ as a $k ⟨ X ⟩$ -module in a natural way: $f m : = (f + I d (S)) m$ for $f \in k ⟨ X ⟩$ and $m \in M$ . Observe that $_{A} M$ is an epimorphic image of some free $A$ -module. Assume now that $_{A} M = {Mod}_{A} ⟨ Y | T ⟩ = {Mod}_{A} ⟨ Y ⟩ / A T$ , where $T \subset {Mod}_{A} ⟨ Y ⟩$ . Put

\begin{matrix} T_{1} = \{\sum f_{i} y_{i} \in {Mod}_{k ⟨ X ⟩} ⟨ Y ⟩ | \sum (f_{i} + I d (S)) y_{i} \in T\} \end{matrix}

and $R = S X^{*} Y \cup T_{1}$ . Then $_{A} M = mod_{k ⟨ X ⟩} ⟨ Y | R ⟩$ as $k ⟨ X ⟩$ -modules.

Theorem 10

Given a submodule $I$ of ${Mod}_{k ⟨ X ⟩} ⟨ Y ⟩$ and a monomial ordering $<$ on $X^{*} Y$ as above, there exists a unique reduced Gröbner–Shirshov basis $S$ for $I$ .

Corollary 2

(Cohn) Every left ideal $I$ of $k ⟨ X ⟩$ is a free left $k ⟨ X ⟩$ -module.

Proof

Take a reduced Gröbner–Shirshov basis $S$ of $I$ as a $k ⟨ X ⟩$ -submodule of the cyclic $k ⟨ X ⟩$ -module. Then $I$ is a free left $k ⟨ X ⟩$ -module with a $k ⟨ X ⟩$ -basis $S$ . $□$

As an application of the CD-lemma for modules, we give GS bases for the Verma modules over the Lie algebras of coefficients of free Lie conformal algebras. We find linear bases for these modules.

Let $B$ be a set of symbols. Take the constant locality function $N : B \times B \to Z_{+}$ ; that is, $N (a, b) \equiv N$ for all $a, b \in B$ . Put $X = {b (n) | b \in B, n \in Z}$ and consider the Lie algebra $L = L i e (X | S)$ over a field $k$ of characteristic 0 generated by $X$ with the relations

\begin{matrix} S = \{\sum_{s} {(- 1)}^{s} (\binom{N}{s}) [b (n - s) a (m + s)] = 0 | a, b \in B, m, n \in Z\} . \end{matrix}

For every $b \in B$ , put $\tilde{b} = \sum_{n} b (n) z^{- n - 1} \in L [[z, z^{- 1}]]$ . It is well-known that these elements generate a free Lie conformal algebra $C$ with data $(B, N)$ (see [194]). Moreover, the coefficient algebra of $C$ is just $L$ .

Suppose that $B$ is linearly ordered. Define an ordering on $X$ as

\begin{matrix} a (m) < b (n) \Leftrightarrow m < n or (m = n and a < b) . \end{matrix}

We use the deg-lex ordering on $X^{*}$ . It is clear that the leading term of each polynomial in $S$ is $b (n) a (m)$ with

\begin{matrix} n - m > N or (n - m = N and (b > a or (b = a and N is odd))) . \end{matrix}

The following lemma is essentially from [194].

Lemma 3

([78]) With the deg-lex ordering on $X^{*}$ , the set $S$ is a GS basis in $L i e (X)$ .

Corollary 3

([78]) A linear basis of the universal enveloping algebra $U = U (L)$ of $L$ consists of the monomials

\begin{matrix} a_{1} (n_{1}) a_{2} (n_{2}) \dots a_{k} (n_{k}) \end{matrix}

with $a_{i} \in B$ and $n_{i} \in Z$ such that for every $1 \leq i < k$ we have

\begin{matrix} n_{i} - n_{i + 1} \leq \{\begin{matrix} N - 1 & if a_{i} > a_{i + 1} or (a_{i} = a_{i + 1} and N is odd) \\ N & otherwise . \end{matrix} \end{matrix}

An $L$ -module $M$ is called restricted if for all $a \in C$ and $v \in M$ there is some integer $T$ such that $a (n) v = 0$ for $n \geq T$ .

An $L$ -module $M$ is called a highest weight module whenever it is generated over $L$ by a single element $m \in M$ satisfying $L_{+} m = 0$ , where $L_{+}$ is the subspace of $L$ generated by ${a (n) | a \in C, n \geq 0}$ . In this case $m$ is called a highest weight vector.

Let us now construct a universal highest weight module $V$ over $L$ , which is often called the Verma module. Take the trivial $1$ -dimensional $L_{+}$ -module $k I_{v}$ generated by $I_{v}$ ; hence, $a (n) I_{v} = 0$ for all $a \in B, n \geq 0$ . Clearly,

\begin{matrix} V = I n d_{L_{+}}^{L} k I_{v} = U (L) \otimes_{U (L_{+})} k I_{v} ≅ U (L) / U (L) L_{+} . \end{matrix}

Then $V$ has the structure of the highest weight module over $L$ with the action given by multiplication on $U (L) / U (L) L_{+}$ and a highest weight vector $I \in U (L)$ . In addition, $V = U (L) / U (L) L_{+}$ is the universal enveloping vertex algebra of $C$ and the embedding $φ : C \to V$ is given by $a \mapsto a (- 1) I$ (see also [194]).

Theorem 11

([78]) With the above notions, a linear basis of $V$ consists of the elements

\begin{matrix} a_{1} (n_{1}) a_{2} (n_{2}) \dots a_{k} (n_{k}), a_{i} \in B, n_{i} \in Z \end{matrix}

satisfying the condition in Corollary 3 and $n_{k} < 0$ .

Proof

Clearly, as $k ⟨ X ⟩$ -modules, we have

\begin{matrix} _{U} V =_{U} (U (L) / U (L) L_{+}) = {Mod}_{k ⟨ X ⟩} ⟨ I | S^{(-)} X^{*} I, a (n) I, n \geq 0 ⟩ =_{k ⟨ X ⟩} ⟨ I | S^{'} ⟩, \end{matrix}

where $S^{'} = {S^{(-)} X^{*} I, a (n) I, n \geq 0}$ . In order to show that $S^{'}$ is a Gröbner–Shirshov basis, we only need to verify that $w = b (n) a (m) I$ , where $m \geq 0$ . Take

\begin{matrix} f = \sum_{s} {(- 1)}^{s} (\binom{n}{s}) (b (n - s) a (m + s) - a (m + s) b (n - s)) I and g = a (m) I . \end{matrix}

Then ${(f, g)}_{w} = f - b (n) a (m) I \equiv 0 mod (S^{'}, w)$ since $n - m \geq N$ , $m + s \geq 0$ , $n - s \geq 0$ , and $0 \leq s \leq N$ . It follows that $S^{'}$ is a Gröbner–Shirshov basis. Now, the result follows from the CD-lemma for modules. $□$

2.4 Composition-Diamond lemma for categories

Denote by $X$ an oriented multi-graph. A path

\begin{matrix} a_{n} \to a_{n - 1} \to \dots \to a_{1} \to a_{0}, n \geq 0, \end{matrix}

in $X$ with edges $x_{n}, \dots, x_{2}, x_{1}$ is a partial word $u = x_{1} x_{2} \dots x_{n}$ on $X$ with source $a_{n}$ and target $a_{0}$ . Denote by $C (X)$ the free category generated by $X$ (the set of all partial words (paths) on $X$ with partial multiplication, the free ‘partial path monoid’ on $X$ ). A well-ordering on $C (X)$ is called monomial whenever it is compatible with partial multiplication.

A polynomial $f \in k C (X)$ is a linear combination of partial words with the same source and target. Then $k C (X)$ is the partial path algebra on $X$ (the free associative partial path algebra generated by $X$ ).

Given $S \subset k C (X)$ , denote by $I d (S)$ the minimal subset of $k C (X)$ that includes $S$ and is closed under the partial operations of addition and multiplication. The elements of $I d (S)$ are of the form $\sum α_{i} a_{i} s_{i} b_{i}$ with $α_{i} \in k$ , $a_{i}, b_{i} \in C (X)$ , and $s_{i} \in S$ , and all $S$ -words have the same source and target.

Both inclusion and intersection compositions are possible.

With these differences, the statement and proof of the CD-lemma are the same as for the free associative algebra.

Theorem 12

([36], the CD-lemma for categories) Consider a nonempty set $S \subset k C (X)$ of monic polynomials and a monomial ordering $<$ on $C (X)$ . Denote by $I d (S)$ the ideal of $k C (X)$ generated by $S$ . The following statements are equivalent:

(i)
The set $S$ is a Gröbner–Shirshov basis in $k C (X)$ .
(ii)
$f \in I d (S) \Rightarrow \bar{f} = a \bar{s} b$ for some $s \in S$ and $a, b \in C (X)$ .
(iii)
the set $I r r (S) = {u \in C (X) | u \neq a \bar{s} b a, b \in C (X), s \in S}$ is a linear basis for $k C (X) / I d (S)$ , which is denoted by $k C (X | S)$ .

Outline of the proof.

Define $w = lcm (u, v), u, v \in C (X)$ and the general composition ${(f, g)}_{w}$ for $f, g \in k C (X)$ and $w = lcm (\bar{f}, \bar{g})$ by the same formulas as above. Under the conditions of the analogue of Lemma 1, we again have $a_{1} s_{1} b_{1} - a_{2} s_{2} b_{2} = c {(s_{1}, s_{2})}_{w} d \equiv 0 mod (S, w)$ , where $w = lcm ({\bar{s}}_{1}, {\bar{s}}_{2})$ and $c, d \in C (X) .$ This implies the analogue of Lemma 1 and the main assertion (i) $\Rightarrow$ (ii) of Theorem 12.

Let us present some applications of CD-lemma for categories.

For each non-negative integer $p$ , denote by $[p]$ the set ${0, 1, 2, \dots, p}$ of integers in their usual ordering. A (weakly) monotonic map $μ : [q] \to [p]$ is a function from $[q]$ to $[p]$ such that $i \leq j$ implies $μ (i) \leq μ (j)$ . The objects $[p]$ with weakly monotonic maps as morphisms constitute the category $Δ$ called the simplex category. It is convenient to use two special families of monotonic maps,

\begin{matrix} ε_{q}^{i} : [q - 1] \to [q], η_{q}^{i} : [q + 1] \to [q] \end{matrix}

defined for $i = 0, 1, \dots q$ (and for $q > 0$ in the case of $ε^{i}$ ) by

\begin{matrix} ε_{q}^{i} (j) & = \{\begin{matrix} j & if i > j, \\ j + 1 & if i \leq j, \end{matrix} \\ η_{q}^{i} (j) & = \{\begin{matrix} j & if i \geq j, \\ j - 1 & if j > i . \end{matrix} \end{matrix}

Take the oriented multi-graph $X = (V (X), E (X))$ with

\begin{matrix} V (X) = {[p] | p \in Z^{+} \cup {0}}, \\ E (X) = {ε_{p}^{i} : [p - 1] \to [p], η_{q}^{j} : [q + 1] \to [q] | p > 0, 0 \leq i \leq p, 0 \leq j \leq q} & . \end{matrix}

Consider the relation $S \subseteq C (X) \times C (X)$ consisting of:

\begin{matrix} f_{_{q + 1, q}} : ε_{q + 1}^{i} ε_{q}^{j - 1} = ε_{q + 1}^{j} ε_{q}^{i} for j > i; \\ g_{_{q, q + 1}} : η_{q}^{j} η_{q + 1}^{i} = η_{q}^{i} η_{q + 1}^{j + 1} for j \geq i; \\ h_{_{q - 1, q}} : η_{q - 1}^{j} ε_{q}^{i} = \{\begin{matrix} ε_{q - 1}^{i} η_{q - 2}^{j - 1} & for j > i, \\ 1_{q - 1} & for i = j or i = j + 1, \\ ε_{q - 1}^{i - 1} η_{q - 2}^{j} & for i > j + 1 . \end{matrix} \end{matrix}

This yields a presentation $Δ = C (X | S)$ of the simplex category $Δ$ .

Order now $C (X)$ as follows.

Firstly, for $η_{p}^{i}, η_{q}^{j} \in {η_{p}^{i} | p \geq 0, 0 \leq i \leq p}$ put $η_{p}^{i} > η_{q}^{j}$ iff $p > q$ or ( $p = q$ and $i < j$ ).

Secondly, for

\begin{matrix} u = η_{p_{1}}^{i_{1}} η_{p_{2}}^{i_{2}} \dots η_{p_{n}}^{i_{n}} \in {η_{p}^{i} | p \geq 0, 0 \leq i \leq p}^{*} \end{matrix}

(these are all possible words on ${η_{p}^{i} | p \geq 0, 0 \leq i \leq p}$ , including the empty word $1_{v}$ , where $v \in O b (X)$ ), define

\begin{matrix} wt (u) = (n, η_{p_{n}}^{i_{n}}, η_{p_{n - 1}}^{i_{n - 1}}, \dots, η_{p_{1}}^{i_{1}}) . \end{matrix}

Then, for $u, v \in {η_{p}^{i} | p \geq 0, 0 \leq i \leq p}^{*}$ put $u > v$ iff $wt (u) > wt (v)$ lexicographically.

Thirdly, for $ε_{p}^{i}, ε_{q}^{j} \in {ε_{p}^{i}, | p \in Z^{+}, 0 \leq i \leq p}$ , put $ε_{p}^{i} > ε_{q}^{j}$ iff $p > q$ or ( $p = q$ and $i < j$ ).

Finally, for $u = v_{0} ε_{p_{1}}^{i_{1}} v_{1} ε_{p_{2}}^{i_{2}} \dots ε_{p_{n}}^{i_{n}} v_{n} \in C (X)$ , where $n \geq 0$ , and $v_{j} \in {η_{p}^{i} | p \geq 0, 0 \leq i \leq p}^{*}$ put $wt (u) = (n, v_{0}, v_{1}, \dots, v_{n}, ε_{p_{1}}^{i_{1}}, \dots, ε_{p_{n}}^{i_{n}})$ . Then for every $u, v \in C (X)$ ,

\begin{matrix} u ≻_{_{1}} v \Leftrightarrow wt (u) > wt (v) lexicographically . \end{matrix}

It is easy to check that $≻_{_{1}}$ is a monomial ordering on $C (X)$ . Then we have

Theorem 13

([36]) For $X$ and $S$ defined above, with the ordering $≻_{1}$ on $C (X)$ , the set $S$ is a Gröbner–Shirshov basis for the simplex partial path algebra $k C (X | S)$ .

Corollary 4

([157]) Every morphism $μ : [q] \to [p]$ of the simplex category has a unique expression of the form

\begin{matrix} ε_{p}^{i_{1}} \dots ε_{p - m + 1}^{i_{m}} η_{q - n}^{j_{1}} \dots η_{q - 1}^{j_{n}} \end{matrix}

with $p \geq i_{1} > \dots > i_{m} \geq 0$ , $0 \leq j_{1} < \dots < j_{n} < q$ , and $q - n + m = p$ .

The cyclic category is defined by generators and relations as follows, see [104]. Take the oriented (multi) graph $Y = (V (Y), E (Y))$ with $V (Y) = {[p] | p \in Z^{+} \cup {0}}$ and

\begin{matrix} E (Y) & = {ε_{p}^{i} : [p - 1] \to [p], η_{q}^{j} : [q + 1] \to [q], t_{q} : [q] \\ \to [q] | p > 0, 0 \leq i \leq p, 0 \leq j \leq q} . \end{matrix}

Consider the relation $S \subseteq C (Y) \times C (Y)$ consisting of:

\begin{matrix} f_{_{q + 1, q}} : ε_{q + 1}^{i} ε_{q}^{j - 1} = ε_{q + 1}^{j} ε_{q}^{i} for j > i; \\ g_{_{q, q + 1}} : η_{q}^{j} η_{q + 1}^{i} = η_{q}^{i} η_{q + 1}^{j + 1} for j \geq i; \\ h_{_{q - 1, q}} : η_{q - 1}^{j} ε_{q}^{i} = \{\begin{matrix} ε_{q - 1}^{i} η_{q - 2}^{j - 1} & for j > i, \\ 1_{q - 1} & for i = j or i = j + 1, \\ ε_{q - 1}^{i - 1} η_{q - 2}^{j} & for i > j + 1, \end{matrix} \\ ρ_{1} : t_{q} ε_{q}^{i} = ε_{q}^{i - 1} t_{q - 1} for i = 1, \dots, q; \\ ρ_{2} : t_{q} η_{q}^{i} = η_{q}^{i - 1} t_{q + 1} for i = 1, \dots, q; \\ ρ_{3} : t_{q}^{q + 1} = 1_{q} . \end{matrix}

The category $C (Y | S)$ is called the cyclic category and denoted by $Λ$ .

Define an ordering on $C (Y)$ as follows.

Firstly, for $t_{p}^{i}$ , $t_{q}^{j} \in {t_{q} | q \geq 0}^{*}$ put ${(t_{p})}^{i} > {(t_{q})}^{j}$ iff $i > j$ or ( $i = j$ and $p > q$ ).

Secondly, for $η_{p}^{i}, η_{q}^{j} \in {η_{p}^{i} | p \geq 0, 0 \leq i \leq p}$ put $η_{p}^{i} > η_{q}^{j}$ iff $p > q$ or ( $p = q$ and $i < j$ ).

Thirdly, for

\begin{matrix} u = w_{0} η_{p_{1}}^{i_{1}} w_{1} η_{p_{2}}^{i_{2}} \dots w_{n - 1} η_{p_{n}}^{i_{n}} w_{n} \in {t_{q}, η_{p}^{i} | q, p \geq 0, 0 \leq i \leq p}^{*}, \end{matrix}

where $w_{i} \in {t_{q} | q \geq 0}^{*}$ , put

\begin{matrix} wt (u) = (n, w_{0}, w_{1}, \dots, w_{n}, η_{p_{n}}^{i_{n}}, η_{p_{n - 1}}^{i_{n - 1}}, \dots, η_{p_{1}}^{i_{1}}) . \end{matrix}

Then for every $u, v \in$ ${t_{q}, η_{p}^{i} | q, p \geq 0, 0 \leq i \leq p}^{*}$ put $u > v$ iff $wt (u) > wt (v)$ lexicographically.

Fourthly, for $ε_{p}^{i}, ε_{q}^{j} \in$ ${ε_{p}^{i}, | p \in Z^{+}, 0 \leq i \leq p}$ , $ε_{p}^{i} > ε_{q}^{j}$ iff $p > q$ or ( $p = q$ and $i < j$ ).

Finally, for $u = v_{0} ε_{p_{1}}^{i_{1}} v_{1} ε_{p_{2}}^{i_{2}} \dots ε_{p_{n}}^{i_{n}} v_{n} \in C (Y)$ and $v_{j} \in {t_{q}, η_{p}^{i} | q, p \geq 0, 0 \leq i \leq p}^{*}$ define $wt (u) = (n, v_{0}, v_{1}, \dots, v_{n}, ε_{p_{1}}^{i_{1}}, \dots, ε_{p_{n}}^{i_{n}}) .$

Then for every $u, v \in C (Y)$ put $u ≻_{_{2}} v \Leftrightarrow wt (u) < wt (v) lexicographically .$

It is also easy to verify that $≻_{_{2}}$ is a monomial ordering on $C (Y)$ which extends $≻_{_{1}}$ . Then we have

Theorem 14

([36]) Consider $Y$ and $S$ defined as the above. Put $ρ_{4} : t_{q} ε_{q}^{0} = ε_{q}^{q}$ and $ρ_{5} : t_{q} η_{q}^{0} = η_{q}^{q} t_{q + 1}^{2}$ . Then

(1)
With the ordering $≻_{_{2}}$ on $C (Y)$ , the set $S \cup {ρ_{4}, ρ_{5}}$ is a Gröbner–Shirshov basis for the cyclic category $C (Y | S)$ .
(2)
Every morphism $μ : [q] \to [p]$ of the cyclic category $Λ = C (Y | S)$ has a unique expression of the form
$\begin{matrix} ε_{p}^{i_{1}} \dots ε_{p - m + 1}^{i_{m}} η_{q - n}^{j_{1}} \dots η_{q - 1}^{j_{n}} t_{q}^{k} \end{matrix}$
with $p \geq i_{1} > \dots > i_{m} \geq 0$ , $0 \leq j_{1} < \dots < j_{n} < q$ , $0 \leq k \leq q$ , and $q - n + m = p$ .

2.5 Composition-Diamond lemma for associative algebras over commutative algebras

Given two well-ordered sets $X$ and $Y$ , put

\begin{matrix} N = [X] Y^{*} = {u = u^{X} u^{Y} | u^{X} \in [X] a n d u^{Y} \in Y^{*}} \end{matrix}

and denote by $k N$ the $k$ -space spanned by $N$ . Define the multiplication of words as

\begin{matrix} u = u^{X} u^{Y}, v = v^{X} v^{Y} \in N \Rightarrow u v = u^{X} v^{X} u^{Y} v^{Y} \in N . \end{matrix}

This makes $k N$ an algebra isomorphic to the tensor product $k [X] \otimes k ⟨ Y ⟩$ , called a‘double free associative algebra’. It is a free object in the category of all associative algebras over all commutative algebras (over $k$ ): every associative algebra $_{K} A$ over a commutative algebra $K$ is isomorphic to $k [X] \otimes k ⟨ Y ⟩ / I d (S)$ as a $k$ -algebra and a $k [X]$ -algebra.

Choose a monomial ordering $>$ on $N$ . The following definitions of compositions and the GS basis are taken from [170].

Take two monic polynomials $f$ and $g$ in $k [X] \otimes k ⟨ Y ⟩$ and denote by $L$ the least common multiple of ${\bar{f}}^{X}$ and ${\bar{g}}^{X}$ .

1.
Inclusion. Assume that ${\bar{g}}^{Y}$ is a subword of ${\bar{f}}^{Y}$ , say, ${\bar{f}}^{Y} = c {\bar{g}}^{Y} d$ for some $c, d \in Y^{*}$ . If ${\bar{f}}^{Y} = {\bar{g}}^{Y}$ then ${\bar{f}}^{X} \geq {\bar{g}}^{X}$ and if ${\bar{g}}^{Y} = 1$ then we set $c = 1$ . Put $w = L {\bar{f}}^{Y} = L c {\bar{g}}^{Y} d$ . Define the composition $C_{1} {(f, g, c)}_{w} = \frac{L}{{\bar{f}}^{X}} f - \frac{L}{{\bar{g}}^{X}} c g d$ .
2.
Overlap. Assume that a non-empty beginning of ${\bar{g}}^{Y}$ is a non-empty ending of ${\bar{f}}^{Y}$ , say, ${\bar{f}}^{Y} = c c_{0}$ , ${\bar{g}}^{Y} = c_{0} d$ , and ${\bar{f}}^{Y} d = c {\bar{g}}^{Y}$ for some $c, d, c_{0} \in Y^{*}$ and $c_{0} \neq 1$ . Put $w = L {\bar{f}}^{Y} d = L c {\bar{g}}^{Y}$ . Define the composition $C_{2} {(f, g, c_{0})}_{w} = \frac{L}{{\bar{f}}^{X}} f d - \frac{L}{{\bar{g}}^{X}} c g$ .
3.
External. Take a (possibly empty) associative word $c_{0} \in Y^{*}$ . In the case that the greatest common divisor of ${\bar{f}}^{X}$ and ${\bar{g}}^{X}$ is non-empty and both ${\bar{f}}^{Y}$ and ${\bar{g}}^{Y}$ are non-empty, put $w = L {\bar{f}}^{Y} c_{0} {\bar{g}}^{Y}$ and define the composition $C_{3} {(f, g, c_{0})}_{w} = \frac{L}{{\bar{f}}^{X}} f c_{0} {\bar{g}}^{Y} - \frac{L}{{\bar{g}}^{X}} {\bar{f}}^{Y} c_{0} g$ .

A monic subset $S$ of $k [X] \otimes k ⟨ Y ⟩$ is called a GS basis whenever all compositions of elements of $S$ , say ${(f, g)}_{w}$ , are trivial modulo $(S, w)$ :

\begin{matrix} {(f, g)}_{w} = \sum_{i} α_{i} a_{i} s_{i} b_{i}, \end{matrix}

where $a_{i}, b_{i} \in N$ , $s_{i} \in S$ , $α_{i} \in k$ , and $a_{i} \bar{s_{i}} b_{i} < w$ for all $i$ .

Theorem 15

(Mikhalev and Zolotykh [170, 228], the CD-lemma for associative algebras over commutative algebras) Consider a monic subset $S \subseteq k [X] \otimes k ⟨ Y ⟩$ and a monomial ordering $<$ on $N$ . The following statements are equivalent:

(i)
The set $S$ is a Gröbner–Shirshov basis in $k [X] \otimes k ⟨ Y ⟩$ .
(ii)
For every element $f \in I d (S)$ , the monomial $\bar{f}$ contains $\bar{s}$ as its subword for some $s \in S$ .
(iii)
The set $I r r (S) = {w \in N | w \neq a \bar{s} b, a, b \in N, s \in S}$ is a linear basis for the quotient $k [X] \otimes k ⟨ Y ⟩$ .

Outline of the proof. For

\begin{matrix} w = lcm (u, v) = lcm (u^{X}, v^{X}) lcm (u^{Y}, v^{Y}) \end{matrix}

the general composition is

\begin{matrix} {(s_{1}, s_{2})}_{w} = (lcm (u^{X}, v^{X}) / u^{X}) w {|_{u \mapsto s_{1}} - (lcm (u^{X}, v^{X}) / v^{X}) w |}_{v \mapsto s_{2}}, \end{matrix}

where $s_{1}, s_{2} \in k [X] ⟨ Y ⟩$ are $k$ -monic with $u = {\bar{s}}_{1}$ and $v = {\bar{s}}_{2}$ . Moreover, ${(s_{1}, s_{2})}_{w} \equiv 0 mod ({s_{1}, s_{2}}, w)$ whenever $w = u^{X} v^{X} u^{Y} c^{Y} v^{Y}$ with $c^{Y} \in Y^{*}$ , that is, $w$ is a trivial least common multiple relative to both $X$ -words and $Y$ -words. This implies the analog of Lemma 1 and the claim (i) $\Rightarrow$ (ii) in Theorem 15.

We apply this lemma in Sect. 4.3.

2.6 PBW-theorem for Lie algebras

Consider a Lie algebra $(L, [])$ over a field $k$ with a well-ordered linear basis $X = {x_{i} | i \in I}$ and multiplication table $S = {[x_{i} x_{j}] = [| x_{i} x_{j} |] | i > j, i, j \in I}$ , where for every $i, j \in I$ we write $[| x_{i} x_{j} |] = Σ_{t} α_{i j}^{t} x_{t}$ with $α_{i j}^{t} \in k$ . Then $U (L) = k ⟨ X | S^{(-)} ⟩$ is called the universal enveloping associative algebra of $L$ , where $S^{(-)} = {x_{i} x_{j} - x_{j} x_{i} = [| x_{i} x_{j} |] | i > j, i, j \in I}$ .

Theorem 16

(PBW Theorem) In the above notation and with the deg-lex ordering on $X^{*}$ , the set $S^{(-)}$ is a Gröbner–Shirshov basis of $k ⟨ X ⟩$ . Then by the CD-lemma for associative algebras, the set $I r r (S^{(-)})$ consists of the elements

\begin{matrix} x_{i_{1}} \dots x_{i_{n}} with i_{1} \leq \dots \leq i_{n}, i_{1}, \dots, i_{n} \in I, n \geq 0, \end{matrix}

and constitutes a linear basis of $U (L)$ .

Theorem 17

(The PBW Theorem in Shirshov’s form) Consider $L = L i e (X | S)$ with $S \subset L i e (X) \subset k ⟨ X ⟩$ and $U (L) = k ⟨ X | S^{(-)} ⟩$ . The following statements are equivalent.

(i)
For the deg-lex ordering, $S$ is a GS basis of $L i e (X)$ .
(ii)
For the deg-lex ordering, $S^{(-)}$ is a GS basis of $k ⟨ X ⟩$ .
(iii)
A linear basis of $U (L)$ consists of the words $u = u_{1} \dots u_{n}$ , where $u_{1} ⪯ \dots ⪯ u_{n}$ in the lex ordering, $n \geq 0$ , and every $u_{i}$ is an $S^{(-)}$ -irreducible associative Lyndon–Shirshov word in $X$ .
(iv)
A linear basis of $L$ is the set of all $S$ -irreducible Lyndon–Shirshov Lie monomials $[u]$ in $X$ .
(v)
A linear basis of $U (L)$ consists of the polynomials $u = [u_{1}] \dots [u_{n}]$ , where $u_{1} ⪯ \dots ⪯ u_{n}$ in the lex ordering, $n \geq 0$ , and every $[u_{i}]$ is an $S$ -irreducible non-associative Lyndon–Shirshov word in $X$ .

The PBW theorem, Theorem 33, the CD-lemmas for associative and Lie algebras, Shirshov’s factorization theorem, and property (VIII) of Sect. 4.2 imply that every LS-subword of $u$ is a subword of some $u_{i}$ .

Makar–Limanov gave [158] an interesting form of the PBW theorem for a finite dimensional Lie algebra.

2.7 Drinfeld–Jimbo algebra $U_{q} (A)$ , Kac–Moody enveloping algebra $U (A)$ , and the PBW basis of $U_{q} (A_{N})$

Take an integral symmetrizable $N \times N$ Cartan matrix $A = (a_{i j})$ . Hence, $a_{i i} = 2$ , $a_{i j} \leq 0$ for $i \neq j$ , and there exists a diagonal matrix $D$ with diagonal entries $d_{i}$ , which are nonzero integers, such that the product $D A$ is symmetric. Fix a nonzero element $q$ of $k$ with $q^{4 d_{i}} \neq 1$ for all $i$ . Then the Drinfeld–Jimbo quantum enveloping algebra is

\begin{matrix} U_{q} (A) = k ⟨ X \cup H \cup Y | S^{+} \cup K \cup T \cup S^{-} ⟩, \end{matrix}

where

\begin{matrix} X & = {x_{i}}, H = {h_{i}^{\pm 1}}, Y = {y_{i}}, \\ S^{+} & = \{\sum_{ν = 0}^{1 - a_{i j}} {(- 1)}^{ν} {(\begin{matrix} 1 - a_{i j} \\ ν \end{matrix})}_{t} x_{i}^{1 - a_{i j} - ν} x_{j} x_{i}^{ν}, where i \neq j, t = q^{2 d_{i}}\}, \\ S^{-} & = \{\sum_{ν = 0}^{1 - a_{i j}} {(- 1)}^{ν} {(\begin{matrix} 1 - a_{i j} \\ ν \end{matrix})}_{t} y_{i}^{1 - a_{i j} - ν} y_{j} y_{i}^{ν}, where i \neq j, t = q^{2 d_{i}}\}, \\ K & = {h_{i} h_{j} - h_{j} h_{i}, h_{i} h_{i}^{- 1} - 1, h_{i}^{- 1} h_{i} - 1, x_{j} h_{i}^{\pm 1} - q^{\mp 1} d_{i} a_{i j} h^{\pm 1} x_{j}, \\ h_{i}^{\pm 1} y_{j} - q^{\mp 1} y_{j} h^{\pm 1}}, \\ T & = \{x_{i} y_{j} - y_{j} x_{i} - δ_{i j} \frac{h_{i}^{2} - h_{i}^{- 2}}{q^{2 d_{i}} - q^{- 2 d_{i}}}\}, \end{matrix}

and

\begin{matrix} {(\begin{matrix} m \\ n \end{matrix})}_{t} = \{\begin{matrix} \prod_{i = 1}^{n} \frac{t^{m - i + 1} - t^{i - m - 1}}{t^{i} - t^{- i}} & (for m > n > 0), \\ 1 & (for n = 0 or m = n) . \end{matrix} \end{matrix}

Theorem 18

([55]) For every symmetrizable Cartan matrix $A$ , with the deg-lex ordering on ${X \cup H \cup Y}^{*}$ , the set $S^{+ c} \cup T \cup K \cup S^{- c}$ is a Gröbner–Shirshov basis of the Drinfeld–Jimbo algebra $U_{q} (A)$ , where $S^{+ c}$ and $S^{- c}$ are the Shirshov completions of $S^{+}$ and $S^{-}$ .

Corollary 5

(Rosso [195], Yamane [220]) For every symmetrizable Cartan matrix $A$ we have the triangular decomposition

\begin{matrix} U_{q} (A) = U_{q}^{+} (A) \otimes k [H] \otimes U_{q}^{-} (A) \end{matrix}

with $U_{q}^{+} (A) = k ⟨ X | S^{+} ⟩$ and $U_{q}^{-} (A) = k ⟨ Y | S^{-} ⟩$ .

Similar results are valid for the Kac–Moody Lie algebras $g (A)$ and their universal enveloping algebras

\begin{matrix} U (A) = k ⟨ X \cup H \cup Y | S^{+} \cup H \cup K \cup S^{-} ⟩, \end{matrix}

where $S^{+}, S^{-}$ are the same as for $U_{q} (A)$ ,

\begin{matrix} K = {h_{i} h_{j} - h_{j} h_{i}, x_{j} h_{i} - h_{i} x_{j} + d_{i} a_{i j} x_{i}, h_{i} y_{i} - y_{i} h_{i} + d_{i} a_{i j} y_{j}}, \end{matrix}

and $T = {x_{i} y_{j} - y_{j} x_{i} - δ_{i j} h_{i}}$ .

Theorem 19

([55]) For every symmetrizable Cartan matrix $A$ , the set $S^{+ c} \cup T \cup K \cup S^{- c}$ is a Gröbner–Shirshov basis of the universal enveloping algebra $U (A)$ of the Kac–Moody Lie algebra $g (A)$ .

The PBW theorem in Shirshov’s form implies

Corollary 6

(Kac [117]) For every symmetrizable Cartan matrix $A$ , we have the triangular decomposition

\begin{matrix} U (A) = U^{+} (A) \otimes k [H] \otimes U^{-} (A), g (A) = g^{+} (A) \oplus k [H] \oplus g^{-} (A) . \end{matrix}

Poroshenko [179, 180] found GS bases for the Kac–Moody algebras of types $\tilde{A_{n}}$ , $\tilde{B_{n}}$ , $\tilde{C_{n}}$ , and $\tilde{D_{n}}$ . He used the available linear bases of the algebras [117].

Consider now

\begin{matrix} A = A_{N} = (\begin{matrix} 2 & - 1 & 0 & \dots & 0 \\ - 1 & 2 & - 1 & \dots & 0 \\ 0 & - 1 & 2 & \dots & 0 \\ \cdot & \cdot & \cdot & \cdot & \cdot \\ 0 & 0 & 0 & \dots & 2 \end{matrix}) \end{matrix}

and assume that $q^{8} \neq 1$ . Introduce new variables, defined by Jimbo (see [220]), which generate $U_{q} (A_{N})$ :

\begin{matrix} \tilde{X} = {x_{i j}, 1 \leq i < j \leq N + 1}, \end{matrix}

where

\begin{matrix} x_{i j} = \{\begin{matrix} x_{i} & j = i + 1, \\ q x_{i, j - 1} x_{j - 1, j} - q^{- 1} x_{j - 1, j} x_{i, j - 1} & j > i + 1 . \end{matrix} \end{matrix}

Order the set $\tilde{X}$ as follows: $x_{m n} > x_{i j} ⟺ (m, n) >_{l e x} (i, j) .$ Recall from Yamane [220] the notation

\begin{matrix} C_{1} & = {((i, j), (m, n)) | i = m < j < n}, C_{2} = {((i, j), (m, n)) | i < m < n < j}, \\ C_{3} & = {((i, j), (m, n)) | i < m < j = n}, C_{4} = {((i, j), (m, n)) | i < m < j < n}, \\ C_{5} & = {((i, j), (m, n)) | i < j = m < n}, C_{6} = {((i, j), (m, n)) | i < j < m < n} . \end{matrix}

Consider the set ${\tilde{S}}^{+}$ consisting of Jimbo’s relations:

\begin{matrix} x_{m n} x_{i j} & - q^{- 2} x_{i j} x_{m n} ((i, j), (m, n)) \in C_{1} \cup C_{3}, \\ x_{m n} x_{i j} & - x_{i j} x_{m n} ((i, j), (m, n)) \in C_{2} \cup C_{6}, \\ x_{m n} x_{i j} & - x_{i j} x_{m n} + (q^{2} - q^{- 2}) x_{i n} x_{m j} ((i, j), (m, n)) \in C_{4}, \\ x_{m n} x_{i j} & - q^{2} x_{i j} x_{m n} + q x_{i n} ((i, j), (m, n)) \in C_{5} . \end{matrix}

It is easy to see that $U_{q}^{+} (A_{N}) = k ⟨ \tilde{X} | \tilde{S^{+}} ⟩$ .

A direct proof [86] shows that ${\tilde{S}}^{+}$ is a GS basis for $k ⟨ \tilde{X} | \tilde{S^{+}} ⟩ = U_{q}^{+} (A_{N})$ [55]. The proof is different from the argument of Bokut and Malcolmson [55]. This yields

Theorem 20

([55]) In the above notation and with the deg-lex ordering on ${\tilde{X} \cup H \cup \tilde{Y}}^{*}$ , the set ${\tilde{S}}^{+} \cup T \cup K \cup {\tilde{S}}^{-}$ is a Gröbner–Shirshov basis of

\begin{matrix} U_{q} (A_{N}) = k ⟨ \tilde{X} \cup H \cup \tilde{Y} | {\tilde{S}}^{+} \cup T \cup K \cup {\tilde{S}}^{-} ⟩ . \end{matrix}

Corollary 7

([195, 220]) For $q^{8} \neq 1$ , a linear basis of $U_{q} (A_{n})$ consists of

\begin{matrix} y_{m_{1} n_{1}} \dots y_{m_{l} n_{l}} h_{1}^{s_{1}} \dots h_{N}^{s_{N}} x_{i_{1} j_{1}} \dots x_{i_{k} j_{k}} \end{matrix}

with $(m_{1}, n_{1}) \leq \dots \leq (m_{l}, n_{l})$ , $(i_{1}, j_{1}) \leq \dots \leq (i_{k}, j_{k})$ , $k, l \geq 0$ and $s_{t} \in Z$ .

3 Gröbner–Shirshov bases for groups and semigroups

In this section we apply the method of GS bases for braid groups in different sets of generators, Chinese monoids, free inverse semigroups, and plactic monoids in two sets of generators (row words and column words).

Given a set $X$ consider $S \subseteq X^{*} \times X^{*}$ the congruence $ρ (S)$ on $X^{*}$ generated by $S$ , the quotient semigroup

\begin{matrix} A = sgp ⟨ X | S ⟩ = X^{*} / ρ (S), \end{matrix}

and the semigroup algebra $k (X^{*} / ρ (S))$ . Identifying the set ${u = v | (u, v) \in S}$ with $S$ , it is easy to see that

\begin{matrix} σ : k ⟨ X | S ⟩ \to k (X^{*} / ρ (S)), \sum α_{i} u_{i} + I d (S) \mapsto \sum α_{i} \bar{u_{i}} \end{matrix}

is an algebra isomorphism.

The Shirshov completion $S^{c}$ of $S$ consists of semigroup relations, $S^{c} = {u_{i} - v_{i}, i \in I}$ . Then $I r r (S^{c})$ is a linear basis of $k ⟨ X | S ⟩$ , and so $σ (I r r (S^{c}))$ is a linear basis of $k (X^{*} / ρ (S))$ . This shows that $I r r (S^{c})$ consists precisely of the normal forms of the elements of the semigroup $sgp ⟨ X | S ⟩$ .

Therefore, in order to find the normal forms of the semigroup $sgp ⟨ X | S ⟩$ , it suffices to find a GS basis $S^{c}$ in $k ⟨ X | S ⟩$ . In particular, consider a group $G = g p ⟨ X | S ⟩$ , where $S = {(u_{i}, v_{i}) \in F (X) \times F (X) | i \in I}$ and $F (X)$ is the free group on a set $X$ . Then $G$ has a presentation

\begin{matrix} G = sgp ⟨ X \cup X^{- 1} | S, x^{ε} x^{- ε} = 1, ε = \pm 1, x \in X ⟩, X \cap X^{- 1} = \emptyset \end{matrix}

as a semigroup.

3.1 Gröbner–Shirshov bases for braid groups

Consider the Artin braid group $B_{n}$ of type $A_{n - 1}$ (Artin [5]). We have

\begin{matrix} B_{n} = g p ⟨ σ_{1}, \dots, σ_{n} | σ_{j} σ_{i} = σ_{i} σ_{j} (j - 1 > i), σ_{i + 1} σ_{i} σ_{i + 1} = σ_{i} σ_{i + 1} σ_{i}, 1 \leq i \leq n - 1 ⟩ . \end{matrix}

3.1.1 Braid groups in the Artin–Burau generators

Assume that $X = Y \dot{\cup} Z$ with $Y^{*}$ and $Z$ well-ordered and that the ordering on $Y^{*}$ is monomial. Then every word in $X$ has the form $u = u_{0} z_{1} u_{1} \dots z_{k} u_{k}$ , where $k \geq 0$ , $u_{i} \in Y^{*}$ , and $z_{i} \in Z$ . Define the inverse weight of the word $u \in X^{*}$ as

\begin{matrix} inwt (u) = (k, u_{k}, z_{k}, \dots, u_{1}, z_{1}, u_{0}) \end{matrix}

and the inverse weight lexicographic ordering as

\begin{matrix} u > v \Leftrightarrow inwt (u) > inwt (v) . \end{matrix}

Call this ordering the inverse tower ordering for short. Clearly, it is a monomial ordering on $X^{*}$ .

When $X = Y \dot{\cup} Z$ , $Y = T \dot{\cup} U$ , and $Y^{*}$ is endowed with the inverse tower ordering, define the inverse tower ordering on $X^{*}$ with respect to the presentation $X = (T \dot{\cup} U) \dot{\cup} Z .$ In general, for

\begin{matrix} X = (\dots (X^{(n)} \dot{\cup} X^{(n - 1)}) \dot{\cup} \dots) \dot{\cup} X^{(0)} \end{matrix}

with $X^{(n)}$ -words equipped with a monomial ordering we can define the inverse tower ordering of $X$ -words.

Introduce a new set of generators for the braid group $B_{n}$ , called the Artin–Burau generators. Put

\begin{matrix} s_{i, i + 1} = σ_{i}^{2}, s_{i, j + 1} = σ_{j} \dots σ_{i + 1} σ_{i}^{2} σ_{i + 1}^{- 1} \dots σ_{j}^{- 1}, 1 \leq i < j \leq n - 1; \\ σ_{i, j + 1} = σ_{i}^{- 1} \dots σ_{j}^{- 1}, 1 \leq i \leq j \leq n - 1; σ_{i i} = 1, {a, b} = b^{- 1} a b . \end{matrix}

Form the sets

\begin{matrix} S_{j} = {s_{i, j}, s_{i, j}^{- 1}, 1 \leq i, j < n} and Σ^{- 1} = {σ_{1}^{- 1}, \dots σ_{n - 1}^{- 1}} . \end{matrix}

Then the set

\begin{matrix} S = S_{n} \cup S_{n - 1} \cup \dots \cup S_{2} \cup Σ^{- 1} \end{matrix}

generates $B_{n}$ as a semigroup.

Order now the alphabet $S$ as

\begin{matrix} S_{n} < S_{n - 1} < \dots < S_{2} < Σ^{- 1}, \end{matrix}

and

\begin{matrix} s_{1, j}^{- 1} < s_{1, j} < s_{2, j}^{- 1} < \dots < s_{j - 1, j}, σ_{1}^{- 1} < σ_{2}^{- 1} < \dots σ_{n - 1}^{- 1} . \end{matrix}

Order $S_{n}$ -words by the deg-inlex ordering; that is, first compare words by length and then by the inverse lexicographic ordering starting from their last letters. Then we use the inverse tower ordering of $S$ -words.

Lemma 4

(Artin [6], Markov [160]) The following Artin–Markov relations hold in the braid group $B_{n}$ :

\begin{matrix} σ_{k}^{- 1} s_{i, j}^{δ} = s_{i, j}^{δ} σ_{k}^{- 1} for k \neq i - 1, i, j - 1, j, \end{matrix}

(1)

\begin{matrix} σ_{i}^{- 1} s_{i, i + 1}^{δ} = s_{i, i + 1}^{δ} σ_{1}^{- 1}, \end{matrix}

(2)

\begin{matrix} σ_{i - 1}^{- 1} s_{i, j}^{δ} = s_{i - 1, j}^{δ} σ_{i - 1}^{- 1}, \end{matrix}

(3)

\begin{matrix} σ_{i}^{- 1} s_{i, j}^{δ} = {s_{i + 1, j}^{δ}, s_{i, i + 1}} σ_{i}^{- 1}, \end{matrix}

(4)

\begin{matrix} σ_{j - 1}^{- 1} s_{i, j}^{δ} = s_{i, j - 1}^{δ} σ_{j - 1}^{- 1}, \end{matrix}

(5)

\begin{matrix} σ_{j}^{- 1} s_{i, j}^{δ} = {s_{i, j + 1}^{δ}, s_{j, j + 1}} σ_{j}^{- 1}, \end{matrix}

(6)

where $δ = \pm 1$ ;

\begin{matrix} s_{j, k}^{- 1} s_{k, l}^{ε} = {s_{k, l}^{ε}, s_{j, l}^{- 1}} s_{j, k}^{- 1}, \end{matrix}

(7)

\begin{matrix} s_{j, k} s_{k, l}^{ε} = {s_{k, l}^{ε}, s_{j, l} s_{k, l}} s_{j, k}, \end{matrix}

(8)

\begin{matrix} s_{j, k}^{- 1} s_{j, l}^{ε} = {s_{j, l}^{ε}, s_{k, l}^{- 1} s_{j, l}^{- 1}} s_{j, k}^{- 1}, \end{matrix}

(9)

\begin{matrix} s_{j, k} s_{j, l}^{ε} = {s_{j, l}^{ε}, s_{k, l}} s_{j, k}, \end{matrix}

(10)

\begin{matrix} s_{i, k}^{- 1} s_{j, l}^{ε} = {s_{j, l}^{ε}, s_{k, l} s_{i, l} s_{k, l}^{- 1} s_{i, l}^{- 1}} s_{i, k}^{- 1}, \end{matrix}

(11)

\begin{matrix} s_{i, k} s_{j, l}^{ε} = {s_{j, l}^{ε}, s_{i, l}^{- 1} s_{k, l}^{- 1} s_{i, l} s_{k, l}} s_{i, k}, \end{matrix}

(12)

where $i < j < k < l$ and $ε = \pm 1$ ;

\begin{matrix} s_{i, k}^{δ} s_{j, l}^{ε} = s_{j, l}^{ε} s_{i, k}^{δ}, \end{matrix}

(13)

\begin{matrix} σ_{j}^{- 1} σ_{k}^{- 1} = σ_{k}^{- 1} σ_{j}^{- 1} for j < k - 1 \end{matrix}

(14)

\begin{matrix} σ_{j, j + 1} σ_{k, j + 1} = σ_{k, j + 1} σ_{j - 1, j} for j < k, \end{matrix}

(15)

\begin{matrix} σ_{i}^{- 2} = s_{i, i + l}^{- 1}, \end{matrix}

(16)

\begin{matrix} s_{i, j}^{\pm 1} s_{i, j}^{\mp 1} = 1, \end{matrix}

(17)

where $j < i < k < l$ or $i < k < j < l$ , and $ε$ , $δ = \pm 1$ .

Theorem 21

([25]) The Artin–Markov relations (1)–(13) form a Gröbner–Shirshov basis of the braid group $B_{n}$ in terms of the Artin–Burau generators with respect to the inverse tower ordering of words.

It is claimed in [25] that some compositions are trivial. Processing all compositions explicitly, [82] supported the claim.

Corollary 8

(Markov–Ivanovskii [6]) The following words are normal forms of the braid group $B_{n}$ :

\begin{matrix} f_{n} f_{n - 1} \dots f_{2} σ_{i_{n} n} σ_{i_{n - 1} n - 1} \dots σ_{i_{2} 2}, \end{matrix}

where all $f_{j}$ for $2 \leq j \leq n$ are free irreducible words in ${s_{i j}, i < j}$ .

3.1.2 Braid groups in the Artin–Garside generators

The Artin–Garside generators of the braid group $B_{n + 1}$ are $σ_{i}, 1 \leq i \leq n, △, △^{- 1}$ (Garside [103] 1969), where $△ = Λ_{1} \dots Λ_{n}$ with $Λ_{i} = σ_{1} \dots σ_{i}$ .

Putting $△^{- 1} < △ < σ_{1} < \dots < σ_{n}$ , order ${△^{- 1}, △, σ_{1}, \dots, σ_{n}}^{*}$ by the deg-lex ordering.

Denote by $V (j, i)$ , $W (j, i), \dots$ for $j \leq i$ positive words in the letters $σ_{j}, σ_{j + 1}, \dots, σ_{i}$ , assuming that $V (i + 1, i) = 1$ , $W (i + 1, i) = 1, \dots$ .

Given $V = V (1, i)$ , for $1 \leq k \leq n - i$ denote by $V^{(k)}$ the result of shifting the indices of all letters in $V$ by $k$ : $σ_{1} \mapsto σ_{k + 1}, \dots, σ_{i} \mapsto σ_{k + i}$ , and put $V^{'} = V^{(1)}$ . Define $σ_{i j} = σ_{i} σ_{i - 1} \dots σ_{j}$ for $j \leq i - 1$ , while $σ_{i i} = σ_{i}$ and $σ_{i i + 1} = 1$ .

Theorem 22

([23, 47]) A Gröbner–Shirshov basis $S$ of $B_{n + 1}$ in the Artin–Garside generators consists of the following relations:

\begin{matrix} σ_{i + 1} σ_{i} V (1, i - 1) W (j, i) σ_{i + 1 j} = σ_{i} σ_{i + 1} σ_{i} V (1, i - 1) σ_{i j} W {(j, i)}^{'}, \\ σ_{s} σ_{k} = σ_{k} σ_{s} for s - k \geq 2, \\ σ_{1} V_{1} σ_{2} σ_{1} V_{2} \dots V_{n - 1} σ_{n} \dots σ_{1} = △ V_{1}^{(n - 1)} V_{2}^{(n - 2)} \dots V_{(n - 1)}^{'}, \\ σ_{l} △ = △ σ_{n - l + 1} for 1 \leq l \leq n, \\ σ_{l} △^{- 1} = △^{- 1} σ_{n - l + 1} for 1 \leq l \leq n, \\ △ △^{- 1} = 1, △^{- 1} △ = 1, \end{matrix}

where $1 \leq i \leq n - 1$ and $1 \leq j \leq i + 1$ ; moreover, $W (j, i)$ begins with $σ_{i}$ unless it is empty, and $V_{i} = V_{i} (1, i)$ .

There are corollaries.

Corollary 9

The $S$ -irreducible normal form of each word of $B_{n + 1}$ coincides with its Garside normal form [103].

Corollary 10

(Garside [103]) The semigroup $B_{n + 1}^{+}$ of positive braids can be embedded into a group.

3.1.3 Braid groups in the Birman–Ko–Lee generators

Recall that the Birman–Ko–Lee generators $σ_{t s}$ of the braid group $B_{n}$ are

\begin{matrix} σ_{t s} = (σ_{t - 1} σ_{t - 2} \dots σ_{s + 1}) σ_{s} (σ_{s + 1}^{- 1} \dots σ_{t - 2}^{- 1} σ_{t - 1}^{- 1}) \end{matrix}

and we have the presentation

\begin{matrix} B_{n} & = g p ⟨ σ_{t s}, n \geq t > s \geq 1 | σ_{t s} σ_{r q} = σ_{r q} σ_{t s}, (t - r) (t - q) (s - r) (s - q) > 0, \\ σ_{t s} σ_{s r} = σ_{t r} σ_{t s} = σ_{s r} σ_{t r}, n \geq t > s > r \geq 1 ⟩ . \end{matrix}

Denote by $δ$ the Garside word, $δ = σ_{n n - 1} σ_{n - 1 n - 2} \dots σ_{21}$ .

Define the order as $δ^{- 1} < δ < σ_{t s} < σ_{r q}$ iff $(t, s) < (r, q)$ lexicographically. Use the deg-lex ordering on ${δ^{- 1}, δ, σ_{t s}, n \geq t > s \geq 1}^{*}$ .

Instead of $σ_{i j}$ , we write simply $(i, j)$ or $(j, i)$ . We also set

\begin{matrix} (t_{m}, t_{m - 1}, \dots, t_{1}) = (t_{m}, t_{m - 1}) (t_{m - 1}, t_{m - 2}) \dots (t_{2}, t_{1}), \end{matrix}

where $t_{j} \neq t_{j + 1}, 1 \leq j \leq m - 1$ . In this notation, we can write the defining relations of $B_{n}$ as

\begin{matrix} (t_{3}, t_{2}, t_{1}) = (t_{2}, t_{1}, t_{3}) = (t_{1}, t_{3}, t_{2}) for t_{3} > t_{2} > t_{1}, \\ (k, l) (i, j) = (i, j) (k, l) for k > l, i > j, k > i, \end{matrix}

where either $k > i > j > l$ or $k > l > i > j$ .

Denote by $V_{[t_{2}, t_{1}]}$ , where $n \geq t_{2} > t_{1} \geq 1$ , a positive word in $(k, l)$ satisfying $t_{2} \geq k > l \geq t_{1}$ . We can use any capital Latin letter with indices instead of $V$ , and appropriate indices (for instance, $t_{3}$ and $t_{0}$ with $t_{3} > t_{0}$ ) instead of $t_{2}$ and $t_{1}$ . Use also the following equalities in $B_{n}$ :

\begin{matrix} V_{[t_{2} - 1, t_{1}]} (t_{2}, t_{1}) = (t_{2}, t_{1}) V_{[t_{2} - 1, t_{1}]}^{'} \end{matrix}

for $t_{2} > t_{1}$ , where $V_{[t_{2} - 1, t_{1}]}^{'} = (V_{[t_{2} - 1, t_{1}]}) |_{(k, l) \mapsto (k, l), if l \neq t_{1}; (k, t_{1}) \mapsto (t_{2}, k)};$

\begin{matrix} W_{[t_{2} - 1, t_{1}]} (t_{1}, t_{0}) = (t_{1}, t_{0}) W_{[t_{2} - 1, t_{1}]}^{⋆} \end{matrix}

for $t_{2} > t_{1} > t_{0}$ , where $W_{[t_{2} - 1, t_{1}]}^{⋆} = (W_{[t_{2} - 1, t_{1}]}) |_{(k, l) \mapsto (k, l), if l \neq t_{1}; (k, t_{1}) \mapsto (k, t_{0})} .$

Theorem 23

([24]) A Gröbner–Shirshov basis of the braid group $B_{n}$ in the Birman–Ko–Lee generators consists of the following relations:

\begin{matrix} (k, l) (i, j) = (i, j) (k, l) for k > l > i > j, \\ (k, l) V_{[j - 1, 1]} (i, j) = (i, j) (k, l) V_{[j - 1, 1]} for k > i > j > l, \\ (t_{3}, t_{2}) (t_{2}, t_{1}) = (t_{2}, t_{1}) (t_{3}, t_{1}), \\ (t_{3}, t_{1}) V_{[t_{2} - 1, 1]} (t_{3}, t_{2}) = (t_{2}, t_{1}) (t_{3}, t_{1}) V_{[t_{2} - 1, 1]}, \\ (t, s) V_{[t_{2} - 1, 1]} (t_{2}, t_{1}) W_{[t_{3} - 1, t_{1}]} (t_{3}, t_{1}) = (t_{3}, t_{2}) (t, s) V_{[t_{2} - 1, 1]} (t_{2}, t_{1}) W_{[t_{3} - 1, t_{1}]}^{'}, \\ (t_{3}, s) V_{[t_{2} - 1, 1]} (t_{2}, t_{1}) W_{[t_{3} - 1, t_{1}]} (t_{3}, t_{1}) = (t_{2}, s) (t_{3}, s) V_{[t_{2} - 1, 1]} (t_{2}, t_{1}) W_{[t_{3} - 1, t_{1}]}^{'}, \\ (2, 1) V_{2 [2, 1]} (3, 1) \dots V_{n - 1 [n - 1, 1]} (n, 1) = δ V_{2 [2, 1]}^{'} \dots V_{n - 1 [n - 1, 1]}^{'}, \\ (t, s) δ = δ (t + 1, s + 1), (t, s) δ^{- 1} = δ^{- 1} (t - 1, s - 1) with t \pm 1, s \pm 1 (mod n), \\ δ δ^{- 1} = 1, δ^{- 1} δ = 1, \end{matrix}

where $V_{[k, l]}$ means, as above, a word in $(i, j)$ satisfying $k \geq i > j \geq l$ , $t > t_{3}$ , and $t_{2} > s$ .

There are two corollaries.

Corollary 11

(Birman et al. [13]) The semigroup $B_{n}^{+}$ of positive braids in the Birman–Ko–Lee generators embeds into a group.

Corollary 12

(Birman et al. [13]) The $S$ -irreducible normal form of a word in $B_{n}$ in the Birman–Ko–Lee generators coincides with the Birman–Ko–Lee–Garside normal form $δ^{k} A$ , where $A \in B_{n}^{+}$ .

3.1.4 Braid groups in the Adjan–Thurston generators

The symmetric group $S_{n + 1}$ has the presentation

\begin{matrix} S_{n + 1} = g p ⟨ s_{1}, \dots, s_{n} | s_{i}^{2} = 1, s_{j} s_{i} = s_{i} s_{j} (j - 1 > i), s_{i + 1} s_{i} s_{i + 1} = s_{i} s_{i + 1} s_{i} ⟩ . \end{matrix}

Bokut and Shiao [58] found the normal form for $S_{n + 1}$ in the following statement: the set $N = {s_{1 i_{1}} s_{2 i_{2}} \dots s_{n i_{n}} | i_{j} \leq j + 1}$ is a Gröbner–Shirshov normal form for $S_{n + 1}$ in the generators $s_{i} = (i, i + 1)$ relative to the deg-lex ordering, where $s_{j i} = s_{j} s_{j - 1} \dots s_{i}$ for $j \geq i$ and $s_{j j + 1} = 1$ .

Take $α \in S_{n + 1}$ with the normal form $\bar{α} = s_{1 i_{1}} s_{2 i_{2}} \dots s_{n i_{n}} \in N$ . Define the length of $α$ as $| \bar{α} | = l (s_{1 i_{1}} s_{2 i_{2}} \dots s_{n i_{n}})$ and write $α ⊥ β$ whenever $| \bar{α β} | = | \bar{α} | + | \bar{β} |$ . Moreover, every $\bar{α} \in N$ has a unique expression $\bar{α} = s_{_{l_{1} i_{l_{1}}}} s_{_{l_{2} i_{l_{2}}}} \dots s_{_{l_{t} i_{l_{t}}}}$ with all $s_{_{l_{j} i_{l_{j}}}} \neq 1$ . The number $t$ is called the breadth of $α$ .

Now put

\begin{matrix} B_{n + 1}^{'} = g p ⟨ r (\bar{α}), α \in S_{n + 1} \ {1} | r (\bar{α}) r (\bar{β}) = r (\bar{α β}), α ⊥ β ⟩, \end{matrix}

where $r (\bar{α})$ stands for a letter with index $\bar{α}$ .

Then for the braid group with $n$ generators we have $B_{n + 1} ≅ B_{n + 1}^{'}$ . Indeed, define

\begin{matrix} θ : B_{n + 1} \to B_{n + 1}^{'}, σ_{i} \mapsto r (s_{i}), \\ θ^{'} : B_{n + 1}^{'} \to B_{n + 1}, r (\bar{α}) \mapsto \bar{α} |_{s_{i} \mapsto σ_{i}} . \end{matrix}

These mappings are homomorphism satisfying $θ θ^{'} = l_{B_{n + 1}^{'}}$ and $θ^{'} θ = l_{B_{n + 1}}$ . Hence,

\begin{matrix} B_{n + 1} = g p ⟨ r (\bar{α}), α \in S_{n + 1} \ {1} | r (\bar{α}) r (\bar{β}) = r (\bar{α β}), α ⊥ β ⟩ . \end{matrix}

Put $X = {r (\bar{α}), α \in S_{n + 1} \ {1}}$ . These generators of $B_{n + 1}$ are called the Adjan–Thurston generators.

Then the positive braid semigroup generated by $X$ is

\begin{matrix} B_{n + 1}^{+} = sgp ⟨ X | r (\bar{α}) r (\bar{β}) = r (\bar{α β}), α ⊥ β ⟩ . \end{matrix}

Assume that $s_{1} < s_{2} < \dots < s_{n}$ . Define $r (\bar{α}) < r (\bar{β})$ if and only if $| \bar{α} | > | \bar{β} |$ or $| \bar{α} | = | \bar{β} |$ and $\bar{α} <_{l e x} \bar{β}$ . Clearly, this is a well-ordering on $X$ . We will use the deg-lex ordering on $X^{*}$ .

Theorem 24

([89]) The Gröbner–Shirshov basis of $B_{n + 1}^{+}$ in the Adjan–Thurston generator $X$ relative to the deg-lex ordering on $X^{*}$ consists of the relations

\begin{matrix} r (\bar{α}) r (\bar{β}) = r (\bar{α β}) for α ⊥ β; r (\bar{α}) r (\bar{β γ}) = r (\bar{α β}) r (\bar{γ}) for α ⊥ β ⊥ γ . \end{matrix}

Theorem 25

([89]) The Gröbner–Shirshov basis of $B_{n + 1}$ in the Adjan–Thurston generator $X$ with respect to the deg-lex ordering on $X^{*}$ consists of the relations

(1)
$r (\bar{α}) r (\bar{β}) = r (\bar{α β}) for α ⊥ β,$
(2)
$r (\bar{α}) r (\bar{β γ}) = r (\bar{α β}) r (\bar{γ}) for α ⊥ β ⊥ γ,$
(3)
$r (\bar{α}) Δ^{ε} = Δ^{ε} r ({\bar{α}}^{'}) for {\bar{α}}^{'} = \bar{α} |_{s_{i} \mapsto s_{n + 1 - i}},$
(4)
$r (\bar{α β}) r (\bar{γ μ}) = Δ r ({\bar{α}}^{'}) r (\bar{μ}) for α ⊥ β ⊥ γ ⊥ μ with r (\bar{β γ}) = Δ,$
(5)
$Δ^{ε} Δ^{- ε}$ =1.

Corollary 13

(Adjan–Thurston) The normal forms for $B_{n + 1}$ are $Δ^{k} r (\bar{α_{1}}) \dots r (\bar{α_{s}})$ for $k \in Z$ , where $r (\bar{α_{1}}) \dots r (\bar{α_{s}})$ is minimal in the deg-lex ordering.

3.2 Gröbner–Shirshov basis for the Chinese monoid

The Chinese monoid $C H (X, <)$ over a well-ordered set $(X, <)$ has the presentation $C H (X) = sgp ⟨ X | S ⟩$ , where $X = {x_{i} | i \in I}$ and $S$ consists of the relations

\begin{matrix} x_{i} x_{j} x_{k} = x_{i} x_{k} x_{j} = x_{j} x_{i} x_{k} for i > j > k, \\ x_{i} x_{j} x_{j} = x_{j} x_{i} x_{j}, x_{i} x_{i} x_{j} = x_{i} x_{j} x_{i} for i > j . \end{matrix}

Theorem 26

([85]) With the deg-lex ordering on $X^{*}$ , the following relations (1)–(5) constitute a Gröbner–Shirshov basis of the Chinese monoid $C H (X)$ :

(1)
$x_{i} x_{j} x_{k} - x_{j} x_{i} x_{k}$ ,
(2)
$x_{i} x_{k} x_{j} - x_{j} x_{i} x_{k}$ ,
(3)
$x_{i} x_{j} x_{j} - x_{j} x_{i} x_{j}$ ,
(4)
$x_{i} x_{i} x_{j} - x_{i} x_{j} x_{i}$ ,
(5)
$x_{i} x_{j} x_{i} x_{k} - x_{i} x_{k} x_{i} x_{j}$ ,

where $x_{i}, x_{j}, x_{k} \in X$ and $i > j > k$ .

Denote by $Λ$ the set consistsing of the words on $X$ of the form $u_{n} = w_{1} w_{2} \dots w_{n}$ with $n \geq 0$ , where

\begin{matrix} w_{1} & = x_{1}^{t_{11}} \\ w_{2} & = {(x_{2} x_{1})}^{t_{21}} x_{2}^{t_{22}} \\ w_{3} & = {(x_{3} x_{1})}^{t_{31}} {(x_{3} x_{2})}^{t_{32}} x_{3}^{t_{33}} \\ \dots \\ w_{n} & = {(x_{n} x_{1})}^{t_{n 1}} {(x_{n} x_{2})}^{t_{n 2}} \dots {(x_{n} x_{n - 1})}^{t_{n (n - 1)}} x_{n}^{t_{n n}} \end{matrix}

for $x_{i} \in X$ with $x_{1} < x_{2} < \dots < x_{n}$ , and all exponents are non-negative.

Corollary 14

([71]) This $Λ$ is a set of normal forms of elements of the Chinese monoid $C H (X)$ .

3.3 Gröbner–Shirshov basis for free inverse semigroup

Consider a semigroup $S$ . An element $s \in S$ is called an inverse of $t \in S$ whenever $s t s = s$ and $t s t = t$ . An inverse semigroup is a semigroup in which every element $t$ has a unique inverse, denoted by $t^{- 1}$ .

Given a set $X$ , put $X^{- 1} = {x^{- 1} | x \in X}$ . On assuming that $X \cap X^{- 1} = \emptyset$ , denote $X \cup X^{- 1}$ by $Y$ . Define the formal inverses of the elements of $Y^{*}$ as

\begin{matrix} 1^{- 1} = 1, {(x^{- 1})}^{- 1} = x (x \in X), \\ {(y_{1} y_{2} \dots y_{n})}^{- 1} = y_{n}^{- 1} \dots y_{2}^{- 1} y_{1}^{- 1} (y_{1}, y_{2}, \dots, y_{n} \in Y) . \end{matrix}

It is well known that

\begin{matrix} FI (X) = sgp ⟨ Y | a a^{- 1} a = a, a a^{- 1} b b^{- 1} = b b^{- 1} a a^{- 1}, a, b \in Y^{*} ⟩ \end{matrix}

is the free inverse semigroup (with identity) generated by $X$ .

Introduce the notions of a formal idempotent, a (prime) canonical idempotent, and an ordered (prime) canonical idempotent in $Y^{*}$ . Assume that $<$ is a well-ordering on $Y$ .

(i)
The empty word 1 is an idempotent.
(ii)
If $h$ is an idempotent and $x \in Y$ then $x^{- 1} h x$ is both an idempotent and a prime idempotent.
(iii)
If $e_{1}, e_{2}, \dots, e_{m}$ , where $m > 1$ , are prime idempotents then $e = e_{1} e_{2} \dots e_{m}$ is an idempotent.
(iv)
An idempotent $w \in Y^{*}$ is called canonical whenever $w$ avoids subwords of the form $x^{- 1} e x f x^{- 1}$ , where $x \in Y$ , both $e$ and $f$ are idempotents.
(v)
A canonical idempotent $w \in Y^{*}$ is called ordered if every subword $e = e_{1} e_{2} \dots e_{m}$ of $w$ with $m > 2$ and $e_{i}$ being idempotents satisfies $fir (e_{1}) < fir (e_{2}) < \dots < fir (e_{m})$ , where $fir (u)$ is the first letter of $u \in Y^{*}$ .

Theorem 27

([44]) Denote by $S$ the subset of $k ⟨ Y ⟩$ consisting two kinds of polynomials:

$e f - f e$ , where $e$ and $f$ are ordered prime canonical idempotents with $e f > f e$ ;
$x^{- 1} e^{'} x f^{'} x^{- 1} - f^{'} x^{- 1} e^{'}$ , where $x \in Y$ , $x^{- 1} e^{'} x$ , and $x f^{'} x^{- 1}$ are ordered prime canonical idempotents.

Then, with the deg-lex ordering on $Y^{*}$ , the set $S$ is a Gröber–Shirshov basis of the free inverse semigroup $sgp ⟨ Y | S ⟩$ .

Theorem 28

([44]) The normal forms of elements of the free inverse semigroup $sgp ⟨ Y | S ⟩$ are

\begin{matrix} u_{0} e_{1} u_{1} \dots e_{m} u_{m} \in Y^{*}, \end{matrix}

where $m \geq 0$ , $u_{1}, \dots, u_{m - 1} \neq 1$ and $u_{0} u_{1} \dots u_{m}$ avoids subwords of the form $y y^{- 1}$ for $y \in Y$ , while $e_{1}, \dots, e_{m}$ are ordered canonical idempotents such that the first (respectively last) letter of $e_{i}$ , for $1 \leq i \leq m$ is not equal to the first (respectively last) letter of $u_{i}$ (respectively $u_{i - 1}$ ).

The above normal form is analogous to the semi-normal forms of Poliakova and Schein [176], 2005.

3.4 Approaches to plactic monoids via Gröbner–Shirshov bases in row and column generators

Consider the set $X = {x_{1}, \dots, x_{n}}$ of $n$ elements with the ordering $x_{1} < \dots < x_{n}$ . Schützenberger called $P_{n} = sgp ⟨ X | T ⟩$ a plactic monoid (see also Lothaire [153], Chapter 5), where $T$ consists of the Knuth relations

\begin{matrix} x_{i} x_{k} x_{j} = x_{k} x_{i} x_{j} for x_{i} \leq x_{j} < x_{k}, \\ x_{j} x_{i} x_{k} = x_{j} x_{k} x_{i} for x_{i} < x_{j} \leq x_{k} . \end{matrix}

A nondecreasing word $R \in X^{*}$ is called a row and a strictly decreasing word $C \in X^{*}$ is called a column; for example, $x_{1} x_{1} x_{3} x_{5} x_{5} x_{5} x_{6}$ is a row and $x_{6} x_{4} x_{2} x_{1}$ is a column.

For two rows $R, S \in A^{*}$ say that $R$ dominates $S$ whenever $| R | \leq | S |$ and every letter of $R$ is greater than the corresponding letter of $S$ , where $| R |$ is the length of $R$ .

A (semistandard) Young tableau on $A$ (see [152]) is a word $w = R_{1} R_{2} \dots R_{t}$ in $U^{*}$ such that $R_{i}$ dominates $R_{i + 1}$ for all $i = 1, \dots, t - 1$ . For example,

\begin{matrix} x_{4} x_{5} x_{5} x_{6} \cdot x_{2} x_{2} x_{3} x_{3} x_{5} x_{7} \cdot x_{1} x_{1} x_{1} x_{2} x_{4} x_{4} x_{4} \end{matrix}

is a Young tableau.

Cain et al. [69] use the Schensted–Knuth normal form (the set of (semistandard) Young tableaux) to prove that the multiplication table of column words, $u v = u^{'} v^{'}$ , forms a finite GS basis of the finitely generated plactic monoid. Here the Young tableaux $u^{'} v^{'}$ is the output of the column Schensted algorithm applied to $u v$ , but $u^{'} v^{'}$ is not made explicit.

In this section we give new explicit formulas for the multiplication tables of row and column words. In addition, we give independent proofs that the resulting sets of relations are GS bases in row and column generators respectively. This yields two new approaches to plactic monoids via their GS bases.

3.4.1 Plactic monoids in the row generators

Consider the plactic monoid $P_{n} = sgp ⟨ X | T ⟩$ , where $X = {1, 2, \dots, n}$ with $1 < 2 < \dots < n$ . Denote by $N$ the set of non-negative integers. It is convenient to express the rows $R \in X^{*}$ as $R = (r_{1}, r_{2}, \dots, r_{n})$ , where $r_{i}$ for $i = 1, 2, \dots, n$ is the number of occurrences of the letter $i$ . For example, $R = 111225 = (3, 2, 0, 0, 1, 0, \dots, 0)$ .

Denote by $U$ the set of all rows in $X^{*}$ and order $U^{*}$ as follows. Given $R = (r_{1}, r_{2}, \dots, r_{n}) \in U$ , define the length $| R | = r_{1} + \dots + r_{n}$ of $R$ in $X^{*}$ .

Firstly, order $U$ : for every $R, S \in U$ , put $R < S$ if and only if $| R | < | S |$ or $| R | = | S |$ and $(r_{1}, r_{2}, \dots, r_{n}) > (s_{1}, s_{2}, \dots, s_{n})$ lexicographically. Clearly, this is a well-ordering on $U$ . Then, use the deg-lex ordering on $U^{*}$ .

Lemma 5

([29]) Take $Φ = (ϕ_{1}, \dots, ϕ_{n}) \in U$ . For $1 \leq p \leq n$ put

\begin{matrix} Φ_{p} = \sum_{i = 1}^{p} ϕ_{i}, \end{matrix}

where $ϕ_{i}$ ( $w_{i}$ , $z_{i}$ , $w_{i}^{'}$ , and $z_{i}^{'}$ , see below) stands for a lowercase symbol, and $Φ_{p}$ ( $W_{p}$ , $Z_{p}$ , $W_{p}^{'}$ , and $Z_{p}^{'}$ , see below) for the corresponding uppercase symbol. Take $W = (w_{1}, w_{2}, \dots, w_{n})$ and $Z = (z_{1}, z_{2}, \dots, z_{n})$ in $U$ . Put $W^{'} = (w_{1}^{'}, w_{2}^{'}, \dots, w_{n}^{'})$ and $Z^{'} = (z_{1}^{'}, z_{2}^{'}, \dots, z_{n}^{'})$ , where

\begin{matrix} w_{1}^{'} = 0, w_{p}^{'} = min (Z_{p - 1} - W_{p - 1}^{'}, w_{p}), z_{q}^{'} = w_{q} + z_{q} - w_{q}^{'} \end{matrix}

(*)

for $n \geq p \geq 2$ and $n \geq q \geq 1$ .

Then $W \cdot Z = W^{'} \cdot Z^{'}$ in $P_{n} = sgp ⟨ X | T ⟩$ and $W^{'} \cdot Z^{'}$ is a Young tableau on $X$ , which could have only one row, that is, $Z^{'} = (0, 0, \dots, 0)$ . Moreover,

\begin{matrix} P_{n} = sgp ⟨ X | T ⟩ ≅ sgp ⟨ U | Γ ⟩, \end{matrix}

where $Γ = {W \cdot Z = W^{'} \cdot Z^{'} | W, Z \in V}$ .

We should emphasize that $(*)$ gives explicitly the product of two rows obtained by the Schensted row algorithm.

Jointly with our students Weiping Chen and Jing Li we proved [29], independently of Knuth’s normal form theorem [137], that $Γ$ is a GS basis of the plactic monoid algebra in row generators with respect to the deg-lex ordering. In particular, this yields a new proof of Knuth’s theorem.

3.4.2 Plactic monoids in the column generators

Consider the plactic monoid $P_{n} = sgp ⟨ X | T ⟩$ , where $X = {1, 2, \dots, n}$ with $1 < 2 < \dots < n$ . Every Young tableaux is a product of columns. For example,

\begin{matrix} 4, 556 \cdot 223, 357 \cdot 1, 112, 444 = (421) (521) (531) (632) (54) (74) (4) \end{matrix}

is a Young tableau.

Given a column $C \in X^{*}$ , denote by $c_{i}$ the number of occurrences of the letter $i$ in $C$ . Then $c_{i} \in {0, 1}$ for $i = 1, 2, \dots, n$ . We write $C = (c_{1}; c_{2}; \dots; c_{n})$ . For example, $C = 6, 421 = (1; 1; 0; 1; 0; 1; 0; \dots; 0) .$

Put $V = {C | C is a column in X^{*}}$ . For $R = (r_{1}; r_{2}; \dots; r_{n}) \in V$ define $wt (R) = (| R |, r_{1}, \dots, r_{n})$ . Order $V$ as follows: for $R, S \in V$ , put $R < S$ if and only if $wt (R) > wt (S)$ lexicographically. Then, use the deg-lex ordering on $V^{*}$ .

For $Φ = (ϕ_{1}; \dots; ϕ_{n}) \in V$ , put $Φ_{p} = \sum_{i = 1}^{p} ϕ_{i}, 1 \leq p \leq n$ , where $ϕ$ stands for some lowercase symbol defined above and $Φ$ stands for the corresponding uppercase symbol.

Lemma 6

([29]) Take $W = (w_{1}; w_{2}; \dots; w_{n})$ , $Z = (z_{1}; z_{2}; \dots; z_{n}) \in V$ . Define $W^{'} = (w_{1}^{'}; w_{2}^{'}; \dots; w_{n}^{'})$ and $Z^{'} = (z_{1}^{'}; z_{2}^{'}; \dots; z_{n}^{'})$ , where

\begin{matrix} z_{1}^{'} = min (w_{1}, z_{1}), z_{p}^{'} = min (W_{p} - Z_{p - 1}^{'}, z_{p}), w_{q}^{'} = w_{q} + z_{q} - z_{q}^{'} \end{matrix}

(**)

for $n \geq p \geq 2$ and $n \geq q \geq 1$ . Then $W^{'}, Z^{'} \in V$ and $W \cdot Z = W^{'} \cdot Z^{'}$ in $P_{n} = sgp ⟨ X | T ⟩$ , and $W^{'} \cdot Z^{'}$ is a Young tableau on $X$ . Moreover,

\begin{matrix} P_{n} = sgp ⟨ X | T ⟩ ≅ sgp ⟨ V | Λ ⟩, \end{matrix}

where $Λ = {W \cdot Z = W^{'} \cdot Z^{'} | W, Z \in V}$ .

Equation $(* *)$ gives explicitly the product of two columns obtained by the Schensted column algorithm.

Jointly with our students Weiping Chen and Jing Li we proved [29], independently of Knuth’s normal form theorem [137], that $Λ$ is a GS basis of the plactic monoid algebra in column generators with respect to the deg-lex ordering. In particular, this yields another new proof of Knuth’s theorem. Previously Cain, Gray, and Malheiro [69] established the same result using Knuth’s theorem, and they did not find $Λ$ explicitly.

Remark All results of [29] are valid for every plactic monoid, not necessarily finitely generated.

4 Gröbner–Shirshov bases for Lie algebras

In this section we first give a different approach to the LS basis and the Hall basis of a free Lie algebra by using Shirshov’s CD-lemma for anti-commutative algebras. Then, using the LS basis, we construct the classical theory of GS bases for Lie algebras over a field. Finally, we mention GS bases for Lie algebras over a commutative algebra and give some applications.

4.1 Lyndon–Shirshov basis and Lyndon–Shirshov words in anti-commutative algebras

A linear space $A$ equipped with a bilinear product $x \cdot y$ is called an anti-commutative algebra if it satisfies the identity $x^{2} = 0$ , and so $x \cdot y = - y \cdot x$ for every $x, y \in A$ .

Take a well-ordered set $X$ and denote by $X^{* *}$ the set of all non-associative words. Define three orderings $≻_{l e x}$ , $>_{_{d e g - l e x}}$ , and $>_{_{n - d e g - l e x}}$ (non-associative deg-lex) on $X^{* *}$ . For $(u), (v) \in X^{* *}$ put

$(u) = ((u_{1}) (u_{2})) ≻_{l e x} (v) = ((v_{1}) (v_{2}))$ (here $(u_{2})$ or $(v_{2})$ is empty when $| (u) | = 1$ or $| (v) | = 1$ ) iff one of the following holds:
1. (a)
  $u_{1} u_{2} > v_{1} v_{2}$ in the lex ordering;
2. (b)
  $u_{1} u_{2} = v_{1} v_{2}$ and $(u_{1}) ≻_{l e x} (v_{1})$ ;
3. (c)
  $u_{1} u_{2} = v_{1} v_{2}$ , $(u_{1}) = (v_{1})$ , and $(u_{2}) ≻_{l e x} (v_{2})$ ;
$(u) = ((u_{1}) (u_{2})) >_{d e g - l e x} (v) = ((v_{1}) (v_{2}))$ iff one of the following holds:
1. (a)
  $u_{1} u_{2} > v_{1} v_{2}$ in the deg-lex ordering;
2. (b)
  $u_{1} u_{2} = v_{1} v_{2}$ and $(u_{1}) >_{d e g - l e x} (v_{1})$ ;
3. (c)
  $u_{1} u_{2} = v_{1} v_{2}$ , $(u_{1}) = (v_{1})$ , and $(u_{2}) >_{d e g - l e x} (v_{2})$ ;
$(u) >_{_{n - d e g - l e x}} (v)$ iff one of the following holds:
1. (a)
  $| (u) | > | (v) |$ ;
2. (b)
  if $| (u) | = | (v) |$ , $(u) = ((u_{1}) (u_{2}))$ , and $(v) = ((v_{1}) (v_{2}))$ then $(u_{1}) >_{_{n - d e g - l e x}} (v_{1})$ or ( $(u_{1}) = (v_{1})$ and $(u_{2}) >_{_{n - d e g - l e x}} (v_{2})$ ).

Define regular words $(u) \in X^{* *}$ by induction on $| (u) |$ :

(i)
$x_{i} \in X$ is a regular word.
(ii)
$(u) = ((u_{1}) (u_{2}))$ is regular if both $(u_{1})$ and $(u_{2})$ are regular and $(u_{1}) ≻_{l e x} (u_{2})$ .

Denote $(u)$ by $[u]$ whenever $(u)$ is regular.

The set $N (X)$ of all regular words on $X$ constitutes a linear basis of the free anti-commutative algebra $A C (X)$ on $X$ .

The following result gives an alternative approach to the definition of LS words as the radicals of associative supports $u$ of the normal words $[u]$ .

Theorem 29

([37]) Suppose that $[u]$ is a regular word of the anti-commutative algebra $A C (X)$ . Then $u = v^{m}$ , where $v$ is a Lyndon–Shirshov word in $X$ and $m \geq 1$ . Moreover, the set of associative supports of the words in $N (X)$ includes the set of all Lyndon–Shirshov words in $X$ .

Fix an ordering $>_{d e g - l e x}$ on $X^{* *}$ and choose monic polynomials $f$ and $g$ in $A C (X)$ . If there exist $a, b \in X^{*}$ such that $[w] = [\bar{f}] = [a [\bar{g}] b]$ then the inclusion composition of $f$ and $g$ is defined as ${(f, g)}_{[w]} = f - [a [g] b]$ .

A monic subset $S$ of $A C (X)$ is called a GS basis in $A C (X)$ if every inclusion composition ${(f, g)}_{[w]}$ in $S$ is trivial modulo $(S, [w])$ .

Theorem 30

(Shirshov’s CD-lemma for anti-commutative algebras, cf. [206]) Consider a nonempty set $S \subset A C (X)$ of monic polynomials with the ordering $>_{d e g - l e x}$ on $X^{* *}$ . The following statements are equivalent:

(i)
The set $S$ is a Gröbner–Shirshov basis in $A C (X)$ .
(ii)
If $f \in I d (S)$ then $[\bar{f}] = [a [\bar{s}] b]$ for some $s \in S a n d a, b \in X^{*}$ , where $[a s b]$ is a normal $S$ -word.
(iii)
The set
$\begin{matrix} I r r (S) & = {[u] \in N (X) | [u] \neq [a [\bar{s}] b] a, b \in X^{*}, s \in S \\ a n d [a s b] i s a n o r m a l S - w o r d} \end{matrix}$
is a linear basis of the algebra $A C (X | S) = A C (X) / I d (S)$ .

Define the subset $S_{1}$ the free anti-commutative algebra $A C (X)$ as

\begin{matrix} S_{1} & = {([u] [v]) [w] - ([u] [w]) [v] - [u] ([v] [w]) | \\ [u], [v], [w] \in N (X) and [u] ≻_{l e x} [v] ≻_{l e x} [w]} . \end{matrix}

It is easy to prove that the free Lie algebra admits a presentation as an anti-commutative algebra: $L i e (X) = A C (X) / I d (S_{1})$ .

The next result gives an alternating approach to the definition of the LS basis of a free Lie algebra $L i e (X)$ as a set of irreducible non-associative words for an anti-commutative GS basis in $A C (X)$ .

Theorem 31

([37]) Under the ordering $>_{d e g - l e x}$ , the subset $S_{1}$ of $A C (X)$ is an anti-commutative Gröbner–Shirshov basis in $A C (X)$ . Then $I r r (S_{1})$ is the set of all non-associative LS words in $X$ . So, the LS monomials constitute a linear basis of the free Lie algebra $L i e (X)$ .

Theorem 32

([34]) Define $S_{2}$ by analogy with $S_{1}$ , but using $>_{_{n - d e g - l e x}}$ instead of $≻_{l e x}$ . Then with the ordering $>_{_{n - d e g - l e x}}$ the subset $S_{2}$ of $A C (X)$ is also an anti-commutative GS basis. The set $I r r (S_{2})$ amounts to the set of all Hall words in $X$ and forms a linear basis of a free Lie algebra $L i e (X)$ .

4.2 Composition-Diamond lemma for Lie algebras over a field

We start with some concepts and results from the literature concerning the theory of GS bases for the free Lie algebra $L i e (X)$ generated by $X$ over a field $k$ .

Take a well-ordered set $X = {x_{i} | i \in I}$ with $x_{i} > x_{t}$ whenever $i > t$ , for all $i, t \in I$ . Given $u = x_{i_{1}} x_{i_{2}} \dots x_{i_{m}} \in X^{*}$ , define the length (or degree) of $u$ to be $m$ and denote it by $| u | = m$ or $d e g (u) = m$ , put $fir (u) = x_{i_{1}}$ , and introduce

\begin{matrix} x_{β} = min (u) = min {x_{i_{1}}, x_{i_{2}}, \dots, x_{i_{m}}}, \\ X^{'} (u) = {x_{i}^{j} = x_{i} \underset{j}{\underset{⏟}{x_{β} \dots x_{β}}} | i > β, j \geq 0} . \end{matrix}

Order the new alphabet $X^{'} (u)$ as follows:

\begin{matrix} x_{i_{1}}^{j_{1}} > x_{i_{2}}^{j_{2}} \Leftrightarrow i_{1} > i_{2} or i_{1} = i_{2} and j_{2} > j_{1} . \end{matrix}

Assuming that

\begin{matrix} u = x_{r_{1}} \underset{m_{1}}{\underset{⏟}{x_{β} \dots x_{β}}} \dots x_{r_{t}} \underset{m_{t}}{\underset{⏟}{x_{β} \dots x_{β}}}, \end{matrix}

where $r_{i} > β$ , define the Shirshov elimination

\begin{matrix} u^{'} = x_{r_{1}}^{m_{1}} \dots x_{r_{t}}^{m_{t}} \in {(X^{'} (u))}^{*} . \end{matrix}

We use two linear orderings on $X^{*}$ :

(i)
the lex ordering (or lex-antideg ordering): $1 ≻ v$ if $v \neq 1$ and, by induction, if $u = x_{i} u_{1}$ and $v = x_{j} v_{1}$ then $u ≻ v$ if and only if $x_{i} > x_{j}$ or $x_{i} = x_{j}$ and $u_{1} ≻ v_{1}$ ;
(ii)
the deg-lex ordering: $u > v$ if $| u | > | v |$ or $| u | = | v |$ and $u ≻ v$ .

Remark In commutative algebras, the lex ordering is understood to be the lex-deg ordering with the condition $v > 1$ for $v \neq 1$ .

We cite some useful properties of ALSWs and NLSWs (see below) following Shirshov [203, 204, 207], see also [209]. Property (X) was given by Shirshov [204] and Chen et al. [72]. Property (VIII) was implicitly used in Shirshov [207], see also Chibrikov [94].

We regard $L i e (X)$ as the Lie subalgebra of the free associative algebra $k ⟨ X ⟩$ generated by $X$ with the Lie bracket $[u, v] = u v - v u$ . Below we prove that $L i e (X)$ is the free Lie algebra generated by $X$ for every commutative ring $k$ (Shirshov [203]). For a field, this follows from the PBW theorem because the free Lie algebra $L i e (X) = L i e (X | \emptyset)$ has the universal enveloping associative algebra $k ⟨ X ⟩ = k ⟨ X | \emptyset ⟩$ .

Given $f \in k ⟨ X ⟩$ , denote by $\bar{f}$ the leading word of $f$ with respect to the deg-lex ordering and write $f = α_{\bar{f}} \bar{f} - r_{_{f}}$ with $α_{\bar{f}} \in k$ .

Definition 3

([156, 203]) Refer to $w \in X^{*} \ {1}$ as an associative Lyndon–Shirshov word, or ALSW for short, whenever

\begin{matrix} (\forall u, v \in X^{*}, u, v \neq 1) w = u v \Rightarrow w > v u . \end{matrix}

Denote the set of all ALSWs on $X$ by $A L S W (X)$ .

Associative Lyndon–Shirshov words enjoy the following properties (Lyndon [156], Chen et al. [72], Shirshov [203, 204]).

(I) Put $x_{β} = min (u v)$ . If $fir (u) \neq x_{β}$ and $fir (v) \neq x_{β}$ then

\begin{matrix} u ≻ v (in the lex ordering on X^{*}) \Leftrightarrow u^{'} ≻ v^{'} (in the lex ordering on {(X_{u v}^{'})}^{*}) . \end{matrix}

(II) (Shirshov’s key property of ALSWs) A word $u$ is an ALSW in $X^{*}$ if and only if $u^{'}$ is an ALSW in ${(X^{'} (u))}^{*}$ .

Properties (I) and (II) enable us to prove the properties of ALSWs and NLSWs (see below) by induction on length.

(III) (down-to-up bracketing) $u \in A L S W (X) \Leftrightarrow (\exists k) | u^{(k)} |_{_{{(X (u))}^{(k)}}} = 1$ , where $u^{(k)} = {(u^{'})}^{(k - 1)}$ and ${(X (u))}^{(k)} = {(X^{'} (u))}^{(k - 1)}$ . In the process $u \to u^{'} \to u^{''} \to \dots$ we use the algorithm of joining the minimal letters of $u$ , $u^{'} \dots$ to the previous words.

(IV) If $u, v \in A L S W (X)$ then $u v \in A L S W (X) \Leftrightarrow u ≻ v$ .

(V) $w \in A L S W (X) \Leftrightarrow$ (for every $u, v \in X^{*} \ {1}$ and $w = u v \Rightarrow w ≻ v$ ).

(VI) If $w \in A L S W (X)$ then an arbitrary proper prefix of $w$ cannot be a suffix of $w$ and $w x_{_{β}} \in A L S W (X)$ if $x_{_{β}} = min (w)$ .

(VII) (Shirshov’s factorization theorem) Every associative word $w$ can be uniquely represented as $w = c_{1} c_{2} \dots c_{n}$ , where $c_{1}, \dots, c_{n} \in A L S W (X)$ and $c_{1} ⪯ c_{2} ⪯ \dots ⪯ c_{n}$ .

Actually, if we apply to $w$ the algorithm of joining the minimal letter to the previous word using the Lie product, $w \to w^{'} \to w^{''} \to \dots$ , then after finitely many steps we obtain $w^{(k)} = [c_{1}] [c_{2}] \dots [c_{n}]$ , with $c_{1} ⪯ c_{2} ⪯ \dots ⪯ c_{n}$ , and $w = c_{1} c_{2} \dots c_{n}$ would be the required factorization (see an example in the Introduction).

(VIII) If an associative word $w$ is represented as in (VII) and $v$ is a LS subword of $w$ then $v$ is a subword of one of the words $c_{1}$ , $c_{2}, \dots, c_{n}$ .

(IX) If $u_{1} u_{2}$ and $u_{2} u_{3}$ are ALSWs then so is $u_{1} u_{2} u_{3}$ provided that $u_{2} \neq 1$ .

(X) If $w = u v$ is an ALSW and $v$ is its longest proper ALSW ending, then $u$ is an ALSW as well (Chen et al. [72], Shirshov [204]).

Definition 4

(down-to-up bracketing of ALSW, Shirshov [203]) For an ALSW $w$ , there is the down-to-up bracketing $w \to w^{'} \to w^{''} \to \dots \to w^{(k)} = [w]$ , where each time we join the minimal letter of the previous word using Lie multiplication. To be more precise, we use the induction $[w] = {[w^{'}]}_{_{x_{i}^{j} \mapsto [[x_{i} x_{β}] \dots x_{β}]}}$ .

Definition 5

(up-to-down bracketing of ALSW, Shirshov [204], Chen et al. [72]) For an ALSW $w$ , we define the up-to-down Lie bracketing $[[w]]$ by the induction $[[w]] = [[[u]] [[v]]]$ , where $w = u v$ as in (X).

(XI) If $w \in A L S W (X)$ then $[w] = [[w]]$ .

(XII) Shirshov’s definition of a NLSW (non-associative LS word) $(w)$ below is the same as $[w]$ and $[[w]]$ ; that is, $(w) = [w] = [[w]]$ . Chen et al. [72] used $[[w]]$ .

Definition 6

(Shirshov[203]) A non-associative word $(w)$ in $X$ is a NLSW if

(i)
$w$ is an ALSW;
(ii)
if $(w) = ((u) (v))$ then both $(u)$ and $(v)$ are NLSWs (then (IV) implies that $u ≻ v$ );
(iii)
if $(w) = (((u_{1}) (u_{2})) (v))$ then $u_{2} ⪯ v$ .

Denote the set of all NLSWs on $X$ by $N L S W (X)$ .

(XIII) If $u \in A L S W (X)$ and $[u] \in N L S W (X)$ then $\bar{[u]} = u$ in $k ⟨ X ⟩$ .

(XIV) The set $N L S W (X)$ is linearly independent in $L i e (X) \subset k ⟨ X ⟩$ for every commutative ring $k$ .

(XV) $N L S W (X)$ is a set of linear generators in every Lie algebra generated by $X$ over an arbitrary commutative ring $k$ .

(XVI) $L i e (X) \subset k ⟨ X ⟩$ is the free Lie algebra over the commutative ring $k$ with the $k$ -basis $N L S W (X)$ .

(XVII) (Shirshov’s special bracketing [203]) Consider $w = a u b$ with $w, u \in A L S W (X)$ . Then

(i)
$[w] = [a [u c] d],$ where $b = c d$ and possibly $c = 1$ .
(ii)
Express $c$ in the form $c = c_{1} c_{2} \dots c_{n},$ where $c_{1}, \dots, c_{n} \in A L S W (X)$ and $c_{1} ⪯ c_{2} ⪯ \dots ⪯ c_{n}$ . Replacing $[u c]$ by $[\dots [[u] [c_{1}]] \dots [c_{n}]]$ , we obtain the word
$\begin{matrix} {[w]}_{u} = [a [\dots [[[u] [c_{1}]] [c_{2}]] \dots [c_{n}]] d] \end{matrix}$
which is called the Shirshov special bracketing of $w$ relative to $u$ .
(iii)
${[w]}_{u} = a [u] b + \sum_{i} α_{i} a_{i} [u] b_{i}$ in $k ⟨ X ⟩$ with $α_{i} \in k$ and $a_{i}, b_{i} \in X^{*}$ satisfying $a_{i} u b_{i} < a u b$ , and hence ${\bar{[w]}}_{u} = w$ .

Outline of the proof. Put $x_{β} = min (w)$ . Then $w^{'} = a^{'} {(u x_{β}^{m})}^{'} {(b_{1})}^{'}$ in ${(X {(w)}^{'})}^{*}$ , where $b = x_{β}^{m} b_{1}$ and $u x_{β}^{m}$ is an ALSW. Claim (i) follows from (II) by induction on length. The same applies to claim (iii).

(XVIII) (Shirshov’s Lie elimination of the leading word) Take two monic Lie polynomials $f$ and $s$ with $\bar{f} = a \bar{s} b$ for some $a, b \in X^{*}$ . Then $f_{1} = f - {[a s b]}_{\bar{s}}$ is a Lie polynomial with smaller leading word, and so ${\bar{f}}_{1} < \bar{f}$ .

(XIX) (Shirshov’s double special bracketing) Assume that $w = a u b v c$ with $w, u, v \in A L S W (X)$ . Then there exists a bracketing ${[w]}_{u, v}$ such that ${[w]}_{u, v} = {[a [u] b [v] c]}_{u, v}$ and $\bar{{[w]}_{u, v}} = w$ .

More precisely, ${[w]}_{u, v} = [a {[u p]}_{u} q {[v r]}_{v} s]$ if $[w] = [a [u p] q [v r] s]$ , and

\begin{matrix} {[w]}_{u, v} = [a [\dots [\dots [[u] [c_{1}]] \dots {[c_{i}]}_{v}] \dots [c_{n}]] p] \end{matrix}

if $[w] = [a [u c] p]$ , where $c = c_{1} \dots c_{n}$ is the Shirshov factorization of $c$ and $v$ is a subword of $c_{i}$ . In both cases ${[w]}_{u, v} = a [u] b [v] d + \sum α_{i} a_{i} [u] b_{i} [v] d_{i}$ in $k ⟨ X ⟩$ , where $a_{i} u b_{i} v d_{i} < w$ .

(XX) (Shirshov’s algorithm for recognizing Lie polynomials, cf. the Dynkin–Specht–Wever and Friedrich algorithms). Take $s \in L i e (X) \subset k ⟨ X ⟩$ . Then $\bar{s}$ is an ALSW and $s_{1} = s - α_{\bar{s}} [\bar{s}]$ is a Lie polynomial with a smaller maximal word (in the deg-lex ordering), ${\bar{s}}_{1} < \bar{s}$ , where $s = α_{\bar{s}} [\bar{s}] + \dots$ . Then $s_{2} = s_{1} - α_{{\bar{s}}_{1}} [{\bar{s}}_{1}], \bar{s_{2}} < \bar{s_{1}}$ . Consequently, $s \in L i e (X)$ if and only if after finitely many steps we obtain

\begin{matrix} s_{m + 1} = s - α_{\bar{s}} [\bar{s}] - α_{{\bar{s}}_{1}} [{\bar{s}}_{1}] - \dots - α_{{\bar{s}}_{m}} [{\bar{s}}_{m}] = 0 . \end{matrix}

Here $k$ can be an arbitrary commutative ring.

Definition 7

Consider $S \subset L i e (X)$ with all $s \in S$ monic. Take $a, b \in X^{*}$ and $s \in S$ . If $a \bar{s} b$ is an ALSW then we call ${[a s b]}_{\bar{s}} = {[a \bar{s} b]}_{\bar{s}} |_{[\bar{s}] \mapsto s}$ a special normal $S$ -word (or a special normal $s$ -word), where ${[a \bar{s} b]}_{\bar{s}}$ is defined in (XVII) (ii). A Lie $S$ -word $(a s b)$ is called a normal $S$ -word whenever $\bar{(a s b)} = a \bar{s} b$ . Every special normal $s$ -word is a normal $s$ -word by (XVII) (iii).

For $f, g \in S$ there are two kinds of Lie compositions:

(i)
If $w = \bar{f} = a \bar{g} b$ for some $a, b \in X^{*}$ then the polynomial ${⟨ f, g ⟩}_{w} = f - {[a g b]}_{\bar{g}}$ is called the inclusion composition of $f$ and $g$ with respect to $w$ .
(ii)
If $w$ is a word satisfying $w = \bar{f} b = a \bar{g}$ for some $a, b \in X^{*}$ with $d e g (\bar{f}) + d e g (\bar{g}) > d e g (w)$ then the polynomial ${⟨ f, g ⟩}_{w} = {[f b]}_{\bar{f}} - {[a g]}_{\bar{g}}$ is called the intersection composition of $f$ and $g$ with respect to $w$ , and $w$ is an ALSW by (IX).

Given a Lie polynomial $h$ and $w \in X^{*}$ , say that $h$ is trivial modulo $(S, w)$ and write $h \equiv_{L i e} 0 m o d (S, w)$ whenever $h = \sum_{i} α_{i} (a_{i} s_{i} b_{i})$ , where each $α_{i} \in k, (a_{i} s_{i} b_{i})$ is a normal $S$ -word and $a_{i} \bar{s_{i}} b_{i} < w$ .

A set $S$ is called a GS basis in $L i e (X)$ if every composition ${(f, g)}_{w}$ of polynomials $f$ and $g$ in $S$ is trivial modulo $S$ and $w$ .

(XXI) If $s \in L i e (X)$ is monic and $(a s b)$ is a normal $S$ -word then $(a s b) = a s b + \sum_{i} α_{i} a_{i} s b_{i}$ , where $a_{i} \bar{s} b_{i} < a \bar{s} b$ .

A proof of (XXI) follows from the CD-lemma for associative algebras since ${s}$ is an associative GS basis by (IV).

(XXII) Given two monic Lie polynomials $f$ and $g$ , we have

\begin{matrix} {⟨ f, g ⟩}_{w} - {(f, g)}_{w} \equiv_{a s s} 0 mod ({f, g}, w) . \end{matrix}

Proof

If ${⟨ f, g ⟩}_{w}$ and ${(f, g)}_{w}$ are intersection compositions, where $w = \bar{f} b = a \bar{g}$ , then (XIII) and (XVII) yield

\begin{matrix} {⟨ f, g ⟩}_{w} = {[f b]}_{\bar{f}} - {[a g]}_{\bar{g}} = f b + \sum_{I_{1}} α_{i} a_{i} f b_{i} - a g - \sum_{I_{2}} β_{j} a_{j} g b_{j}, \end{matrix}

where $a_{i} \bar{f} b_{i}, a_{j} \bar{g} b_{j} < \bar{f} b = a \bar{g} = w$ . Hence,

\begin{matrix} {⟨ f, g ⟩}_{w} - {(f, g)}_{w} \equiv_{a s s} 0 mod ({f, g}, w) . \end{matrix}

In the case of inclusion compositions we arrive at the same conclusion. $□$

Theorem 33

(PBW Theorem in Shirshov’s form [56, 57], see Theorem 17) A nonempty set $S \subset L i e (X) \subset k ⟨ X ⟩$ of monic Lie polynomials is a Gröbner–Shirshov basis in $L i e (X)$ if and only if $S$ is a Gröbner–Shirshov basis in $k ⟨ X ⟩$ .

Proof

Observe that, by definition, for any $f, g \in S$ the composition lies in $L i e (X)$ if and only if it lies $k ⟨ X ⟩$ .

Assume that $S$ is a GS basis in $L i e (X)$ . Then we can express every composition ${⟨ f, g ⟩}_{w}$ as ${⟨ f, g ⟩}_{w} = \sum_{I_{1}} α_{i} (a_{i} s_{i} b_{i}),$ where $(a_{i} s_{i} b_{i})$ are normal $S$ -words and $a_{i} \bar{s_{i}} b_{i} < w$ . By (XXI), we have ${⟨ f, g ⟩}_{w} = \sum_{I_{2}} β_{j} c_{j} s_{j} d_{j}$ with $c_{j} \bar{s_{j}} d_{j} < w$ . Therefore, (XXII) yields ${(f, g)}_{w} \equiv_{a s s} 0 mod (S, w) .$ Thus, $S$ is a GS basis in $k ⟨ X ⟩$ .

Conversely, assume that $S$ is a GS basis in $k ⟨ X ⟩$ . Then the CD-lemma for associative algebras implies that $\bar{{⟨ f, g ⟩}_{w}} = a \bar{s} b < w$ for some $a, b \in X^{*}$ and $s \in S$ . Then $h = {⟨ f, g ⟩}_{w} - α {[a s b]}_{\bar{s}} \in I d_{a s s} (S)$ is a Lie polynomial and $\bar{h} < \bar{{⟨ f, g ⟩}_{w}}$ . Induction on $\bar{{⟨ f, g ⟩}_{w}}$ yields ${⟨ f, g ⟩}_{w} \equiv_{L i e} 0 mod (S, w) .$ $□$

Theorem 34

(The CD-lemma for Lie algebras over a field) Consider a nonempty set $S \subset L i e (X) \subset k ⟨ X ⟩$ of monic Lie polynomials and denote by $I d (S)$ the ideal of $L i e (X)$ generated by $S$ . The following statements are equivalent:

(i)
The set $S$ is a Gröbner–Shirshov basis in $L i e (X)$ .
(ii)
If $f \in I d (S)$ then $\bar{f} = a \bar{s} b$ for some $s \in S$ and $a, b \in X^{*}$ .
(iii)
The set
$\begin{matrix} I r r (S) = {[u] \in N L S W (X) | u \neq a \bar{s} b, s \in S, a, b \in X^{*}} \end{matrix}$
is a linear basis for $L i e (X | S)$ .

Proof

(i) $\Rightarrow$ (ii). Denote by $I d_{a s s} (S)$ and $I d_{L i e} (S)$ the ideals of $k ⟨ X ⟩$ and $L i e (X)$ generated by $S$ respectively. Since $I d_{L i e} (S) \subseteq I d_{a s s} (S)$ , Theorem 33 and the CD-lemma for associative algebras imply the claim.

(ii) $\Rightarrow$ (iii). Suppose that $\sum α_{i} [u_{i}] = 0$ in $L i e (X | S)$ with $[u_{i}] \in I r r (S)$ and $u_{1} > u_{2} > \dots$ , that is, $\sum α_{i} [u_{i}] \in I d_{L i e} (S)$ . Then all $α_{i}$ must vanish. Otherwise we may assume that $α_{1} \neq 0$ . Then $\bar{\sum α_{i} [u_{i}]} = u_{1}$ and (ii) implies that $[u_{1}] \notin I r r (S)$ , which is a contradiction. On the other hand, by the next property (XXIII), $I r r (S)$ generates $L i e (X | S)$ as a linear space.

(iii) $\Rightarrow$ (i). This part follows from (XXIII). $□$

The next property is similar to Lemma 2.

(XXIII) Given $S \subset L i e (X)$ , we can express every $f \in L i e (X)$ as

\begin{matrix} f = \sum α_{i} [u_{i}] + \sum β_{j} {[a_{j} s_{j} b_{j}]}_{\bar{s_{j}}} \end{matrix}

with $α_{i}, β_{j} \in k$ , $[u_{i}] \in I r r (S)$ satisfying $\bar{[u_{i}]} \leq \bar{f}$ , and ${[a_{j} s_{j} b_{j}]}_{\bar{s_{j}}}$ are special normal $S$ -word satisfying $\bar{{[a_{j} s_{j} b_{j}]}_{\bar{s_{j}}}} \leq \bar{f}$ .

(XXIV) Given a normal $s$ -word $(a s b)$ , take $w = a \bar{s} b$ . Then $(a s b) \equiv {[a s b]}_{\bar{s}} mod (s, w)$ . It follows that $h \equiv_{L i e} 0 mod (S, w)$ is equivalent to $h = \sum_{i} α_{i} {[a_{i} s_{i} b_{i}]}_{\bar{s_{i}}}$ , where ${[a_{i} s_{i} b_{i}]}_{\bar{s_{i}}}$ are special normal $S$ -words with $a_{i} \bar{s_{i}} b_{i} < w$ .

Proof

Observe that for every monic Lie polynomial $s$ , the set ${s}$ is a GS basis in $L i e (X)$ . Then (XVIII) and the CD-lemma for Lie algebras yield $(a s b) \equiv {[a s b]}_{\bar{s}} mod (s, w)$ . $□$

Summary of the proof of Theorem 34.

Given two ALSWs $u$ and $v$ , define the ALSW- $lcm (u, v)$ (or $lcm (u, v)$ for short) as follows:

\begin{matrix} w = lcm (u, v) \in {a u c v b (an ALSW), a, b, c \in X^{*} (a trivial lcm); \\ u = a v b, a, b \in X^{*} (an inclusion lcm); \\ u b = a v, a, b \in X^{*}, d e g (u b) < d e g (u) + d e g (v) (an intersection lcm)} . \end{matrix}

Denote by ${[w]}_{u, v}$ the Shirshov double special bracketing of $w$ in the case that $w$ is a trivial $lcm (u, v)$ , by ${[w]}_{u}$ and ${[w]}_{v}$ the Shrishov special bracketings of $w$ if $w$ is an inclusion or intersection $lcm$ respectively. Then we can define a general Lie composition for monic Lie polynomials $f$ and $g$ with $\bar{f} = u$ and $\bar{g} = v$ as

\begin{matrix} {(f, g)}_{w} = {[w]}_{u, v} {|_{[u] \mapsto f} - {[w]}_{u, v} |}_{[v] \mapsto g} \end{matrix}

if $w$ is a trivial $lcm (u, v)$ (it is $0 mod ({f, g}, w)$ ), and

\begin{matrix} {(f, g)}_{w} = {[w]}_{u} {|_{[u] \mapsto f} - {[w]}_{v} |}_{[v] \mapsto g} \end{matrix}

if $w$ is an inclusion or intersection $lcm (u, v)$ .

If $S \subset L i e (X) \subset k ⟨ X ⟩$ is a Lie GS basis then $S$ is an associative GS basis. This follows from property (XVII) (iii) and justifies the claim (i) $\Rightarrow$ (ii) of Theorem 34.

Shirshov’s original proof of (i) $\Rightarrow$ (ii) in Theorem 34, (see [207, 209]), rests on an analogue of Lemma 1 for Lie algebras.

Lemma 7

([207, 209]) If $(a_{1} s_{1} b_{1}), (a_{2} s_{2} b_{2})$ are normal $S$ -words with equal leading associative words, $w = a_{1} \bar{s_{1}} b_{1} = a_{2} \bar{s_{2}} b_{2}$ , then they are equal $mod (S, w)$ , that is, $(a_{1} s_{1} b_{1}) - (a_{2} s_{2} b_{2}) \equiv 0 mod (S, w) .$

Outline of the proof. We have $w_{1} = c w d$ and $w = lcm (\bar{s_{1}}, \bar{s_{2}})$ . Shirshov’s (double) special bracketing lemma yields

\begin{matrix} {[w_{1}]}_{w} = [c [[w] d_{1}] d_{2}] = c [w] d + \sum α_{i} a_{i} [w] b_{i} \end{matrix}

with $a_{i} w b_{i} < w_{1}$ . The ALSW $w$ includes $u = \bar{s_{1}}$ and $v = \bar{s_{2}}$ as subwords, and so there is a bracketing ${w} \in {{[w]}_{u, v}, {[w]}_{u}, {[w]}_{v}}$ such that

\begin{matrix} [a_{1} s_{1} b_{1}] = [c {w} |_{[u] \mapsto s_{1}} d], [a_{2} s_{2} b_{2}] = [c {w} |_{[v] \mapsto s_{2}} d] \end{matrix}

are normal $s_{1}$ - and $s_{2}$ - words with the same leading associative word $w_{1}$ . Then

\begin{matrix} [a_{1} s_{1} b_{1}] - [a_{2} s_{2} b_{2}] = [c {(s_{1}, s_{2})}_{w} d] \equiv 0 mod (S, w_{1}) . \end{matrix}

Now it is enough to prove that two normal Lie $s$ -words with the same leading associative words, say $w_{1}$ , are equal $mod (s, w_{1})$ :

\begin{matrix} f = (a s b) - [a s b] \equiv_{L i e} 0 mod (s, w_{1}) provided that \bar{f} < w_{1} . \end{matrix}

Since $f \in I d_{a s s} (s)$ , we have $\bar{f} = c_{1} \bar{s} d_{1}$ by the CD-lemma for associative algebras with one Lie polynomial relation $s$ . Then $f - α {[c_{1} s d_{1}]}_{\bar{s}}$ is a Lie polynomial with the leading associative word smaller than $w_{1}$ . Induction on $w_{1}$ finishes the proof.

4.2.1 Gröbner–Shirshov basis for the Drinfeld–Kohno Lie algebra

In this section we give a GS basis for the Drinfeld–Kohno Lie algebra $L_{n}$ .

Definition 8

Fix an integer $n > 2$ . The Drinfeld–Kohno Lie algebra $L_{n}$ over $Z$ is defined by generators $t_{i j} = t_{j i}$ for distinct indices $1 \leq i, j \leq n - 1$ satisfying the relations $[t_{i j} t_{k l}] = 0$ and $[t_{i j} (t_{i k} + t_{j k})] = 0$ for distinct $i$ , $j$ , $k$ , and $l$ .

Therefore, we have the presentation $L_{n} = L i e_{Z} (T | S)$ , where $T = {t_{i j} | 1 \leq i < j \leq n - 1}$ and $S$ consists of the following relations:

\begin{matrix} [t_{i j} t_{k l}] = 0 if k < i < j, k < l, l \neq i, j; \end{matrix}

(18)

\begin{matrix} [t_{j k} t_{i j}] + [t_{i k} t_{i j}] = 0 if i < j < k; \end{matrix}

(19)

\begin{matrix} [t_{j k} t_{i k}] - [t_{i k} t_{i j}] = 0 if i < j < k . \end{matrix}

(20)

Order $T$ by setting $t_{i j} < t_{k l}$ if either $i < k$ or $i = k$ and $j < l$ . Let $<$ be the deg-lex ordering on $T^{*}$ .

Theorem 35

([80]) With $S =$ {(18), (19), (20)} as before and the deg-lex ordering $<$ on $T^{*}$ , the set $S$ is a Gröbner–Shirshov basis of $L_{n}$ .

Corollary 15

The Drinfeld–Kohno Lie algebra $L_{n}$ is a free $Z$ -module with $Z$ -basis $\cup_{i = 1}^{n - 2} N L S W (T_{i})$ , where $T_{i} = {t_{i j} | i < j \leq n - 1}$ for $i = 1, \dots, n - 2$ .

Corollary 16

([100]) The Drinfeld–Kohno Lie algebra $L_{n}$ is an iterated semidirect product of free Lie algebras $A_{i}$ generated by $T_{i} = {t_{i j} | i < j \leq n - 1}$ , for $i = 1, \dots, n - 2$ .

4.2.2 Kukin’s example of a Lie algebra with undecidable word problem

Markov [161], Post [182], Turing [211], Novikov [173], and Boone [60] constructed finitely presented semigroups and groups with undecidable word problem. For groups this also follows from Higman’s theorem [115] asserting that every recursively presented group embeds into a finitely presented group. A weak analogue of Higman’s theorem for Lie algebras was proved in [21], which was enough for the existence of a finitely presented Lie algebra with undecidable word problem. In this section we give Kukin’s construction [142] of a Lie algebra $A_{P}$ for every semigroup $P$ such that if $P$ has undecidable word problem then so does $A_{P}$ .

Given a semigroup $P = sgp ⟨ x, y | u_{i} = v_{i}, i \in I ⟩$ , consider the Lie algebra

\begin{matrix} A_{P} = L i e (x, \hat{x}, y, \hat{y}, z | S) \end{matrix}

with $S$ consisting of the relations

(1)
$[\hat{x} x] = 0, [\hat{x} y] = 0, [\hat{y} x] = 0, [\hat{y} y] = 0$ ;
(2)
$[\hat{x} z] = - [z x], [\hat{y} z] = - [z y]$ ;
(3)
$⌊ z u_{i} ⌋ = ⌊ z v_{i} ⌋, i \in I$ .

Here, $⌊ z u ⌋$ stands for the left normed bracketing.

Put $\hat{x} > \hat{y} > z > x > y$ and denote by $>$ the deg-lex ordering on the set ${\hat{x}, \hat{y}, x, y, z}^{*}$ . Denote by $ρ$ the congruence on ${x, y}^{*}$ generated by ${(u_{i}, v_{i}), i \in I}$ . Put

$(3^{'})$ $⌊ z u ⌋ = ⌊ z v ⌋, (u, v) \in ρ$ with $u > v$ .

Lemma 8

([80]) In this notation, the set $S_{1} = {(1), (2), (3^{'})}$ is a GS basis in $L i e (\hat{x}, \hat{y}, x, y, z)$ .

Proof

For every $u \in {x, y}^{*}$ , we can show that $\bar{⌊ z u ⌋} = z u$ by induction on $| u |$ . All possible compositions in $S_{1}$ are the intersection compositions of (2) and $(3^{'})$ , and the inclusion compositions of $(3^{'})$ and $(3^{'})$ .

For $(2) \land (3^{'})$ , we take $f = [\hat{x} z] + [z x]$ and $g = ⌊ z u ⌋ - ⌊ z v ⌋$ . Therefore, $w = \hat{x} z u$ with $(u, v) \in ρ$ and $u > v$ . We have

\begin{matrix} {⟨ [\hat{x} z] + [z x], ⌊ z u ⌋ - ⌊ z v ⌋ ⟩}_{w} = {[f u]}_{\bar{f}} - {[\hat{x} g]}_{\bar{g}} \\ \equiv ⌊ ([\hat{x} z] + [z x]) u ⌋ - [\hat{x} (⌊ z u ⌋ - ⌊ z v ⌋)] \\ \equiv ⌊ z x u ⌋ + ⌊ \hat{x} z v ⌋ \equiv ⌊ z x u ⌋ - ⌊ z x v ⌋ \equiv 0 mod (S_{1}, w) . \end{matrix}

For $(3^{'}) \land (3^{'})$ , we use $w = z u_{1} = z u_{2} e$ , where $e \in {x, y}^{*}$ and $(u_{i}, v_{i}) \in ρ$ with $u_{i} > v_{i}$ for $i = 1, 2$ . We have

\begin{matrix} {⟨ ⌊ z u_{1} ⌋ - ⌊ z v_{1} ⌋, ⌊ z u_{2} ⌋ - ⌊ z v_{2} ⌋ ⟩}_{w} \equiv (⌊ z u_{1} ⌋ - ⌊ z v_{1} ⌋) - ⌊ (⌊ z u_{2} ⌋ - ⌊ z v_{2} ⌋) e ⌋ \\ \equiv ⌊ ⌊ z v_{2} ⌋ e ⌋ - ⌊ z v_{1} ⌋ \equiv ⌊ z v_{2} e ⌋ - ⌊ z v_{1} ⌋ \equiv 0 mod (S_{1}, w) . \end{matrix}

Thus, $S_{1} = {(1), (2), (3^{'})}$ is a GS basis in $L i e (\hat{x}, \hat{y}, x, y, z)$ . $□$

Corollary 17

(Kukin [142]) For $u, v \in {x, y}^{*}$ we have

\begin{matrix} u = v i n t h e s e m i g r o u p P \Leftrightarrow ⌊ z u ⌋ = ⌊ z v ⌋ i n t h e L i e a l g e b r a A_{P} . \end{matrix}

Proof

Assume that $u = v$ in the semigroup $P$ . Without loss of generality we may assume that $u = a u_{1} b$ and $v = a v_{1} b$ for some $a, b \in {x, y}^{*}$ and $(u_{1}, v_{1}) \in ρ$ . For every $r \in {x, y}$ relations (1) yield $[\hat{x} r] = 0$ ; consequently, $⌊ z x c ⌋ = ⌊ [z \hat{x}] c ⌋ = [⌊ z c ⌋ \hat{x}]$ and $⌊ z y c ⌋ = [⌊ z c ⌋ \hat{y}]$ for every $c \in {x, y}^{*}$ . This implies that in $A_{P}$ we have

\begin{matrix} ⌊ z u ⌋ & = ⌊ z a u_{1} b ⌋ = ⌊ ⌊ z a u_{1} ⌋ b ⌋ = ⌊ ⌊ z u_{1} \hat{\overset{\leftarrow}{a}} ⌋ b ⌋ = ⌊ z u_{1} \hat{\overset{\leftarrow}{a}} b ⌋ = ⌊ z v_{1} \hat{\overset{\leftarrow}{a}} b ⌋ \\ = ⌊ z a v_{1} b ⌋ = ⌊ z v ⌋, \end{matrix}

where for every $x_{i_{1}} x_{i_{2}} \dots x_{i_{n}} \in {x, y}^{*}$ we put

\begin{matrix} \overset{\leftarrow}{x_{i_{1}} x_{i_{2}} \dots x_{i_{n}}} : = x_{i_{n}} x_{i_{n - 1}} \dots x_{i_{1}}, \hat{x_{i_{1}} x_{i_{2}} \dots x_{i_{n}}} : = \hat{x_{i_{1}}} \hat{x_{i_{2}}} \dots \hat{x_{i_{n}}} . \end{matrix}

Moreover, $(3^{'})$ holds in $A_{P}$ .

Suppose that $⌊ z u ⌋ = ⌊ z v ⌋ in the Lie algebra A_{P}$ . Then both $⌊ z u ⌋$ and $⌊ z v ⌋$ have the same normal form in $A_{P}$ . Since $S_{1}$ is a GS basis in $A_{P}$ , we can reduce both $⌊ z u ⌋$ and $⌊ z v ⌋$ to the same normal form $⌊ z c ⌋$ for some $c \in {x, y}^{*}$ using only relations $(3^{'})$ . This implies that $u = c = v$ in $P$ . $□$

By the corollary, if the semigroup $P$ has undecidable word problem then so does the Lie algebra $A_{P}$ .

4.3 Composition-Diamond lemma for Lie algebras over commutative algebras

For a well-ordered set $X = {x_{i} | i \in I}$ , consider the free Lie algebra $L i e (X) \subset k ⟨ X ⟩$ with the Lie bracket $[u, v] = u v - v u$ .

Given a well-ordered set $Y = {y_{j} | j \in J}$ , the free commutative monoid $[Y]$ generated by $Y$ is a linear basis of $k [Y]$ . Regard

\begin{matrix} L i e_{k [Y]} (X) ≅ k [Y] \otimes L i e (X) \end{matrix}

as a Lie subalgebra of the free associative algebra $k [Y] ⟨ X ⟩ ≅ k [Y] \otimes k ⟨ X ⟩$ generated by $X$ over the polynomial algebra $k [Y]$ , equipped with the Lie bracket $[u, v] = u v - v u$ . Then $N L S W (X)$ constitutes a $k [Y]$ -basis of $L i e_{k [Y]} (X)$ . Put $[Y] X^{*} = {β t | β \in [Y], t \in X^{*}}$ . For $u = β t \in [Y] X^{*}$ , put $u^{X} = t$ and $u^{Y} = β$ .

Denote the deg-lex orderings on $[Y]$ and $X^{*}$ by $>_{_{Y}}$ and $>_{_{X}}$ . Define an ordering $>$ on $[Y] X^{*}$ as follows: for $u, v \in [Y] X^{*}$ , put

\begin{matrix} u > v if (u^{X} >_{_{X}} v^{X}) or (u^{X} = v^{X} and u^{Y} >_{_{Y}} v^{Y}) . \end{matrix}

We can express every element $f \in L i e_{k [Y]} (X)$ as $f = \sum α_{i} β_{i} [u_{i}]$ , where $α_{i} \in k$ , $β_{i} \in [Y]$ , and $[u_{i}] \in N S L W (X) .$

Then $f = \sum α_{i} β_{i} [u_{i}] = \sum g_{j} (Y) [u_{j}]$ , where $g_{j} (Y) \in k [Y]$ are polynomials in the $k$ -algebra $k [Y] ⟨ X ⟩$ . The leading word $\bar{f}$ of $f$ in $k [Y] ⟨ X ⟩$ is of the form $β_{1} u_{1}$ with $β_{1} \in [Y]$ and $u_{1} \in A L S W (X)$ . The polynomial $f$ is called monic (or $k$ -monic) if the coefficient of $\bar{f}$ is equal to 1, that is, $α_{1} = 1$ . The notion of $k [Y]$ -monic polynomials is introduced similarly: $α_{1} = 1$ and $β_{1} = 1$ .

Recall that every ALSW $w$ admits a unique bracketing such that $[w]$ is a NLSW.

Consider a monic subset $S \subset L i e_{k [Y]} (X)$ . Given a non-associative word $(u)$ on $X$ with a fixed occurrence of some $x_{i}$ and $s \in S$ , call ${(u)}_{x_{i} \mapsto s}$ an $S$ -word. Define $| u |$ to be the $s$ -length of ${(u)}_{x_{i} \mapsto s}$ . Every $S$ -word is of the form $(a s b)$ with $a, b \in X^{*}$ and $s \in S$ . If $a {\bar{s}}^{X} b \in A L S W (X)$ then we have the special bracketing ${[a {\bar{s}}^{X} b]}_{{\bar{s}}^{X}}$ of $a {\bar{s}}^{X} b$ relative to ${\bar{s}}^{X}$ . Refer to ${[a s b]}_{\bar{s}} = {[a {\bar{s}}^{X} b]}_{{\bar{s}}^{X}} |_{[{\bar{s}}^{X}] \mapsto s}$ as a special normal $s$ -word (or special normal $S$ -word).

An $S$ -word $(u) = (a s b)$ is a normal $s$ -word, denoted by ${⌊ u ⌋}_{s}$ , whenever ${\bar{(a s b)}}^{X} = a {\bar{s}}^{X} b$ . The following condition is sufficient.

(i)
The $s$ -length of $(u)$ is 1, that is, $(u) = s$ ;
(ii)
if ${⌊ u ⌋}_{s}$ is a normal $S$ -word of $s$ -length $k$ and $[v] \in N L S W (X)$ satisfies $| v | = l$ then $[v] {⌊ u ⌋}_{s}$ whenever $v > {\bar{⌊ u ⌋}}_{s}^{X}$ and ${⌊ u ⌋}_{s} [v]$ whenever $v < {\bar{⌊ u ⌋}}_{s}^{X}$ are normal $S$ -words of $s$ -length $k + l$ .

Take two monic polynomials $f$ and $g$ in $L i e_{k [Y]} (X)$ and put $L = lcm ({\bar{f}}^{Y}, {\bar{g}}^{Y})$ .

There are four kinds of compositions.

$C_{1}$ : Inclusion composition. If ${\bar{f}}^{X} = a {\bar{g}}^{X} b$ for some $a, b \in X^{*}$ , then
$\begin{matrix} C_{1} {⟨ f, g ⟩}_{w} = \frac{L}{{\bar{f}}^{Y}} f - \frac{L}{{\bar{g}}^{Y}} {[a g b]}_{\bar{g}}, where w = L {\bar{f}}^{X} = L a {\bar{g}}^{X} b . \end{matrix}$
$C_{2}$ : Intersection composition. If ${\bar{f}}^{X} = a a_{0}$ and ${\bar{g}}^{X} = a_{0} b$ with $a, b, a_{0} \neq 1$ then
$\begin{matrix} C_{2} {⟨ f, g ⟩}_{w} = \frac{L}{{\bar{f}}^{Y}} {[f b]}_{\bar{f}} - \frac{L}{{\bar{g}}^{Y}} {[a g]}_{\bar{g}}, where w = L {\bar{f}}^{X} b = L a {\bar{g}}^{X} . \end{matrix}$
$C_{3}$ : External composition. If $g c d ({\bar{f}}^{Y}, {\bar{g}}^{Y}) \neq 1$ then for all $a, b, c \in X^{*}$ satisfying
$\begin{matrix} w = L a {\bar{f}}^{X} b {\bar{g}}^{X} c \in T_{A} = {β t | β \in [Y], t \in A L S W (X)} \end{matrix}$
we have
$\begin{matrix} C_{3} {⟨ f, g ⟩}_{w} = \frac{L}{{\bar{f}}^{Y}} {[a f b {\bar{g}}^{X} c]}_{\bar{f}} - \frac{L}{{\bar{g}}^{Y}} {[a {\bar{f}}^{X} b g c]}_{\bar{g}} . \end{matrix}$
$C_{4}$ : Multiplication composition. If ${\bar{f}}^{Y} \neq 1$ then for every special normal $f$ -word ${[a f b]}_{\bar{f}}$ with $a, b \in X^{*}$ we have
$\begin{matrix} C_{4} {⟨ f ⟩}_{w} = [a {\bar{f}}^{X} b] {[a f b]}_{\bar{f}}, where w = a {\bar{f}}^{X} b a \bar{f} b . \end{matrix}$

Given a $k$ -monic subset $S \subset L i e_{k [Y]} (X)$ and $w \in [Y] X^{*}$ , which is not necessarily in $T_{A}$ , an element $h \in L i e_{k [Y]} (X)$ is called trivial modulo $(S, w)$ if $h$ can be expressed as a $k [Y]$ -linear combination of normal $S$ -words with leading words smaller than $w$ . The set $S$ is a Gröbner–Shirshov basis in $L i e_{k [Y]} (X)$ if all possible compositions in $S$ are trivial.

Theorem 36

([31], the CD-lemma for Lie algebras over commutative algebras) Consider a nonempty set $S \subset L i e_{k [Y]} (X)$ of monic polynomials and denote by $I d (S)$ the ideal of $L i e_{k [Y]} (X)$ generated by $S$ . The following statements are equivalent:

(i)
The set $S$ is a Gröbner–Shirshov basis in $L i e_{k [Y]} (X)$ .
(ii)
If $f \in I d (S)$ then $\bar{f} = a \bar{s} b \in T_{A}$ for some $s \in S$ and $a, b \in [Y] X^{*}$ .
(iii)
The set $I r r (S) = {[u] | [u] \in T_{N}, u \neq a \bar{s} b, for s \in S and a, b \in [Y] X^{*}}$ is a linear basis for $L i e_{k [Y]} (X | S) = (L i e_{k [Y]} (X)) / I d (S)$ .

Here $T_{A} = {β t | β \in [Y], t \in A L S W (X)}$ and $T_{N} = {β [t] | β \in [Y], [t] \in N L S W (X)} .$

Outline of the proof.

Take $u, v \in [Y] A L S W (X)$ and write $u = u^{Y} u^{X}$ and $v = v^{Y} v^{X}$ . Define the ALSW- $lcm (u, v)$ (or $lcm (u, v)$ for short) as $w = w^{Y} w^{X} = lcm (u^{Y}, v^{Y}) lcm (u^{X}, v^{X})$ , where

\begin{matrix} lcm (u^{X}, v^{X}) \in {a u^{X} c v^{X} b (a n A L S W), a, b, c \in X^{*}; \\ u^{X} = a v^{X} b, a, b \in X^{*}; u^{X} b = a v^{X}, a, b \in X^{*}, d e g (u^{X} b) < d e g (u^{X}) + d e g (v^{X})} . \end{matrix}

Six $lcm (u, v)$ are possible:

(i)
( $Y$ -trivial, $X$ -trivial) (a trivial $lcm (u, v))$ ;
(ii)
( $Y$ -trivial, $X$ -inclusion);
(iii)
( $Y$ -trivial, $X$ -intersection);
(iv)
( $Y$ -nontrivial, $X$ -trivial);
(v)
( $Y$ -nontrivial, $X$ -inclusion);
(vi)
( $Y$ -nontrivial, $X$ -intersection).

In accordance with $lcm (u, v)$ , six general compositions are possible.

Denote by ${[w^{X}]}_{u^{X}, v^{X}}$ the Shirshov double special bracketing of $w^{X}$ whenever $w^{X}$ is a $X$ -trivial $lcm (u^{X}, v^{X})$ , by ${[w^{X}]}_{u^{X}}$ and ${[w^{X}]}_{v^{X}}$ the Shirshov special bracketings of $w^{X}$ whenever $w^{X}$ is a lcm of $X$ -inclusion or $X$ -intersection respectively.

Define general Lie compositions for $k$ -monic Lie polynomials $f$ and $g$ with $\bar{f} = u$ and $\bar{g} = v$ as

\begin{matrix} {(f, g)}_{w} & = (lcm (u^{Y}, v^{Y}) / u^{Y}) {[w^{X}]}_{u^{X}, v^{X}} {|_{[u] \mapsto f} - (lcm (u^{Y}, v^{Y}) / v^{Y}) {[w^{X}]}_{u^{X}, v^{X}} |}_{[v] \mapsto g}, \\ {(f, g)}_{w} & = (lcm (u^{Y}, v^{Y}) / u^{Y}) {[w^{X}]}_{u} {|_{[u] \mapsto f} - (lcm (u^{Y}, v^{Y}) / v^{Y}) {[w^{X}]}_{v} |}_{[v] \mapsto g} . \end{matrix}

Lemma 9

([31]) The general composition ${(f, g)}_{w}$ of $k$ -monic Lie polynomials $f$ and $g$ with $\bar{f} = u$ and $\bar{g} = v$ , where $w$ is a ( $Y$ -trivial, $X$ -trivial) $lcm (u, v)$ , is $0 mod ({f, g}, w) .$

Proof

By (XIX), we have

\begin{matrix} {(f, g)}_{w} & = v^{Y} [a f b [v^{X}] d] - u^{Y} [a [u^{X}] b g d] = [a f b [v] d] - [a u b g d] \\ = [a f b ([v] - g) d] - [a ([u] - f) b g d] \equiv 0 mod ({f, g}, w) . \end{matrix}

The proof is complete. $□$

A Lie GS basis $S \subset L i e_{k [Y]} (X) \subset k [Y] ⟨ X ⟩$ need not be an associative GS basis because the PBW-theorem is not valid for Lie algebras over a commutative algebra (Shirshov [201]). Therefore, the argument for $L i e_{k} (X)$ above (see Sect. 4.2) fails for $L i e_{k [Y]} (X)$ .

Moreover, Shirshov’s original proof of the CD-lemma fails because the singleton ${s} \in L i e_{k [Y]} (X)$ is not a GS basis in general. The reason is that there exists a nontrivial composition ${(s, s)}_{w}$ of type ( $Y$ -nontrivial, $X$ -trivial).

There is another obstacle. For $L i e_{k} (X)$ , every $s$ -word is a linear combination of normal $s$ -words. For $L i e_{k [Y]} (X)$ this is not the case. Hence, we must use a multiplication composition $[u^{X}] f$ such that $\bar{f} = u = u^{Y} u^{X}$ .

Lemma 10

([31]) If every multiplication composition $[{\bar{s}}^{X}] s$ , $s \in S$ , is trivial modulo $(S, w = [u^{X}] u)$ , where $u = \bar{s}$ , then every $S$ -word is a linear combination of normal $S$ -words.

In our paper with Yongshan Chen [31], we use the following definition of triviality of a polynomial $f$ modulo $(S, w)$ :

\begin{matrix} f \equiv 0 mod (S, w) \Leftrightarrow f = \sum α_{i} e_{i}^{Y} [a_{i}^{X} s_{i} b_{i}^{X}], \end{matrix}

where $[a_{i}^{X} [{\bar{s_{i}}}^{X}] b_{i}^{X}]$ is the Shirshov special bracketing of the ALSW $a_{i}^{X} {\bar{s_{i}}}^{X} b_{i}^{X}$ with an ALSW ${\bar{s_{i}}}^{X}$ .

The previous definition of triviality modulo $(S, w)$ is equivalent to the usual definition by Lemma 11, which is key in the proof of the CD-lemma for Lie algebras over a commutative algebra.

Lemma 11

([31]) Given a monic set $S$ with trivial multiplication compositions, take a normal $s$ -word $(a s b)$ and a special normal $s$ -word $[a s b]$ with the same leading monomial $w = a \bar{s} b$ . Then they are equal modulo $(s, w)$ .

Lemmas 10 and 11 imply

Lemma 12

([31]) Given a monic set $S$ with trivial multiplication compositions, every element of the ideal generated by $S$ is a linear combination of special normal $S$ -words.

On the other hand, (XVII) and (XIX) imply the following analogue of Lemma 1 for $L i e_{k [Y]} (X)$ .

Lemma 13

([31]) Given two $k$ -monic special normal $S$ -words $e_{1}^{Y} [{a_{1}}^{X} s_{1} {b_{1}}^{X}]$ and $e_{2}^{Y} [{a_{2}}^{X} s_{2} {b_{2}}^{X}]$ with the same leading associative word $w_{1}$ , their difference is equal to $[a {(s_{1}, s_{2})}_{w} b]$ , where $w = lcm (\bar{s_{1}}, \bar{s_{2}})$ , $w_{1} = a w b$ , and $[a {(s_{1}, s_{2})}_{w} b] = {[w_{1}]}_{w} |_{[w] \mapsto {(s_{1}, s_{2})}_{w}}$ . Hence, if $S$ is a GS basis then the previous special normal $S$ -words are equal modulo $(S, w_{1})$ .

Now the claim (i) $\Rightarrow$ (ii) of the CD-lemma for $L i e_{k [Y]} (X)$ follows.

For every Lie algebra $L = L i e_{K} (X | S)$ over the commutative algebra $K = k [Y | R]$ ,

\begin{matrix} U (L) = K ⟨ X | S^{(-)} ⟩ = k [Y] ⟨ X | S^{(-)}, R X ⟩, \end{matrix}

where $S^{(-)}$ is just $S$ with all commutators $[u v] 4$ replaced with $u v - v u$ , is the universal enveloping associative algebra of $L$ .

A Lie algebra $L$ over a commutative algebra $K$ is called special whenever it embeds into its universal enveloping associative algebra. Otherwise it is called non-special.

Shirshov (1953) and Cartier (1958) gave classical examples of non-special Lie algebras over commutative algebras over $G F (2)$ , justified using ad hoc methods. Cohn (1963) suggested another non-special Lie algebra over a commutative algebra over a field of positive characteristic.

Example 1

(Shirshov (1953)) Take $k = G F (2)$ and

\begin{matrix} K = k [y_{i}, i = 0, 1, 2, 3 | y_{0} y_{i} = y_{i} (i = 0, 1, 2, 3), y_{i} y_{j} = 0 (i, j \neq 0)] . \end{matrix}

Consider $L = L i e_{K} (x_{i}, 1 \leq i \leq 13 | S_{1}, S_{2})$ , where

\begin{matrix} S_{1} & = {[x_{2} x_{1}] = x_{11}, [x_{3} x_{1}] = x_{13}, [x_{3} x_{2}] = x_{12}, \\ [x_{5} x_{3}] = [x_{6} x_{2}] = [x_{8} x_{1}] = x_{10}, [x_{i} x_{j}] = 0 (i > j)}; \\ S_{2} & = {y_{0} x_{i} = x_{i} (i = 1, 2, \dots, 13), \\ y_{1} x_{1} = x_{4}, y_{1} x_{2} = x_{5}, y_{1} x_{3} = x_{6}, y_{1} x_{12} = x_{10}, \\ y_{2} x_{1} = x_{5}, y_{2} x_{2} = x_{7}, y_{2} x_{3} = x_{8}, y_{2} x_{13} = x_{10}, \\ y_{3} x_{1} = x_{6}, y_{3} x_{2} = x_{8}, y_{3} x_{3} = x_{9}, y_{3} x_{11} = x_{10}, \\ y_{1} x_{k} = 0 (k = 4, 5, \dots, 11, 13), \\ y_{2} x_{t} = 0 (t = 4, 5, \dots, 12), \\ y_{3} x_{l} = 0 (l = 4, 5, \dots, 10, 12, 13)} . \end{matrix}

Then $L = L i e_{K} (X | S_{1}, S_{2}) = L i e_{k [Y]} (X | S_{1}, S_{2}, R X)$ and

\begin{matrix} S = S_{1} \cup S_{2} \cup R X \cup {y_{1} x_{2} = y_{2} x_{1}, y_{1} x_{3} = y_{3} x_{1}, y_{2} x_{3} = y_{3} x_{2}} \end{matrix}

is a GS basis in $L i e_{k [Y]} (X)$ , which implies that $x_{10}$ belongs to the linear basis of $L$ by Theorem 36, that is, $x_{10} \neq 0$ in $L$ .

On the other hand, the universal enveloping algebra of $L$ has the presentation

\begin{matrix} U_{K} (L) = K ⟨ X | S_{1}^{(-)}, S_{2} ⟩ ≅ k [Y] ⟨ X | S_{1}^{(-)}, S_{2}, R X ⟩ . \end{matrix}

However, the GS completion (see Mikhalev and Zolotykh [170]) of $S_{1}^{(-)} \cup S_{2} \cup R X$ in $k [Y] ⟨ X ⟩$ is

\begin{matrix} S^{C} = S_{1}^{(-)} \cup S_{2} \cup R X \cup {y_{1} x_{2} = y_{2} x_{1}, y_{1} x_{3} = y_{3} x_{1}, y_{2} x_{3} = y_{3} x_{2}, x_{10} = 0} . \end{matrix}

Thus, $L$ is not special.

Example 2

(Cartier [70]) Take $k = G F (2)$ and

\begin{matrix} K = k [y_{1}, y_{2}, y_{3} | y_{i}^{2} = 0, i = 1, 2, 3] . \end{matrix}

Consider $L = L i e_{K} (x_{i j}, 1 \leq i \leq j \leq 3 | S)$ , where

\begin{matrix} S = {[x_{i i} x_{j j}] = x_{j i} (i > j), [x_{i j} x_{k l}] = 0, y_{3} x_{33} = y_{2} x_{22} + y_{1} x_{11}} . \end{matrix}

Then $L$ is not special over $K$ .

Proof

The set $S^{'} = S \cup {y_{i}^{2} x_{k l} = 0 (\forall i, k, l)} \cup S_{1}$ is a GS basis in $L i e_{k [Y]} (X)$ , where

\begin{matrix} S_{1} & = {y_{3} x_{23} = y_{1} x_{12}, y_{3} x_{13} = y_{2} x_{12}, y_{2} x_{23} = y_{1} x_{13}, y_{3} y_{2} x_{22} = y_{3} y_{1} x_{11}, \\ y_{3} y_{1} x_{12} = 0, y_{3} y_{2} x_{12} = 0, y_{3} y_{2} y_{1} x_{11} = 0, y_{2} y_{1} x_{13} = 0} . \end{matrix}

Then, $y_{2} y_{1} x_{12} \in I r r (S^{'})$ and so $y_{2} y_{1} x_{12} \neq 0$ in $L$ .

However, in

\begin{matrix} U_{K} (L) = K ⟨ X | S^{(-)} ⟩ ≅ k [Y] ⟨ X | S^{(-)}, y_{i}^{2} x_{k l} = 0 (\forall i, k, l) ⟩ \end{matrix}

we have

\begin{matrix} 0 = y_{3}^{2} x_{33}^{2} = {(y_{2} x_{22} + y_{1} x_{11})}^{2} = y_{2}^{2} x_{22}^{2} + y_{1}^{2} x_{11}^{2} + y_{2} y_{1} [x_{22}, x_{11}] = y_{2} y_{1} x_{12} . \end{matrix}

Thus, $L ↪̸ U_{K} (L)$ .

Conjecture (Cohn [95]) Take the algebra $K = k [y_{1}, y_{2}, y_{3} | y_{i}^{p} = 0, i = 1, 2, 3]$ of truncated polynomials over a field $k$ of characteristic $p > 0$ . The algebra

\begin{matrix} L_{p} = L i e_{K} (x_{1}, x_{2}, x_{3} | y_{3} x_{3} = y_{2} x_{2} + y_{1} x_{1}), \end{matrix}

called Cohn’s Lie algebra, is not special.

In $U_{K} (L_{p})$ we have

\begin{matrix} 0 = {(y_{3} x_{3})}^{p} = {(y_{2} x_{2})}^{p} + Λ_{p} (y_{2} x_{2}, y_{1} x_{1}) + {(y_{1} x_{1})}^{p} = Λ_{p} (y_{2} x_{2}, y_{1} x_{1}), \end{matrix}

where $Λ_{p}$ is a Jacobson–Zassenhaus Lie polynomial. Cohn conjectured that $Λ_{p} (y_{2} x_{2}, y_{1} x_{1}) \neq 0$ in $L_{p}$ . To prove this, we must know a GS basis of $L_{p}$ up to degree $p$ in $X$ . We found it for $p = 2, 3, 5$ . For example, $Λ_{2} = [y_{2} x_{2}, y_{1} x_{1}] = y_{2} y_{1} [x_{2} x_{1}]$ and a GS basis of $L_{2}$ up to degree $2$ in $X$ is

\begin{matrix} y_{3} x_{3} = y_{2} x_{2} + y_{1} x_{1}, y_{i}^{2} x_{j} = 0 (1 \leq i, j \leq 3), y_{3} y_{2} x_{2} = y_{3} y_{1} x_{1}, y_{3} y_{2} y_{1} x_{1} = 0, \\ y_{2} [x_{3} x_{2}] = y_{1} [x_{3} x_{1}], y_{3} y_{1} [x_{2} x_{1}] = 0, y_{2} y_{1} [x_{3} x_{1}] = 0 . \end{matrix}

Therefore, $y_{2} y_{1} [x_{2} x_{1}] \in I r r (S^{C})$ .

Similar though much longer computations show that $Λ_{3} \neq 0$ in $L_{3}$ and $Λ_{5} \neq 0$ in $L_{5}$ . Thus, we have

Theorem 37

([31]) Cohn’s Lie algebras $L_{2}$ , $L_{3}$ , and $L_{5}$ are non-special.

Theorem 38

([31]) Given a commutative $k$ -algebra $K = k [Y | R]$ , if $S$ is a Gröbner–Shirshov basis in $L i e_{k [Y]} (X)$ such that every $s \in S$ is $k [Y]$ -monic then $L = L i e_{K} (X | S)$ is special.

Corollary 18

([31]) Every Lie $K$ -algebra $L_{K} = L i e_{K} (X | f)$ with one monic defining relation $f = 0$ is special.

Theorem 39

([31]) Suppose that $S$ is a finite homogeneous subset of $L i e_{k} (X)$ . Then the word problem of $L i e_{K} (X | S)$ is solvable for every finitely generated commutative $k$ -algebra $K$ .

Theorem 40

([31]) Every finitely or countably generated Lie $K$ -algebra embeds into a two-generated Lie $K$ -algebra, where $K$ is an arbitrary commutative $k$ -algebra.

5 Gröbner–Shirshov bases for $Ω$ -algebras and operads

5.1 CD-lemmas for $Ω$ -algebras

Some new CD-lemmas for $Ω$ -algebras have appeared: for associative conformal algebras [45] and $n$ -conformal algebras [43], for the tensor product of free algebras [30], for metabelian Lie algebras [75], for associative $Ω$ -algebras [41], for color Lie superalgebras and Lie $p$ -superalgebras [165, 166], for Lie superalgebras [167], for associative differential algebras [76], for associative Rota–Baxter algebras [32], for $L$ -algebras [33], for dialgebras [38], for pre-Lie algebras [35], for semirings [40], for commutative integro-differential algebras [102], for difference-differential modules and difference-differential dimension polynomials [225], for $λ$ -differential associative $Ω$ -algebras [185], for commutative associative Rota–Baxter algebras [186], for algebras with differential type operators [111].

Latyshev studied general versions of GS (or standard) bases [147, 148].

Let us state the CD-lemma for pre-Lie algebras, see [35].

A non-associative algebra $A$ is called a pre-Lie (or a right-symmetric) algebra if $A$ satisfies the identity $(x, y, z) = (x, z, y)$ for the associator $(x, y, z) = (x y) z - x (y z)$ . It is a Lie admissible algebra in the sense that $A^{(-)} = (A, [x y] = x y - y x)$ is a Lie algebra.

Take a well-ordered set $X = {x_{i} | i \in I}$ . Order $X^{* *}$ by induction on the lengths of the words $(u)$ and $(v)$ :

(i)
When $| ((u) (v)) | = 2$ put $(u) = x_{i} > (v) = x_{j}$ if and only if $i > j$ .
(ii)
When $| ((u) (v)) | > 2$ put $(u) > (v)$ if and only if one of the following holds:
1. (a)
  $| (u) | > | (v) |$ ;
2. (b)
  if $| (u) | = | (v) |$ with $(u) = ((u_{1}) (u_{2}))$ and $(v) = ((v_{1}) (v_{2}))$ then $(u_{1}) > (v_{1})$ or $(u_{1}) = (v_{1})$ and $(u_{2}) > (v_{2})$ .

We now quote the definition of good words (see [198]) by induction on length:

(1)
$x$ is a good word for any $x \in X$ ;
(2)
a non-associative word $((v) (w))$ is called a good word if
1. (a)
  both $(v)$ and $(w)$ are good words and
2. (b)
  if $(v) = ((v_{1}) (v_{2}))$ then $(v_{2}) \leq (w)$ .

Denote $(u)$ by $[u]$ whenever $(u)$ is a good word.

Denote by $W$ the set of all good words in the alphabet $X$ and by $R S ⟨ X ⟩$ the free right-symmetric algebra over a field $k$ generated by $X$ . Then $W$ forms a linear basis of $R S ⟨ X ⟩$ , see [198]. Kozybaev et al. [141] proved that the deg-lex ordering on $W$ is monomial.

Given a set $S \subset R S ⟨ X ⟩$ of monic polynomials and $s \in S$ , an $S$ -word ${(u)}_{s}$ is called a normal $S$ -word whenever ${(u)}_{\bar{s}} = (a \bar{s} b)$ is a good word.

Take $f, g \in S$ , $[w] \in W$ , and $a, b \in X^{*}$ . Then there are two kinds of compositions.

(i)
If $\bar{f} = [a \bar{g} b]$ then ${(f, g)}_{\bar{f}} = f - [a g b]$ is called the inclusion composition.
(ii)
If $(\bar{f} [w])$ is not good then $f \cdot [w]$ is called the right multiplication composition.

Theorem 41

([35], the CD-lemma for pre-Lie algebras) Consider a nonempty set $S \subset R S ⟨ X ⟩$ of monic polynomials and the ordering $<$ defined above. The following statements are equivalent:

(i)
The set $S$ is a Gröbner–Shirshov basis in $R S ⟨ X ⟩$ .
(ii)
If $f \in I d (S)$ then $\bar{f} = [a \bar{s} b]$ for some $s \in S$ and $a, b \in X^{*}$ , where $[a s b]$ is a normal $S$ -word.
(iii)
The set $I r r (S) = {[u] \in W | [u] \neq [a \bar{s} b] a, b \in X^{*}, s \in S and [a s b] is a normal S -word}$ is a linear basis of the algebra $R S ⟨ X | S ⟩ = R S ⟨ X ⟩ / I d (S)$ .

As an application, we have a GS basis for the universal enveloping pre-Lie algebra of a Lie algebra.

Theorem 42

([35]) Consider a Lie algebra $(L, [])$ with a well-ordered linear basis $X = {e_{i} | i \in I}$ . Write $[e_{i} e_{j}] = \sum_{m} α_{i j}^{m} e_{m}$ with $α_{i j}^{m} \in k$ . Denote $\sum_{m} α_{i j}^{m} e_{m}$ by ${e_{i} e_{j}}$ . Denote by

\begin{matrix} U (L) = R S ⟨ {e_{i}}_{I} | e_{i} e_{j} - e_{j} e_{i} = {e_{i} e_{j}}, i, j \in I ⟩ \end{matrix}

the universal enveloping pre-Lie algebra of $L$ . The set

\begin{matrix} S & = {f_{i j} = e_{i} e_{j} - e_{j} e_{i} - {e_{i} e_{j}}, i, j \in I and i > j} \end{matrix}

is a Gröbner–Shirshov basis in $R S ⟨ X ⟩$ .

Theorems 41 and 42 directly imply the following PBW theorem for Lie algebras and pre-Lie algebras.

Corollary 19

(Segal [198]) A Lie algebra $L$ embeds into its universal enveloping pre-Lie algebra $U (L)$ as a subalgebra of $U {(L)}^{(-)}$ .

Recently the CD-lemmas mentioned above and other combinatorial methods yielded many applications: for groups of Novikov–Boone type [119–121] (see also [16, 17, 77, 118], for Coxeter groups [58, 150], for center-by-metabelian Lie algebras [214], for free metanilpotent Lie algebras, Lie algebras and associative algebras [112, 168, 215, 216], for Poisson algebras [159], for quantum Lie algebras and related problems [132, 135], for PBW-bases [131, 134, 158], for extensions of groups and associative algebras [73, 74], for (color) Lie ( $p$ )-superalgebras [9, 48, 91, 92, 105–107, 169, 227, 228], for Hecke algebras and Specht modules [125], for representations of Ariki–Koike algebras [126], for the linear algebraic approach to GS bases [127], for HNN groups [87], for certain one-relator groups [88], for embeddings of algebras [39, 83], for free partially commutative Lie algebras [84, 181], for quantum groups of type $D_{n}$ , $E_{6}$ , and $G_{2}$ [174, 189, 221, 222], for calculations of homogeneous GS bases [145], for Picard groups, Weyl groups, and Bruck–Reilly extensions of semigroups [7, 128–130, 139], for Akivis algebras and pre-Lie algebras [79], for free Sabinin algebras [93].

5.2 CD-lemma for operads

Following Dotsenko and Khoroshkin ([98], Proposition 3), linear bases for a symmetric operad and a shuffle operad are the same provided both of them are defined by the same generators and defining relations. It means that we need CD-lemma for shuffle operads only (and we define a GS basis for a symmetric operad as a GS basis of the corresponding shuffle operad).

We express the elements of the free shuffle operad using planar trees.

Put $V = ⋃_{n = 1}^{\infty} V_{n}$ , where $V_{n} = {δ_{i}^{(n)} | i \in I_{n}}$ is the set of $n$ -ary operations.

Call a planar tree with $n$ leaves decorated whenever the leaves are labeled by $[n] = {1, 2, 3, \dots, n}$ for $n \in N$ and every vertex is labeled by an element of $V$ .

For an arrow in a decorated tree, let its value be the minimal value of the leaves of the subtree grafted to its end. A decorated tree is called a tree monomial whenever for each its internal vertex the values of the arrows beginning from it increase from the left to the right.

Denote by $F_{V} (n)$ the set of all tree monomials with $n$ leaves and put $T = \cup_{n \geq 1} F_{V} (n)$ . Given $α = α (x_{1}, \dots, x_{n}) \in F_{V} (n)$ and $β \in F_{V} (m)$ , define the shuffle composition $α \circ_{i, σ} β$ as

\begin{matrix} α (x_{1}, \dots, x_{i - 1}, β (x_{i}, x_{σ (i + 1)}, \dots, x_{σ (i + m - 1)}), x_{σ (i + m)}, \dots, x_{σ (m + n - 1)}), \end{matrix}

which lies in $F_{V} (n + m - 1)$ , where $1 \leq i \leq n$ and the bijection

\begin{matrix} σ : {i + 1, \dots, m + n - 1} \to {i + 1, \dots, m + n - 1} \end{matrix}

is an $(m - 1, n - i)$ -shuffle, that is,

\begin{matrix} σ (i + 1) < σ (i + 2) < \dots < σ (i + m - 1), \\ σ (i + m) < σ (i + m + 1) < \dots < σ (n + m - 1) . \end{matrix}

The set $T$ is freely generated by $V$ with the shuffle composition.

Denote by $F_{V} = k T$ the $k$ -linear space spanned by $T$ . This space with the shuffle compositions $\circ_{i, σ}$ is called the free shuffle operad.

Take a homogeneous subset $S$ of $F_{V}$ . For $s \in S$ , define an $S$ -word ${u |}_{s}$ as before.

A well ordering $>$ on $T$ is called monomial (admissible) whenever

\begin{matrix} {α > β \Rightarrow u |}_{α} {> u |}_{β} for any u \in T . \end{matrix}

Assume that $T$ is equipped with a monomial ordering. Then each $S$ -word is a normal $S$ -word.

For example, the following ordering $>$ on $T$ is monomial, see Proposition 5 of [98].

Every $α = α (x_{1}, \dots, x_{n}) \in F_{V} (n)$ has a unique expression

\begin{matrix} α = (path (1), \dots, path (n), [i_{1} \dots i_{n}]), \end{matrix}

where $path (r) \in V^{*}$ for $1 \leq r \leq n$ is the unique path from the root to the leaf $r$ and the permutation $[i_{1} \dots i_{n}]$ lists the labels of the leaves of the underlying tree in the order determined by the planar structure, from left to right. In this case define

\begin{matrix} wt (α) = (n, path (1), \dots, path (n), [i_{1} \dots i_{n}]) . \end{matrix}

Assume that $V$ is a well-ordered set and use the deg-lex ordering on $V^{*}$ . Take the order on the permutations in reverse lexicographic order: $i > j$ if and only if $i$ is less than $j$ as numbers.

Now, given $α, β \in T$ , define

\begin{matrix} α > β \Leftrightarrow wt (α) > wt (β) lexicographically . \end{matrix}

An element of $F_{V}$ is called homogeneous whenever all tree monomials occurring in this element with nonzero coefficients have the same arity degree (but not necessarily the same operation degree).

For two tree monomials $α$ and $β$ , say that $α$ is divisible by $β$ whenever there exists a subtree of the underlying tree of $α$ for which the corresponding tree monomial $α^{'}$ is equal to $α$ .

A tree monomial $γ$ is called a common multiple of two tree monomials $α$ and $β$ whenever it is divisible by both $α$ and $β$ . A common multiple $γ$ of two tree monomials $α$ and $β$ is called a least common multiple and denoted by $γ = lcm (α, β)$ whenever $| α | + | β | > | γ |$ , where $| δ | = n$ for $δ \in F_{V} (n)$ .

Take two monic homogeneous elements $f$ and $g$ of $F_{V}$ . If $\bar{f}$ and $\bar{g}$ have a least common multiple $w$ then ${(f, g)}_{w} = w_{_{\bar{f} \mapsto f}} - w_{_{\bar{g} \mapsto g}}$ .

Theorem 43

([98], the CD-lemma for shuffle operads) In the above notation, consider a nonempty set $S \subset F_{V}$ of monic homogeneous elements and a monomial ordering $<$ on $T$ . The following statements are equivalent:

(i)
The set $S$ is a Gröbner–Shirshov basis in $F_{V}$ .
(ii)
If $f \in I d (S)$ then ${\bar{f} = u |}_{\bar{s}}$ for some $S$ -word ${u |}_{s}$ .
(iii)
The set $I r r (S) = {u \in T | u \neq v |_{\bar{s}} for all S -word v |_{s}}$ is a $k$ -linear basis of $F_{V} / I d (S)$ .

As applications, the authors of [98] calculate Gröbner–Shirshov bases for some well-known operads: the operad Lie of Lie algebras, the operad As of associative algebras, and the operad PreLie of pre-Lie algebras.

Notes

Though Shirshov [207] 1962 was the first to come up with the idea of a ‘Gröbner–Shirshov basis’ for Lie and non-commutative polynomial algebras, his paper became practically unknown outside Russia. In the meantime, Buchberger’s ‘Gröbner basis’ (Thesis 1965 [65], paper 1970 [66]) for (commutative) polynomials became very popular in science. As a result, the first author suggested the name ‘Gröbner–Shirshov basis’ for non-commutative and non-associative polynomials. For (commutative) differential polynomials an analogous, or better to say, closely related ‘basis’ is called a Ritt–Kolchin characteristic set, due to Ritt [193] 1950 and Kolchin [140] 1973, and rediscovered by Wu [219] 1978.
The name ‘Composition-Diamond lemma’ combines the Neuman Diamond Lemma [172], the Shirshov Composition Lemma [207] and the Bergman Diamond Lemma [11].
We use the standard algebraic terminology ‘the word problem’, ‘the identity problem’, see Kharlampovich, Sapir [136] for instance.
After his Ph.D. Thesis of 1950, Zhukov moved to the present Keldysh Institute of Applied Mathematics (Moscow) to do computational mathematics. Godunov in ‘Reminiscence about numerical schemes’, arxiv.org/pdf/0810.0649, 2008, mentioned his name in relation to the creation of the famous Godunov numerical method. So, Zhukov was a forerunner of two important computational methods!
It must be pointed out that Malcev (1909–1967) inspired Shirshov’s works very much. Malcev was an official opponent (referee) of his (second) Doctor of Sciences Dissertation at MSU in 1958. The first author, Bokut, remembers this event at the Science Council Meeting, chaired by Kolmogorov, and Malcev’s words “Shirshov’s dissertation is a brilliant one!”. Malcev and Shirshov worked together at the present Sobolev Institute of Mathematics in Novosibirsk since 1959 until Malcev’s sudden death at 1967, and have been friends despite the age difference. Malcev headed the Algebra and Logic Department (by the way, the first author is a member of the department since 1960) and Shirshov was the first deputy director of the institute (whose director was Sobolev). In those years, Malcev was interested in the theory of algorithms of mathematical logic and algorithmic problems of model theory. Thus, Shirshov had an additional motivation to work on algorithmic problems for Lie algebras. Both Maltsev and Kurosh were delighted with Shirshov’s results of [207]. Malcev successfully nominated the paper for an award of the Presidium of the Siberian Branch of the Academy of Sciences (Sobolev and Malcev were the only Presidium members from the Institute of Mathematics at the time).
The Lyndon–Shirshov basis for the alphabet $x_{1}, x_{2}$ is different from the above Shirshov content basis starting with monomials of degree 7.
The first definitions of the symmetric operad were given by Kurosh’s student Artamonov under the name ‘clone of multilinear operations’ in 1969, see Kurosh [144] and Artamonov [4], cf. Lambek (1969) [146] and May (1972) [162].
From [12]: “A famous theorem concerning Lyndon words asserts that any word $w$ can be factorized in a unique way as a non-increasing product of Lyndon words, i.e. written $w = x_{1} x_{2} \dots x_{n}$ with $x_{1} \geq x_{2} \geq \dots \geq x_{n}$ . This theorem has imprecise origin. It is usually credited to Chen et al., following the paper of Schützenberger [197] in which it appears as an example of factorization of free monoids. Actually, as pointed out to one of us by Knuth in 2004, the reference [72] does not contain explicitly this statement.”

Abbreviations

CD-lemma:: Composition-Diamond lemma
GS basis:: Gröbner–Shirshov basis
LS word (basis):: Lyndon–Shirshov word (basis)
ALSW(X):: The set of all associative Lyndon–Shirshov words in $X$
NLSW(X):: The set of all non-associative Lyndon–Shirshov words in $X$
PBW theorem:: The Poincare–Birkhoff–Witt theorem
$X^{*}$ :: The free monoid generated by $X$
$[X]$ :: The free commutative monoid generated by $X$
$X^{* *}$ :: The set of all non-associative words $(u)$ in $X$
$g p ⟨ X | S ⟩$ :: The group generated by $X$ with defining relations $S$
$sgp ⟨ X | S ⟩$ :: The semigroup generated by $X$ with defining relations $S$
$k$ :: A field
$K$ :: A commutative algebra over $k$ with unity
$k ⟨ X ⟩$ :: The free associative algebra over $k$ generated by $X$
$k ⟨ X | S ⟩$ :: The associative algebra over $k$ with generators $X$ and defining relations $S$
$S^{c}$ :: A Gröbner–Shirshov completion of $S$
$I d (S)$ :: The ideal generated by a set $S$
$\bar{s}$ :: The maximal word of a polynomial $s$ with respect to some ordering $<$
$I r r (S)$ :: The set of all monomials avoiding the subword $\bar{s}$ for all $s \in S$
$k [X]$ :: The polynomial algebra over $k$ generated by $X$
$L i e (X)$ :: The free Lie algebra over $k$ generated by $X$
$L i e_{K} (X)$ :: The free Lie algebra generated by $X$ over a commutative algebra $K$

References

Adjan, S.I.: Algorithmic undecidability of certain decision problems of group theory. Trudy Moscow Mat. Ob. 6, 231–298 (1957)
Google Scholar
Alahmadi, A., Alsulami, H., Jain, S.K., Zelmanov, E.: Leavitt path algebras of finite Gelfand–Kirillov dimension. J. Algebra Appl. 11(6), 1250225-1–1250225-6 (2012)
Alahmadi, A., Alsulami, H., Jain, S.K., Zelmanov, E.: Structure of Leavitt path algebras of polynomial growth. doi:10.1073/pnas.1311216110
Artamonov, V.A.: Clones of multilinear operations and multioperator for algebras. Uspekhi Mat. Nauk. 24(145), 47–59 (1969)
MathSciNet Google Scholar
Artin, E.: Theory der Zöpf. Abh. Math. Sem. Hamburg Univ. 4, 47–72 (1926)
MathSciNet Google Scholar
Artin, E.: Theory of braids. Ann. Math. 48, 101–126 (1947)
MathSciNet Google Scholar
Ates, F., Karpuz, E., Kocapinar, C., Cevik, A.S.: Gröbner–Shirshov bases of some monoids. Discret. Math. 311(12), 1064–1071 (2011)
MathSciNet Google Scholar
Bahturin, Y., Olshanskii, A.: Filtrations and distortion in infinite-dimensional algebras. J. Algebra 327, 251–291 (2011)
MathSciNet Google Scholar
Bahturin, Y.A., Mikhalev, A.A., Petrogradskij, V.M., Zajtsev, M.V.: Infinite Dimensional Lie Superalgebras, vol. x, 250 p. De Gruyter Expositions in Mathematics, vol. 7. W. de Gruyter, Berlin (1992)
Belyaev, V.Y.: Subrings of finitely presented associative rings. Algebra Log. 17, 627–638 (1978)
MathSciNet Google Scholar
Bergman, G.M.: The diamond lemma for ring theory. Adv. Math. 29, 178–218 (1978)
MathSciNet Google Scholar
Berstel, J.D., Perrin, D.: The origins of combinatorics on words. Eur. J. Comb. 28, 996–1022 (2007)
MathSciNet Google Scholar
Birman, J., Ko, K.H., Lee, S.J.: A new approach to the word and conjugacy problems for the braid groups. Adv. Math. 139, 322–353 (1998)
MathSciNet Google Scholar
Bjoner, A., Brenti, F.: Combinatorics of Coxeter Groups. Graduate Texts in Mathematics, vol. 231. Springer, Berlin (2005)
Bokut, L.A.: A base of free polynilpotent Lie algebras. Algebra Log. 2, 13–19 (1963)
MathSciNet Google Scholar
Bokut, L.A.: On one property of the Boone group. Algebra Log. 5, 5–23 (1966)
MathSciNet Google Scholar
Bokut, L.A.: On the Novikov groups. Algebra Log. 6, 25–38 (1967)
MathSciNet Google Scholar
Bokut, L.A.: Degrees of unsolvability of the conjugacy problem for finitely presented groups. Algebra Log. 5, 6, 4–70, 4–52 (1968)
Bokut, L.A.: Groups of fractions for the multiplicative semigroups of certain rings I–III. Sibirsk. Mat. Zh. 10, 246–286, 744–799, 800–819 (1969)
Bokut, L.A.: On the Malcev problem. Sibirsk. Mat. Zh. 10, 965–1005 (1969)
MathSciNet Google Scholar
Bokut, L.A.: Insolvability of the word problem for Lie algebras, and subalgebras of finitely presented Lie algebras. Izvestija AN USSR (mathem.) 36, 1173–1219 (1972)
MathSciNet Google Scholar
Bokut, L.A.: Imbeddings into simple associative algebras. Algebra Log. 15, 117–142 (1976)
MathSciNet Google Scholar
Bokut, L.A.: Gröbner–Shirshov bases for braid groups in Artin–Garside generators. J. Symb. Comput. 43, 397–405 (2008)
MathSciNet Google Scholar
Bokut, L.A.: Gröbner–Shirshov bases for the braid group in the Birman–Ko–Lee generators. J. Algebra 321, 361–379 (2009)
MathSciNet Google Scholar
Bokut, L.A., Chainikov, V.V., Shum, K.P.: Markov and Artin normal form theorem for braid groups. Commun. Algebra 35, 2105–2115 (2007)
Google Scholar
Bokut, L.A., Chainikov, V.V.: Gröbner–Shirshov bases of Adjan extension of the Novikov group. Discret. Math. 308, 4916–4930 (2008)
MathSciNet Google Scholar
Bokut, L.A., Chen, Y.Q.: Gröbner–Shirshov bases for Lie algebras: after A.I. Shirshov. Southeast Asian Bull. Math. 31, 1057–1076 (2007)
MathSciNet Google Scholar
Bokut, L.A., Chen, Y.Q.: Gröbner–Shirshov bases: some new results. In: Shum, K.P., Zelmanov, E., Zhang, J., Shangzhi, L. (eds.) Advance in Algebra and Combinatorics. Proceedings of the Second International Congress in Algebra and Combinatorics, pp. 35–56. World Scientific, Singapore (2008)
Bokut, L.A., Chen, Y.Q., Chen, W.P., Li, J.: New approaches to plactic monoid via Gröbner–Shirshov bases. arxiv.org/abs/1106.4753
Bokut, L.A., Chen, Y.Q., Chen, Y.S.: Composition-Diamond lemma for tensor product of free algebras. J. Algebra 323, 2520–2537 (2010)
MathSciNet Google Scholar
Bokut, L.A., Chen, Y.Q., Chen, Y.S.: Gröbner–Shirshov bases for Lie algebras over a commutative algebra. J. Algebra 337, 82–102 (2011)
MathSciNet Google Scholar
Bokut, L.A., Chen, Y.Q., Deng, X.M.: Gröbner–Shirshov bases for Rota–Baxter algebras. Sib. Math. J. 51, 978–988 (2010)
MathSciNet Google Scholar
Bokut, L.A., Chen, Y.Q., Huang, J.P.: Gröbner–Shirshov bases for L-algebras. Int. J. Algebra Comput. 23, 547–571 (2013)
MathSciNet Google Scholar
Bokut, L.A., Chen, Y.Q., Li, Y.: Anti-commutative Gröbner–Shirshov basis of a free Lie algebra. Sci. China Ser. A Math. 52, 244–253 (2009)
MathSciNet Google Scholar
Bokut, L.A., Chen, Y.Q., Li, Y.: Gröbner–Shirshov bases for Vinberg–Koszul–Gerstenhaber right-symmetric algebras. J. Math. Sci. 166, 603–612 (2010)
MathSciNet Google Scholar
Bokut, L.A., Chen, Y.Q., Li, Y.: Gröbner–Shirshov Bases for Categories. Nankai Series in Pure, Applied Mathematics and Theoretical Physical, Operads and Universal Algebra, vol. 9, pp. 1–23 (2012)
Bokut, L.A., Chen, Y.Q., Li, Y.: Lyndon–Shirshov words and anti-commutative algebras. J. Algebra 378, 173–183 (2013)
MathSciNet Google Scholar
Bokut, L.A., Chen, Y.Q., Liu, C.H.: Gröbner–Shirshov bases for dialgebras. Int. J. Algebra Comput. 20, 391–415 (2010)
MathSciNet Google Scholar
Bokut, L.A., Chen, Y.Q., Mo, Q.H.: Gröbner–Shirshov bases and embeddings of algebras. Int. J. Algebra Comput. 20, 875–900 (2010)
MathSciNet Google Scholar
Bokut, L.A., Chen, Y.Q., Mo, Q.H.: Gröbner–Shirshov bases for semirings. J. Algebra 385, 47–63 (2013)
MathSciNet Google Scholar
Bokut, L.A., Chen, Y.Q., Qiu, J.J.: Gröbner–Shirshov bases for associative algebras with multiple operations and free Rota–Baxter algebras. J. Pure Appl. Algebra 214, 89–100 (2010)
MathSciNet Google Scholar
Bokut, L.A., Chen, Y.Q., Shum, K.P.: Some new results on Gröbner–Shirshov bases. In: Proceedings of International Conference on Algebra 2010, Advances in Algebraic Structures, pp. 53–102 (2012)
Bokut, L.A., Chen, Y.Q., Zhang, G.L.: Composition-Diamond lemma for associative n-conformal algebras. arXiv:0903.0892
Bokut, L.A., Chen, Y.Q., Zhao, X.G.: Gröbner–Shirshov beses for free inverse semigroups. Int. J. Algebra Comput. 19, 129–143 (2009)
MathSciNet Google Scholar
Bokut, L.A., Fong, Y., Ke, W.-F.: Composition Diamond lemma for associative conformal algebras. J. Algebra 272, 739–774 (2004)
MathSciNet Google Scholar
Bokut, L.A., Fong, Y., Ke, W.-F., Kolesnikov, P.S.: Gröbner and Gröbner–Shirshov bases in algebra and conformal algebras. Fundam. Appl. Math. 6, 669–706 (2000)
MathSciNet Google Scholar
Bokut, L.A., Fong, Y., Ke, W.-F., Shiao, L.-S.: Gröbner–Shirshov bases for the braid semigroup. In: Shum, K.P. et al. (eds.) Advances in Algebra. Proceedings of the ICM Satellite Conference in Algebra and Related Topics, Hong Kong, China, August 14–17 (2002)
Bokut, L.A., Kang, S.-J., Lee, K.-H., Malcolmson, P.: Gröbner–Shirshov bases for Lie superalgebras and their universal enveloping algebras. J. Algebra 217, 461–495 (1999)
MathSciNet Google Scholar
Bokut, L.A., Klein, A.A.: Serre relations and Gröbner–Shirshov bases for simple Lie algebras I, II. Int. J. Algebra Comput. 6(389–400), 401–412 (1996)
MathSciNet Google Scholar
Bokut, L.A., Klein, A.A.: Gröbner–Shirshov bases for exceptional Lie algebras I. J. Pure Appl. Algebra 133, 51–57 (1998)
MathSciNet Google Scholar
Bokut, L.A., Klein, A.A.: Gröbner–Shirshov bases for exceptional Lie algebras E6, E7, E8. In: Algebra and Combinatorics (Hong Kong), pp. 37–46, Springer, Singapore (1999)
Bokut, L.A., Kolesnikov, P.S.: Gröbner–Shirshov bases: from their incipiency to the present. J. Math. Sci. 116, 2894–2916 (2003)
MathSciNet Google Scholar
Bokut, L.A., Kolesnikov, P.S.: Gröbner–Shirshov bases, conformal algebras and pseudo-algebras. J. Math. Sci. 131, 5962–6003 (2005)
MathSciNet Google Scholar
Bokut, L.A., Kukin, G.P.: Algorithmic and Combinatorial Algebra. Mathematics and its Applications. Kluwer Academic Publishers Group, Dordrecht (1994)
Google Scholar
Bokut, L.A., Malcolmson, P.: Gröbner–Shirshov bases for quantum enveloping algebras. Isr. J. Math. 96, 97–113 (1996)
MathSciNet Google Scholar
Bokut, L.A., Malcolmson, P.: Gröbner–Shirshov bases for Lie and associative algebras. Collection of Abstracts, ICAC’97, Hong Kong, pp. 139–142 (1997)
Bokut, L.A., Malcolmson, P.: Gröbner–Shirshov bases for relations of a Lie algebra and its enveloping algebra. In: Shum, K.-P. et al. (eds.) Algebras and Combinatorics. Papers from the International Congress, ICAC’97, Hong Kong, August 1997, pp. 47–54. Springer, Singapore (1999)
Bokut, L.A., Shiao, L.-S.: Gröbner–Shirshov bases for Coxeter groups. Commun. Algebra 29, 4305–4319 (2001)
MathSciNet Google Scholar
Bokut, L.A., Shum, K.P.: Relative Gröbner–Shirshov bases for algebras and groups. St. Petersbg. Math. J. 19, 867–881 (2008)
MathSciNet Google Scholar
Boone, W.W.: The word problem. Ann. Math. 70, 207–265 (1959)
MathSciNet Google Scholar
Borcherds, R.E.: Vertex algebras, Kac–Moody algebras, and the monster. Proc. Natl. Acad. Sci. USA 84, 3068–3071 (1986)
MathSciNet Google Scholar
Borcherds, R.E.: Generalized Kac–Moody algebras. J. Algebra 115(2), 501–512 (1988)
MathSciNet Google Scholar
Borcherds, R.E.: The monster Lie algebra. Adv. Math. 83(1), 30–47 (1990)
MathSciNet Google Scholar
Brieskorn, E., Saito, K.: Artin-Gruppen und Coxeter-Gruppen. Invent. Math. 17, 245–271 (1972)
MathSciNet Google Scholar
Buchberger, B.: An algorithm for finding a basis for the residue class ring of a zero-dimensional polynomial ideal. Ph.D. thesis, University of Innsbruck, Austria (1965)
Buchberger, B.: An algorithmical criteria for the solvability of algebraic systems of equations. Aequ. Math. 4, 374–383 (1970)
MathSciNet Google Scholar
Buchberger, B.: History and basic feature of the critical-pair/completion procedure. J. Symb. Comput. 3, 3–38 (1987)
MathSciNet Google Scholar
Buchberger, B., Collins, G.E., Loos, R., Albrecht, R.: Computer Algebra, Symbolic and Algebraic Computation. Computing Supplementum, vol. 4. Springer, New York (1982)
Google Scholar
Cain, A.J., Gray, R., Malheiro, A.: Finite Gröbner–Shirshov bases for Plactic algebras and biautomatic structures for Plactic monoids. arXiv:1205.4885v2
Cartier, P.: Remarques sur le th $\overset{´}{e}$ or $\overset{`}{e}$ me de Birkhoff–Witt, Annali della Scuola Norm. Sup. di Pisa s $\overset{´}{e}$ rie III, vol. XII, pp. 1–4 (1958)
Cassaigne, J., Espie, M., Krob, D., Novelli, J.C., Hivert, F.: The Chinese monoid. Int. J. Algebra Comput. 11, 301–334 (2001)
MathSciNet Google Scholar
Chen, K.-T., Fox, R., Lyndon, R.: Free differential calculus IV: the quotient group of the lower central series. Ann. Math. 68, 81–95 (1958)
MathSciNet Google Scholar
Chen, Y.Q.: Gröbner–Shirshov basis for Schreier extensions of groups. Commun. Algebra 36, 1609–1625 (2008)
Google Scholar
Chen, Y.Q.: Gröbner–Shirshov basis for extensions of algebras. Algebra Colloq. 16, 283–292 (2009)
MathSciNet Google Scholar
Chen, Y.S., Chen, Y.Q.: Gröbner–Shirshov bases for matabelian Lie algebras. J. Algebra 358, 143–161 (2012)
MathSciNet Google Scholar
Chen, Y.Q., Chen, Y.S., Li, Y.: Composition-Diamond lemma for differential algebras. Arab. J. Sci. Eng. 34, 135–145 (2009)
Google Scholar
Chen, Y.Q., Chen, W.S., Luo, R.I.: Word problem for Novikov’s and Boone’s group via Gröbner–Shirshov bases. Southeast Asian Bull. Math. 32, 863–877 (2008)
MathSciNet Google Scholar
Chen, Y.Q., Chen, Y.S., Zhong, C.Y.: Composition-Diamond lemma for modules. Czechoslov. Math. J. 60, 59–76 (2010)
MathSciNet Google Scholar
Chen, Y.Q., Li, Y.: Some remarks for the Akivis algebras and the Pre-Lie algebras. Czechoslov. Math. J. 61(136), 707–720 (2011)
Google Scholar
Chen, Y.Q., Li, Y., Tang, Q.Y.: Gröbner–Shirshov bases for some Lie algebras. Sib. Math. J. arXiv:1305.4546
Chen, Y.Q., Li, J., Zeng, M.J.: Composition-Diamond lemma for non-associative algebras over a commutative algebra. Southeast Asian Bull. Math. 34, 629–638 (2010)
MathSciNet Google Scholar
Chen, Y.Q., Mo, Q.H.: Artin-Markov normal form for braid group. Southeast Asian Bull. Math. 33, 403–419 (2009)
MathSciNet Google Scholar
Chen, Y.Q., Mo, Q.H.: Embedding dendriform algebra into its universal enveloping Rota–Baxter algebra. Proc. Am. Math. Soc. 139, 4207–4216 (2011)
MathSciNet Google Scholar
Chen, Y.Q., Mo, Q.H.: Gröbner–Shirshov bases for free partially commutative Lie algebras. Commun. Algebra 41, 3753–3761 (2013)
MathSciNet Google Scholar
Chen, Y.Q., Qiu, J.J.: Gröbner–Shirshov basis for the Chinese monoid. J. Algebra Appl. 7, 623–628 (2008)
MathSciNet Google Scholar
Chen, Y.Q., Shao, H.S., Shum, K.P.: On Rosso–Yamane theorem on PBW basis of $U_{q} (A_{N})$ . CUBO Math. J. 10, 171–194 (2008)
MathSciNet Google Scholar
Chen, Y.Q., Zhong, C.Y.: Gröbner–Shirshov basis for HNN extensions of groups and for the alternative group. Commun. Algebra 36, 94–103 (2008)
MathSciNet Google Scholar
Chen, Y.Q., Zhong, C.Y.: Gröbner–Shirshov basis for some one-relator groups. Algebra Colloq. 19, 99–116 (2011)
MathSciNet Google Scholar
Chen, Y.Q., Zhong, C.Y.: Gröbner–Shirshov bases for braid groups in Adjan–Thurston generators. Algebra Colloq. 20, 309–318 (2013)
MathSciNet Google Scholar
Chibrikov, E.S.: On free conformal Lie algebras. Vestn. Novosib. Gos. Univ. Ser. Mat. Mekh. Inform. 4(1), 65–83 (2004)
Google Scholar
Chibrikov, E.S.: A right normed basis for free Lie algebras and Lyndon–Shirshov words. J. Algebra 302, 593–612 (2006)
MathSciNet Google Scholar
Chibrikov, E.S.: The right-normed basis for a free Lie superalgebra and Lyndon–Shirshov words. Algebra Log. 45(4), 458–483 (2006)
MathSciNet Google Scholar
Chibrikov, E.S.: On free Sabinin algebras. Commun. Algebra 39, 4014–4035 (2011)
MathSciNet Google Scholar
Chibrikov, E.S.: On some embedding of Lie algebras. J. Algebra Appl. 11(1), 12 (2012)
Cohn, P.M.: A remark on the Birkhoff–Witt theorem. J. Lond. Math. Soc. 38, 197–203 (1963)
Google Scholar
Cohn, P.M.: Universal Algebra. Harper’s Series in Modern Mathematics. Harper and Row, New York Publishers xv, 333 p. (1965) (Second edition: Reidel, Dordrecht (1981))
Collins, D.J.: Representation of Turing reducibility by word and conjugacy problems in finitely presented groups. Acta Math. 128, 73–90 (1972)
MathSciNet Google Scholar
Dotsenko, V., Khoroshkin, A.: Gröbner bases for operads. Duke Math. J. 153, 363–396 (2010)
MathSciNet Google Scholar
Eisenbud, D., Peeva, I., Sturmfels, B.: Non-commutative Gröbner bases for commutative algebras. Proc. Am. Math. Soc. 126, 687–691 (1998)
MathSciNet Google Scholar
Etingof, P., Henriques, A., Kamnitzer, J., Rains, E.M.: The cohomology ring of the real locus of the moduli space of stable curves of genus 0 with marked points. Ann. Math. 171, 731–777 (2010)
MathSciNet Google Scholar
Farkas, D.R., Feustel, C., Green, E.I.: Synergy in the theories of Gröbner bases and path algebras. Can. J. Math. 45, 727–739 (1993)
MathSciNet Google Scholar
Gao, X., Guo, L., Zheng, S.H.: Constrction of free commutative integro-differential algebras by the method of Gröbner–Shirshov bases. J. Algebra Appl. (2014 to appear)
Garside, A.F.: The braid group and other groups. Q. J. Math. Oxf. 20, 235–254 (1969)
MathSciNet Google Scholar
Gelfand, S.I., Manin, Y.I.: Homological Algebra. Springer, Berlin (1999)
Gerdt, V.P., Kornyak, V.V.: Lie algebras and superalgebras defined by a finite number of relations: computer analysis. J. Nonlinear Math. Phys. 2(3–4), 367–373 (1995)
MathSciNet Google Scholar
Gerdt, V.P., Robuk, V.N., Sever’yanov, V.M.: The construction of finitely represented Lie algebras. Comput. Math. Math. Phys. 36(11), 1493–1505 (1996)
MathSciNet Google Scholar
Gerdt, V.P., Kornyak, V.V.: Program for constructing a complete system of relations, basis elements, and commutator table for finitely presented Lie algebras and superalgebras. Program. Comput. Softw. 23(3), 164–172 (1997)
MathSciNet Google Scholar
Golod, E.S.: Standard bases and homology. In: Algebra: Some Current Trends. Lecture Notes in Mathematics, vol. 1352, pp. 88–95 (1988)
Green, D.J.: Gröbner Bases and the Computation of Group Cohomology. Springer, Berlin (2003)
Google Scholar
Green, J.A.: Hall algebras, hereditary algebras and guantum algebras. Invent. Math. 120, 361–377 (1985)
Google Scholar
Guo, L., Sit, W., Zhang, R.: Differential type operators and Gröbner–Shirshov bases. J. Symb. Comput. 52, 97–123 (2013)
MathSciNet Google Scholar
Gupta, C.K., Umirbaev, U.U.: The occurrence problem for free metanilpotent Lie algebras. Commun. Algebra 27, 5857–5876 (1999)
MathSciNet Google Scholar
Hall, M.: A basis for free Lie rings and higher commutators in free groups. Proc. Am. Math. Soc. 3, 575–581 (1950)
Google Scholar
Hall, P.: A contribution to the theory of groups of prime power order. Proc. Lond. Math. Soc. Ser. 36, 29–95 (1933)
Google Scholar
Higman, G.: Subgroups of finitely presented groups. Proc. R. Soc. Lond. (Series A) 262, 455–475 (1961)
MathSciNet Google Scholar
Jones, V.F.R.: Hecke algebra representations of braid groups and link polynimials. Ann. Math. 128, 335–388 (1987)
Google Scholar
Kac, G.: Infinite Dimensional Lie Algebras. Cambridge University Press, Cambridge (1990)
Google Scholar
Kalorkoti, K.: Decision problems in group theory. Proc. Lond. Math. Soc. III Ser. 44, 312–332 (1982)
Kalorkoti, K.: Turing degrees and the word and conjugacy problems for finitely presented groups. Southeast Asian Bull. Math. 30, 855–887 (2006)
MathSciNet Google Scholar
Kalorkoti, K.: A finitely presented group with almost solvable conjugacy problem. Asian Eur. J. Math. 2, 611–635 (2009)
MathSciNet Google Scholar
Kalorkoti, K.: Sufficiency conditions for Bokut’ normal forms. Commun. Algebra 39, 2862–2873 (2011)
MathSciNet Google Scholar
Kandri-Rody, A., Weispfenning, V.: Non-commutative Gröbner bases in algebras of solvable type A. J. Symb. Comput. 9, 1–26 (1990)
MathSciNet Google Scholar
Kang, S.-J., Lee, K.-H.: Gröbner–Shirshov bases for representation theory. J. Korean Math. Soc. 37, 55–72 (2000)
MathSciNet Google Scholar
Kang, S.-J., Lee, K.-H.: Gröbner–Shirshov bases for irreducible $s l_{n + 1}$ -modules. J. Algebra 232, 1–20 (2000)
MathSciNet Google Scholar
Kang, S.-J., Lee, I.-S., Lee, K.-H., Oh, H.: Hecke algebras, Specht modules and Gröbner–Shirshov bases. J. Algebra 252, 258–292 (2002)
MathSciNet Google Scholar
Kang, S.-J., Lee, I.-S., Lee, K.-H., Oh, H.: Representations of Ariki–Koike algebras and Gröbner–Shirshov bases. Proc. Lond. Math. Soc. III Ser. 89, 54–70 (2004)
MathSciNet Google Scholar
Kang, S.-J., Lee, K.-H.: Linear algebraic approach to Gröbner–Shirshov basis theory. J. Algebra 313, 988–1004 (2007)
MathSciNet Google Scholar
Karpuz, E.G.: Complete rewriting system for the Chinese monoid. Appl. Math. Sci. 4, 1081–1087 (2010)
MathSciNet Google Scholar
Karpuz, E.G., Cevik, A.S.: Gröbner–Shirshov bases for extended modular, extended Hecke, and Picard groups. Math. Notes 92, 636–642 (2012)
MathSciNet Google Scholar
Karpuz, E.G., Ates, F., Cevik, A.S.: Gröbner–Shirshov bases of some Weyl groups. Rocky Mt. J. Math. (2014, to appear)
Kharchenko, V.K.: A quantum analog of the Poincar–Birkhoff–Witt theorem. Algebra Log. 38(4), 476–507 (1999)
MathSciNet Google Scholar
Kharchenko, V.K.: A combinatorial approach to the quantification of Lie algebras. Pac. J. Math. 203, 191–233 (2002)
MathSciNet Google Scholar
Kharchenko, V.K.: Braided version of Shirshov–Witt theorem. J. Algebra 294, 196–225 (2005)
MathSciNet Google Scholar
Kharchenko, V.K.: PBW-bases of coideal subalgebras and a freeness theorem. Trans. Am. Math. Soc. 360, 5121–5143 (2008)
MathSciNet Google Scholar
Kharchenko, V.K.: Triangular decomposition of right coideal subalgebras. J. Algebra 324, 3048–3089 (2010)
MathSciNet Google Scholar
Kharlampovich, O.G., Sapir, M.V.: Algorithmic problems in varieties. Int. J. Algebra Comput. 5, 379–602 (1995)
MathSciNet Google Scholar
Knuth, D.E.: Permutations, matrices, and generalized Young tableaux. Pac. J. Math. 34, 709–727 (1970)
MathSciNet Google Scholar
Knuth, D.E., Bendix, P.B.: Simple word problems in universal algebras. In: Leech, J. (ed.) Computational Problems in Abstract Algebra, pp. 263–297. Pergamon Press, Oxford (1970)
Kocapinar, C., Karpuz, E., Ates, F., Cevik, A.S.: Gröbner–Shirshov bases of generalized Bruck–Reilly *-extension. Algebra Colloq. 19, 813–820 (2012)
MathSciNet Google Scholar
Kolchin, E.R.: Differential Algebras and Algebraic Groups. Academic Press, New York (1973)
Google Scholar
Kozybaev, D., Makar-Limanov, L., Umirbaev, U.: The Freiheitssatz and autoumorphisms of free right-symmetric algebras. Asian Eur. J. Math. 1, 243–254 (2008)
MathSciNet Google Scholar
Kukin, G.P.: On the word problem for Lie algebras. Sibirsk. Mat. Zh. 18, 1194–1197 (1977)
MathSciNet Google Scholar
Kurosh, A.G.: Nonassociative free algebras and free products of algebras. Mat. Sb. 20, 239–262 (1947)
MathSciNet Google Scholar
Kurosh, A.G.: Multioperator ringpond algebras. Uspekhi Mat. Nauk 24(145), 3–15 (1969)
Google Scholar
La Scala, R., Levandovskyy, V.: Letterplace ideals and non-commutative Gröbner bases. J. Symb. Comput. 44, 1374–1393 (2009)
Google Scholar
Lambek, J.: Deductive system and categories II: standard constructions and closed categories. Lecture Notes in Mathematics, vol. 86. Springer, Berlin (1969)
Latyshev, V.N.: General version of standard bases in linear structures. In: Bahturin, Y. (ed.) Algebra, pp. 215–226. Walter de Gruyter, Berlin (2000)
Latyshev, V.N.: An improved version of standard bases. In: Krob, D. et al. (ed.) Formal Power Series and Algebraic Combinatorics, pp. 496–505. Springer, Berlin (2000)
Lazard, M.: Groupes, anneaux de Lie et problème de Burnside. Istituto Matematico dell’ Università di Roma (1960)
Lee, D.V.: Gröbner–Shirshov bases and normal forms for the Coxeter groups $E_{6}$ and $E_{7}$ . In: Shum, K.P. et al. (ed.) Advances in Algebra and Combinatorics, pp. 243–255. World Scientific, Hackensack (2008)
Lothaire, M.: Combinatorics on Words. Addison-Wesley Publishing Company, vol. xix, 238 p. (1983) (Second edition: Cambridge University Press, Cambridge (1977))
Lothaire, M.: Algebraic combinatorics on words, Encyclopedia of Mathematics and its Applications 90, Cambridge University Press (2002)
Lothaire, M.: Algebraic Combinatorics on Words. Cambridge University Press, Cambridge (2002)
Google Scholar
Lusztig, G.: Canonical bases arising from quantized enveloping algebras. J. Am. Math. Soc. 3, 447–498 (1990)
MathSciNet Google Scholar
Lusztig, G.: Hecke Algebras with Unequal Parameters. CRM Monograph Series, vol. 18. American Mathematical Society, Providence (2003)
Lyndon, R.C.: On Burnside’s problem I. Trans. Am. Math. Soc. 77, 202–215 (1954)
MathSciNet Google Scholar
Maclane, S.: Homology. Springer, Berlin (1963)
Makar-Limanov, L.: A version of the Poincaré–Birkhoff–Witt theorem. Bull. Lond. Math. Soc. 26(3), 273–276 (1994)
MathSciNet Google Scholar
Makar-Limanov, L., Umirbaev, U.U.: The Freiheitssatz for Poisson algebras. J. Algebra 328(1), 495–503 (2011)
MathSciNet Google Scholar
Markov, A.A.: An introduction to the algebraical theory of braids. In: Proceedings of the Steklov Mat. Ins. RAS, vol. 16 (1945)
Markov, A.A.: Impossibility of some algorithms in the theory of some associative system. Dokl. Akad. Nauk SSSR 55, 587–590 (1947)
Google Scholar
May, P.: The Geometry of Iterated Loop Space. Lecture Notes in Mathematics, vol. 271. Springer, Berlin (1972)
Michel, J.: Bases des algèbres de Lie et série de Hausdorff, Semin. P. Dubreil, 27e annee 1973/74, Algebre, Fasc. 1, Expose 6, 9 p. (1975)
Michel, J.: Calculs dans les algèbres de Lie libres: la série de Hausdorff et le problème de Burnside. Astérisque 38/39, 139–148 (1976)
Mikhalev, A.A.: A composition lemma and the equality problem for color Lie superalgebras. Mosc. Univ. Math. Bull. 44(5), 87–90 (1989)
MathSciNet Google Scholar
Mikhalev, A.A.: The composition lemma for color Lie superalgebras and for Lie $p$ -superalgebras. Algebra. In: Bokut, L.A., Ershov, Y.L., Kostrikin, A.I. (eds.) Proceedings of the International Conference on Memory A.I. Mal’cev, Novosibirsk/USSR 1989, Contemp. Math. 131, Pt. 2, 91–104 (1992)
Mikhalev, A.A.: Shirshov composition techniques in Lie superalgebras (noncommutative Gröbner bases). J. Math. Sci. New York 80(5), 2153–2160 (1996)
Mikhalev, A.A., Shpilrain, V., Umirbaev, U.U.: On isomorphism of Lie algebras with one defining relation. Int. J. Algebra Comput. 14(3), 389–393 (2004)
MathSciNet Google Scholar
Mikhalev, A.A., Zolotykh, A.A.: Combinatorial Aspects of Lie Superalgebras, vol. viii, 260 p. CRC Press, Boca Raton (1995)
Mikhalev, A.A., Zolotykh, A.A.: Standard Gröbner–Shirshov bases of free algebras over rings, I. Free associative algebras. Int. J. Algebra Comput. 8, 689–726 (1998)
MathSciNet Google Scholar
Mora, F.: Gröbner bases for non-commutative polynomial rings. In: Algebraic Algorithms and Error-Correcting Codes. Lecture Notes in Computer Science, vol. 229, pp. 353–362 (1986)
Newman, M.H.A.: On theories with a combinatorial definition of ‘equivalence’. Ann. Math. 43, 223–243 (1942)
Google Scholar
Novikov, P.S.: On algorithmic undecidability of the word problem in the theory of groups. Trudy Mat. Inst. Steklov. 44, 1–144 (1955)
Google Scholar
Obul, A., Yunus, G.: Gröbner–Shirshov basis of quantum group of type $E_{6}$ . J. Algebra 346, 248–265 (2011)
MathSciNet Google Scholar
Odesskii, A.: Introduction to the theory of elliptic algebras. data.imf.au.dk/conferences/FMOA05/
Poliakova, O., Schein, B.M.: A new construction for free inverse semigroups. J. Algebra 288, 20–58 (2005)
MathSciNet Google Scholar
Polishchuk, A., Positselski, L.: Quadratic Algebras. AMS, Providence (2005)
Poroshenko, E.N.: Gröbner–Shirshov bases for Kac–Moody algebras $A_{n}^{(1)}$ and $B_{n}^{(1)}$ . In: Krob, D. et al. (ed.) Formal Power Series and Algebraic Combinatorics, pp. 552–563. Springer, Berlin (2000)
Poroshenko, E.N.: Gröbner–Shirshov bases for Kac–Moody algebras of types $C_{n}^{(1)}$ and $D_{n}^{(1)}$ . Vestn. Novosib. Gos. Univ. Ser. Mat. Mekh. Inform. 2, 58–70 (2002)
Google Scholar
Poroshenko, E.N.: Gröbner–Shirshov bases for Kac–Moody algebras of type $A_{n}^{(1)}$ . Commun. Algebra 30, 2617–2637 (2002)
MathSciNet Google Scholar
Poroshenko, E.N.: Bases for partially commutative Lie algebras. Algebra Log. 50, 405–417 (2011)
MathSciNet Google Scholar
Post, E.: A variant of a recursively unsolvable problem. Bull. Am. Math. Soc. 52, 264–268 (1946)
MathSciNet Google Scholar
Post, E.: Recursive unsolvability of a problem of Thue. J. Symb. Logic 1, 1–11 (1947)
MathSciNet Google Scholar
Priddy, S.B.: Koszul resolutions. Trans. Am. Math. Soc. 152, 39–60 (1970)
MathSciNet Google Scholar
Qiu, J.J., Chen, Y.Q: Composition-Diamond lemma for $λ$ -differential associative algebras with multiple operators. J. Algebra Appl. 9, 223–239 (2010)
Qiu, J.J.: Gröbner–Shirshov bases for commutative algebras with multiple operators and free commutative Rota–Baxter algebras. Asian Eur. J. Math. (2014, to appear)
Rabin, M.: Recursice unsolvability of group thepretic problems. Ann. Math. 67(1), 172–194 (1958)
MathSciNet Google Scholar
Razmyslov, Y.P.: Identities of Algebras and their Representations, vol. xiii, 318 p. AMS, Providence (1994)
Ren, Y.H., Obul, A.: Gröbner–Shirshov basis of quantum group of type $G_{2}$ . Commun. Algebra 39, 1510–1518 (2011)
MathSciNet Google Scholar
Reutenauer, C.: Free Lie Algebras. Oxford University Press, New York (1993)
Google Scholar
Ringel, C.M.: Hall algebras and quantum groups. Invent. Math. 101, 583–592 (1990)
MathSciNet Google Scholar
Ringel, C.M.: PBW-bases of quantum groups. J. Reine Angew. Math. 170, 51–88 (1996)
MathSciNet Google Scholar
Ritt, J.F.: Differential Algebras. AMS, New York (1950)
Google Scholar
Roitman, M.: On the free conformal and vertex algebras. J. Algebra 217, 496–527 (1999)
MathSciNet Google Scholar
Rosso, M.: An analogue of the Poincare–Birkhoff–Witt theorem and the universal R-matrix of $U_{q} (s l (N + 1))$ . Commun. Math. Phys. 124, 307–318 (1989)
MathSciNet Google Scholar
Schützenberger, M.P., Sherman, S.: On a formal product over the conjugate classes in a free group. J. Math. Anal. Appl. 7, 482–488 (1963)
MathSciNet Google Scholar
Schützenberger, M.P.: On a factorization of free monoids. Proc. Am. Math. Soc. 16, 21–24 (1965)
Google Scholar
Segal, D.: Free left-symmetric algebras and an analogue of the Poincaré–Birkhoff–Witt Theorem. J. Algebra 164, 750–772 (1994)
Shirshov, A.I.: Some problems in the theory of non-associative rings and algebras. Candidate of Science Thesis, Moscow State University (1953). http://math.nsc.ru/LBRT/a1/ShirshovPhD.djvu
Shirshov, A.I.: Subalgebras of free Lie algebras. Uspekhi Mat. Nauk 8(3), 173 (1953)
Google Scholar
Shirshov, A.I.: On the representation of Lie rings in associative rings. Uspekhi Mat. Nauk N. S. 8(5)(57), 173–175 (1953)
Shirshov, A.I.: Subalgebras of free commutative and free anticommutative algebras. Mat. Sb. 4(1), 82–88 (1954)
Google Scholar
Shirshov, A.I.: On free Lie rings. Mat. Sb. 45(2), 113–122 (1958)
MathSciNet Google Scholar
Shirshov, A.I.: Some problems in the theory of rings that are nearly associative. Uspekhi Mat. Nauk 13(6)(84), 3–20 (1958)
Shirshov, A.I.: On the bases of a free Lie algebra. Algebra Log. 1(1), 14–19 (1962)
MathSciNet Google Scholar
Shirshov, A.I.: Some algorithmic problem for $ε$ -algebras. Sibirsk. Mat. Zh. 3, 132–137 (1962)
Google Scholar
Shirshov, A.I.: Some algorithmic problem for Lie algebras. Sibirsk. Mat. Zh. 3(2), 292–296 (1962) (English translation in SIGSAM Bull. 33, 3–6 (1999))
Shirshov, A.I.: On a hypothesis in the theory of Lie algebras. Sibirsk Mat. Zh. 3(2), 297–301 (1962)
Google Scholar
Selected works of A.I. Shirshov, in: Bokut, L.A., Latyshev, V., Shestakov, I., Zelmanov, E., Trs. Bremner, M., Kochetov, M., (Eds.) Birkhäuser, Basel, Boston, Berlin (2009)
Tits, J.: Le problème des mots dans les groupes de Coxeter. Symp. Math. 1, 175–185 (1968)
Turing, A.M.: The word problem in semi-groups with cancellation. Ann. Math. 52, 191–505 (1950)
MathSciNet Google Scholar
Ufnarovski, V.A.: Combinatorial and asymptotic methods in algebra. Algebra VI(57), 1–196 (1995)
Ufnarovski, V.A.: Introduction to noncommutative Gröbner bases theory. In: Buchberger, B. et al. (ed.) Gröbner Bases and Applications. London Mathematical Society Lecture Note Series, vol. 251, pp. 259–280 Cambridge University Press, Cambridge (1998)
Umirbaev, U.U.: Equality problem for center-by-metabelian Lie algebras. Algebra Log. 23, 209–219 (1984)
Google Scholar
Umirbaev, U.U.: The occurrence problem for Lie algebras. Algebra Log. 32(3), 173–181 (1993)
MathSciNet Google Scholar
Umirbaev, U.U.: Algorithmic problems in associative algebras. Algebra Log. 32(4), 244–255 (1993)
MathSciNet Google Scholar
Viennot, G.: Algebras de Lie libres et monoid libres. Lecture Notes in Mathematics, vol. 691. Springer, Berlin (1978)
Witt, E.: Die Unterringe der freien Lienge Ringe. Math. Z. 64, 195–216 (1956)
MathSciNet Google Scholar
Wu, W.-T.: On the decision problem and the mechanization of theorem proving in elementary geometry. Sci. Sin. 21, 157–179 (1978)
Google Scholar
Yamane, I.: A Poincare–Birkhoff–Witt theorem for quantized universal enveloping algebras of type $A_{N}$ . Publ. RIMS. Kyoto Univ. 25, 503–520 (1989)
MathSciNet Google Scholar
Yunus, G., Obul, A.: Gröbner–Shirshov basis of quantum group of type $D_{4}$ . Chin. Ann. Math. 32,B(4), 581–592 (2011)
Yunus, G., Gao, Z.Z., Obul, A.: Gröbner–Shirshov bases of quantum groups. Algebra Colloq. (2014 to appear)
Zhang, X., Jiang, M.: On Post’s and Markov’s examples of semigroups with unsolvable word problem. Southeast Asian Bull. Math. 37, 465–473 (2013)
MathSciNet Google Scholar
Zelmanov, E.: Nil rings and periodic groups. KMS Lecture Notes in Mathematics, vol. x, 79 p. Korean Mathematical Society, Seoul (1992)
Zhou, M., Winkler, F.: Gröbner bases in difference-differential modules and difference-differential dimension polynomials. Sci. China Ser. A Math. 51, 1732–1752 (2008)
Zhukov, A.I.: Complete systems of defining relations in noassociative algebras. Mat. Sb. 69(27), 267–280 (1950)
Google Scholar
Zolotykh, A.A., Mikhalev, A.A.: A complex of algorithms for computations in Lie superalgebras. Prog. Comput. Softw. 23(1), 8–16 (1997)
MathSciNet Google Scholar
Zolotykh, A.A., Mikhalev, A.A.: Algorithms for construction of standard Gröbner–Shirshov bases of ideals of free algebras over commutative rings. Prog. Comput. Softw. 24(6), 271–272 (1998)
MathSciNet Google Scholar

Download references

Acknowledgments

Authors thank Pavel Kolesnikov, Dima Piontkovskii, Yongshan Chen and Yu Li for valuable comments and help in writing some parts of the survey. They thank the referee for valuable comments and suggestions.

Author information

Authors and Affiliations

Sobolev Institute of Mathematics and Novosibirsk State University, Novosibirsk, 630090, Russia
L. A. Bokut
School of Mathematical Sciences, South China Normal University, Guangzhou, 510631, People’s Republic of China
L. A. Bokut & Yuqun Chen

Authors

L. A. Bokut
View author publications
You can also search for this author in PubMed Google Scholar
Yuqun Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to L. A. Bokut.

Additional information

Communicated by Efim Zelmanov.

Supported by the NNSF of China (11171118), the Research Fund for the Doctoral Program of Higher Education of China (20114407110007), the NSF of Guangdong Province (S2011010003374) and the Program on International Cooperation and Innovation, Department of Education, Guangdong Province (2012gjhz0007). Supported by RFBR 12-01-00329, LSS–3669.2010.1, SB RAS Integration Grant No. 2009.97 (Russia) and Federal Target Grant “Scientific and educational personnel of innovation Russia” for 2009–2013 (government contract No.02.740.11.5191).

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Reprints and permissions

About this article

Cite this article

Bokut, L.A., Chen, Y. Gröbner–Shirshov bases and their calculation. Bull. Math. Sci. 4, 325–395 (2014). https://doi.org/10.1007/s13373-014-0054-6

Download citation

Received: 03 December 2013
Revised: 03 July 2014
Accepted: 13 August 2014
Published: 09 September 2014
Issue Date: December 2014
DOI: https://doi.org/10.1007/s13373-014-0054-6

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Gröbner–Shirshov bases and their calculation

Abstract

Similar content being viewed by others

Division Algebras, Clifford Algebras, Periodicity

Schur inequality for Murray–von Neumann algebras and its applications

On the generalized n-strong Drazin inverses and block matrices in Banach algebras

1 Introduction

1.1 Digression on the history of Lyndon–Shirshov bases and Lyndon–Shirshov words

2 Gröbner–Shirshov bases for associative algebras

2.1 Composition-Diamond lemma for associative algebras

Lemma 1

Proof

Lemma 2

Proof

Theorem 1

Proof

Theorem 2

Proof

Theorem 3

Proof

Remark 1

Remark 2

2.2 Gröbner bases for commutative algebras and their lifting to Gröbner–Shirshov bases

Theorem 4

Proof

Theorem 5

Theorem 6

Theorem 7

Proof

Corollary 1

2.3 Composition-Diamond lemma for modules

Definition 1

Definition 2

Theorem 8

Theorem 9

Theorem 10

Corollary 2

Proof

Lemma 3

Corollary 3

Theorem 11

Proof

2.4 Composition-Diamond lemma for categories

Theorem 12

Theorem 13

Corollary 4

Theorem 14

2.5 Composition-Diamond lemma for associative algebras over commutative algebras

Theorem 15

2.6 PBW-theorem for Lie algebras

Theorem 16

Theorem 17

2.7 Drinfeld–Jimbo algebra U q ( A ) , Kac–Moody enveloping algebra U ( A ) , and the PBW basis of U q ( A N )

Theorem 18

Corollary 5

Theorem 19

Corollary 6

Theorem 20

Corollary 7

3 Gröbner–Shirshov bases for groups and semigroups

3.1 Gröbner–Shirshov bases for braid groups

3.1.1 Braid groups in the Artin–Burau generators

Lemma 4

Theorem 21

Corollary 8

3.1.2 Braid groups in the Artin–Garside generators

Theorem 22

Corollary 9

Corollary 10

3.1.3 Braid groups in the Birman–Ko–Lee generators

Theorem 23

Corollary 11

Corollary 12

3.1.4 Braid groups in the Adjan–Thurston generators

Theorem 24

Theorem 25

Corollary 13

3.2 Gröbner–Shirshov basis for the Chinese monoid

Theorem 26

Corollary 14

2.7 Drinfeld–Jimbo algebra $U_{q} (A)$ , Kac–Moody enveloping algebra $U (A)$ , and the PBW basis of $U_{q} (A_{N})$

5 Gröbner–Shirshov bases for $Ω$ -algebras and operads

5.1 CD-lemmas for $Ω$ -algebras