1 Introduction

In this survey we review the method of Gröbner–ShirshovFootnote 1 (GS for short) bases for different classes of linear universal algebras, together with an overview of calculation of these bases in a variety of specific cases.

Shirshov (also spelled Širšov) in his pioneering work ([207], 1962) posed the following fundamental question:

How to find a linear basis of a Lie algebra defined by generators and relations?

He gave an infinite algorithm to solve this problem using a new notion of the composition (later the ‘s-polynomial’ in Buchberger’s terminology [65, 66]) of two Lie polynomials and a new notion of completion of a set of Lie polynomials (adding nontrivial compositions; the critical pair/completion (cpc-) algorithm in the later terminology of Knuth and Bendix [138] and Buchberger [67, 68]).

Shirshov’s algorithm goes as follows. Consider a set S L i e ( X ) of Lie polynomials in the free algebra k X on X over a field k (the algebra of non-commutative polynomials on X over k). Denote by S the superset of S obtained by adding all non-trivial Lie compositions (‘Lie s-polynomials’) of the elements of S. The problem of triviality of a Lie polynomial modulo a finite (or recursive) set S can be solved algorithmically using Shirshov’s Lie reduction algorithm from his previous paper [203], 1958. In general, an infinite sequence

S S S S ( n )

of Lie multi-compositions arises. The union S c of this sequence has the property that every Lie composition of elements of S c is trivial modulo S c . This is what is now called a Lie GS basis.

Then a new ‘Composition-Diamond lemmaFootnote 2 for Lie algebras’ (Lemma 3 in [207]) implies that the set I r r ( S c ) of all S c -irreducible (or S c -reduced) basic Lie monomials [ u ] in X is a linear basis of the Lie algebra L i e ( X | S ) generated by X with defining relations S. Here a basic Lie monomial means a Lie monomial in a special linear basis of the free Lie algebra L i e ( X ) k X , known as the Lyndon–Shirshov (LS for short) basis (Shirshov [207] and Chen–Fox–Lyndon [72], see below). An LS monomial [ u ] is called S c -irreducible (or S c -reduced) whenever u, the associative support of [ u ] , avoids the word s ¯ for all s S , where s ¯ is the maximal word of s as an associative polynomial (in the deg-lex ordering). To be more precise, Shirshov used his reduction algorithm at each step S, S , S , . Then we have a direct system S S S and S c = l i m S ( n ) is what is now called a minimal GS basis (a minimal GS basis is not unique, but a reduced GS basis is, see below). As a result, Shirshov’s algorithm gives a solution to the above problem for Lie algebras.

Shirshov’s algorithm, dealing with the word problem, is an infinite algorithm like the Knuth–Bendix algorithm [138], 1970 dealing with the identity problem for every variety of universal algebras.Footnote 3 The initial data for the Knuth–Bendix algorithm is the defining identities of a variety. The output of the algorithm, if any, is a ‘Knuth–Bendix basis’ of identities of the variety in the class of all universal algebras of a given signature (not a GS basis of defining relations, say, of a Lie algebra).

Shirshov’s algorithm gives linear bases and algorithmic decidability of the word problem for one-relation Lie algebras [207], (recursive) linear bases for Lie algebras with (finite) homogeneous defining relations [207], and linear bases for free products of Lie algebras with known linear bases [208]. He also proved the Freiheitssatz (freeness theorem) for Lie algebras [207] (for every one-relation Lie algebra L i e ( X | f ) , the subalgebra X \ { x i 0 } , where x i 0 appears in f, is a free Lie algebra). The Shirshov problem [207] of the decidability of the word problem for Lie algebras was solved negatively in [21]. More generally, it was proved [21] that some recursively presented Lie algebras with undecidable word problem can be embedded into finitely presented Lie algebras (with undecidable word problem). It is a weak analogue of the Higman embedding theorem for groups [115]. The problem [21] whether an analogue of the Higman embedding theorem is valid for Lie algebras is still open. For associative algebras a similar problem [21] was solved positively by Belyaev [10]. A simple example of a Lie algebra with undecidable word problem was given by Kukin [142].

Actually, a similar algorithm for associative algebras is implicit in Shirshov’s paper [207]. The reason is that he treats L i e ( X ) as the subspace of Lie polynomials in the free associative algebra k X . Then to define a Lie composition f , g w of two Lie polynomials relative to an associative word w = lcm ( f ¯ , g ¯ ) , he defines firstly the associative composition (non-commutative ‘s-polynomial’) ( f , g ) w = f b - a g , with a , b X . Then he inserts some brackets f , g w = [ f b ] f ¯ - [ a g ] g ¯ by applying his special bracketing lemma of [203]. We can obtain S c for every S k X in the same way as for Lie polynomials and in the same way as for Lie algebras (‘CD-lemma for associative algebras’) to infer that I r r ( S c ) is a linear basis of the associative algebra k X | S generated by X with defining relations S. All proofs are similar to those in [207] but much easier.

Moreover, the cases of semigroups and groups presented by generators and defining relations are just special cases of associative algebras via semigroup and group algebras. To summarize, Shirshov’s algorithm gives linear bases and normal forms of elements of every Lie algebra, associative algebra, semigroup or group presented by generators and defining relations! The algorithm works in many cases (see below).

The theory of Gröbner bases and Buchberger’s algorithm were initiated by Buchberger (Thesis [65] 1965, paper [66] 1970) for commutative associative algebras. Buchberger’s algorithm is a finite algorithm for finitely generated commutative algebras. It is one of the most useful and famous algorithms in modern computer science.

Shirshov’s paper [207] was in the spirit of the program of Kurosh (1908–1972) to study non-associative (relatively) free algebras and free products of algebras, initiated in Kurosh’s paper [143], 1947. In that paper he proved non-associative analogs of the Nielsen–Schreier and Kurosh theorems for groups. It took quite a few years to clarify the situation for Lie algebras in Shirshov’s papers [200], 1953 and [207], 1962 closely related to his theory of GS bases. It is important to note that Kurosh’s program quite unexpectedly led to Shirshov’s theory of GS bases for Lie and associative algebras [207].

A step in Kurosh’s program was made by his student Zhukov in his Ph.D. Thesis [226], 1950. He algorithmically solved the word problem for non-associative algebras. In a sense, it was the beginning of the theory of GS bases for non-associative algebras. The main difference with the future approach of Shirshov is that Zhukov did not use a linear ordering of non-associative monomials. Instead he chose an arbitrary monomial of maximal degree as the ‘leading’ monomial of a polynomial. Also, for non-associative algebras there is no ‘composition of intersection’ (‘s-polynomial’). In this sense it cannot be a model for Lie and associative algebras.Footnote 4

Shirshov, also a student of Kurosh’s, defended his Candidate of Sciences Thesis [199] at Moscow State University in 1953. His thesis together with the paper that followed [203], 1958 may be viewed as a background for his later method of GS bases. In the thesis, he proved the free subalgebra theorem for free Lie algebras (now known as Shirshov–Witt theorem, see also Witt [218], 1956) using the elimination process rediscovered by Lazard [149], 1960. He used the elimination process later [203], 1958 as a general method to prove the properties of regular (LS) words, including an algorithm of (special) bracketing of an LS word (with a fixed LS subword). The latter algorithm is of some importance in his theory of GS bases for Lie algebras (particularly in the definition of the composition of two Lie polynomials). Shirshov also proved the free subalgebra theorem for (anti-) commutative non-associative algebras [202], 1954. He used that later in [206], 1962 for the theory of GS bases of (commutative, anti-commutative) non-associative algebras. Shirshov (Thesis [199], 1953) found the (‘Hall–Shirshov’) series of bases of a free Lie algebra (see also [205] 1962, the first issue of Malcev’s Algebra and Logic).Footnote 5

The LS basis is a particular case of the Shirshov or Hall–Shirshov series of bases (cf.  Reutenauer [190], where this series is called the ‘Hall series’). In the definition of his series, Shirshov used Hall’s inductive procedure (see Ph. Hall [114], 1933, Hall [113], 1950): a non-associative monomial w = ( ( u ) ( v ) ) is a basic monomial whenever

  1. (1)

    ( u ) , ( v ) are basic;

  2. (2)

    ( u ) > ( v ) ;

  3. (3)

    if ( u ) = ( ( u 1 ) ( u 2 ) ) then ( u 2 ) ( v ) .

However, instead of ordering by the degree function (Hall words), he used an arbitrary linear ordering of non-associative monomials satisfying

( ( u ) ( v ) ) > ( v ) .

For example, in his Thesis [199], 1953 he used the ordering by the content of monomials (the content of, say, the monomial ( u ) = ( ( x 2 x 1 ) ( ( x 2 x 1 ) x 1 ) ) is the vector ( x 2 , x 2 , x 1 , x 1 , x 1 ) ). Actually, the content u ^ of ( u ) may be viewed as a commutative associative word that equals u in the free commutative semigroup. Two contents are compared lexicographically (a proper prefix of a content is greater than the content).

If we use the lexicographic ordering, ( u ) ( v ) if u v lexicographically (with the condition u u v , v 1 ), then we obtain the LS basis.Footnote 6 For example, for the alphabet x 1 , x 2 with x 2 x 1 we obtain basic Lyndon–Shirshov monomials by induction:

x 2 , x 1 , [ x 2 x 1 ] , [ x 2 [ x 2 x 1 ] ] = [ x 2 x 2 x 1 ] , [ [ x 2 x 1 ] x 1 ] = [ x 2 x 1 x 1 ] , [ x 2 [ x 2 x 2 x 1 ] ] = [ x 2 x 2 x 2 x 1 ] , [ x 2 [ x 2 x 1 x 1 ] ] = [ x 2 x 2 x 1 x 1 ] , [ [ x 2 x 1 x 1 ] x 1 ] = [ x 2 x 1 x 1 x 1 ] , [ [ x 2 x 1 ] [ x 2 x 1 x 1 ] ] = [ x 2 x 1 x 2 x 1 x 1 ] ,

and so on. They are exactly all Shirshov regular (LS) Lie monomials and their associative supports are exactly all Shirshov regular words with a one-to-one correspondence between two sets given by the Shirshov elimination (bracketing) algorithm for (associative) words.

Let us recall that an elementary step of Shirshov’s elimination algorithm is to join the minimal letter of a word to previous ones by bracketing and to continue this process with the lexicographic ordering of the new alphabet. For example, suppose that x 2 x 1 . Then we have the succession of bracketings

x 2 x 1 x 2 x 1 x 1 x 2 x 1 x 1 x 1 x 1 x 2 x 1 x 1 , [ x 2 x 1 ] [ x 2 x 1 x 1 ] [ x 2 x 1 x 1 x 1 ] [ x 2 x 1 x 1 ] , [ x 2 x 1 ] [ [ x 2 x 1 x 1 ] [ x 2 x 1 x 1 x 1 ] ] [ x 2 x 1 x 1 ] , [ [ x 2 x 1 ] [ [ x 2 x 1 x 1 ] [ x 2 x 1 x 1 x 1 ] ] ] [ x 2 x 1 x 1 ] , [ [ [ x 2 x 1 ] [ [ x 2 x 1 x 1 ] [ x 2 x 1 x 1 x 1 ] ] ] [ x 2 x 1 x 1 ] ] ; x 2 x 1 x 1 x 1 x 2 x 1 x 1 x 2 x 1 x 2 x 2 x 1 , [ x 2 x 1 x 1 x 1 ] [ x 2 x 1 x 1 ] [ x 2 x 1 ] x 2 [ x 2 x 1 ] , [ x 2 x 1 x 1 x 1 ] [ x 2 x 1 x 1 ] [ x 2 x 1 ] [ x 2 [ x 2 x 1 ] ] ; x 2 x 1 x 1 x 1 x 2 x 1 x 1 x 2 x 1 x 2 x 2 x 1 .

By the way, the second series of partial bracketings illustrates Shirshov’s factorization theorem [203] of 1958 that every word is a non-decreasing product of LS words (it is often mistakenly called Lyndon’s theorem, see [12]).

The Shirshov special bracketing [203] goes as follows. Let us give as an example the special bracketing of the LS word w = x 2 x 2 x 1 x 1 x 2 x 1 x 1 x 1 with the LS subword u = x 2 x 2 x 1 . The Shirshov standard bracketing is

[ w ] = [ x 2 [ [ [ x 2 x 1 ] x 1 ] [ x 2 x 1 x 1 x 1 ] ] ] .

The Shirshov special bracketing is

[ w ] u = [ [ [ u ] x 1 ] [ x 2 x 1 x 1 x 1 ] ] .

In general, if w = a u b then the Shirshov standard bracketing gives [ w ] = [ a [ u c ] d ] , where b = c d . Now, c = c 1 c t , each c i is an LS-word, and c 1 c t in the lex ordering (Shirshov’s factorization theorem). Then we must change the bracketing of  [ u c ] :

[ w ] u = [ a [ [ [ u ] [ c 1 ] ] [ c t ] ] d ]

The main property of [ w ] u is that [ w ] u is a monic associative polynomial with the maximal monomial w; hence, [ w ] u ¯ = w .

Actually, Shirshov [207], 1962 needed a ‘double’ relative bracketing of a regular word with two disjoint LS subwords. Then he implicitly used the following property: every LS subword of c = c 1 c t as above is a subword of some c i for 1 i t .

Shirshov defined regular (LS) monomials [203], 1958, as follows: ( w ) = ( ( u ) ( v ) ) is a regular monomial iff:

  1. (1)

    w is a regular word;

  2. (2)

    ( u ) and ( v ) are regular monomials (then automatically u v in the lex ordering);

  3. (3)

    if ( u ) = ( ( u 1 ) ( u 2 ) ) then u 2 v .

Once again, if we formally omit all Lie brackets in Shirshov’s paper [207] then essentially the same algorithm and essentially the same CD-lemma (with the same but much simpler proof) yield a linear basis for associative algebra presented by generators and defining relations. The differences are the following:

  • no need to use LS monomials and LS words, since the set X is a linear basis of the free associative algebra k X ;

  • the definition of associative composition for monic polynomials f and g,

    ( f , g ) w = f b - a g , w = f ¯ b = a g ¯ , d e g ( w ) < d e g ( f ¯ ) + d e g ( g ¯ ) ,

    or

    ( f , g ) w = f - a g b , w = f ¯ = a g ¯ b , w , a , b X ,

    are much simpler than the definition of Lie composition for monic Lie polynomials f and g,

    f , g w = [ f b ] f ¯ - [ a g ] g ¯ , w = f ¯ b = a g ¯ , d e g ( w ) < d e g ( f ¯ ) + d e g ( g ¯ ) ,

    or

    f , g w = f - [ a g b ] g ¯ , w = f ¯ = a g ¯ b , w , a , b , f ¯ , g ¯ X ,

    where [ f b ] f ¯ , [ a g ] g ¯ , and [ a g b ] g ¯ are the Shirshov special bracketings of the LS words w with fixed LS subwords f ¯ and g ¯ respectively.

  • The definition of elimination of the leading word s ¯ of an associative monic polynomial s is straightforward: a s ¯ b a ( r s ) b whenever s = s ¯ - r s and a , b X . However, for Lie polynomials, it is much more involved and uses the Shirshov special bracketing: f f - [ a g b ] g ¯ whenever f ¯ = a g ¯ b .

We can formulate the main idea of Shirshov’s proof as follows. Consider a complete set S of monic Lie polynomials (all compositions are trivial). If w = a 1 s 1 ¯ b 1 = a 2 s 2 ¯ b 2 , where w, a i , b i X and w is an LS word, while s 1 , s 2 S , then the Lie monomials [ a 1 s 1 b 1 ] s 1 ¯ and [ a 2 s 2 b 2 ] s 2 ¯ are equal modulo the smaller Lie monomials in I d ( S ) :

[ a 1 s 1 b 1 ] s 1 ¯ = [ a 2 s 2 b 2 ] s 2 ¯ + i > 2 α i [ a i s i b i ] s i ¯ ,

where α i k , s i S and [ a i s i b i ] s i ¯ ¯ = a i s i ¯ b i < w . Actually, Shirshov proved a more general result: if ( a 1 s 1 b 1 ) ¯ = a 1 s 1 ¯ b 1 and ( a 2 s 2 b 2 ) ¯ = a 2 s 2 ¯ b 2 with w = a 1 s 1 ¯ b 1 = a 2 s 2 ¯ b 2 then

( a 1 s 1 b 1 ) = ( a 2 s 2 b 2 ) + i > 2 α i ( a i s i b i ) ,

where α i k , s i S and ( a i s i b i ) ¯ = a i s i ¯ b i < w . Below we call a Lie polynomial ( a s b ) a Lie normal S-word provided that ( a s b ) ¯ = a s ¯ b .

This is precisely where he used the notion of composition and other notions and properties mentioned above.

It is much easier to prove an analogue of this property for associative algebras (as well as commutative associative algebras): given a complete monic set S in k X ( k [ X ] ), for w = a 1 s 1 ¯ b 1 = a 2 s 2 ¯ b 2 with a i , b i X and s 1 , s 2 S we have

a 1 s 1 b 1 = a 2 s 2 b 2 + i > 2 α i a i s i b i ,

where α i k , s i S and a i s i ¯ b i < w .

Summarizing, we can say with confidence that the work (Shirshov [207]) implicitly contains the CD-lemma for associative algebras as a simple exercise that requires no new ideas. The first author, Bokut, can confirm that Shirshov clearly understood this and told him that “the case of associative algebras is the same”. The lemma was formulated explicitly in Bokut [22], 1976 (with a reference to Shirshov’s paper [207]), Bergman [11], 1978, and Mora [171], 1986.

Let us emphasize once again that the CD-Lemma for associative algebras applies to every semigroup P = sgp X | S , and in particular to every group, by way of the semigroup algebra k P over a field k. The latter algebra has the same generators and defining relations as P, or k P = k X | S . Every composition of the binomials u 1 - v 1 and u 2 - v 2 is a binomial u - v . As a result, applying Shirshov’s algorithm to a set of semigroup relations S gives rise to a complete set of semigroup relations S c . The S c -irreducible words in X constitute the set of normal forms of the elements of P.

Before we go any further, let us give some well-known examples of algebra, group, and semigroup presentations by generators and defining relations together with linear bases, normal forms, and GS bases for them (if known). Consider a field k and a commutative ring or commutative k-algebra K.

  • The Grassman algebra over K is

    K X | x i 2 = 0 , x i x j + x j x i = 0 , i > j .

    The set of defining relations is a GS basis with respect to the deg-lex ordering. A K-basis is

    { x i 1 x i n | x i j X , j = 1 , , n , i 1 < < i n , n 0 } .
  • The Clifford algebra over K is

    K X | x i x j + x j x i = a i j , 1 i , j n ,

    where ( a i j ) is an n × n symmetric matrix over K. The set of defining relations is a GS basis with respect to the deg-lex ordering. A K-basis is

    { x i 1 x i n | x i j X , j = 1 , , n , n 0 , i 1 < < i n } .
  • The universal enveloping algebra of a Lie algebra L is

    U K ( L ) = K X | x i x j - x j x i = α i j k x k , i > j .

    If L is a free K-module with a well-ordered K-basis

    X = { x i | i I } , [ x i x j ] = α i j k x k , i > j , i , j I ,

    then the set of defining relations is a GS basis of U K ( L ) . The PBW theorem follows: U K ( L ) is a free K-module with a K-basis,

    { x i 1 x i n | i 1 i n , i t I , t = 1 , , n , n 0 } .
  • Kandri-Rody and Weispfenning [122] invented an important class of (noncommutative polynomial) ‘algebras of solvable type’, which includes universal enveloping algebras. An algebra of solvable type is

    R = k X | s i j = x i x j - x j x i - p i j , i > j , p i j < x i x j ,

    and the compositions ( s i j , s j k ) w = 0 modulo ( S , w ) , where w = x i x j x k with i > j > k . Here p i j is a noncommutative polynomial with all terms less than x i x j . They created a theory of GS bases for every algebra of this class; thus, they found a linear basis of every quotient of R.

  • A general presentation U k ( L ) = k X | S ( - ) of a universal enveloping algebra over a field k, where L = L i e ( X | S ) with S L i e ( X ) k X and S ( - ) is S as a set of associative polynomials. PBW theorem in a Shirshov’s form. The following conditions are equivalent:

    1. (i)

      the set S is a Lie GS basis;

    2. (ii)

      the set S ( - ) is a GS basis for k X ;

    3. (iii)

      a linear basis for U k ( L ) consists of words u 1 u 2 u n , where u i are S-irreducible LS words with u 1 u 2 u n (in the lex-ordering), see [56, 57];

    4. (iv)

      a linear basis for L consists of the S-irreducible LS Lie monomials [ u ] in X;

    5. (v)

      a linear basis for U k ( L ) consists of the polynomials u = [ u 1 ] [ u n ] , where u 1 u n in the lex ordering, n 0 , and each [ u i ] is an S-irreducible non-associative LS word in X.

  • Free Lie algebras L i e K ( X ) over K. Hall, Shirshov, and Lyndon provided different linear K-bases for a free Lie algebra (the Hall–Shirshov series of bases, in particular, the Hall basis, the Lyndon–Shirshov basis, the basis compatible with the free solvable (polynilpotent) Lie algebra) [194], see also [15]. Two anticommutative GS bases of L i e K ( X ) were found in [34, 37], which yields the Hall and Lyndon–Shirshov linear bases respectively.

  • The Lie k-algebras presented by Chevalley generators and defining relations of types A n , B n , C n , D n , G 2 , F 4 , E 6 , E 7 , and E 8 . Serre’s theorem provides linear bases and multiplication tables for these algebras (they are finite dimensional simple Lie algebras over k). Lie GS bases for these algebras are found in [4951].

  • The Coxeter group

    W = sgp S | s i 2 = 1 , m i j ( s i , s j ) = m j i ( s j , s i )

    for a given Coxeter matrix M = ( m i j ) . Tits [210] (see also [14]) algorithmically solved the word problem for Coxeter groups. Finite Coxeter groups are presented by ‘finite’ Coxeter matrices A n , B n , D n , G 2 , F 4 , E 6 , E 7 , E 8 , H 3 , and H 4 . Coxeter’s theorem provides normal forms and Cayley tables (these are finite groups generated by reflections). GS bases for finite Coxeter groups are found in [58].

  • The Iwahory–Hecke (Hecke) algebras H over K differ from the group algebras K ( W ) of Coxeter groups in that instead of s i 2 = 1 there are relations ( s i - q i 1 / 2 ) ( s i + q i 1 / 2 ) = 0 or ( s i - q i ) ( s i + 1 ) = 0 , where q i are units of K. Two K-bases for H are known; one is natural, and the other is the Kazhdan–Lusztig canonical basis [155]. The GS bases for the Iwahory–Hecke algebras are known for the finite Coxeter matrices. A deep connection of the Iwahory–Hecke algebras of type A n and braid groups (as well as link invariants) was found by Jones [116].

  • Affine Kac–Moody algebras [117]. The Kac–Gabber theorem provides linear bases for these algebras under the symmetrizability condition on the Cartan matrix. Using this result, Poroshenko found the GS bases of these algebras [178180].

  • Borcherds–Kac–Moody algebras [6163, 117]. GS bases are not known.

  • Quantum enveloping algebras (Drinfeld, Jimbo). Lusztig’s theorem [154] provides linear canonical bases of these algebras. Different approaches were developed by Ringel [191, 192], Green [110], and Kharchenko [131135]. GS bases of quantum enveloping algebras are unknown except for the case A n , see [55, 86, 195, 220].

  • Koszul algebras. The quadratic algebras with a basis of standard monomials, called PBW-algebras, are always Koszul (Priddy [184]), but not conversely. In different terminology, PBW-algebras are algebras with quadratic GS bases. See [177].

  • Elliptic algebras (Feigin, Odesskii) These are associative algebras presented by n generators and n ( n - 1 ) / 2 homogeneous quadratic relations for which the dimensions of the graded components are the same as for the polynomial algebra in n variables. The first example of this type was Sklyanin algebra (1982) generated by x 1 , x 2 , and x 3 with the defining relations [ x 3 , x 2 ] = x 1 2 , [ x 2 , x 1 ] = x 3 2 , and [ x 1 , x 3 ] = x 2 2 . See [175]. GS bases are not known.

  • Leavitt path algebras. GS bases for these algebras are found in Alahmedi et al. [2] and applied by the same authors to determine the structure of the Leavitt path algebras of polynomial growth in [3].

  • Artin braid group B r n . The Markov–Artin theorem provides the normal form and semi-direct structure of the group in the Burau generators. Other normal forms of B r n were obtained by Garside, Birman–Ko–Lee, and Adjan–Thurston. GS bases for B r n in the Artin–Burau, Artin–Garside, Birman–Ko–Lee, and Adjan–Thurston generators were found in [2325, 89] respectively.

  • Artin–Tits groups. They differ from Coxeter groups in the absence of the relations s i 2 = 1 . Normal forms are known in the spherical case, see Brieskorn, Saito [64]. GS bases are not known except for braid groups (the Artin–Tits groups of type A n ).

  • The groups of Novikov–Boon type (Novikov [173], Boon [60], Collins [97], Kalorkoti [118121]) with unsolvable word or conjugacy problem. They are groups with standard bases (standard normal forms or standard GS bases), see [1618, 77].

  • Adjan’s [1] and Rabin’s [187] constructions of groups with unsolvable isomorphism problem and Markov properties. A GS basis is known for Adjan’s construction [26].

  • Markov’s [161] and Post’s [183] semigroups with unsolvable word problem. The GS basis of Post’s semigroup is found in [223].

  • Markov’s construction of semigroups with unsolvable isomorphism problem and Markov properties. The GS basis for the construction is not known.

  • Plactic monoids. A theorem due to Richardson, Schensted, and Knuth provides a normal form of the elements of these monoids (see Lothaire [151]). New approaches to plactic monoids via GS bases in the alphabets of row and column generators are found in [29].

  • The groups of quotients of the multiplicative semigroups of power series rings with topological quadratic relations of the type k x , y , z , t | x y = z t embeddable (without the zero element) into groups but in general not embeddable into division algebras (settling a problem of Malcev). The relative standard normal forms of these groups found in [19, 20] are the reduced words for what was later called a relative GS basis [59].

To date, the method of GS bases has been adapted, in particular, to the following classes of linear universal algebras, as well as for operads, categories, and semirings. Unless stated otherwise, we consider all linear algebras over a field k. Following the terminology of Higgins and Kurosh, we mean by a ((differential) associative) Ω-algebra a linear space ((differential) associative algebra) with a set of multi-linear operations Ω:

  • Associative algebras, Shirshov [207], Bokut [22], Bergman [11];

  • Associative algebras over a commutative algebra, Mikhalev and Zolotykh [170];

  • Associative Γ-algebras, where Γ is a group, Bokut and Shum [59];

  • Lie algebras, Shirshov [207];

  • Lie algebras over a commutative algebra, Bokut et al. [31];

  • Lie p-algebras over k with char k = p , Mikhalev [166];

  • Lie superalgebras, Mikhalev [165, 167];

  • Metabelian Lie algebras, Chen and Chen [75];

  • Quiver (path) algebras, Farkas et al. [101];

  • Tensor products of associative algebras, Bokut et al. [30];

  • Associative differential algebras, Chen et al. [76];

  • Associative ( n - ) conformal algebras over k with char k = 0 , Bokut et al. [45], Bokut et al. [43];

  • Dialgebras, Bokut et al. [38];

  • Pre-Lie (Vinberg–Koszul–Gerstenhaber, right (left) symmetric) algebras, Bokut et al. [35],

  • Associative Rota–Baxter algebras over k with char k = 0 , Bokut et al. [32];

  • L-algebras, Bokut et al. [33];

  • Associative Ω-algebras, Bokut et al.  [41];

  • Associative differential Ω-algebras, Qiu and Chen [185];

  • Ω-algebras, Bokut et al. [33];

  • Differential Rota–Baxter commutative associative algebras, Guo et al. [111];

  • Semirings, Bokut et al. [40];

  • Modules over an associative algebra, Golod [108], Green [109], Kang and Lee [123, 124], Chibrikov [90];

  • Small categories, Bokut et al. [36];

  • Non-associative algebras, Shirshov [206];

  • Non-associative algebras over a commutative algebra, Chen et al. [81];

  • Commutative non-associative algebras, Shirshov [206];

  • Anti-commutative non-associative algebras, Shirshov [206];

  • Symmetric operads, Dotsenko and Khoroshkin [98].

At the heart of the GS method for a class of linear algebras lies a CD-lemma for a free object of the class. For the cases above, the free objects are the free associative algebra k X , the doubly free associative k [ Y ] -algebra k [ Y ] X , the free Lie algebra L i e ( X ) , and the doubly free Lie k [ Y ] -algebra L i e k [ Y ] ( X ) . For the tensor product of two associative algebras we need to use the tensor product of two free algebras, k X k Y . We can view every semiring as a double semigroup with two associative products · and . So, the CD-lemma for semirings is the CD-lemma for the semiring algebra of the free semiring R i g ( X ) . The CD-lemma for modules is the CD-lemma for the doubly free module Mod k Y ( X ) , a free module over a free associative algebra. The CD-lemma for small categories is the CD-lemma for the ‘free partial k-algebra’ k C X generated by an oriented graph X (a sequence z 1 z 2 z n , where z i X , is a partial word in X iff it is a path; a partial polynomial is a linear combination of partial words with the same source and target).

All CD-lemmas have essentially the same statement. Consider a class V of linear universal algebras, a free algebra V ( X ) in V, and a well-ordered k-basis of terms N ( X ) of V ( X ) . A subset S V ( X ) is called a GS basis if every composition of the elements of S is trivial (vanishes upon the elimination of the leading terms s ¯ for s S ). Then the following conditions are equivalent:

  1. (i)

    S is a GS basis.

  2. (ii)

    If f I d ( S ) then the leading term f ¯ contains the subterm s ¯ for some s S .

  3. (iii)

    The set of S-irreducible terms is a linear basis for the V-algebra V X | S generated by X with defining relations S.

In some cases ( ( n - ) conformal algebras, dialgebras), conditions (i) and (ii) are not equivalent. To be more precise, in those cases we have ( i ) ( i i ) ( i i i ) .

Typical compositions are compositions of intersection and inclusion. Shirshov [206, 207] avoided inclusion composition. He suggested instead that a GS basis must be minimal (the leading words do not contain each other as subwords). In some cases, new compositions must be defined, for example, the composition of left (right) multiplication. Also, sometimes we need to combine all these compositions. We present here a new approach to the definition of a composition, based on the concept of the least common multiple lcm ( u , v ) of two terms u and v.

In some cases (Lie algebras, (n-) conformal algebras) the ‘leading’ term f ¯ of a polynomial f V ( X ) lies outside V ( X ) . For Lie algebras, we have f ¯ k X , for (n-) conformal algebras f ¯ belongs to an ‘Ω-semigroup’.

Almost all CD-lemmas require the new notion of a ‘normal S-term’. A term ( a s b ) in { X , Ω } , where s S , with only one occurrence of s is called a normal S-term whenever ( a s b ) ¯ = ( a ( s ¯ ) b ) . Given S k X , every S-word (that is, an S-term) is a normal S-word. Given S L i e ( X ) , every Lie S-monomial (Lie S-term) is a linear combination of normal Lie S-terms (Shirshov [207]).

One of the two key lemmas asserts that if S is complete under compositions of multiplication then every element of the ideal generated by S is a linear combination of normal S-terms. Another key lemma says that if S is a GS basis and the leading words of two normal S-terms are the same then these terms are the same modulo lower normal S-terms. As we mentioned above, Shirshov proved these results [207] for L i e ( X ) (there are no compositions of multiplication for Lie and associative algebras).

This survey continues our surveys with Kolesnikov, Fong, Ke, and Shum [27, 28, 42, 46, 52, 53], Ufnarovski’s survey [213], and the book of the first named author and Kukin [54].

The paper is organized as follows. Section 2 is for associative algebras, Sect. 3 is for semigroups and groups, Sect. 4 is for Lie algebras, and the short Sect. 5 is for Ω-algebras and operads.Footnote 7

To conclude this introduction, we give some information about the work of Shirshov; for more on this, see the book [209]. Shirshov (1921–1981) was a famous Russian mathematician. His name is associated with notions and results on the Gröbner–Shirshov bases, the Composition-Diamond lemma, the Shirshov–Witt theorem, the Lazard–Shirshov elimination, the Shirshov height theorem, Lyndon–Shirshov words, Lyndon–Shirshov basis (in a free Lie algebra), the Hall–Shirshov series of bases, the Cohn–Shirshov theorem for Jordan algebras, Shirshov’s theorem on the Kurosh problem, and the Shirshov factorization theorem. Shirshov’s ideas were used by his students Efim Zelmanov to solve the restricted Burnside problem and Aleksander Kemer to solve the Specht problem.

1.1 Digression on the history of Lyndon–Shirshov bases and Lyndon–Shirshov words

Lyndon [156], 1954, defined standard words, which are the same as Shirshov’s regular words [203], 1958. Unfortunately, the papers (Lyndon [156]) and (Chen et al. [72], 1958) were practically unknown before 1983. As a result, at that time almost all authors (except four who used the names Shirshov and Chen–Fox–Lyndon, see below) refer to the basis and words as Shirshov regular basis and words, cf.  for instance [8, 9, 96, 188, 212, 224]. To the best of our knowledge, none of the authors mentioned Lyndon’s paper [156] as a source of ‘Lyndon words’ before 1983(!).

In the following papers the authors mentioned both (Chen et al. [72]) and (Shirshov [203]) as a source of ‘Lyndon–Shirshov basis’ and ‘Lyndon–Shirshov words’:

  • Schützenberger and Sherman [196], 1963;

  • Schützenberger [197], 1965;

  • Viennot [217], 1978;

  • Michel [163], 1975; [164], 1976.

The authors of [196] thank Cohn for pointing out Shirshov’s paper [203]. They also formulate Shirshov’s factorization theorem [203]. They mention [72, 203] as a source of ‘LS words’. Schützenberger also mentions [197] Shirshov’s factorization theorem, but in this case he attributes it to both Chen et al. [72] and Shirshov [203]. Actually, he cites [72] by mistake, as that result is absent from the paper, see Berstel and Perrin [12].Footnote 8

Starting with the book of Lothaire, Combinatorics on words ([151], 1983), some authors called the words and basis ‘Lyndon words’ and ‘Lyndon basis’; for instance, see Reutenauer, Free Lie algebras ([190], 1993).

2 Gröbner–Shirshov bases for associative algebras

In this section we give a proof of Shirshov’s CD-lemma for associative algebras and Buchberger’s theorem for commutative algebras. Also, we give the Eisenbud–Peeva–Sturmfels lifting theorem, the CD-lemmas for modules (following Kang and Lee [124] and Chibrikov [90]), the PBW theorem and the PBW theorem in Shirshov’s form, the CD-lemma for categories, the CD-lemma for associative algebras over commutative algebras and the Rosso–Yamane theorem for U q ( A n ) .

2.1 Composition-Diamond lemma for associative algebras

Let k be a field, k X be the free associative algebra over k generated by X and X be the free monoid generated by X, where the empty word is the identity, denoted by 1. Suppose that X is a well-ordered set. Take f k X with the leading word f ¯ and f = α f ¯ - r f , where 0 α k and r f ¯ < f ¯ . We call fmonic if α = 1 .

A well-ordering > on X is called a monomial ordering whenever it is compatible with the multiplication of words, that is, for all u , v X we have

u > v w 1 u w 2 > w 1 v w 2 , for all w 1 , w 2 X .

A standard example of monomial ordering on X is the deg-lex ordering, in which two words are compared first by the degree and then lexicographically, where X is a well-ordered set.

Fix a monomial ordering < on X and take two monic polynomials f and g in k X . There are two kinds of compositions:

  1. (i)

    If w is a word such that w = f ¯ b = a g ¯ for some a , b X with | f ¯ | + | g ¯ | > | w | then the polynomial ( f , g ) w = f b - a g is called the intersection composition of f and g with respect to w.

  2. (ii)

    If w = f ¯ = a g ¯ b for some a , b X then the polynomial ( f , g ) w = f - a g b is called the inclusion composition of f and g with respect to w.

Then ( f , g ) w ¯ < w and ( f , g ) w lies in the ideal I d { f , g } of k X generated by f and g.

In the composition ( f , g ) w , we call w an ambiguity (or the least common multiple lcm ( f ¯ , g ¯ ) , see below).

Consider S k X such that very s S is monic. Take h k X and w X . Then h is called trivial modulo ( S , w ) , denoted by

h 0 mod ( S , w ) ,

if h = α i a i s i b i , where α i k , a i , b i X , and s i S with a i s i ¯ b i < w .

The elements a s b , a , b X , and s S are called S-words.

A monic set S k X is called a GS basis in k X with respect to the monomial ordering < if every composition of polynomials in S is trivial modulo S and the corresponding w.

A set S is called a minimal GS basis in k X if S is a GS basis in k X avoiding inclusion compositions; that is, given f , g S with f g , we have f ¯ a g ¯ b for all a , b X .

Put

I r r ( S ) = { u X | u a s ¯ b , s S , a , b X } .

The elements of I r r ( S ) are called S-irreducible or S-reduced.

A GS basis S in k X is reduced provided that supp ( s ) I r r ( S \ { s } ) for every s S , where supp ( s ) = { u 1 , u 2 , , u n } whenever s = i = 1 n α i u i with 0 α i k and u i X . In other words, each u i is an S \ { s } -irreducible word.

The following lemma is key for proving the CD-lemma for associative algebras.

Lemma 1

If S is a GS basis in k X and w = a 1 s 1 ¯ b 1 = a 2 s 2 ¯ b 2 , where a 1 , b 1 , a 2 , b 2 X and s 1 , s 2 S , then a 1 s 1 b 1 a 2 s 2 b 2 mod ( S , w ) .

Proof

There are three cases to consider.

Case 1 Assume that the subwords s ¯ 1 and s ¯ 2 of w are disjoint, say, | a 2 | | a 1 | + | s ¯ 1 | . Then, a 2 = a 1 s ¯ 1 c and b 1 = c s ¯ 2 b 2 for some c X , and so w 1 = a 1 s ¯ 1 c s ¯ 2 b 2 . Now,

a 1 s 1 b 1 - a 2 s 2 b 2 = a 1 s 1 c s ¯ 2 b 2 - a 1 s ¯ 1 c s 2 b 2 = a 1 s 1 c ( s ¯ 2 - s 2 ) b 2 + a 1 ( s 1 - s ¯ 1 ) c s 2 b 2 .

Since s 2 ¯ - s 2 ¯ < s ¯ 2 and s 1 - s 1 ¯ ¯ < s ¯ 1 , we conclude that

a 1 s 1 b 1 - a 2 s 2 b 2 = i α i u i s 1 v i + j β j u j s 2 v j

with α i , β j k and S-words u i s 1 v i and u j s 2 v j satisfying u i s ¯ 1 v i , u j s ¯ 2 v j < w .

Case 2 Assume that the subword s ¯ 1 of w contains s ¯ 2 as a subword. Then s ¯ 1 = a s ¯ 2 b with a 2 = a 1 a and b 2 = b b 1 , that is, w = a 1 a s ¯ 2 b b 1 for some S-word a s 2 b . We have

a 1 s 1 b 1 - a 2 s 2 b 2 = a 1 s 1 b 1 - a 1 a s 2 b b 1 = a 1 ( s 1 - a s 2 b ) b 1 = a 1 ( s 1 , s 2 ) s 1 ¯ b 1 .

The triviality of compositions implies that a 1 s 1 b 1 a 2 s 2 b 2 mod ( S , w ) .

Case 3 Assume that the subwords s ¯ 1 and s ¯ 2 of w have a nonempty intersection. We may assume that a 2 = a 1 a and b 1 = b b 2 with w = s ¯ 1 b = a s ¯ 2 and | w | < | s ¯ 1 | + | s ¯ 2 | . Then, as in Case 2, we have a 1 s 1 b 1 a 2 s 2 b 2 mod ( S , w ) .

Lemma 2

Consider a set S k X of monic polynomials. For every f k X we have

f = u i f ¯ α i u i + a j s j ¯ b j f ¯ β j a j s j b j

where α i , β j k , u i I r r ( S ) , and a j s j b j are S-words. So, I r r ( S ) is a set of linear generators of the algebra k X | S .

Proof

Induct on f ¯ .

Theorem 1

(The CD-lemma for associative algebras) Choose a monomial ordering < on X . Consider a monic set S k X and the ideal I d ( S ) of k X generated by S. The following statements are equivalent:

  1. (i)

    S is a Gröbner–Shirshov basis in k X .

  2. (ii)

    f I d ( S ) f ¯ = a s ¯ b for some s S and a , b X .

  3. (iii)

    I r r ( S ) = { u X | u a s ¯ b , s S , a , b X } is a linear basis of the algebra k X | S .

Proof

(i)(ii). Assume that S is a GS basis and take 0 f I d ( S ) . Then, we have f = i = 1 n α i a i s i b i where α i k , a i , b i X , and s i S . Suppose that w i = a i s i ¯ b i satisfy

w 1 = w 2 = = w l > w l + 1 .

Induct on w 1 and l to show that f ¯ = a s ¯ b for some s S and a , b X . To be more precise, induct on ( w 1 , l ) with the lex ordering of the pairs.

If l = 1 then f ¯ = a 1 s 1 b 1 ¯ = a 1 s 1 ¯ b 1 and hence the claim holds. Assume that l 2 . Then w 1 = a 1 s 1 ¯ b 1 = a 2 s 2 ¯ b 2 . Lemma 1 implies that a 1 s 1 b 1 a 2 s 2 b 2 mod ( S , w 1 ) . If α 1 + α 2 0 or l > 2 then the claim follows by induction on l. For the case α 1 + α 2 = 0 and l = 2 , induct on w 1 . Thus, (ii) holds.

(ii)(iii). By Lemma 2, I r r ( S ) generates k X | S as a linear space. Suppose that i α i u i = 0 in k X | S , where 0 α i k and u i I r r ( S ) . It means that i α i u i I d ( S ) in k X . Then i α i u i ¯ = u j I r r ( S ) for some j, which contradicts (ii).

(iii)(i). Given f , g S , Lemma 2 and (iii) yield ( f , g ) w 0 mod ( S , w ) . Therefore, S is a GS basis.

A new exposition of the proof of Theorem 1 (CD-lemma for associative algebras).

Let us start with the concepts of non-unique common multiple and least common multiple of two words u , v X . A common multiple cm ( u , v ) means that cm ( u , v ) = a 1 u b 1 = a 2 v b 2 for some a i , b i X . Then lcm ( u , v ) means that some cm ( u , v ) contains some lcm ( u , v ) as a subword: cm ( u , v ) = c · lcm ( u , v ) · d with c , d X , where u and v are the same subwords in both sides. To be precise,

lcm ( u , v ) { u c v , c X (a trivial lcm ( u , v ) ) ; u = a v b , a , b X (an inclusion lcm ( u , v ) ) ; u b = a v , a , b X , | u b | < | u | + | v | (an intersection lcm ( u , v ) ) } .

Define the general composition ( f , g ) lcm ( f ¯ , g ¯ ) of monic polynomials f , g k X as

( f , g ) lcm ( f ¯ , g ¯ ) = lcm ( f ¯ , g ¯ ) | f ¯ f - lcm ( f ¯ , g ¯ ) | g ¯ g .

The only difference with the previous definition of composition is that we include the case of trivial lcm ( f ¯ , g ¯ ) . However, in this case the composition is trivial,

( f , g ) f ¯ c g ¯ 0 mod ( { f , g } , f ¯ c g ¯ ) .

It is clear that if a 1 f ¯ b 1 = a 2 g ¯ b 2 then, up to the ordering of f and g,

a 1 f b 1 - a 2 g b 2 = c · ( f , g ) lcm ( f ¯ , g ¯ ) · d .

This implies Lemma 1. The main claim (i)(ii) of Theorem 1 follows from Lemma 1.

Shirshov algorithm. If a monic subset S k X is not a GS basis then we can add to S all nontrivial compositions, making them monic. Iterating this process, we eventually obtain a GS basis S c that contains S and generates the same ideal, I d ( S c ) = I d ( S ) . This S c is called the GS completion of S. Using the reduction algorithm (elimination of the leading words of polynomials), we may obtain a minimal GS basis S c or a reduced GS basis.

The following theorem gives a linear basis for the ideal I d ( S ) provided that S k X is a GS basis.

Theorem 2

If S k X is a Gröbner–Shirshov basis then, given u X \ I r r ( S ) , by Lemma 2 there exists u ^ k I r r ( S ) with u ^ ¯ < u (if u ^ 0 ) such that u - u ^ I d ( S ) and the set { u - u ^ | u X \ I r r ( S ) } is a linear basis for the ideal I d ( S ) of k X .

Proof

Take 0 f I d ( S ) . Then by the CD-lemma for associative algebras, f ¯ = a 1 s 1 ¯ b 1 = u 1 for some s 1 S and a 1 , b 1 X , which implies that f ¯ = u 1 X \ I r r ( S ) . Put f 1 = f - α 1 ( u 1 - u 1 ^ ) , where α 1 is the coefficient of the leading term of f and u 1 ^ ¯ < u 1 or u 1 ^ = 0 . Then f 1 I d ( S ) and f 1 ¯ < f ¯ . By induction on f ¯ , the set { u - u ^ | u X \ I r r ( S ) } generates I d ( S ) as a linear space. It is clear that { u - u ^ | u X \ I r r ( S ) } is a linearly independent set.

Theorem 3

Choose a monomial ordering > on X . For every ideal I of k X there exists a unique reduced Gröbner–Shirshov basis S for I.

Proof

Clearly, a Gröbner–Shirshov basis S k X for the ideal I = I d ( S ) exists; for example, we may take S = I . By Theorem 1, we may assume that the leading terms of the elements of S are distinct. Given g S , put

Δ g = { f S | f g and f ¯ = a g ¯ b for some a , b X }

and S 1 = S \ g S Δ g .

For every f I d ( S ) we show that there exists an s 1 S 1 such that f ¯ = a s 1 ¯ b for some a , b X .

In fact, Theorem 1 implies that f ¯ = a h ¯ b for some a , b X and h S . Suppose that h S \ S 1 . Then we have h g S Δ g , say, h Δ g . Therefore, h g and h ¯ = a g ¯ b for some a , b X . We claim that h ¯ > g ¯ . Otherwise, h ¯ < g ¯ . It follows that h ¯ = a g ¯ b > a h ¯ b and so we have the infinite descending chain

h ¯ > a h ¯ b > a 2 h ¯ b 2 > a 3 h ¯ b 3 > ,

which contradicts the assumption that > is a well ordering.

Suppose that g S 1 . Then, by the argument above, there exists g 1 S such that g Δ g 1 and g ¯ > g 1 ¯ . Since > is a well ordering, there must exist s 1 S 1 such that f ¯ = a 1 s 1 ¯ b 1 for some a 1 , b 1 X .

Put f 1 = f - α 1 a 1 s 1 b 1 , where α 1 is the coefficient of the leading term of f. Then f 1 I d ( S ) and f ¯ > f 1 ¯ .

By induction on f ¯ , we know that f I d ( S 1 ) , and hence I = I d ( S 1 ) . Moreover, Theorem 1 implies that S 1 is clearly a minimal GS basis for the ideal I d ( S ) .

Assume that S is a minimal GS basis for I.

For every s S we have s = s + s , where supp ( s ) I r r ( S \ { s } ) and s I d ( S \ { s } ) . Since S is a minimal GS basis, it follows that s ¯ = s ¯ for every s S .

We claim that S 2 = { s | s S } is a reduced GS basis for I. In fact, it is clear that S 2 I d ( S ) = I . By Theorem 1, for every f I d ( S ) we have f ¯ = a 1 s 1 ¯ b 1 = a 1 s 1 ¯ b 1 for some a 1 , b 1 X .

Take two reduced GS bases S and R for the ideal I. By Theorem 1, for every s S ,

s ¯ = a r ¯ b , r ¯ = c s 1 ¯ d

for some a , b , c , d X , r R , and s 1 S , and hence s ¯ = a c s 1 ¯ d b . Since s ¯ supp ( s ) I r r ( S \ { s } ) , we have s = s 1 . It follows that a = b = c = d = 1 , and so s ¯ = r ¯ .

If s r then 0 s - r I = I d ( S ) = I d ( R ) . By Theorem 1, s - r ¯ = a 1 r 1 ¯ b 1 = c 1 s 2 ¯ d 1 for some a 1 , b 1 , c 1 , d 1 X with r 1 ¯ , s 2 ¯ < s ¯ = r ¯ . This means that s 2 S \ { s } and r 1 R \ { r } . Noting that s - r ¯ supp ( s ) supp ( r ) , we have either s - r ¯ supp ( s ) or s - r ¯ supp ( r ) . If s - r ¯ supp ( s ) then s - r ¯ I r r ( S \ { s } ) , which contradicts s - r ¯ = c 1 s 2 ¯ d 1 ; if s - r ¯ supp ( r ) then s - r ¯ I r r ( R \ { r } ) , which contradicts s - r ¯ = a 1 r 1 ¯ b 1 . This shows that s = r , and then S R . Similarly, R S .

Remark 1

In fact, a reduced GS basis is unique (up to the ordering) in all possible cases below.

Remark 2

Both associative and Lie CD-lemmas are valid when we replace the base field k by an arbitrary commutative ring K with identity because we assume that all GS bases consist of monic polynomials. For example, consider a Lie algebra L over K which is a free K-module with a well-ordered K-basis { a i | i I } . With the deg-lex ordering on { a i | i I } , the universal enveloping associative algebra U K ( L ) has a (monic) GS basis

a i a j - a j a i = α i j t a t | i > j , i , j I ,

where α i j t K and [ a i , a j ] = α i j t a t in L, and the CD-lemma for associative algebras over K implies that L U K ( L ) and

{ a i 1 a i n | i 1 i n , n 0 , i 1 , , i n I }

is a K-basis for U K ( L ) .

In fact, for the same reason, all CD-lemmas in this survey are valid if we replace the base field k by an arbitrary commutative ring K with identity. If this is the case then claim (iii) in the CD-lemma should read: K ( X | S ) is a free K-module with a K-basis I r r ( S ) . But in the general case, Shirshov’s algorithm fails: if S is a monic set then S , the set obtained by adding to S all non-trivial compositions, is not a monic set in general, and the algorithm may stop with no result.

2.2 Gröbner bases for commutative algebras and their lifting to Gröbner–Shirshov bases

Consider the free commutative associative algebra k [ X ] . Given a well ordering < on X = { x i | i I } ,

[ X ] = { x i 1 x i t | i 1 i t , i 1 , , i t I , t 0 }

is a linear basis for k [ X ] .

Choose a monomial ordering < on [ X ] . Take two monic polynomials f and g in k [ X ] such that w = lcm ( f ¯ , g ¯ ) = f ¯ a = g ¯ b for some a , b [ X ] with | f ¯ | + | g ¯ | > | w | (so, f ¯ and g ¯ are not coprime in [ X ] ). Then ( f , g ) w = f a - g b is called the s-polynomial of f and g.

A monic subset S k [ X ] is called a Gröbner basis with respect to the monomial ordering < whenever all s-polynomials of two arbitrary polynomials in S are trivial modulo S and corresponding w.

An argument similar to the proof of the CD-lemma for associative algebras justifies the following theorem due to Buchberger.

Theorem 4

(Buchberger Theorem) Choose a monomial ordering < on [ X ] . Consider a monic set S k [ X ] and the ideal I d ( S ) of k [ X ] generated by S. The following statements are equivalent:

  1. (i)

    S is a Gröbner basis in k [ X ] .

  2. (ii)

    f I d ( S ) f ¯ = s ¯ a for some s S and a [ X ] .

  3. (iii)

    I r r ( S ) = { u [ X ] | u s ¯ a , s S , a [ X ] } is a linear basis for the algebra k [ X | S ] = k [ X ] / I d ( S ) .

Proof

Denote by lcm ( u , v ) be the usual (unique) least common multiple of two commutative words u , v [ X ] :

lcm ( u , v ) { u v (the trivial lcm ( u , v ) ) ; a u = b v , a , b [ X ] , | a u | < | u | + | v | (the nontrivial lcm ( u , v ) ) } .

If cm ( u , v ) = a 1 u = a 2 v is a common multiple of u and v then cm ( u , v ) = b · lcm ( u , v ) .

The s-polynomial of two monic polynomials f and g is

( f , g ) lcm ( f ¯ , g ¯ ) = lcm ( f ¯ , g ¯ ) | f ¯ f - lcm ( f ¯ , g ¯ ) | g ¯ g .

An analogue of Lemma 1 is valid for k [ X ] because if a 1 s ¯ 1 = a 2 s ¯ 2 for two monic polynomials s 1 and s 2 then

a 1 s 1 - a 2 s 2 = b · ( s 1 , s 2 ) lcm ( s ¯ 1 , s ¯ 2 ) .

Lemma 1 implies the main claim (i)(ii) of Buchberger’s theorem.

Theorem 5

Given an ideal I of k [ X ] and a monomial ordering < on [ X ] , there exists a unique reduced Gröbner basis S for I. Moreover, if X is finite then so is S.

Eisenbud et al. [99] constructed a GS basis in k X by lifting a commutative Gröbner basis for k [ X ] and adding all commutators. Write X = { x 1 , x 2 , , x n } and put

S 1 = { h i j = x i x j - x j x i | i > j } k X .

Consider the natural map γ : k X k [ X ] carrying x i to x i and the lexicographic splitting of γ, which is defined as the k-linear map

δ : k [ X ] k X , x i 1 x i 2 x i r x i 1 x i 2 x i r if i 1 i 2 i r .

Given u [ X ] , we express it as u = x 1 l 1 x 2 l 2 x n l n , where l i 0 , using an arbitrary monomial ordering on [ X ] .

Following [99], define an ordering on X using the ordering x 1 < x 2 < < x n as follows: given u , v X , put

u > v if γ ( u ) > γ ( v ) in [ X ] or ( γ ( u ) = γ ( v ) and u > l e x v ) .

It is easy to check that this is a monomial ordering on X and δ ( s ) ¯ = δ ( s ¯ ) for every s k [ X ] . Moreover, v δ ( u ) for every v γ - 1 ( u ) .

Consider an arbitrary ideal L of k [ X ] generated by monomials. Given m = x i 1 x i 2 x i r L , i 1 i 2 i r , denote by U L ( m ) the set of all monomials u [ x i 1 + 1 , , x i r - 1 ] such that neither u x i 2 x i r nor u x i 1 x i r - 1 lie in L.

Theorem 6

([99]) Consider the orderings on [ X ] and X defined above. If S is a minimal Gröbner basis in k [ X ] then S = { δ ( u s ) | s S , u U L ( s ¯ ) } S 1 is a minimal Gröbner–Shirshov basis in k X , where L is the monomial ideal of k [ X ] generated by  S ¯ .

Jointly with Yongshan Chen [30], we generalized this result to lifting a GS basis S k [ Y ] k X , see Mikhalev and Zolotykh [170], to a GS basis of I d ( S , [ y i , y j ] for all ( i , j ) ) of k Y k X .

Recall that for a prime number p the Gauss ordering on the natural numbers is described as s p t whenever t s 0 mod p . Let 0 = be the usual ordering on the natural numbers. A monomial ideal L of k [ X ] is called p-Borel-fixed whenever it satisfies the following condition: for each monomial generator m of L, if m is divisible by x j t but no higher power of x j then ( x i / x j ) s m L for all i < j and s p t .

Thus, we have the following Eisenbud–Peeva–Sturmfels lifting theorem.

Theorem 7

([99]) Given an ideal I of k [ X ] , take L = I d ( f ¯ , f I ) and J = γ - 1 ( I ) k X .

  1. (i)

    If L is 0 -Borel-fixed then a minimal Gröbner–Shirshov basis of J is obtained by applying δ to a minimal Gröbner basis of I and adding commutators.

  2. (ii)

    If L is p-Borel-fixed for some p then J has a finite Gröbner–Shirshov basis.

Proof

Assume that L is p-Borel-fixed for some p. Take a generator m = x i 1 x i 2 x i r of L, where x i 1 x i 2 x i r , and suppose that x i r t is the highest power of x i r dividing m. Since t p t , it follows that x l t m / x i r t L for l < i r . This implies that x l t m / x i r L for l < i r , and hence, every monomial in U L ( m ) satisfies d e g x l ( u ) < t for i 1 < l < i r . Thus, U L ( m ) is a finite set, and the result follows from Theorem 6. In particular, if p = 0 then U L ( m ) = 1 .

In characteristic p 0 observe that if the field k is infinite then after a generic change of variables L is p-Borel-fixed. Then Theorems 6 and 7 imply

Corollary 1

([99]) Consider an infinite field k and an ideal I k [ X ] . After a general linear change of variables, the ideal γ - 1 ( I ) in k X has a finite Gröbner–Shirshov basis.

2.3 Composition-Diamond lemma for modules

Consider S, T k X and f, g k X . Kang and Lee define [123] the composition of f and g as follows.

Definition 1

([123, 127])

  1. (a)

    If there exist a , b X such that w = f ¯ a = b g ¯ with | w | < | f ¯ | + | g ¯ | then the intersection composition is defined as ( f , g ) w = f a - b g .

  2. (b)

    If there exist a, b X such that w = a f ¯ b = g ¯ then the inclusion composition is defined as ( f , g ) w = a f b - g .

  3. (c)

    The composition ( f , g ) w is called right-justified whenever w = f ¯ = a g ¯ for some a X .

If f - g = α i a i s i b i + β j c j t j , where α i , β j k , a i , b i , c j X , s i S , and t j T with a i s ¯ i b i < w and c j t ¯ j < w for all i and j, then we call f - g trivial with respect to S and T and write f g mod ( S , T ; w ) .

Definition 2

([123, 124]) A pair ( S , T ) of monic subsets of k X is called a GS pair if S is closed under composition, T is closed under right-justified composition with respect to S, and given f S , g T , and w X such that if ( f , g ) w is defined, we have ( f , g ) w 0 mod ( S , T ; w ) . In this case, say that ( S , T ) is a GS pair for the A-module A M = A k X / ( k X T + I d ( S ) ) , where A = k X | S .

Theorem 8

(Kang and Lee [123, 124], the CD-lemma for cyclic modules) Consider a pair ( S , T ) of monic subsets of k X , the associative algebra A = k X | S defined by S, and the left cyclic module A M = A k X / ( k X T + I d ( S ) ) defined by ( S , T ) . Suppose that ( S , T ) is a Gröbner–Shirshov pair for the A-module A M and p k X T + I d ( S ) . Then p ¯ = a s ¯ b or p ¯ = c t ¯ , where a , b , c X , s S , and t T .

Applications of Theorem 8 appeared in [125127].

Take two sets X and Y and consider the free left k X -module Mod k X Y with k X -basis Y. Then Mod k X Y = y Y k X y is called a double-free module. We now define the GS basis in Mod k X Y . Choose a monomial ordering < on X , and a well-ordering < on Y. Put X Y = { u y | u X , y Y } and define an ordering < on X Y as follows: for any w 1 = u 1 y 1 , w 2 = u 2 y 2 X Y ,

w 1 < w 2 u 1 < u 2 or u 1 = u 2 , y 1 < y 2

Given S Mod k X Y with all s S monic, define composition in S to be only inclusion composition, which means that f ¯ = a g ¯ for some a X , where f , g S . If ( f , g ) f ¯ = f - a g = α i a i s i , where α i k , a i X , s i S , and a i s ¯ i < f ¯ , then this composition is called trivial modulo ( S , f ¯ ) .

Theorem 9

(Chibrikov [90], see also [78], the CD-lemma for modules) Consider a non-empty set S m o d k X Y with all s S monic and choose an ordering < on X Y as before. The following statements are equivalent:

  1. (i)

    S is a Gröbner–Shirshov basis in Mod k X Y .

  2. (ii)

    If 0 f k X S then f ¯ = a s ¯ for some a X and s S .

  3. (iii)

    I r r ( S ) = { w X Y | w a s ¯ , a X , s S } is a linear basis for the quotient Mod k X Y | S = Mod k X Y / k X S .

Outline of the proof. Take u X Y and express it as u = u X y u with u X X and y u Y . Put

cm ( u , v ) = a X u = b X v , lcm ( u , v ) = u = d X v ,

where y u = y v . Up to the order of u and v, we have c m ( u , v ) = c · lcm ( u , v ) .

The composition of two monic elements f , g Mod k X ( Y ) is

( f , g ) | lcm ( f ¯ , g ¯ ) = lcm ( f ¯ , g ¯ ) | f ¯ f - lcm ( f ¯ , g ¯ ) | g ¯ g .

If a 1 s ¯ 1 = a 2 s ¯ 2 for monic s 1 and s 2 then a 1 s 1 - a 2 s 2 = c · ( s 1 , s 2 ) lcm ( s ¯ 1 , s ¯ 2 ) . This gives an analogue of Lemma 1 for modules and the implication (i)(ii) of Theorem 9.

Given S k X , put A = k X | S . We can regard every left A-module A M as a k X -module in a natural way: f m : = ( f + I d ( S ) ) m for f k X and m M . Observe that A M is an epimorphic image of some free A-module. Assume now that A M = Mod A Y | T = Mod A Y / A T , where T Mod A Y . Put

T 1 = f i y i Mod k X Y | ( f i + I d ( S ) ) y i T

and R = S X Y T 1 . Then A M = mod k X Y | R as k X -modules.

Theorem 10

Given a submodule I of Mod k X Y and a monomial ordering < on X Y as above, there exists a unique reduced Gröbner–Shirshov basis S for I.

Corollary 2

(Cohn) Every left ideal I of k X is a free left k X -module.

Proof

Take a reduced Gröbner–Shirshov basis S of I as a k X -submodule of the cyclic k X -module. Then I is a free left k X -module with a k X -basis S.

As an application of the CD-lemma for modules, we give GS bases for the Verma modules over the Lie algebras of coefficients of free Lie conformal algebras. We find linear bases for these modules.

Let B be a set of symbols. Take the constant locality function N : B × B Z + ; that is, N ( a , b ) N for all a , b B . Put X = { b ( n ) | b B , n Z } and consider the Lie algebra L = L i e ( X | S ) over a field k of characteristic 0 generated by X with the relations

S = s ( - 1 ) s N s [ b ( n - s ) a ( m + s ) ] = 0 | a , b B , m , n Z .

For every b B , put b ~ = n b ( n ) z - n - 1 L [ [ z , z - 1 ] ] . It is well-known that these elements generate a free Lie conformal algebra C with data ( B , N ) (see [194]). Moreover, the coefficient algebra of C is just L.

Suppose that B is linearly ordered. Define an ordering on X as

a ( m ) < b ( n ) m < n or ( m = n and a < b ) .

We use the deg-lex ordering on X . It is clear that the leading term of each polynomial in S is b ( n ) a ( m ) with

n - m > N or ( n - m = N and ( b > a or ( b = a and N is odd ) ) ) .

The following lemma is essentially from [194].

Lemma 3

([78]) With the deg-lex ordering on X , the set S is a GS basis in L i e ( X ) .

Corollary 3

([78]) A linear basis of the universal enveloping algebra U = U ( L ) of L consists of the monomials

a 1 ( n 1 ) a 2 ( n 2 ) a k ( n k )

with a i B and n i Z such that for every 1 i < k we have

n i - n i + 1 N - 1 if a i > a i + 1 or ( a i = a i + 1 and N is odd ) N otherwise .

An L-module M is called restricted if for all a C and v M there is some integer T such that a ( n ) v = 0 for n T .

An L-module M is called a highest weight module whenever it is generated over L by a single element m M satisfying L + m = 0 , where L + is the subspace of L generated by { a ( n ) | a C , n 0 } . In this case m is called a highest weight vector.

Let us now construct a universal highest weight module V over L, which is often called the Verma module. Take the trivial 1 -dimensional L + -module k I v generated by I v ; hence, a ( n ) I v = 0 for all a B , n 0 . Clearly,

V = I n d L + L k I v = U ( L ) U ( L + ) k I v U ( L ) / U ( L ) L + .

Then V has the structure of the highest weight module over L with the action given by multiplication on U ( L ) / U ( L ) L + and a highest weight vector I U ( L ) . In addition, V = U ( L ) / U ( L ) L + is the universal enveloping vertex algebra of C and the embedding φ : C V is given by a a ( - 1 ) I (see also [194]).

Theorem 11

([78]) With the above notions, a linear basis of V consists of the elements

a 1 ( n 1 ) a 2 ( n 2 ) a k ( n k ) , a i B , n i Z

satisfying the condition in Corollary 3 and n k < 0 .

Proof

Clearly, as k X -modules, we have

U V = U ( U ( L ) / U ( L ) L + ) = Mod k X I | S ( - ) X I , a ( n ) I , n 0 = k X I | S ,

where S = { S ( - ) X I , a ( n ) I , n 0 } . In order to show that S is a Gröbner–Shirshov basis, we only need to verify that w = b ( n ) a ( m ) I , where m 0 . Take

f = s ( - 1 ) s n s ( b ( n - s ) a ( m + s ) - a ( m + s ) b ( n - s ) ) I and g = a ( m ) I .

Then ( f , g ) w = f - b ( n ) a ( m ) I 0 mod ( S , w ) since n - m N , m + s 0 , n - s 0 , and 0 s N . It follows that S is a Gröbner–Shirshov basis. Now, the result follows from the CD-lemma for modules.

2.4 Composition-Diamond lemma for categories

Denote by X an oriented multi-graph. A path

a n a n - 1 a 1 a 0 , n 0 ,

in X with edges x n , , x 2 , x 1 is a partial word u = x 1 x 2 x n on X with source a n and target a 0 . Denote by C ( X ) the free category generated by X (the set of all partial words (paths) on X with partial multiplication, the free ‘partial path monoid’ on X). A well-ordering on C ( X ) is called monomial whenever it is compatible with partial multiplication.

A polynomial f k C ( X ) is a linear combination of partial words with the same source and target. Then k C ( X ) is the partial path algebra on X (the free associative partial path algebra generated by X).

Given S k C ( X ) , denote by I d ( S ) the minimal subset of k C ( X ) that includes S and is closed under the partial operations of addition and multiplication. The elements of I d ( S ) are of the form α i a i s i b i with α i k , a i , b i C ( X ) , and s i S , and all S-words have the same source and target.

Both inclusion and intersection compositions are possible.

With these differences, the statement and proof of the CD-lemma are the same as for the free associative algebra.

Theorem 12

([36], the CD-lemma for categories) Consider a nonempty set S k C ( X ) of monic polynomials and a monomial ordering < on C ( X ) . Denote by I d ( S ) the ideal of k C ( X ) generated by S. The following statements are equivalent:

  1. (i)

    The set S is a Gröbner–Shirshov basis in k C ( X ) .

  2. (ii)

    f I d ( S ) f ¯ = a s ¯ b for some s S and a , b C ( X ) .

  3. (iii)

    the set I r r ( S ) = { u C ( X ) | u a s ¯ b a , b C ( X ) , s S } is a linear basis for k C ( X ) / I d ( S ) , which is denoted by k C ( X | S ) .

Outline of the proof.

Define w = lcm ( u , v ) , u , v C ( X ) and the general composition ( f , g ) w for f , g k C ( X ) and w = lcm ( f ¯ , g ¯ ) by the same formulas as above. Under the conditions of the analogue of Lemma 1, we again have a 1 s 1 b 1 - a 2 s 2 b 2 = c ( s 1 , s 2 ) w d 0 mod ( S , w ) , where w = lcm ( s ¯ 1 , s ¯ 2 ) and c , d C ( X ) . This implies the analogue of Lemma 1 and the main assertion (i)(ii) of Theorem 12.

Let us present some applications of CD-lemma for categories.

For each non-negative integer p, denote by [ p ] the set { 0 , 1 , 2 , , p } of integers in their usual ordering. A (weakly) monotonic map μ : [ q ] [ p ] is a function from [ q ] to [ p ] such that i j implies μ ( i ) μ ( j ) . The objects [ p ] with weakly monotonic maps as morphisms constitute the category Δ called the simplex category. It is convenient to use two special families of monotonic maps,

ε q i : [ q - 1 ] [ q ] , η q i : [ q + 1 ] [ q ]

defined for i = 0 , 1 , q (and for q > 0 in the case of ε i ) by

ε q i ( j ) = j if i > j , j + 1 if i j , η q i ( j ) = j if i j , j - 1 if j > i .

Take the oriented multi-graph X = ( V ( X ) , E ( X ) ) with

V ( X ) = { [ p ] | p Z + { 0 } } , E ( X ) = { ε p i : [ p - 1 ] [ p ] , η q j : [ q + 1 ] [ q ] | p > 0 , 0 i p , 0 j q } .

Consider the relation S C ( X ) × C ( X ) consisting of:

f q + 1 , q : ε q + 1 i ε q j - 1 = ε q + 1 j ε q i for j > i ; g q , q + 1 : η q j η q + 1 i = η q i η q + 1 j + 1 for j i ; h q - 1 , q : η q - 1 j ε q i = ε q - 1 i η q - 2 j - 1 for j > i , 1 q - 1 for i = j or i = j + 1 , ε q - 1 i - 1 η q - 2 j for i > j + 1 .

This yields a presentation Δ = C ( X | S ) of the simplex category Δ.

Order now C ( X ) as follows.

Firstly, for η p i , η q j { η p i | p 0 , 0 i p } put η p i > η q j iff p > q or ( p = q and i < j ).

Secondly, for

u = η p 1 i 1 η p 2 i 2 η p n i n { η p i | p 0 , 0 i p }

(these are all possible words on { η p i | p 0 , 0 i p } , including the empty word 1 v , where v O b ( X ) ), define

wt ( u ) = ( n , η p n i n , η p n - 1 i n - 1 , , η p 1 i 1 ) .

Then, for u , v { η p i | p 0 , 0 i p } put u > v iff wt ( u ) > wt ( v ) lexicographically.

Thirdly, for ε p i , ε q j { ε p i , | p Z + , 0 i p } , put ε p i > ε q j iff p > q or ( p = q and i < j ).

Finally, for u = v 0 ε p 1 i 1 v 1 ε p 2 i 2 ε p n i n v n C ( X ) , where n 0 , and v j { η p i | p 0 , 0 i p } put wt ( u ) = ( n , v 0 , v 1 , , v n , ε p 1 i 1 , , ε p n i n ) . Then for every u , v C ( X ) ,

u 1 v wt ( u ) > wt ( v ) lexicographically .

It is easy to check that 1 is a monomial ordering on C ( X ) . Then we have

Theorem 13

([36]) For X and S defined above, with the ordering 1 on C ( X ) , the set S is a Gröbner–Shirshov basis for the simplex partial path algebra k C ( X | S ) .

Corollary 4

([157]) Every morphism μ : [ q ] [ p ] of the simplex category has a unique expression of the form

ε p i 1 ε p - m + 1 i m η q - n j 1 η q - 1 j n

with p i 1 > > i m 0 , 0 j 1 < < j n < q , and q - n + m = p .

The cyclic category is defined by generators and relations as follows, see [104]. Take the oriented (multi) graph Y = ( V ( Y ) , E ( Y ) ) with V ( Y ) = { [ p ] | p Z + { 0 } } and

E ( Y ) = { ε p i : [ p - 1 ] [ p ] , η q j : [ q + 1 ] [ q ] , t q : [ q ] [ q ] | p > 0 , 0 i p , 0 j q } .

Consider the relation S C ( Y ) × C ( Y ) consisting of:

f q + 1 , q : ε q + 1 i ε q j - 1 = ε q + 1 j ε q i for j > i ; g q , q + 1 : η q j η q + 1 i = η q i η q + 1 j + 1 for j i ; h q - 1 , q : η q - 1 j ε q i = ε q - 1 i η q - 2 j - 1 for j > i , 1 q - 1 for i = j or i = j + 1 , ε q - 1 i - 1 η q - 2 j for i > j + 1 , ρ 1 : t q ε q i = ε q i - 1 t q - 1 for i = 1 , , q ; ρ 2 : t q η q i = η q i - 1 t q + 1 for i = 1 , , q ; ρ 3 : t q q + 1 = 1 q .

The category C ( Y | S ) is called the cyclic category and denoted by Λ.

Define an ordering on C ( Y ) as follows.

Firstly, for t p i , t q j { t q | q 0 } put ( t p ) i > ( t q ) j iff i > j or ( i = j and p > q ).

Secondly, for η p i , η q j { η p i | p 0 , 0 i p } put η p i > η q j iff p > q or ( p = q and i < j ).

Thirdly, for

u = w 0 η p 1 i 1 w 1 η p 2 i 2 w n - 1 η p n i n w n { t q , η p i | q , p 0 , 0 i p } ,

where w i { t q | q 0 } , put

wt ( u ) = ( n , w 0 , w 1 , , w n , η p n i n , η p n - 1 i n - 1 , , η p 1 i 1 ) .

Then for every u , v { t q , η p i | q , p 0 , 0 i p } put u > v iff wt ( u ) > wt ( v ) lexicographically.

Fourthly, for ε p i , ε q j { ε p i , | p Z + , 0 i p } , ε p i > ε q j iff p > q or ( p = q and i < j ).

Finally, for u = v 0 ε p 1 i 1 v 1 ε p 2 i 2 ε p n i n v n C ( Y ) and v j { t q , η p i | q , p 0 , 0 i p } define wt ( u ) = ( n , v 0 , v 1 , , v n , ε p 1 i 1 , , ε p n i n ) .

Then for every u , v C ( Y ) put u 2 v wt ( u ) < wt ( v ) lexicographically .

It is also easy to verify that 2 is a monomial ordering on C ( Y ) which extends 1 . Then we have

Theorem 14

([36]) Consider Y and S defined as the above. Put ρ 4 : t q ε q 0 = ε q q and ρ 5 : t q η q 0 = η q q t q + 1 2 . Then

  1. (1)

    With the ordering 2 on C ( Y ) , the set S { ρ 4 , ρ 5 } is a Gröbner–Shirshov basis for the cyclic category C ( Y | S ) .

  2. (2)

    Every morphism μ : [ q ] [ p ] of the cyclic category Λ = C ( Y | S ) has a unique expression of the form

    ε p i 1 ε p - m + 1 i m η q - n j 1 η q - 1 j n t q k

    with p i 1 > > i m 0 , 0 j 1 < < j n < q , 0 k q , and q - n + m = p .

2.5 Composition-Diamond lemma for associative algebras over commutative algebras

Given two well-ordered sets X and Y, put

N = [ X ] Y = { u = u X u Y | u X [ X ] a n d u Y Y }

and denote by k N the k-space spanned by N. Define the multiplication of words as

u = u X u Y , v = v X v Y N u v = u X v X u Y v Y N .

This makes k N an algebra isomorphic to the tensor product k [ X ] k Y , called a‘double free associative algebra’. It is a free object in the category of all associative algebras over all commutative algebras (over k): every associative algebra K A over a commutative algebra K is isomorphic to k [ X ] k Y / I d ( S ) as a k-algebra and a k [ X ] -algebra.

Choose a monomial ordering > on N. The following definitions of compositions and the GS basis are taken from [170].

Take two monic polynomials f and g in k [ X ] k Y and denote by L the least common multiple of f ¯ X and g ¯ X .

  1. 1.

    Inclusion. Assume that g ¯ Y is a subword of f ¯ Y , say, f ¯ Y = c g ¯ Y d for some c , d Y . If f ¯ Y = g ¯ Y then f ¯ X g ¯ X and if g ¯ Y = 1 then we set c = 1 . Put w = L f ¯ Y = L c g ¯ Y d . Define the composition C 1 ( f , g , c ) w = L f ¯ X f - L g ¯ X c g d .

  2. 2.

    Overlap. Assume that a non-empty beginning of g ¯ Y is a non-empty ending of f ¯ Y , say, f ¯ Y = c c 0 , g ¯ Y = c 0 d , and f ¯ Y d = c g ¯ Y for some c , d , c 0 Y and c 0 1 . Put w = L f ¯ Y d = L c g ¯ Y . Define the composition C 2 ( f , g , c 0 ) w = L f ¯ X f d - L g ¯ X c g .

  3. 3.

    External. Take a (possibly empty) associative word c 0 Y . In the case that the greatest common divisor of f ¯ X and g ¯ X is non-empty and both f ¯ Y and g ¯ Y are non-empty, put w = L f ¯ Y c 0 g ¯ Y and define the composition C 3 ( f , g , c 0 ) w = L f ¯ X f c 0 g ¯ Y - L g ¯ X f ¯ Y c 0 g .

A monic subset S of k [ X ] k Y is called a GS basis whenever all compositions of elements of S, say ( f , g ) w , are trivial modulo ( S , w ) :

( f , g ) w = i α i a i s i b i ,

where a i , b i N , s i S , α i k , and a i s i ¯ b i < w for all i.

Theorem 15

(Mikhalev and Zolotykh [170, 228], the CD-lemma for associative algebras over commutative algebras) Consider a monic subset S k [ X ] k Y and a monomial ordering < on N. The following statements are equivalent:

  1. (i)

    The set S is a Gröbner–Shirshov basis in k [ X ] k Y .

  2. (ii)

    For every element f I d ( S ) , the monomial f ¯ contains s ¯ as its subword for some s S .

  3. (iii)

    The set I r r ( S ) = { w N | w a s ¯ b , a , b N , s S } is a linear basis for the quotient k [ X ] k Y .

Outline of the proof. For

w = lcm ( u , v ) = lcm ( u X , v X ) lcm ( u Y , v Y )

the general composition is

( s 1 , s 2 ) w = ( lcm ( u X , v X ) / u X ) w | u s 1 - ( lcm ( u X , v X ) / v X ) w | v s 2 ,

where s 1 , s 2 k [ X ] Y are k-monic with u = s ¯ 1 and v = s ¯ 2 . Moreover, ( s 1 , s 2 ) w 0 mod ( { s 1 , s 2 } , w ) whenever w = u X v X u Y c Y v Y with c Y Y , that is, w is a trivial least common multiple relative to both X-words and Y-words. This implies the analog of Lemma 1 and the claim (i)(ii) in Theorem 15.

We apply this lemma in Sect. 4.3.

2.6 PBW-theorem for Lie algebras

Consider a Lie algebra ( L , [ ] ) over a field k with a well-ordered linear basis X = { x i | i I } and multiplication table S = { [ x i x j ] = [ | x i x j | ] | i > j , i , j I } , where for every i , j I we write [ | x i x j | ] = Σ t α i j t x t with α i j t k . Then U ( L ) = k X | S ( - ) is called the universal enveloping associative algebra of L, where S ( - ) = { x i x j - x j x i = [ | x i x j | ] | i > j , i , j I } .

Theorem 16

(PBW Theorem) In the above notation and with the deg-lex ordering on X , the set S ( - ) is a Gröbner–Shirshov basis of k X . Then by the CD-lemma for associative algebras, the set I r r ( S ( - ) ) consists of the elements

x i 1 x i n with i 1 i n , i 1 , , i n I , n 0 ,

and constitutes a linear basis of U ( L ) .

Theorem 17

(The PBW Theorem in Shirshov’s form) Consider L = L i e ( X | S ) with S L i e ( X ) k X and U ( L ) = k X | S ( - ) . The following statements are equivalent.

  1. (i)

    For the deg-lex ordering, S is a GS basis of L i e ( X ) .

  2. (ii)

    For the deg-lex ordering, S ( - ) is a GS basis of k X .

  3. (iii)

    A linear basis of U ( L ) consists of the words u = u 1 u n , where u 1 u n in the lex ordering, n 0 , and every u i is an S ( - ) -irreducible associative Lyndon–Shirshov word in X.

  4. (iv)

    A linear basis of L is the set of all S-irreducible Lyndon–Shirshov Lie monomials [ u ] in X.

  5. (v)

    A linear basis of U ( L ) consists of the polynomials u = [ u 1 ] [ u n ] , where u 1 u n in the lex ordering, n 0 , and every [ u i ] is an S-irreducible non-associative Lyndon–Shirshov word in X.

The PBW theorem, Theorem 33, the CD-lemmas for associative and Lie algebras, Shirshov’s factorization theorem, and property (VIII) of Sect. 4.2 imply that every LS-subword of u is a subword of some u i .

Makar–Limanov gave [158] an interesting form of the PBW theorem for a finite dimensional Lie algebra.

2.7 Drinfeld–Jimbo algebra U q ( A ) , Kac–Moody enveloping algebra U ( A ) , and the PBW basis of U q ( A N )

Take an integral symmetrizable N × N Cartan matrix A = ( a i j ) . Hence, a i i = 2 , a i j 0 for i j , and there exists a diagonal matrix D with diagonal entries d i , which are nonzero integers, such that the product D A is symmetric. Fix a nonzero element q of k with q 4 d i 1 for all i. Then the Drinfeld–Jimbo quantum enveloping algebra is

U q ( A ) = k X H Y | S + K T S - ,

where

X = { x i } , H = { h i ± 1 } , Y = { y i } , S + = ν = 0 1 - a i j ( - 1 ) ν 1 - a i j ν t x i 1 - a i j - ν x j x i ν , where i j , t = q 2 d i , S - = ν = 0 1 - a i j ( - 1 ) ν 1 - a i j ν t y i 1 - a i j - ν y j y i ν , where i j , t = q 2 d i , K = { h i h j - h j h i , h i h i - 1 - 1 , h i - 1 h i - 1 , x j h i ± 1 - q 1 d i a i j h ± 1 x j , h i ± 1 y j - q 1 y j h ± 1 } , T = x i y j - y j x i - δ i j h i 2 - h i - 2 q 2 d i - q - 2 d i ,

and

m n t = i = 1 n t m - i + 1 - t i - m - 1 t i - t - i ( for m > n > 0 ) , 1 ( for n = 0 or m = n ) .

Theorem 18

([55]) For every symmetrizable Cartan matrix A, with the deg-lex ordering on { X H Y } , the set S + c T K S - c is a Gröbner–Shirshov basis of the Drinfeld–Jimbo algebra U q ( A ) , where S + c and S - c are the Shirshov completions of S + and S - .

Corollary 5

(Rosso [195], Yamane [220]) For every symmetrizable Cartan matrix A we have the triangular decomposition

U q ( A ) = U q + ( A ) k [ H ] U q - ( A )

with U q + ( A ) = k X | S + and U q - ( A ) = k Y | S - .

Similar results are valid for the Kac–Moody Lie algebras g ( A ) and their universal enveloping algebras

U ( A ) = k X H Y | S + H K S - ,

where S + , S - are the same as for U q ( A ) ,

K = { h i h j - h j h i , x j h i - h i x j + d i a i j x i , h i y i - y i h i + d i a i j y j } ,

and T = { x i y j - y j x i - δ i j h i } .

Theorem 19

([55]) For every symmetrizable Cartan matrix A, the set S + c T K S - c is a Gröbner–Shirshov basis of the universal enveloping algebra U ( A ) of the Kac–Moody Lie algebra g ( A ) .

The PBW theorem in Shirshov’s form implies

Corollary 6

(Kac [117]) For every symmetrizable Cartan matrix A, we have the triangular decomposition

U ( A ) = U + ( A ) k [ H ] U - ( A ) , g ( A ) = g + ( A ) k [ H ] g - ( A ) .

Poroshenko [179, 180] found GS bases for the Kac–Moody algebras of types A n ~ , B n ~ , C n ~ , and D n ~ . He used the available linear bases of the algebras [117].

Consider now

A = A N = 2 - 1 0 0 - 1 2 - 1 0 0 - 1 2 0 · · · · · 0 0 0 2

and assume that q 8 1 . Introduce new variables, defined by Jimbo (see [220]), which generate U q ( A N ) :

X ~ = { x i j , 1 i < j N + 1 } ,

where

x i j = x i j = i + 1 , q x i , j - 1 x j - 1 , j - q - 1 x j - 1 , j x i , j - 1 j > i + 1 .

Order the set X ~ as follows: x m n > x i j ( m , n ) > l e x ( i , j ) . Recall from Yamane [220] the notation

C 1 = { ( ( i , j ) , ( m , n ) ) | i = m < j < n } , C 2 = { ( ( i , j ) , ( m , n ) ) | i < m < n < j } , C 3 = { ( ( i , j ) , ( m , n ) ) | i < m < j = n } , C 4 = { ( ( i , j ) , ( m , n ) ) | i < m < j < n } , C 5 = { ( ( i , j ) , ( m , n ) ) | i < j = m < n } , C 6 = { ( ( i , j ) , ( m , n ) ) | i < j < m < n } .

Consider the set S ~ + consisting of Jimbo’s relations:

x m n x i j - q - 2 x i j x m n ( ( i , j ) , ( m , n ) ) C 1 C 3 , x m n x i j - x i j x m n ( ( i , j ) , ( m , n ) ) C 2 C 6 , x m n x i j - x i j x m n + ( q 2 - q - 2 ) x i n x m j ( ( i , j ) , ( m , n ) ) C 4 , x m n x i j - q 2 x i j x m n + q x i n ( ( i , j ) , ( m , n ) ) C 5 .

It is easy to see that U q + ( A N ) = k X ~ | S + ~ .

A direct proof [86] shows that S ~ + is a GS basis for k X ~ | S + ~ = U q + ( A N ) [55]. The proof is different from the argument of Bokut and Malcolmson [55]. This yields

Theorem 20

([55]) In the above notation and with the deg-lex ordering on { X ~ H Y ~ } , the set S ~ + T K S ~ - is a Gröbner–Shirshov basis of

U q ( A N ) = k X ~ H Y ~ | S ~ + T K S ~ - .

Corollary 7

([195, 220]) For q 8 1 , a linear basis of U q ( A n ) consists of

y m 1 n 1 y m l n l h 1 s 1 h N s N x i 1 j 1 x i k j k

with ( m 1 , n 1 ) ( m l , n l ) , ( i 1 , j 1 ) ( i k , j k ) , k , l 0 and s t Z .

3 Gröbner–Shirshov bases for groups and semigroups

In this section we apply the method of GS bases for braid groups in different sets of generators, Chinese monoids, free inverse semigroups, and plactic monoids in two sets of generators (row words and column words).

Given a set X consider S X × X the congruence ρ ( S ) on X generated by S, the quotient semigroup

A = sgp X | S = X / ρ ( S ) ,

and the semigroup algebra k ( X / ρ ( S ) ) . Identifying the set { u = v | ( u , v ) S } with S, it is easy to see that

σ : k X | S k ( X / ρ ( S ) ) , α i u i + I d ( S ) α i u i ¯

is an algebra isomorphism.

The Shirshov completion S c of S consists of semigroup relations, S c = { u i - v i , i I } . Then I r r ( S c ) is a linear basis of k X | S , and so σ ( I r r ( S c ) ) is a linear basis of k ( X / ρ ( S ) ) . This shows that I r r ( S c ) consists precisely of the normal forms of the elements of the semigroup sgp X | S .

Therefore, in order to find the normal forms of the semigroup sgp X | S , it suffices to find a GS basis S c in k X | S . In particular, consider a group G = g p X | S , where S = { ( u i , v i ) F ( X ) × F ( X ) | i I } and F ( X ) is the free group on a set X. Then G has a presentation

G = sgp X X - 1 | S , x ε x - ε = 1 , ε = ± 1 , x X , X X - 1 =

as a semigroup.

3.1 Gröbner–Shirshov bases for braid groups

Consider the Artin braid group B n of type A n - 1 (Artin [5]). We have

B n = g p σ 1 , , σ n | σ j σ i = σ i σ j ( j - 1 > i ) , σ i + 1 σ i σ i + 1 = σ i σ i + 1 σ i , 1 i n - 1 .

3.1.1 Braid groups in the Artin–Burau generators

Assume that X = Y ˙ Z with Y and Z well-ordered and that the ordering on Y is monomial. Then every word in X has the form u = u 0 z 1 u 1 z k u k , where k 0 , u i Y , and z i Z . Define the inverse weight of the word u X as

inwt ( u ) = ( k , u k , z k , , u 1 , z 1 , u 0 )

and the inverse weight lexicographic ordering as

u > v inwt ( u ) > inwt ( v ) .

Call this ordering the inverse tower ordering for short. Clearly, it is a monomial ordering on X .

When X = Y ˙ Z , Y = T ˙ U , and Y is endowed with the inverse tower ordering, define the inverse tower ordering on X with respect to the presentation X = ( T ˙ U ) ˙ Z . In general, for

X = ( ( X ( n ) ˙ X ( n - 1 ) ) ˙ ) ˙ X ( 0 )

with X ( n ) -words equipped with a monomial ordering we can define the inverse tower ordering of X-words.

Introduce a new set of generators for the braid group B n , called the Artin–Burau generators. Put

s i , i + 1 = σ i 2 , s i , j + 1 = σ j σ i + 1 σ i 2 σ i + 1 - 1 σ j - 1 , 1 i < j n - 1 ; σ i , j + 1 = σ i - 1 σ j - 1 , 1 i j n - 1 ; σ i i = 1 , { a , b } = b - 1 a b .

Form the sets

S j = { s i , j , s i , j - 1 , 1 i , j < n } and Σ - 1 = { σ 1 - 1 , σ n - 1 - 1 } .

Then the set

S = S n S n - 1 S 2 Σ - 1

generates B n as a semigroup.

Order now the alphabet S as

S n < S n - 1 < < S 2 < Σ - 1 ,

and

s 1 , j - 1 < s 1 , j < s 2 , j - 1 < < s j - 1 , j , σ 1 - 1 < σ 2 - 1 < σ n - 1 - 1 .

Order S n -words by the deg-inlex ordering; that is, first compare words by length and then by the inverse lexicographic ordering starting from their last letters. Then we use the inverse tower ordering of S-words.

Lemma 4

(Artin [6], Markov [160]) The following Artin–Markov relations hold in the braid group B n :

σ k - 1 s i , j δ = s i , j δ σ k - 1 for k i - 1 , i , j - 1 , j ,
(1)
σ i - 1 s i , i + 1 δ = s i , i + 1 δ σ 1 - 1 ,
(2)
σ i - 1 - 1 s i , j δ = s i - 1 , j δ σ i - 1 - 1 ,
(3)
σ i - 1 s i , j δ = { s i + 1 , j δ , s i , i + 1 } σ i - 1 ,
(4)
σ j - 1 - 1 s i , j δ = s i , j - 1 δ σ j - 1 - 1 ,
(5)
σ j - 1 s i , j δ = { s i , j + 1 δ , s j , j + 1 } σ j - 1 ,
(6)

where δ = ± 1 ;

s j , k - 1 s k , l ε = { s k , l ε , s j , l - 1 } s j , k - 1 ,
(7)
s j , k s k , l ε = { s k , l ε , s j , l s k , l } s j , k ,
(8)
s j , k - 1 s j , l ε = { s j , l ε , s k , l - 1 s j , l - 1 } s j , k - 1 ,
(9)
s j , k s j , l ε = { s j , l ε , s k , l } s j , k ,
(10)
s i , k - 1 s j , l ε = { s j , l ε , s k , l s i , l s k , l - 1 s i , l - 1 } s i , k - 1 ,
(11)
s i , k s j , l ε = { s j , l ε , s i , l - 1 s k , l - 1 s i , l s k , l } s i , k ,
(12)

where i < j < k < l and ε = ± 1 ;

s i , k δ s j , l ε = s j , l ε s i , k δ ,
(13)
σ j - 1 σ k - 1 = σ k - 1 σ j - 1 for j < k - 1
(14)
σ j , j + 1 σ k , j + 1 = σ k , j + 1 σ j - 1 , j for j < k ,
(15)
σ i - 2 = s i , i + l - 1 ,
(16)
s i , j ± 1 s i , j 1 = 1 ,
(17)

where j < i < k < l or i < k < j < l , and ε, δ = ± 1 .

Theorem 21

([25]) The Artin–Markov relations (1)–(13) form a Gröbner–Shirshov basis of the braid group B n in terms of the Artin–Burau generators with respect to the inverse tower ordering of words.

It is claimed in [25] that some compositions are trivial. Processing all compositions explicitly, [82] supported the claim.

Corollary 8

(Markov–Ivanovskii [6]) The following words are normal forms of the braid group B n :

f n f n - 1 f 2 σ i n n σ i n - 1 n - 1 σ i 2 2 ,

where all f j for 2 j n are free irreducible words in { s i j , i < j } .

3.1.2 Braid groups in the Artin–Garside generators

The Artin–Garside generators of the braid group B n + 1 are σ i , 1 i n , , - 1 (Garside [103] 1969), where = Λ 1 Λ n with Λ i = σ 1 σ i .

Putting - 1 < < σ 1 < < σ n , order { - 1 , , σ 1 , , σ n } by the deg-lex ordering.

Denote by V ( j , i ) , W ( j , i ) , for j i positive words in the letters σ j , σ j + 1 , , σ i , assuming that V ( i + 1 , i ) = 1 , W ( i + 1 , i ) = 1 , .

Given V = V ( 1 , i ) , for 1 k n - i denote by V ( k ) the result of shifting the indices of all letters in V by k: σ 1 σ k + 1 , , σ i σ k + i , and put V = V ( 1 ) . Define σ i j = σ i σ i - 1 σ j for j i - 1 , while σ i i = σ i and σ i i + 1 = 1 .

Theorem 22

([23, 47]) A Gröbner–Shirshov basis S of B n + 1 in the Artin–Garside generators consists of the following relations:

σ i + 1 σ i V ( 1 , i - 1 ) W ( j , i ) σ i + 1 j = σ i σ i + 1 σ i V ( 1 , i - 1 ) σ i j W ( j , i ) , σ s σ k = σ k σ s for s - k 2 , σ 1 V 1 σ 2 σ 1 V 2 V n - 1 σ n σ 1 = V 1 ( n - 1 ) V 2 ( n - 2 ) V ( n - 1 ) , σ l = σ n - l + 1 for 1 l n , σ l - 1 = - 1 σ n - l + 1 for 1 l n , - 1 = 1 , - 1 = 1 ,

where 1 i n - 1 and 1 j i + 1 ; moreover, W ( j , i ) begins with σ i unless it is empty, and V i = V i ( 1 , i ) .

There are corollaries.

Corollary 9

The S-irreducible normal form of each word of B n + 1 coincides with its Garside normal form [103].

Corollary 10

(Garside [103]) The semigroup B n + 1 + of positive braids can be embedded into a group.

3.1.3 Braid groups in the Birman–Ko–Lee generators

Recall that the Birman–Ko–Lee generators σ t s of the braid group B n are

σ t s = ( σ t - 1 σ t - 2 σ s + 1 ) σ s ( σ s + 1 - 1 σ t - 2 - 1 σ t - 1 - 1 )

and we have the presentation

B n = g p σ t s , n t > s 1 | σ t s σ r q = σ r q σ t s , ( t - r ) ( t - q ) ( s - r ) ( s - q ) > 0 , σ t s σ s r = σ t r σ t s = σ s r σ t r , n t > s > r 1 .

Denote by δ the Garside word, δ = σ n n - 1 σ n - 1 n - 2 σ 21 .

Define the order as δ - 1 < δ < σ t s < σ r q iff ( t , s ) < ( r , q ) lexicographically. Use the deg-lex ordering on { δ - 1 , δ , σ t s , n t > s 1 } .

Instead of σ i j , we write simply ( i , j ) or ( j , i ) . We also set

( t m , t m - 1 , , t 1 ) = ( t m , t m - 1 ) ( t m - 1 , t m - 2 ) ( t 2 , t 1 ) ,

where t j t j + 1 , 1 j m - 1 . In this notation, we can write the defining relations of B n as

( t 3 , t 2 , t 1 ) = ( t 2 , t 1 , t 3 ) = ( t 1 , t 3 , t 2 ) for t 3 > t 2 > t 1 , ( k , l ) ( i , j ) = ( i , j ) ( k , l ) for k > l , i > j , k > i ,

where either k > i > j > l or k > l > i > j .

Denote by V [ t 2 , t 1 ] , where n t 2 > t 1 1 , a positive word in ( k , l ) satisfying t 2 k > l t 1 . We can use any capital Latin letter with indices instead of V, and appropriate indices (for instance, t 3 and t 0 with t 3 > t 0 ) instead of t 2 and t 1 . Use also the following equalities in B n :

V [ t 2 - 1 , t 1 ] ( t 2 , t 1 ) = ( t 2 , t 1 ) V [ t 2 - 1 , t 1 ]

for t 2 > t 1 , where V [ t 2 - 1 , t 1 ] = ( V [ t 2 - 1 , t 1 ] ) | ( k , l ) ( k , l ) , if l t 1 ; ( k , t 1 ) ( t 2 , k ) ;

W [ t 2 - 1 , t 1 ] ( t 1 , t 0 ) = ( t 1 , t 0 ) W [ t 2 - 1 , t 1 ]

for t 2 > t 1 > t 0 , where W [ t 2 - 1 , t 1 ] = ( W [ t 2 - 1 , t 1 ] ) | ( k , l ) ( k , l ) , if l t 1 ; ( k , t 1 ) ( k , t 0 ) .

Theorem 23

([24]) A Gröbner–Shirshov basis of the braid group B n in the Birman–Ko–Lee generators consists of the following relations:

( k , l ) ( i , j ) = ( i , j ) ( k , l ) for k > l > i > j , ( k , l ) V [ j - 1 , 1 ] ( i , j ) = ( i , j ) ( k , l ) V [ j - 1 , 1 ] for k > i > j > l , ( t 3 , t 2 ) ( t 2 , t 1 ) = ( t 2 , t 1 ) ( t 3 , t 1 ) , ( t 3 , t 1 ) V [ t 2 - 1 , 1 ] ( t 3 , t 2 ) = ( t 2 , t 1 ) ( t 3 , t 1 ) V [ t 2 - 1 , 1 ] , ( t , s ) V [ t 2 - 1 , 1 ] ( t 2 , t 1 ) W [ t 3 - 1 , t 1 ] ( t 3 , t 1 ) = ( t 3 , t 2 ) ( t , s ) V [ t 2 - 1 , 1 ] ( t 2 , t 1 ) W [ t 3 - 1 , t 1 ] , ( t 3 , s ) V [ t 2 - 1 , 1 ] ( t 2 , t 1 ) W [ t 3 - 1 , t 1 ] ( t 3 , t 1 ) = ( t 2 , s ) ( t 3 , s ) V [ t 2 - 1 , 1 ] ( t 2 , t 1 ) W [ t 3 - 1 , t 1 ] , ( 2 , 1 ) V 2 [ 2 , 1 ] ( 3 , 1 ) V n - 1 [ n - 1 , 1 ] ( n , 1 ) = δ V 2 [ 2 , 1 ] V n - 1 [ n - 1 , 1 ] , ( t , s ) δ = δ ( t + 1 , s + 1 ) , ( t , s ) δ - 1 = δ - 1 ( t - 1 , s - 1 ) with t ± 1 , s ± 1 ( mod n ) , δ δ - 1 = 1 , δ - 1 δ = 1 ,

where V [ k , l ] means, as above, a word in ( i , j ) satisfying k i > j l , t > t 3 , and t 2 > s .

There are two corollaries.

Corollary 11

(Birman et al. [13]) The semigroup B n + of positive braids in the Birman–Ko–Lee generators embeds into a group.

Corollary 12

(Birman et al. [13]) The S-irreducible normal form of a word in B n in the Birman–Ko–Lee generators coincides with the Birman–Ko–Lee–Garside normal form δ k A , where A B n + .

3.1.4 Braid groups in the Adjan–Thurston generators

The symmetric group S n + 1 has the presentation

S n + 1 = g p s 1 , , s n | s i 2 = 1 , s j s i = s i s j ( j - 1 > i ) , s i + 1 s i s i + 1 = s i s i + 1 s i .

Bokut and Shiao [58] found the normal form for S n + 1 in the following statement: the set N = { s 1 i 1 s 2 i 2 s n i n | i j j + 1 } is a Gröbner–Shirshov normal form for S n + 1 in the generators s i = ( i , i + 1 ) relative to the deg-lex ordering, where s j i = s j s j - 1 s i for j i and s j j + 1 = 1 .

Take α S n + 1 with the normal form α ¯ = s 1 i 1 s 2 i 2 s n i n N . Define the length of α as | α ¯ | = l ( s 1 i 1 s 2 i 2 s n i n ) and write α β whenever | α β ¯ | = | α ¯ | + | β ¯ | . Moreover, every α ¯ N has a unique expression α ¯ = s l 1 i l 1 s l 2 i l 2 s l t i l t with all s l j i l j 1 . The number t is called the breadth of α.

Now put

B n + 1 = g p r ( α ¯ ) , α S n + 1 \ { 1 } | r ( α ¯ ) r ( β ¯ ) = r ( α β ¯ ) , α β ,

where r ( α ¯ ) stands for a letter with index α ¯ .

Then for the braid group with n generators we have B n + 1 B n + 1 . Indeed, define

θ : B n + 1 B n + 1 , σ i r ( s i ) , θ : B n + 1 B n + 1 , r ( α ¯ ) α ¯ | s i σ i .

These mappings are homomorphism satisfying θ θ = l B n + 1 and θ θ = l B n + 1 . Hence,

B n + 1 = g p r ( α ¯ ) , α S n + 1 \ { 1 } | r ( α ¯ ) r ( β ¯ ) = r ( α β ¯ ) , α β .

Put X = { r ( α ¯ ) , α S n + 1 \ { 1 } } . These generators of B n + 1 are called the Adjan–Thurston generators.

Then the positive braid semigroup generated by X is

B n + 1 + = sgp X | r ( α ¯ ) r ( β ¯ ) = r ( α β ¯ ) , α β .

Assume that s 1 < s 2 < < s n . Define r ( α ¯ ) < r ( β ¯ ) if and only if | α ¯ | > | β ¯ | or | α ¯ | = | β ¯ | and α ¯ < l e x β ¯ . Clearly, this is a well-ordering on X. We will use the deg-lex ordering on X .

Theorem 24

([89]) The Gröbner–Shirshov basis of B n + 1 + in the Adjan–Thurston generator X relative to the deg-lex ordering on X consists of the relations

r ( α ¯ ) r ( β ¯ ) = r ( α β ¯ ) for α β ; r ( α ¯ ) r ( β γ ¯ ) = r ( α β ¯ ) r ( γ ¯ ) for α β γ .

Theorem 25

([89]) The Gröbner–Shirshov basis of B n + 1 in the Adjan–Thurston generator X with respect to the deg-lex ordering on X consists of the relations

  1. (1)

    r ( α ¯ ) r ( β ¯ ) = r ( α β ¯ ) for α β ,

  2. (2)

    r ( α ¯ ) r ( β γ ¯ ) = r ( α β ¯ ) r ( γ ¯ ) for α β γ ,

  3. (3)

    r ( α ¯ ) Δ ε = Δ ε r ( α ¯ ) for α ¯ = α ¯ | s i s n + 1 - i ,

  4. (4)

    r ( α β ¯ ) r ( γ μ ¯ ) = Δ r ( α ¯ ) r ( μ ¯ ) for α β γ μ with r ( β γ ¯ ) = Δ ,

  5. (5)

    Δ ε Δ - ε =1.

Corollary 13

(Adjan–Thurston) The normal forms for B n + 1 are Δ k r ( α 1 ¯ ) r ( α s ¯ ) for k Z , where r ( α 1 ¯ ) r ( α s ¯ ) is minimal in the deg-lex ordering.

3.2 Gröbner–Shirshov basis for the Chinese monoid

The Chinese monoid C H ( X , < ) over a well-ordered set ( X , < ) has the presentation C H ( X ) = sgp X | S , where X = { x i | i I } and S consists of the relations

x i x j x k = x i x k x j = x j x i x k for i > j > k , x i x j x j = x j x i x j , x i x i x j = x i x j x i for i > j .

Theorem 26

([85]) With the deg-lex ordering on X , the following relations (1)–(5) constitute a Gröbner–Shirshov basis of the Chinese monoid C H ( X ) :

  1. (1)

    x i x j x k - x j x i x k ,

  2. (2)

    x i x k x j - x j x i x k ,

  3. (3)

    x i x j x j - x j x i x j ,

  4. (4)

    x i x i x j - x i x j x i ,

  5. (5)

    x i x j x i x k - x i x k x i x j ,

where x i , x j , x k X and i > j > k .

Denote by Λ the set consistsing of the words on X of the form u n = w 1 w 2 w n with n 0 , where

w 1 = x 1 t 11 w 2 = ( x 2 x 1 ) t 21 x 2 t 22 w 3 = ( x 3 x 1 ) t 31 ( x 3 x 2 ) t 32 x 3 t 33 w n = ( x n x 1 ) t n 1 ( x n x 2 ) t n 2 ( x n x n - 1 ) t n ( n - 1 ) x n t n n

for x i X with x 1 < x 2 < < x n , and all exponents are non-negative.

Corollary 14

([71]) This Λ is a set of normal forms of elements of the Chinese monoid C H ( X ) .

3.3 Gröbner–Shirshov basis for free inverse semigroup

Consider a semigroup S. An element s S is called an inverse of t S whenever s t s = s and t s t = t . An inverse semigroup is a semigroup in which every element t has a unique inverse, denoted by t - 1 .

Given a set X, put X - 1 = { x - 1 | x X } . On assuming that X X - 1 = , denote X X - 1 by Y. Define the formal inverses of the elements of Y as

1 - 1 = 1 , ( x - 1 ) - 1 = x ( x X ) , ( y 1 y 2 y n ) - 1 = y n - 1 y 2 - 1 y 1 - 1 ( y 1 , y 2 , , y n Y ) .

It is well known that

FI ( X ) = sgp Y | a a - 1 a = a , a a - 1 b b - 1 = b b - 1 a a - 1 , a , b Y

is the free inverse semigroup (with identity) generated by X.

Introduce the notions of a formal idempotent, a (prime) canonical idempotent, and an ordered (prime) canonical idempotent in Y . Assume that < is a well-ordering on Y.

  1. (i)

    The empty word 1 is an idempotent.

  2. (ii)

    If h is an idempotent and x Y then x - 1 h x is both an idempotent and a prime idempotent.

  3. (iii)

    If e 1 , e 2 , , e m , where m > 1 , are prime idempotents then e = e 1 e 2 e m is an idempotent.

  4. (iv)

    An idempotent w Y is called canonical whenever w avoids subwords of the form x - 1 e x f x - 1 , where x Y , both e and f are idempotents.

  5. (v)

    A canonical idempotent w Y is called ordered if every subword e = e 1 e 2 e m of w with m > 2 and e i being idempotents satisfies fir ( e 1 ) < fir ( e 2 ) < < fir ( e m ) , where fir ( u ) is the first letter of u Y .

Theorem 27

([44]) Denote by S the subset of k Y consisting two kinds of polynomials:

  • e f - f e , where e and f are ordered prime canonical idempotents with e f > f e ;

  • x - 1 e x f x - 1 - f x - 1 e , where x Y , x - 1 e x , and x f x - 1 are ordered prime canonical idempotents.

Then, with the deg-lex ordering on Y , the set S is a Gröber–Shirshov basis of the free inverse semigroup sgp Y | S .

Theorem 28

([44]) The normal forms of elements of the free inverse semigroup sgp Y | S are

u 0 e 1 u 1 e m u m Y ,

where m 0 , u 1 , , u m - 1 1 and u 0 u 1 u m avoids subwords of the form y y - 1 for y Y , while e 1 , , e m are ordered canonical idempotents such that the first (respectively last) letter of e i , for 1 i m is not equal to the first (respectively last) letter of u i (respectively u i - 1 ).

The above normal form is analogous to the semi-normal forms of Poliakova and Schein [176], 2005.

3.4 Approaches to plactic monoids via Gröbner–Shirshov bases in row and column generators

Consider the set X = { x 1 , , x n } of n elements with the ordering x 1 < < x n . Schützenberger called P n = sgp X | T a plactic monoid (see also Lothaire [153], Chapter 5), where T consists of the Knuth relations

x i x k x j = x k x i x j for x i x j < x k , x j x i x k = x j x k x i for x i < x j x k .

A nondecreasing word R X is called a row and a strictly decreasing word C X is called a column; for example, x 1 x 1 x 3 x 5 x 5 x 5 x 6 is a row and x 6 x 4 x 2 x 1 is a column.

For two rows R , S A say that R dominates S whenever | R | | S | and every letter of R is greater than the corresponding letter of S, where | R | is the length of R.

A (semistandard) Young tableau on A (see [152]) is a word w = R 1 R 2 R t in U such that R i dominates R i + 1 for all i = 1 , , t - 1 . For example,

x 4 x 5 x 5 x 6 · x 2 x 2 x 3 x 3 x 5 x 7 · x 1 x 1 x 1 x 2 x 4 x 4 x 4

is a Young tableau.

Cain et al. [69] use the Schensted–Knuth normal form (the set of (semistandard) Young tableaux) to prove that the multiplication table of column words, u v = u v , forms a finite GS basis of the finitely generated plactic monoid. Here the Young tableaux u v is the output of the column Schensted algorithm applied to u v , but u v is not made explicit.

In this section we give new explicit formulas for the multiplication tables of row and column words. In addition, we give independent proofs that the resulting sets of relations are GS bases in row and column generators respectively. This yields two new approaches to plactic monoids via their GS bases.

3.4.1 Plactic monoids in the row generators

Consider the plactic monoid P n = sgp X | T , where X = { 1 , 2 , , n } with 1 < 2 < < n . Denote by N the set of non-negative integers. It is convenient to express the rows R X as R = ( r 1 , r 2 , , r n ) , where r i for i = 1 , 2 , , n is the number of occurrences of the letter i. For example, R = 111225 = ( 3 , 2 , 0 , 0 , 1 , 0 , , 0 ) .

Denote by U the set of all rows in X and order U as follows. Given R = ( r 1 , r 2 , , r n ) U , define the length | R | = r 1 + + r n of R in X .

Firstly, order U: for every R , S U , put R < S if and only if | R | < | S | or | R | = | S | and ( r 1 , r 2 , , r n ) > ( s 1 , s 2 , , s n ) lexicographically. Clearly, this is a well-ordering on U. Then, use the deg-lex ordering on U .

Lemma 5

([29]) Take Φ = ( ϕ 1 , , ϕ n ) U . For 1 p n put

Φ p = i = 1 p ϕ i ,

where ϕ i ( w i , z i , w i , and z i , see below) stands for a lowercase symbol, and Φ p ( W p , Z p , W p , and Z p , see below) for the corresponding uppercase symbol. Take W = ( w 1 , w 2 , , w n ) and Z = ( z 1 , z 2 , , z n ) in U. Put W = ( w 1 , w 2 , , w n ) and Z = ( z 1 , z 2 , , z n ) , where

w 1 = 0 , w p = min ( Z p - 1 - W p - 1 , w p ) , z q = w q + z q - w q
(*)

for n p 2 and n q 1 .

Then W · Z = W · Z in P n = sgp X | T and W · Z is a Young tableau on X, which could have only one row, that is, Z = ( 0 , 0 , , 0 ) . Moreover,

P n = sgp X | T sgp U | Γ ,

where Γ = { W · Z = W · Z | W , Z V } .

We should emphasize that ( ) gives explicitly the product of two rows obtained by the Schensted row algorithm.

Jointly with our students Weiping Chen and Jing Li we proved [29], independently of Knuth’s normal form theorem [137], that Γ is a GS basis of the plactic monoid algebra in row generators with respect to the deg-lex ordering. In particular, this yields a new proof of Knuth’s theorem.

3.4.2 Plactic monoids in the column generators

Consider the plactic monoid P n = sgp X | T , where X = { 1 , 2 , , n } with 1 < 2 < < n . Every Young tableaux is a product of columns. For example,

4 , 556 · 223 , 357 · 1 , 112 , 444 = ( 421 ) ( 521 ) ( 531 ) ( 632 ) ( 54 ) ( 74 ) ( 4 )

is a Young tableau.

Given a column C X , denote by c i the number of occurrences of the letter i in C. Then c i { 0 , 1 } for i = 1 , 2 , , n . We write C = ( c 1 ; c 2 ; ; c n ) . For example, C = 6 , 421 = ( 1 ; 1 ; 0 ; 1 ; 0 ; 1 ; 0 ; ; 0 ) .

Put V = { C | C is a column in X } . For R = ( r 1 ; r 2 ; ; r n ) V define wt ( R ) = ( | R | , r 1 , , r n ) . Order V as follows: for R , S V , put R < S if and only if wt ( R ) > wt ( S ) lexicographically. Then, use the deg-lex ordering on V .

For Φ = ( ϕ 1 ; ; ϕ n ) V , put Φ p = i = 1 p ϕ i , 1 p n , where ϕ stands for some lowercase symbol defined above and Φ stands for the corresponding uppercase symbol.

Lemma 6

([29]) Take W = ( w 1 ; w 2 ; ; w n ) , Z = ( z 1 ; z 2 ; ; z n ) V . Define W = ( w 1 ; w 2 ; ; w n ) and Z = ( z 1 ; z 2 ; ; z n ) , where

z 1 = min ( w 1 , z 1 ) , z p = min ( W p - Z p - 1 , z p ) , w q = w q + z q - z q
(**)

for n p 2 and n q 1 . Then W , Z V and W · Z = W · Z in P n = sgp X | T , and W · Z is a Young tableau on X. Moreover,

P n = sgp X | T sgp V | Λ ,

where Λ = { W · Z = W · Z | W , Z V } .

Equation ( ) gives explicitly the product of two columns obtained by the Schensted column algorithm.

Jointly with our students Weiping Chen and Jing Li we proved [29], independently of Knuth’s normal form theorem [137], that Λ is a GS basis of the plactic monoid algebra in column generators with respect to the deg-lex ordering. In particular, this yields another new proof of Knuth’s theorem. Previously Cain, Gray, and Malheiro [69] established the same result using Knuth’s theorem, and they did not find Λ explicitly.

Remark All results of [29] are valid for every plactic monoid, not necessarily finitely generated.

4 Gröbner–Shirshov bases for Lie algebras

In this section we first give a different approach to the LS basis and the Hall basis of a free Lie algebra by using Shirshov’s CD-lemma for anti-commutative algebras. Then, using the LS basis, we construct the classical theory of GS bases for Lie algebras over a field. Finally, we mention GS bases for Lie algebras over a commutative algebra and give some applications.

4.1 Lyndon–Shirshov basis and Lyndon–Shirshov words in anti-commutative algebras

A linear space A equipped with a bilinear product x · y is called an anti-commutative algebra if it satisfies the identity x 2 = 0 , and so x · y = - y · x for every x , y A .

Take a well-ordered set X and denote by X the set of all non-associative words. Define three orderings l e x , > d e g - l e x , and > n - d e g - l e x (non-associative deg-lex) on X . For ( u ) , ( v ) X put

  • ( u ) = ( ( u 1 ) ( u 2 ) ) l e x ( v ) = ( ( v 1 ) ( v 2 ) ) (here ( u 2 ) or ( v 2 ) is empty when | ( u ) | = 1 or | ( v ) | = 1 ) iff one of the following holds:

    1. (a)

      u 1 u 2 > v 1 v 2 in the lex ordering;

    2. (b)

      u 1 u 2 = v 1 v 2 and ( u 1 ) l e x ( v 1 ) ;

    3. (c)

      u 1 u 2 = v 1 v 2 , ( u 1 ) = ( v 1 ) , and ( u 2 ) l e x ( v 2 ) ;

  • ( u ) = ( ( u 1 ) ( u 2 ) ) > d e g - l e x ( v ) = ( ( v 1 ) ( v 2 ) ) iff one of the following holds:

    1. (a)

      u 1 u 2 > v 1 v 2 in the deg-lex ordering;

    2. (b)

      u 1 u 2 = v 1 v 2 and ( u 1 ) > d e g - l e x ( v 1 ) ;

    3. (c)

      u 1 u 2 = v 1 v 2 , ( u 1 ) = ( v 1 ) , and ( u 2 ) > d e g - l e x ( v 2 ) ;

  • ( u ) > n - d e g - l e x ( v ) iff one of the following holds:

    1. (a)

      | ( u ) | > | ( v ) | ;

    2. (b)

      if | ( u ) | = | ( v ) | , ( u ) = ( ( u 1 ) ( u 2 ) ) , and ( v ) = ( ( v 1 ) ( v 2 ) ) then ( u 1 ) > n - d e g - l e x ( v 1 ) or ( ( u 1 ) = ( v 1 ) and ( u 2 ) > n - d e g - l e x ( v 2 ) ).

Define regular words ( u ) X by induction on | ( u ) | :

  1. (i)

    x i X is a regular word.

  2. (ii)

    ( u ) = ( ( u 1 ) ( u 2 ) ) is regular if both ( u 1 ) and ( u 2 ) are regular and ( u 1 ) l e x ( u 2 ) .

Denote ( u ) by [ u ] whenever ( u ) is regular.

The set N ( X ) of all regular words on X constitutes a linear basis of the free anti-commutative algebra A C ( X ) on X.

The following result gives an alternative approach to the definition of LS words as the radicals of associative supports u of the normal words [ u ] .

Theorem 29

([37]) Suppose that [ u ] is a regular word of the anti-commutative algebra A C ( X ) . Then u = v m , where v is a Lyndon–Shirshov word in X and m 1 . Moreover, the set of associative supports of the words in N ( X ) includes the set of all Lyndon–Shirshov words in X.

Fix an ordering > d e g - l e x on X and choose monic polynomials f and g in A C ( X ) . If there exist a , b X such that [ w ] = [ f ¯ ] = [ a [ g ¯ ] b ] then the inclusion composition of f and g is defined as ( f , g ) [ w ] = f - [ a [ g ] b ] .

A monic subset S of A C ( X ) is called a GS basis in A C ( X ) if every inclusion composition ( f , g ) [ w ] in S is trivial modulo ( S , [ w ] ) .

Theorem 30

(Shirshov’s CD-lemma for anti-commutative algebras, cf. [206]) Consider a nonempty set S A C ( X ) of monic polynomials with the ordering > d e g - l e x on X . The following statements are equivalent:

  1. (i)

    The set S is a Gröbner–Shirshov basis in A C ( X ) .

  2. (ii)

    If f I d ( S ) then [ f ¯ ] = [ a [ s ¯ ] b ] for some s S a n d a , b X , where [ a s b ] is a normal S-word.

  3. (iii)

    The set

    I r r ( S ) = { [ u ] N ( X ) | [ u ] [ a [ s ¯ ] b ] a , b X , s S a n d [ a s b ] i s a n o r m a l S - w o r d }

    is a linear basis of the algebra A C ( X | S ) = A C ( X ) / I d ( S ) .

Define the subset S 1 the free anti-commutative algebra A C ( X ) as

S 1 = { ( [ u ] [ v ] ) [ w ] - ( [ u ] [ w ] ) [ v ] - [ u ] ( [ v ] [ w ] ) | [ u ] , [ v ] , [ w ] N ( X ) and [ u ] l e x [ v ] l e x [ w ] } .

It is easy to prove that the free Lie algebra admits a presentation as an anti-commutative algebra: L i e ( X ) = A C ( X ) / I d ( S 1 ) .

The next result gives an alternating approach to the definition of the LS basis of a free Lie algebra L i e ( X ) as a set of irreducible non-associative words for an anti-commutative GS basis in A C ( X ) .

Theorem 31

([37]) Under the ordering > d e g - l e x , the subset S 1 of A C ( X ) is an anti-commutative Gröbner–Shirshov basis in A C ( X ) . Then I r r ( S 1 ) is the set of all non-associative LS words in X. So, the LS monomials constitute a linear basis of the free Lie algebra L i e ( X ) .

Theorem 32

([34]) Define S 2 by analogy with S 1 , but using > n - d e g - l e x instead of l e x . Then with the ordering > n - d e g - l e x the subset S 2 of A C ( X ) is also an anti-commutative GS basis. The set I r r ( S 2 ) amounts to the set of all Hall words in X and forms a linear basis of a free Lie algebra L i e ( X ) .

4.2 Composition-Diamond lemma for Lie algebras over a field

We start with some concepts and results from the literature concerning the theory of GS bases for the free Lie algebra L i e ( X ) generated by X over a field k.

Take a well-ordered set X = { x i | i I } with x i > x t whenever i > t , for all i , t I . Given u = x i 1 x i 2 x i m X , define the length (or degree) of u to be m and denote it by | u | = m or d e g ( u ) = m , put fir ( u ) = x i 1 , and introduce

x β = min ( u ) = min { x i 1 , x i 2 , , x i m } , X ( u ) = { x i j = x i x β x β j | i > β , j 0 } .

Order the new alphabet X ( u ) as follows:

x i 1 j 1 > x i 2 j 2 i 1 > i 2 or i 1 = i 2 and j 2 > j 1 .

Assuming that

u = x r 1 x β x β m 1 x r t x β x β m t ,

where r i > β , define the Shirshov elimination

u = x r 1 m 1 x r t m t ( X ( u ) ) .

We use two linear orderings on X :

  1. (i)

    the lex ordering (or lex-antideg ordering): 1 v if v 1 and, by induction, if u = x i u 1 and v = x j v 1 then u v if and only if x i > x j or x i = x j and u 1 v 1 ;

  2. (ii)

    the deg-lex ordering: u > v if | u | > | v | or | u | = | v | and u v .

Remark In commutative algebras, the lex ordering is understood to be the lex-deg ordering with the condition v > 1 for v 1 .

We cite some useful properties of ALSWs and NLSWs (see below) following Shirshov [203, 204, 207], see also [209]. Property (X) was given by Shirshov [204] and Chen et al. [72]. Property (VIII) was implicitly used in Shirshov [207], see also Chibrikov [94].

We regard L i e ( X ) as the Lie subalgebra of the free associative algebra k X generated by X with the Lie bracket [ u , v ] = u v - v u . Below we prove that L i e ( X ) is the free Lie algebra generated by X for every commutative ring k (Shirshov [203]). For a field, this follows from the PBW theorem because the free Lie algebra L i e ( X ) = L i e ( X | ) has the universal enveloping associative algebra k X = k X | .

Given f k X , denote by f ¯ the leading word of f with respect to the deg-lex ordering and write f = α f ¯ f ¯ - r f with α f ¯ k .

Definition 3

([156, 203]) Refer to w X \ { 1 } as an associative Lyndon–Shirshov word, or ALSW for short, whenever

( u , v X , u , v 1 ) w = u v w > v u .

Denote the set of all ALSWs on X by A L S W ( X ) .

Associative Lyndon–Shirshov words enjoy the following properties (Lyndon [156], Chen et al. [72], Shirshov [203, 204]).

(I) Put x β = min ( u v ) . If fir ( u ) x β and fir ( v ) x β then

u v ( in the lex ordering on X ) u v ( in the lex ordering on ( X u v ) ) .

(II) (Shirshov’s key property of ALSWs) A word u is an ALSW in X if and only if u is an ALSW in ( X ( u ) ) .

Properties (I) and (II) enable us to prove the properties of ALSWs and NLSWs (see below) by induction on length.

(III) (down-to-up bracketing) u A L S W ( X ) ( k ) | u ( k ) | ( X ( u ) ) ( k ) = 1 , where u ( k ) = ( u ) ( k - 1 ) and ( X ( u ) ) ( k ) = ( X ( u ) ) ( k - 1 ) . In the process u u u we use the algorithm of joining the minimal letters of u, u to the previous words.

(IV) If u , v A L S W ( X ) then u v A L S W ( X ) u v .

(V) w A L S W ( X ) (for every u , v X \ { 1 } and w = u v w v ).

(VI) If w A L S W ( X ) then an arbitrary proper prefix of w cannot be a suffix of w and w x β A L S W ( X ) if x β = min ( w ) .

(VII) (Shirshov’s factorization theorem) Every associative word w can be uniquely represented as w = c 1 c 2 c n , where c 1 , , c n A L S W ( X ) and c 1 c 2 c n .

Actually, if we apply to w the algorithm of joining the minimal letter to the previous word using the Lie product, w w w , then after finitely many steps we obtain w ( k ) = [ c 1 ] [ c 2 ] [ c n ] , with c 1 c 2 c n , and w = c 1 c 2 c n would be the required factorization (see an example in the Introduction).

(VIII) If an associative word w is represented as in (VII) and v is a LS subword of w then v is a subword of one of the words c 1 , c 2 , , c n .

(IX) If u 1 u 2 and u 2 u 3 are ALSWs then so is u 1 u 2 u 3 provided that u 2 1 .

(X) If w = u v is an ALSW and v is its longest proper ALSW ending, then u is an ALSW as well (Chen et al. [72], Shirshov [204]).

Definition 4

(down-to-up bracketing of ALSW, Shirshov [203]) For an ALSW w, there is the down-to-up bracketing w w w w ( k ) = [ w ] , where each time we join the minimal letter of the previous word using Lie multiplication. To be more precise, we use the induction [ w ] = [ w ] x i j [ [ x i x β ] x β ] .

Definition 5

(up-to-down bracketing of ALSW, Shirshov [204], Chen et al. [72]) For an ALSW w, we define the up-to-down Lie bracketing [ [ w ] ] by the induction [ [ w ] ] = [ [ [ u ] ] [ [ v ] ] ] , where w = u v as in (X).

(XI) If w A L S W ( X ) then [ w ] = [ [ w ] ] .

(XII) Shirshov’s definition of a NLSW (non-associative LS word) ( w ) below is the same as [ w ] and [ [ w ] ] ; that is, ( w ) = [ w ] = [ [ w ] ] . Chen et al. [72] used [ [ w ] ] .

Definition 6

(Shirshov[203]) A non-associative word ( w ) in X is a NLSW if

  1. (i)

    w is an ALSW;

  2. (ii)

    if ( w ) = ( ( u ) ( v ) ) then both ( u ) and ( v ) are NLSWs (then (IV) implies that u v );

  3. (iii)

    if ( w ) = ( ( ( u 1 ) ( u 2 ) ) ( v ) ) then u 2 v .

Denote the set of all NLSWs on X by N L S W ( X ) .

(XIII) If u A L S W ( X ) and [ u ] N L S W ( X ) then [ u ] ¯ = u in k X .

(XIV) The set N L S W ( X ) is linearly independent in L i e ( X ) k X for every commutative ring k.

(XV) N L S W ( X ) is a set of linear generators in every Lie algebra generated by X over an arbitrary commutative ring k.

(XVI) L i e ( X ) k X is the free Lie algebra over the commutative ring k with the k-basis N L S W ( X ) .

(XVII) (Shirshov’s special bracketing [203]) Consider w = a u b with w , u A L S W ( X ) . Then

  1. (i)

    [ w ] = [ a [ u c ] d ] , where b = c d and possibly c = 1 .

  2. (ii)

    Express c in the form c = c 1 c 2 c n , where c 1 , , c n A L S W ( X ) and c 1 c 2 c n . Replacing [ u c ] by [ [ [ u ] [ c 1 ] ] [ c n ] ] , we obtain the word

    [ w ] u = [ a [ [ [ [ u ] [ c 1 ] ] [ c 2 ] ] [ c n ] ] d ]

    which is called the Shirshov special bracketing of w relative to u.

  3. (iii)

    [ w ] u = a [ u ] b + i α i a i [ u ] b i in k X with α i k and a i , b i X satisfying a i u b i < a u b , and hence [ w ] ¯ u = w .

Outline of the proof. Put x β = min ( w ) . Then w = a ( u x β m ) ( b 1 ) in ( X ( w ) ) , where b = x β m b 1 and u x β m is an ALSW. Claim (i) follows from (II) by induction on length. The same applies to claim (iii).

(XVIII) (Shirshov’s Lie elimination of the leading word) Take two monic Lie polynomials f and s with f ¯ = a s ¯ b for some a , b X . Then f 1 = f - [ a s b ] s ¯ is a Lie polynomial with smaller leading word, and so f ¯ 1 < f ¯ .

(XIX) (Shirshov’s double special bracketing) Assume that w = a u b v c with w , u , v A L S W ( X ) . Then there exists a bracketing [ w ] u , v such that [ w ] u , v = [ a [ u ] b [ v ] c ] u , v and [ w ] u , v ¯ = w .

More precisely, [ w ] u , v = [ a [ u p ] u q [ v r ] v s ] if [ w ] = [ a [ u p ] q [ v r ] s ] , and

[ w ] u , v = [ a [ [ [ [ u ] [ c 1 ] ] [ c i ] v ] [ c n ] ] p ]

if [ w ] = [ a [ u c ] p ] , where c = c 1 c n is the Shirshov factorization of c and v is a subword of c i . In both cases [ w ] u , v = a [ u ] b [ v ] d + α i a i [ u ] b i [ v ] d i in k X , where a i u b i v d i < w .

(XX) (Shirshov’s algorithm for recognizing Lie polynomials, cf. the Dynkin–Specht–Wever and Friedrich algorithms). Take s L i e ( X ) k X . Then s ¯ is an ALSW and s 1 = s - α s ¯ [ s ¯ ] is a Lie polynomial with a smaller maximal word (in the deg-lex ordering), s ¯ 1 < s ¯ , where s = α s ¯ [ s ¯ ] + . Then s 2 = s 1 - α s ¯ 1 [ s ¯ 1 ] , s 2 ¯ < s 1 ¯ . Consequently, s L i e ( X ) if and only if after finitely many steps we obtain

s m + 1 = s - α s ¯ [ s ¯ ] - α s ¯ 1 [ s ¯ 1 ] - - α s ¯ m [ s ¯ m ] = 0 .

Here k can be an arbitrary commutative ring.

Definition 7

Consider S L i e ( X ) with all s S monic. Take a , b X and s S . If a s ¯ b is an ALSW then we call [ a s b ] s ¯ = [ a s ¯ b ] s ¯ | [ s ¯ ] s a special normal S-word (or a special normal s-word), where [ a s ¯ b ] s ¯ is defined in (XVII) (ii). A Lie S-word ( a s b ) is called a normal S-word whenever ( a s b ) ¯ = a s ¯ b . Every special normal s-word is a normal s-word by (XVII) (iii).

For f , g S there are two kinds of Lie compositions:

  1. (i)

    If w = f ¯ = a g ¯ b for some a , b X then the polynomial f , g w = f - [ a g b ] g ¯ is called the inclusion composition of f and g with respect to w.

  2. (ii)

    If w is a word satisfying w = f ¯ b = a g ¯ for some a , b X with d e g ( f ¯ ) + d e g ( g ¯ ) > d e g ( w ) then the polynomial f , g w = [ f b ] f ¯ - [ a g ] g ¯ is called the intersection composition of f and g with respect to w, and w is an ALSW by (IX).

Given a Lie polynomial h and w X , say that h is trivial modulo ( S , w ) and write h L i e 0 m o d ( S , w ) whenever h = i α i ( a i s i b i ) , where each α i k , ( a i s i b i ) is a normal S-word and a i s i ¯ b i < w .

A set S is called a GS basis in L i e ( X ) if every composition ( f , g ) w of polynomials f and g in S is trivial modulo S and w.

(XXI) If s L i e ( X ) is monic and ( a s b ) is a normal S-word then ( a s b ) = a s b + i α i a i s b i , where a i s ¯ b i < a s ¯ b .

A proof of (XXI) follows from the CD-lemma for associative algebras since { s } is an associative GS basis by (IV).

(XXII) Given two monic Lie polynomials f and g, we have

f , g w - ( f , g ) w a s s 0 mod ( { f , g } , w ) .

Proof

If f , g w and ( f , g ) w are intersection compositions, where w = f ¯ b = a g ¯ , then (XIII) and (XVII) yield

f , g w = [ f b ] f ¯ - [ a g ] g ¯ = f b + I 1 α i a i f b i - a g - I 2 β j a j g b j ,

where a i f ¯ b i , a j g ¯ b j < f ¯ b = a g ¯ = w . Hence,

f , g w - ( f , g ) w a s s 0 mod ( { f , g } , w ) .

In the case of inclusion compositions we arrive at the same conclusion.

Theorem 33

(PBW Theorem in Shirshov’s form [56, 57], see Theorem 17) A nonempty set S L i e ( X ) k X of monic Lie polynomials is a Gröbner–Shirshov basis in L i e ( X ) if and only if S is a Gröbner–Shirshov basis in k X .

Proof

Observe that, by definition, for any f , g S the composition lies in L i e ( X ) if and only if it lies k X .

Assume that S is a GS basis in L i e ( X ) . Then we can express every composition f , g w as f , g w = I 1 α i ( a i s i b i ) , where ( a i s i b i ) are normal S-words and a i s i ¯ b i < w . By (XXI), we have f , g w = I 2 β j c j s j d j with c j s j ¯ d j < w . Therefore, (XXII) yields ( f , g ) w a s s 0 mod ( S , w ) . Thus, S is a GS basis in k X .

Conversely, assume that S is a GS basis in k X . Then the CD-lemma for associative algebras implies that f , g w ¯ = a s ¯ b < w for some a , b X and s S . Then h = f , g w - α [ a s b ] s ¯ I d a s s ( S ) is a Lie polynomial and h ¯ < f , g w ¯ . Induction on f , g w ¯ yields f , g w L i e 0 mod ( S , w ) .

Theorem 34

(The CD-lemma for Lie algebras over a field) Consider a nonempty set S L i e ( X ) k X of monic Lie polynomials and denote by I d ( S ) the ideal of L i e ( X ) generated by S. The following statements are equivalent:

  1. (i)

    The set S is a Gröbner–Shirshov basis in L i e ( X ) .

  2. (ii)

    If f I d ( S ) then f ¯ = a s ¯ b for some s S and a , b X .

  3. (iii)

    The set

    I r r ( S ) = { [ u ] N L S W ( X ) | u a s ¯ b , s S , a , b X }

    is a linear basis for L i e ( X | S ) .

Proof

(i)(ii). Denote by I d a s s ( S ) and I d L i e ( S ) the ideals of k X and L i e ( X ) generated by S respectively. Since I d L i e ( S ) I d a s s ( S ) , Theorem 33 and the CD-lemma for associative algebras imply the claim.

(ii)(iii). Suppose that α i [ u i ] = 0 in L i e ( X | S ) with [ u i ] I r r ( S ) and u 1 > u 2 > , that is, α i [ u i ] I d L i e ( S ) . Then all α i must vanish. Otherwise we may assume that α 1 0 . Then α i [ u i ] ¯ = u 1 and (ii) implies that [ u 1 ] I r r ( S ) , which is a contradiction. On the other hand, by the next property (XXIII), I r r ( S ) generates L i e ( X | S ) as a linear space.

(iii)(i). This part follows from (XXIII).

The next property is similar to Lemma 2.

(XXIII) Given S L i e ( X ) , we can express every f L i e ( X ) as

f = α i [ u i ] + β j [ a j s j b j ] s j ¯

with α i , β j k , [ u i ] I r r ( S ) satisfying [ u i ] ¯ f ¯ , and [ a j s j b j ] s j ¯ are special normal S-word satisfying [ a j s j b j ] s j ¯ ¯ f ¯ .

(XXIV) Given a normal s-word ( a s b ) , take w = a s ¯ b . Then ( a s b ) [ a s b ] s ¯ mod ( s , w ) . It follows that h L i e 0 mod ( S , w ) is equivalent to h = i α i [ a i s i b i ] s i ¯ , where [ a i s i b i ] s i ¯ are special normal S-words with a i s i ¯ b i < w .

Proof

Observe that for every monic Lie polynomial s, the set { s } is a GS basis in L i e ( X ) . Then (XVIII) and the CD-lemma for Lie algebras yield ( a s b ) [ a s b ] s ¯ mod ( s , w ) .

Summary of the proof of Theorem 34.

Given two ALSWs u and v, define the ALSW- lcm ( u , v ) (or lcm ( u , v ) for short) as follows:

w = lcm ( u , v ) { a u c v b (an ALSW) , a , b , c X (a trivial lcm ) ; u = a v b , a , b X (an inclusion lcm ) ; u b = a v , a , b X , d e g ( u b ) < d e g ( u ) + d e g ( v ) (an intersection lcm ) } .

Denote by [ w ] u , v the Shirshov double special bracketing of w in the case that w is a trivial lcm ( u , v ) , by [ w ] u and [ w ] v the Shrishov special bracketings of w if w is an inclusion or intersection lcm respectively. Then we can define a general Lie composition for monic Lie polynomials f and g with f ¯ = u and g ¯ = v as

( f , g ) w = [ w ] u , v | [ u ] f - [ w ] u , v | [ v ] g

if w is a trivial lcm ( u , v ) (it is 0 mod ( { f , g } , w ) ), and

( f , g ) w = [ w ] u | [ u ] f - [ w ] v | [ v ] g

if w is an inclusion or intersection lcm ( u , v ) .

If S L i e ( X ) k X is a Lie GS basis then S is an associative GS basis. This follows from property (XVII) (iii) and justifies the claim (i)(ii) of Theorem 34.

Shirshov’s original proof of (i)(ii) in Theorem 34, (see [207, 209]), rests on an analogue of Lemma 1 for Lie algebras.

Lemma 7

([207, 209]) If ( a 1 s 1 b 1 ) , ( a 2 s 2 b 2 ) are normal S-words with equal leading associative words, w = a 1 s 1 ¯ b 1 = a 2 s 2 ¯ b 2 , then they are equal mod ( S , w ) , that is, ( a 1 s 1 b 1 ) - ( a 2 s 2 b 2 ) 0 mod ( S , w ) .

Outline of the proof. We have w 1 = c w d and w = lcm ( s 1 ¯ , s 2 ¯ ) . Shirshov’s (double) special bracketing lemma yields

[ w 1 ] w = [ c [ [ w ] d 1 ] d 2 ] = c [ w ] d + α i a i [ w ] b i

with a i w b i < w 1 . The ALSW w includes u = s 1 ¯ and v = s 2 ¯ as subwords, and so there is a bracketing { w } { [ w ] u , v , [ w ] u , [ w ] v } such that

[ a 1 s 1 b 1 ] = [ c { w } | [ u ] s 1 d ] , [ a 2 s 2 b 2 ] = [ c { w } | [ v ] s 2 d ]

are normal s 1 - and s 2 - words with the same leading associative word w 1 . Then

[ a 1 s 1 b 1 ] - [ a 2 s 2 b 2 ] = [ c ( s 1 , s 2 ) w d ] 0 mod ( S , w 1 ) .

Now it is enough to prove that two normal Lie s-words with the same leading associative words, say w 1 , are equal mod ( s , w 1 ) :

f = ( a s b ) - [ a s b ] L i e 0 mod ( s , w 1 ) provided that f ¯ < w 1 .

Since f I d a s s ( s ) , we have f ¯ = c 1 s ¯ d 1 by the CD-lemma for associative algebras with one Lie polynomial relation s. Then f - α [ c 1 s d 1 ] s ¯ is a Lie polynomial with the leading associative word smaller than w 1 . Induction on w 1 finishes the proof.

4.2.1 Gröbner–Shirshov basis for the Drinfeld–Kohno Lie algebra

In this section we give a GS basis for the Drinfeld–Kohno Lie algebra L n .

Definition 8

Fix an integer n > 2 . The Drinfeld–Kohno Lie algebra L n over Z is defined by generators t i j = t j i for distinct indices 1 i , j n - 1 satisfying the relations [ t i j t k l ] = 0 and [ t i j ( t i k + t j k ) ] = 0 for distinct i, j, k, and l.

Therefore, we have the presentation L n = L i e Z ( T | S ) , where T = { t i j | 1 i < j n - 1 } and S consists of the following relations:

[ t i j t k l ] = 0 if k < i < j , k < l , l i , j ;
(18)
[ t j k t i j ] + [ t i k t i j ] = 0 if i < j < k ;
(19)
[ t j k t i k ] - [ t i k t i j ] = 0 if i < j < k .
(20)

Order T by setting t i j < t k l if either i < k or i = k and j < l . Let < be the deg-lex ordering on T .

Theorem 35

([80]) With S = {(18), (19), (20)} as before and the deg-lex ordering < on T , the set S is a Gröbner–Shirshov basis of L n .

Corollary 15

The Drinfeld–Kohno Lie algebra L n is a free Z-module with Z-basis i = 1 n - 2 N L S W ( T i ) , where T i = { t i j | i < j n - 1 } for i = 1 , , n - 2 .

Corollary 16

([100]) The Drinfeld–Kohno Lie algebra L n is an iterated semidirect product of free Lie algebras A i generated by T i = { t i j | i < j n - 1 } , for i = 1 , , n - 2 .

4.2.2 Kukin’s example of a Lie algebra with undecidable word problem

Markov [161], Post [182], Turing [211], Novikov [173], and Boone [60] constructed finitely presented semigroups and groups with undecidable word problem. For groups this also follows from Higman’s theorem [115] asserting that every recursively presented group embeds into a finitely presented group. A weak analogue of Higman’s theorem for Lie algebras was proved in [21], which was enough for the existence of a finitely presented Lie algebra with undecidable word problem. In this section we give Kukin’s construction [142] of a Lie algebra A P for every semigroup P such that if P has undecidable word problem then so does A P .

Given a semigroup P = sgp x , y | u i = v i , i I , consider the Lie algebra

A P = L i e ( x , x ^ , y , y ^ , z | S )

with S consisting of the relations

  1. (1)

    [ x ^ x ] = 0 , [ x ^ y ] = 0 , [ y ^ x ] = 0 , [ y ^ y ] = 0 ;

  2. (2)

    [ x ^ z ] = - [ z x ] , [ y ^ z ] = - [ z y ] ;

  3. (3)

    z u i = z v i , i I .

Here, z u stands for the left normed bracketing.

Put x ^ > y ^ > z > x > y and denote by > the deg-lex ordering on the set { x ^ , y ^ , x , y , z } . Denote by ρ the congruence on { x , y } generated by { ( u i , v i ) , i I } . Put

  • ( 3 ) z u = z v , ( u , v ) ρ with u > v .

Lemma 8

([80]) In this notation, the set S 1 = { ( 1 ) , ( 2 ) , ( 3 ) } is a GS basis in L i e ( x ^ , y ^ , x , y , z ) .

Proof

For every u { x , y } , we can show that z u ¯ = z u by induction on | u | . All possible compositions in S 1 are the intersection compositions of (2) and ( 3 ) , and the inclusion compositions of ( 3 ) and ( 3 ) .

For ( 2 ) ( 3 ) , we take f = [ x ^ z ] + [ z x ] and g = z u - z v . Therefore, w = x ^ z u with ( u , v ) ρ and u > v . We have

[ x ^ z ] + [ z x ] , z u - z v w = [ f u ] f ¯ - [ x ^ g ] g ¯ ( [ x ^ z ] + [ z x ] ) u - [ x ^ ( z u - z v ) ] z x u + x ^ z v z x u - z x v 0 mod ( S 1 , w ) .

For ( 3 ) ( 3 ) , we use w = z u 1 = z u 2 e , where e { x , y } and ( u i , v i ) ρ with u i > v i for i = 1 , 2 . We have

z u 1 - z v 1 , z u 2 - z v 2 w ( z u 1 - z v 1 ) - ( z u 2 - z v 2 ) e z v 2 e - z v 1 z v 2 e - z v 1 0 mod ( S 1 , w ) .

Thus, S 1 = { ( 1 ) , ( 2 ) , ( 3 ) } is a GS basis in L i e ( x ^ , y ^ , x , y , z ) .

Corollary 17

(Kukin [142]) For u , v { x , y } we have

u = v i n t h e s e m i g r o u p P z u = z v i n t h e L i e a l g e b r a A P .

Proof

Assume that u = v in the semigroup P. Without loss of generality we may assume that u = a u 1 b and v = a v 1 b for some a , b { x , y } and ( u 1 , v 1 ) ρ . For every r { x , y } relations (1) yield [ x ^ r ] = 0 ; consequently, z x c = [ z x ^ ] c = [ z c x ^ ] and z y c = [ z c y ^ ] for every c { x , y } . This implies that in A P we have

z u = z a u 1 b = z a u 1 b = z u 1 a ^ b = z u 1 a ^ b = z v 1 a ^ b = z a v 1 b = z v ,

where for every x i 1 x i 2 x i n { x , y } we put

x i 1 x i 2 x i n : = x i n x i n - 1 x i 1 , x i 1 x i 2 x i n ^ : = x i 1 ^ x i 2 ^ x i n ^ .

Moreover, ( 3 ) holds in A P .

Suppose that z u = z v in the Lie algebra A P . Then both z u and z v have the same normal form in A P . Since S 1 is a GS basis in A P , we can reduce both z u and z v to the same normal form z c for some c { x , y } using only relations ( 3 ) . This implies that u = c = v in P.

By the corollary, if the semigroup P has undecidable word problem then so does the Lie algebra A P .

4.3 Composition-Diamond lemma for Lie algebras over commutative algebras

For a well-ordered set X = { x i | i I } , consider the free Lie algebra L i e ( X ) k X with the Lie bracket [ u , v ] = u v - v u .

Given a well-ordered set Y = { y j | j J } , the free commutative monoid [ Y ] generated by Y is a linear basis of k [ Y ] . Regard

L i e k [ Y ] ( X ) k [ Y ] L i e ( X )

as a Lie subalgebra of the free associative algebra k [ Y ] X k [ Y ] k X generated by X over the polynomial algebra k [ Y ] , equipped with the Lie bracket [ u , v ] = u v - v u . Then N L S W ( X ) constitutes a k [ Y ] -basis of L i e k [ Y ] ( X ) . Put [ Y ] X = { β t | β [ Y ] , t X } . For u = β t [ Y ] X , put u X = t and u Y = β .

Denote the deg-lex orderings on [ Y ] and X by > Y and > X . Define an ordering > on [ Y ] X as follows: for u , v [ Y ] X , put

u > v if ( u X > X v X ) or ( u X = v X and u Y > Y v Y ) .

We can express every element f L i e k [ Y ] ( X ) as f = α i β i [ u i ] , where α i k , β i [ Y ] , and [ u i ] N S L W ( X ) .

Then f = α i β i [ u i ] = g j ( Y ) [ u j ] , where g j ( Y ) k [ Y ] are polynomials in the k-algebra k [ Y ] X . The leading word f ¯ of f in k [ Y ] X is of the form β 1 u 1 with β 1 [ Y ] and u 1 A L S W ( X ) . The polynomial f is called monic (or k-monic) if the coefficient of f ¯ is equal to 1, that is, α 1 = 1 . The notion of k [ Y ] -monic polynomials is introduced similarly: α 1 = 1 and β 1 = 1 .

Recall that every ALSW w admits a unique bracketing such that [ w ] is a NLSW.

Consider a monic subset S L i e k [ Y ] ( X ) . Given a non-associative word ( u ) on X with a fixed occurrence of some x i and s S , call ( u ) x i s an S-word. Define | u | to be the s-length of ( u ) x i s . Every S-word is of the form ( a s b ) with a , b X and s S . If a s ¯ X b A L S W ( X ) then we have the special bracketing [ a s ¯ X b ] s ¯ X of a s ¯ X b relative to s ¯ X . Refer to [ a s b ] s ¯ = [ a s ¯ X b ] s ¯ X | [ s ¯ X ] s as a special normal s-word (or special normal S-word).

An S-word ( u ) = ( a s b ) is a normal s-word, denoted by u s , whenever ( a s b ) ¯ X = a s ¯ X b . The following condition is sufficient.

  1. (i)

    The s-length of ( u ) is 1, that is, ( u ) = s ;

  2. (ii)

    if u s is a normal S-word of s-length k and [ v ] N L S W ( X ) satisfies | v | = l then [ v ] u s whenever v > u ¯ s X and u s [ v ] whenever v < u ¯ s X are normal S-words of s-length k + l .

Take two monic polynomials f and g in L i e k [ Y ] ( X ) and put L = lcm ( f ¯ Y , g ¯ Y ) .

There are four kinds of compositions.

  • C 1 : Inclusion composition. If f ¯ X = a g ¯ X b for some a , b X , then

    C 1 f , g w = L f ¯ Y f - L g ¯ Y [ a g b ] g ¯ , where w = L f ¯ X = L a g ¯ X b .
  • C 2 : Intersection composition. If f ¯ X = a a 0 and g ¯ X = a 0 b with a , b , a 0 1 then

    C 2 f , g w = L f ¯ Y [ f b ] f ¯ - L g ¯ Y [ a g ] g ¯ , where w = L f ¯ X b = L a g ¯ X .
  • C 3 : External composition. If g c d ( f ¯ Y , g ¯ Y ) 1 then for all a , b , c X satisfying

    w = L a f ¯ X b g ¯ X c T A = { β t | β [ Y ] , t A L S W ( X ) }

    we have

    C 3 f , g w = L f ¯ Y [ a f b g ¯ X c ] f ¯ - L g ¯ Y [ a f ¯ X b g c ] g ¯ .
  • C 4 : Multiplication composition. If f ¯ Y 1 then for every special normal f-word [ a f b ] f ¯ with a , b X we have

    C 4 f w = [ a f ¯ X b ] [ a f b ] f ¯ , where w = a f ¯ X b a f ¯ b .

Given a k-monic subset S L i e k [ Y ] ( X ) and w [ Y ] X , which is not necessarily in T A , an element h L i e k [ Y ] ( X ) is called trivial modulo ( S , w ) if h can be expressed as a k [ Y ] -linear combination of normal S-words with leading words smaller than w. The set S is a Gröbner–Shirshov basis in L i e k [ Y ] ( X ) if all possible compositions in S are trivial.

Theorem 36

([31], the CD-lemma for Lie algebras over commutative algebras) Consider a nonempty set S L i e k [ Y ] ( X ) of monic polynomials and denote by I d ( S ) the ideal of L i e k [ Y ] ( X ) generated by S. The following statements are equivalent:

  1. (i)

    The set S is a Gröbner–Shirshov basis in L i e k [ Y ] ( X ) .

  2. (ii)

    If f I d ( S ) then f ¯ = a s ¯ b T A for some s S and a , b [ Y ] X .

  3. (iii)

    The set I r r ( S ) = { [ u ] | [ u ] T N , u a s ¯ b , for s S and a , b [ Y ] X } is a linear basis for L i e k [ Y ] ( X | S ) = ( L i e k [ Y ] ( X ) ) / I d ( S ) .

Here T A = { β t | β [ Y ] , t A L S W ( X ) } and T N = { β [ t ] | β [ Y ] , [ t ] N L S W ( X ) } .

Outline of the proof.

Take u , v [ Y ] A L S W ( X ) and write u = u Y u X and v = v Y v X . Define the ALSW- lcm ( u , v ) (or lcm ( u , v ) for short) as w = w Y w X = lcm ( u Y , v Y ) lcm ( u X , v X ) , where

lcm ( u X , v X ) { a u X c v X b ( a n A L S W ) , a , b , c X ; u X = a v X b , a , b X ; u X b = a v X , a , b X , d e g ( u X b ) < d e g ( u X ) + d e g ( v X ) } .

Six lcm ( u , v ) are possible:

  1. (i)

    (Y-trivial, X-trivial) (a trivial lcm ( u , v ) ) ;

  2. (ii)

    (Y-trivial, X-inclusion);

  3. (iii)

    (Y-trivial, X-intersection);

  4. (iv)

    (Y-nontrivial, X-trivial);

  5. (v)

    (Y-nontrivial, X-inclusion);

  6. (vi)

    (Y-nontrivial, X-intersection).

In accordance with lcm ( u , v ) , six general compositions are possible.

Denote by [ w X ] u X , v X the Shirshov double special bracketing of w X whenever w X is a X-trivial lcm ( u X , v X ) , by [ w X ] u X and [ w X ] v X the Shirshov special bracketings of w X whenever w X is a lcm of X-inclusion or X-intersection respectively.

Define general Lie compositions for k-monic Lie polynomials f and g with f ¯ = u and g ¯ = v as

( f , g ) w = ( lcm ( u Y , v Y ) / u Y ) [ w X ] u X , v X | [ u ] f - ( lcm ( u Y , v Y ) / v Y ) [ w X ] u X , v X | [ v ] g , ( f , g ) w = ( lcm ( u Y , v Y ) / u Y ) [ w X ] u | [ u ] f - ( lcm ( u Y , v Y ) / v Y ) [ w X ] v | [ v ] g .

Lemma 9

([31]) The general composition ( f , g ) w of k-monic Lie polynomials f and g with f ¯ = u and g ¯ = v , where w is a (Y-trivial, X-trivial) lcm ( u , v ) , is 0 mod ( { f , g } , w ) .

Proof

By (XIX), we have

( f , g ) w = v Y [ a f b [ v X ] d ] - u Y [ a [ u X ] b g d ] = [ a f b [ v ] d ] - [ a u b g d ] = [ a f b ( [ v ] - g ) d ] - [ a ( [ u ] - f ) b g d ] 0 mod ( { f , g } , w ) .

The proof is complete.

A Lie GS basis S L i e k [ Y ] ( X ) k [ Y ] X need not be an associative GS basis because the PBW-theorem is not valid for Lie algebras over a commutative algebra (Shirshov [201]). Therefore, the argument for L i e k ( X ) above (see Sect. 4.2) fails for L i e k [ Y ] ( X ) .

Moreover, Shirshov’s original proof of the CD-lemma fails because the singleton { s } L i e k [ Y ] ( X ) is not a GS basis in general. The reason is that there exists a nontrivial composition ( s , s ) w of type (Y-nontrivial, X-trivial).

There is another obstacle. For L i e k ( X ) , every s-word is a linear combination of normal s-words. For L i e k [ Y ] ( X ) this is not the case. Hence, we must use a multiplication composition [ u X ] f such that f ¯ = u = u Y u X .

Lemma 10

([31]) If every multiplication composition [ s ¯ X ] s , s S , is trivial modulo ( S , w = [ u X ] u ) , where u = s ¯ , then every S-word is a linear combination of normal S-words.

In our paper with Yongshan Chen [31], we use the following definition of triviality of a polynomial f modulo ( S , w ) :

f 0 mod ( S , w ) f = α i e i Y [ a i X s i b i X ] ,

where [ a i X [ s i ¯ X ] b i X ] is the Shirshov special bracketing of the ALSW a i X s i ¯ X b i X with an ALSW s i ¯ X .

The previous definition of triviality modulo ( S , w ) is equivalent to the usual definition by Lemma 11, which is key in the proof of the CD-lemma for Lie algebras over a commutative algebra.

Lemma 11

([31]) Given a monic set S with trivial multiplication compositions, take a normal s-word ( a s b ) and a special normal s-word [ a s b ] with the same leading monomial w = a s ¯ b . Then they are equal modulo ( s , w ) .

Lemmas 10 and 11 imply

Lemma 12

([31]) Given a monic set S with trivial multiplication compositions, every element of the ideal generated by S is a linear combination of special normal S-words.

On the other hand, (XVII) and (XIX) imply the following analogue of Lemma 1 for L i e k [ Y ] ( X ) .

Lemma 13

([31]) Given two k-monic special normal S-words e 1 Y [ a 1 X s 1 b 1 X ] and e 2 Y [ a 2 X s 2 b 2 X ] with the same leading associative word w 1 , their difference is equal to [ a ( s 1 , s 2 ) w b ] , where w = lcm ( s 1 ¯ , s 2 ¯ ) , w 1 = a w b , and [ a ( s 1 , s 2 ) w b ] = [ w 1 ] w | [ w ] ( s 1 , s 2 ) w . Hence, if S is a GS basis then the previous special normal S-words are equal modulo ( S , w 1 ) .

Now the claim (i)(ii) of the CD-lemma for L i e k [ Y ] ( X ) follows.

For every Lie algebra L = L i e K ( X | S ) over the commutative algebra K = k [ Y | R ] ,

U ( L ) = K X | S ( - ) = k [ Y ] X | S ( - ) , R X ,

where S ( - ) is just S with all commutators [ u v ] 4 replaced with u v - v u , is the universal enveloping associative algebra of L.

A Lie algebra L over a commutative algebra K is called special whenever it embeds into its universal enveloping associative algebra. Otherwise it is called non-special.

Shirshov (1953) and Cartier (1958) gave classical examples of non-special Lie algebras over commutative algebras over G F ( 2 ) , justified using ad hoc methods. Cohn (1963) suggested another non-special Lie algebra over a commutative algebra over a field of positive characteristic.

Example 1

(Shirshov (1953)) Take k = G F ( 2 ) and

K = k [ y i , i = 0 , 1 , 2 , 3 | y 0 y i = y i ( i = 0 , 1 , 2 , 3 ) , y i y j = 0 ( i , j 0 ) ] .

Consider L = L i e K ( x i , 1 i 13 | S 1 , S 2 ) , where

S 1 = { [ x 2 x 1 ] = x 11 , [ x 3 x 1 ] = x 13 , [ x 3 x 2 ] = x 12 , [ x 5 x 3 ] = [ x 6 x 2 ] = [ x 8 x 1 ] = x 10 , [ x i x j ] = 0 ( i > j ) } ; S 2 = { y 0 x i = x i ( i = 1 , 2 , , 13 ) , y 1 x 1 = x 4 , y 1 x 2 = x 5 , y 1 x 3 = x 6 , y 1 x 12 = x 10 , y 2 x 1 = x 5 , y 2 x 2 = x 7 , y 2 x 3 = x 8 , y 2 x 13 = x 10 , y 3 x 1 = x 6 , y 3 x 2 = x 8 , y 3 x 3 = x 9 , y 3 x 11 = x 10 , y 1 x k = 0 ( k = 4 , 5 , , 11 , 13 ) , y 2 x t = 0 ( t = 4 , 5 , , 12 ) , y 3 x l = 0 ( l = 4 , 5 , , 10 , 12 , 13 ) } .

Then L = L i e K ( X | S 1 , S 2 ) = L i e k [ Y ] ( X | S 1 , S 2 , R X ) and

S = S 1 S 2 R X { y 1 x 2 = y 2 x 1 , y 1 x 3 = y 3 x 1 , y 2 x 3 = y 3 x 2 }

is a GS basis in L i e k [ Y ] ( X ) , which implies that x 10 belongs to the linear basis of L by Theorem 36, that is, x 10 0 in L.

On the other hand, the universal enveloping algebra of L has the presentation

U K ( L ) = K X | S 1 ( - ) , S 2 k [ Y ] X | S 1 ( - ) , S 2 , R X .

However, the GS completion (see Mikhalev and Zolotykh [170]) of S 1 ( - ) S 2 R X in k [ Y ] X is

S C = S 1 ( - ) S 2 R X { y 1 x 2 = y 2 x 1 , y 1 x 3 = y 3 x 1 , y 2 x 3 = y 3 x 2 , x 10 = 0 } .

Thus, L is not special.

Example 2

(Cartier [70]) Take k = G F ( 2 ) and

K = k [ y 1 , y 2 , y 3 | y i 2 = 0 , i = 1 , 2 , 3 ] .

Consider L = L i e K ( x i j , 1 i j 3 | S ) , where

S = { [ x i i x j j ] = x j i ( i > j ) , [ x i j x k l ] = 0 , y 3 x 33 = y 2 x 22 + y 1 x 11 } .

Then L is not special over K.

Proof

The set S = S { y i 2 x k l = 0 ( i , k , l ) } S 1 is a GS basis in L i e k [ Y ] ( X ) , where

S 1 = { y 3 x 23 = y 1 x 12 , y 3 x 13 = y 2 x 12 , y 2 x 23 = y 1 x 13 , y 3 y 2 x 22 = y 3 y 1 x 11 , y 3 y 1 x 12 = 0 , y 3 y 2 x 12 = 0 , y 3 y 2 y 1 x 11 = 0 , y 2 y 1 x 13 = 0 } .

Then, y 2 y 1 x 12 I r r ( S ) and so y 2 y 1 x 12 0 in L.

However, in

U K ( L ) = K X | S ( - ) k [ Y ] X | S ( - ) , y i 2 x k l = 0 ( i , k , l )

we have

0 = y 3 2 x 33 2 = ( y 2 x 22 + y 1 x 11 ) 2 = y 2 2 x 22 2 + y 1 2 x 11 2 + y 2 y 1 [ x 22 , x 11 ] = y 2 y 1 x 12 .

Thus, L ↪̸ U K ( L ) .

Conjecture (Cohn [95]) Take the algebra K = k [ y 1 , y 2 , y 3 | y i p = 0 , i = 1 , 2 , 3 ] of truncated polynomials over a field k of characteristic p > 0 . The algebra

L p = L i e K ( x 1 , x 2 , x 3 | y 3 x 3 = y 2 x 2 + y 1 x 1 ) ,

called Cohn’s Lie algebra, is not special.

In U K ( L p ) we have

0 = ( y 3 x 3 ) p = ( y 2 x 2 ) p + Λ p ( y 2 x 2 , y 1 x 1 ) + ( y 1 x 1 ) p = Λ p ( y 2 x 2 , y 1 x 1 ) ,

where Λ p is a Jacobson–Zassenhaus Lie polynomial. Cohn conjectured that Λ p ( y 2 x 2 , y 1 x 1 ) 0 in L p . To prove this, we must know a GS basis of L p up to degree p in X. We found it for p = 2 , 3 , 5 . For example, Λ 2 = [ y 2 x 2 , y 1 x 1 ] = y 2 y 1 [ x 2 x 1 ] and a GS basis of L 2 up to degree 2 in X is

y 3 x 3 = y 2 x 2 + y 1 x 1 , y i 2 x j = 0 ( 1 i , j 3 ) , y 3 y 2 x 2 = y 3 y 1 x 1 , y 3 y 2 y 1 x 1 = 0 , y 2 [ x 3 x 2 ] = y 1 [ x 3 x 1 ] , y 3 y 1 [ x 2 x 1 ] = 0 , y 2 y 1 [ x 3 x 1 ] = 0 .

Therefore, y 2 y 1 [ x 2 x 1 ] I r r ( S C ) .

Similar though much longer computations show that Λ 3 0 in L 3 and Λ 5 0 in L 5 . Thus, we have

Theorem 37

([31]) Cohn’s Lie algebras L 2 , L 3 , and L 5 are non-special.

Theorem 38

([31]) Given a commutative k-algebra K = k [ Y | R ] , if S is a Gröbner–Shirshov basis in L i e k [ Y ] ( X ) such that every s S is k [ Y ] -monic then L = L i e K ( X | S ) is special.

Corollary 18

([31]) Every Lie K-algebra L K = L i e K ( X | f ) with one monic defining relation f = 0 is special.

Theorem 39

([31]) Suppose that S is a finite homogeneous subset of L i e k ( X ) . Then the word problem of L i e K ( X | S ) is solvable for every finitely generated commutative k-algebra K.

Theorem 40

([31]) Every finitely or countably generated Lie K-algebra embeds into a two-generated Lie K-algebra, where K is an arbitrary commutative k-algebra.

5 Gröbner–Shirshov bases for Ω-algebras and operads

5.1 CD-lemmas for Ω-algebras

Some new CD-lemmas for Ω-algebras have appeared: for associative conformal algebras [45] and n-conformal algebras [43], for the tensor product of free algebras [30], for metabelian Lie algebras [75], for associative Ω-algebras [41], for color Lie superalgebras and Lie p-superalgebras [165, 166], for Lie superalgebras [167], for associative differential algebras [76], for associative Rota–Baxter algebras [32], for L-algebras [33], for dialgebras [38], for pre-Lie algebras [35], for semirings [40], for commutative integro-differential algebras [102], for difference-differential modules and difference-differential dimension polynomials [225], for λ-differential associative Ω-algebras [185], for commutative associative Rota–Baxter algebras [186], for algebras with differential type operators [111].

Latyshev studied general versions of GS (or standard) bases [147, 148].

Let us state the CD-lemma for pre-Lie algebras, see [35].

A non-associative algebra A is called a pre-Lie (or a right-symmetric) algebra if A satisfies the identity ( x , y , z ) = ( x , z , y ) for the associator ( x , y , z ) = ( x y ) z - x ( y z ) . It is a Lie admissible algebra in the sense that A ( - ) = ( A , [ x y ] = x y - y x ) is a Lie algebra.

Take a well-ordered set X = { x i | i I } . Order X by induction on the lengths of the words ( u ) and ( v ) :

  1. (i)

    When | ( ( u ) ( v ) ) | = 2 put ( u ) = x i > ( v ) = x j if and only if i > j .

  2. (ii)

    When | ( ( u ) ( v ) ) | > 2 put ( u ) > ( v ) if and only if one of the following holds:

    1. (a)

      | ( u ) | > | ( v ) | ;

    2. (b)

      if | ( u ) | = | ( v ) | with ( u ) = ( ( u 1 ) ( u 2 ) ) and ( v ) = ( ( v 1 ) ( v 2 ) ) then ( u 1 ) > ( v 1 ) or ( u 1 ) = ( v 1 ) and ( u 2 ) > ( v 2 ) .

We now quote the definition of good words (see [198]) by induction on length:

  1. (1)

    x is a good word for any x X ;

  2. (2)

    a non-associative word ( ( v ) ( w ) ) is called a good word if

    1. (a)

      both ( v ) and ( w ) are good words and

    2. (b)

      if ( v ) = ( ( v 1 ) ( v 2 ) ) then ( v 2 ) ( w ) .

Denote ( u ) by [ u ] whenever ( u ) is a good word.

Denote by W the set of all good words in the alphabet X and by R S X the free right-symmetric algebra over a field k generated by X. Then W forms a linear basis of R S X , see [198]. Kozybaev et al. [141] proved that the deg-lex ordering on W is monomial.

Given a set S R S X of monic polynomials and s S , an S-word ( u ) s is called a normal S-word whenever ( u ) s ¯ = ( a s ¯ b ) is a good word.

Take f , g S , [ w ] W , and a , b X . Then there are two kinds of compositions.

  1. (i)

    If f ¯ = [ a g ¯ b ] then ( f , g ) f ¯ = f - [ a g b ] is called the inclusion composition.

  2. (ii)

    If ( f ¯ [ w ] ) is not good then f · [ w ] is called the right multiplication composition.

Theorem 41

([35], the CD-lemma for pre-Lie algebras) Consider a nonempty set S R S X of monic polynomials and the ordering < defined above. The following statements are equivalent:

  1. (i)

    The set S is a Gröbner–Shirshov basis in R S X .

  2. (ii)

    If f I d ( S ) then f ¯ = [ a s ¯ b ] for some s S and a , b X , where [ a s b ] is a normal S-word.

  3. (iii)

    The set I r r ( S ) = { [ u ] W | [ u ] [ a s ¯ b ] a , b X , s S and [ a s b ] is a normal S -word } is a linear basis of the algebra R S X | S = R S X / I d ( S ) .

As an application, we have a GS basis for the universal enveloping pre-Lie algebra of a Lie algebra.

Theorem 42

([35]) Consider a Lie algebra ( L , [ ] ) with a well-ordered linear basis X = { e i | i I } . Write [ e i e j ] = m α i j m e m with α i j m k . Denote m α i j m e m by { e i e j } . Denote by

U ( L ) = R S { e i } I | e i e j - e j e i = { e i e j } , i , j I

the universal enveloping pre-Lie algebra of L. The set

S = { f i j = e i e j - e j e i - { e i e j } , i , j I and i > j }

is a Gröbner–Shirshov basis in R S X .

Theorems 41 and 42 directly imply the following PBW theorem for Lie algebras and pre-Lie algebras.

Corollary 19

(Segal [198]) A Lie algebra L embeds into its universal enveloping pre-Lie algebra U ( L ) as a subalgebra of U ( L ) ( - ) .

Recently the CD-lemmas mentioned above and other combinatorial methods yielded many applications: for groups of Novikov–Boone type [119121] (see also [16, 17, 77, 118], for Coxeter groups [58, 150], for center-by-metabelian Lie algebras [214], for free metanilpotent Lie algebras, Lie algebras and associative algebras [112, 168, 215, 216], for Poisson algebras [159], for quantum Lie algebras and related problems [132, 135], for PBW-bases [131, 134, 158], for extensions of groups and associative algebras [73, 74], for (color) Lie (p)-superalgebras [9, 48, 91, 92, 105107, 169, 227, 228], for Hecke algebras and Specht modules [125], for representations of Ariki–Koike algebras [126], for the linear algebraic approach to GS bases [127], for HNN groups [87], for certain one-relator groups [88], for embeddings of algebras [39, 83], for free partially commutative Lie algebras [84, 181], for quantum groups of type D n , E 6 , and G 2 [174, 189, 221, 222], for calculations of homogeneous GS bases [145], for Picard groups, Weyl groups, and Bruck–Reilly extensions of semigroups [7, 128130, 139], for Akivis algebras and pre-Lie algebras [79], for free Sabinin algebras [93].

5.2 CD-lemma for operads

Following Dotsenko and Khoroshkin ([98], Proposition 3), linear bases for a symmetric operad and a shuffle operad are the same provided both of them are defined by the same generators and defining relations. It means that we need CD-lemma for shuffle operads only (and we define a GS basis for a symmetric operad as a GS basis of the corresponding shuffle operad).

We express the elements of the free shuffle operad using planar trees.

Put V = n = 1 V n , where V n = { δ i ( n ) | i I n } is the set of n-ary operations.

Call a planar tree with n leaves decorated whenever the leaves are labeled by [ n ] = { 1 , 2 , 3 , , n } for n N and every vertex is labeled by an element of V.

For an arrow in a decorated tree, let its value be the minimal value of the leaves of the subtree grafted to its end. A decorated tree is called a tree monomial whenever for each its internal vertex the values of the arrows beginning from it increase from the left to the right.

Denote by F V ( n ) the set of all tree monomials with n leaves and put T = n 1 F V ( n ) . Given α = α ( x 1 , , x n ) F V ( n ) and β F V ( m ) , define the shuffle composition α i , σ β as

α ( x 1 , , x i - 1 , β ( x i , x σ ( i + 1 ) , , x σ ( i + m - 1 ) ) , x σ ( i + m ) , , x σ ( m + n - 1 ) ) ,

which lies in F V ( n + m - 1 ) , where 1 i n and the bijection

σ : { i + 1 , , m + n - 1 } { i + 1 , , m + n - 1 }

is an ( m - 1 , n - i ) -shuffle, that is,

σ ( i + 1 ) < σ ( i + 2 ) < < σ ( i + m - 1 ) , σ ( i + m ) < σ ( i + m + 1 ) < < σ ( n + m - 1 ) .

The set T is freely generated by V with the shuffle composition.

Denote by F V = k T the k-linear space spanned by T. This space with the shuffle compositions i , σ is called the free shuffle operad.

Take a homogeneous subset S of F V . For s S , define an S-word u | s as before.

A well ordering > on T is called monomial (admissible) whenever

α > β u | α > u | β for any u T .

Assume that T is equipped with a monomial ordering. Then each S-word is a normal S-word.

For example, the following ordering > on T is monomial, see Proposition 5 of [98].

Every α = α ( x 1 , , x n ) F V ( n ) has a unique expression

α = ( path ( 1 ) , , path ( n ) , [ i 1 i n ] ) ,

where path ( r ) V for 1 r n is the unique path from the root to the leaf r and the permutation [ i 1 i n ] lists the labels of the leaves of the underlying tree in the order determined by the planar structure, from left to right. In this case define

wt ( α ) = ( n , path ( 1 ) , , path ( n ) , [ i 1 i n ] ) .

Assume that V is a well-ordered set and use the deg-lex ordering on V . Take the order on the permutations in reverse lexicographic order: i > j if and only if i is less than j as numbers.

Now, given α , β T , define

α > β wt ( α ) > wt ( β ) lexicographically .

An element of F V is called homogeneous whenever all tree monomials occurring in this element with nonzero coefficients have the same arity degree (but not necessarily the same operation degree).

For two tree monomials α and β, say that α is divisible by β whenever there exists a subtree of the underlying tree of α for which the corresponding tree monomial α is equal to α.

A tree monomial γ is called a common multiple of two tree monomials α and β whenever it is divisible by both α and β. A common multiple γ of two tree monomials α and β is called a least common multiple and denoted by γ = lcm ( α , β ) whenever | α | + | β | > | γ | , where | δ | = n for δ F V ( n ) .

Take two monic homogeneous elements f and g of F V . If f ¯ and g ¯ have a least common multiple w then ( f , g ) w = w f ¯ f - w g ¯ g .

Theorem 43

([98], the CD-lemma for shuffle operads) In the above notation, consider a nonempty set S F V of monic homogeneous elements and a monomial ordering < on T. The following statements are equivalent:

  1. (i)

    The set S is a Gröbner–Shirshov basis in F V .

  2. (ii)

    If f I d ( S ) then f ¯ = u | s ¯ for some S-word u | s .

  3. (iii)

    The set I r r ( S ) = { u T | u v | s ¯ for all S -word v | s } is a k-linear basis of F V / I d ( S ) .

As applications, the authors of [98] calculate Gröbner–Shirshov bases for some well-known operads: the operad Lie of Lie algebras, the operad As of associative algebras, and the operad PreLie of pre-Lie algebras.