Skip to main content
Log in

Circular codes, symmetries and transformations

  • Published:
Journal of Mathematical Biology Aims and scope Submit manuscript

Abstract

Circular codes, putative remnants of primeval comma-free codes, have gained considerable attention in the last years. In fact they represent a second kind of genetic code potentially involved in detecting and maintaining the normal reading frame in protein coding sequences. The discovering of an universal code across species suggested many theoretical and experimental questions. However, there is a key aspect that relates circular codes to symmetries and transformations that remains to a large extent unexplored. In this article we aim at addressing the issue by studying the symmetries and transformations that connect different circular codes. The main result is that the class of 216 \(C^3\) maximal self-complementary codes can be partitioned into 27 equivalence classes defined by a particular set of transformations. We show that such transformations can be put in a group theoretic framework with an intuitive geometric interpretation. More general mathematical results about symmetry transformations which are valid for any kind of circular codes are also presented. Our results pave the way to the study of the biological consequences of the mathematical structure behind circular codes and contribute to shed light on the evolutionary steps that led to the observed symmetries of present codes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Notes

  1. Of course, excluding the identity.

References

  • Arquès DG, Michel CJ (1996) A complementary circular code in the protein coding genes. J Theor Biol 182:45–58

    Article  Google Scholar 

  • Benard E, Michel CJ (2013) Transition and transversion on the common trinucleotide circular code. Comput Biol J ID 795418:10

  • Bussoli L, Michel CJ, Pirillo G (2012) On conjugation partitions of sets of trinucleotides. Appl Math 3:107–112

    Article  MathSciNet  Google Scholar 

  • Crick FHC, Griffith JS, Orgel LE (1957) Codes without commas. Proc Natl Acad Sci USA 43(5):416–421

  • Fimmel E, Danielli A, Strüngmann L (2013) On dichotomic classes and bijections of the genetic code. J Theor Biol 336:221–230

    Article  Google Scholar 

  • Frey G, Michel CJ (2006) Identification of circular codes in bacterial genomes and their use in a factorization method for retrieving the reading frames of genes. Comput Biol Chem 30:87–101

    Article  MATH  Google Scholar 

  • Giannerini S, Gonzalez DL, Rosa R (2012) DNA, dichotomic classes and frame synchronization: a quasi-crystal framework. Philos Trans R Soc A Math Phys Eng Sci 370(1969):2987–3006

    Article  MATH  MathSciNet  Google Scholar 

  • Golomb SW, Gordon B, Welch LR (1958) Comma-free codes. Can J Math 10:202–209

    Article  MATH  MathSciNet  Google Scholar 

  • Gonzalez DL (2004) Can the genetic code be mathematically described? Med Sci Monit 10(4):11–17

    Google Scholar 

  • Gonzalez DL (2008) The mathematical structure of the genetic code. In: Barbieri M, Hoffmeyer J (eds) The codes of life: the rules of macroevolution. Biosemiotics, vol 1, chap 8. Springer, Netherlands, pp 111–152

  • Gonzalez DL, Giannerini S, Rosa R (2008) Strong short-range correlations and dichotomic codon classes in coding DNA sequences. Phys Rev E 78(5, ID 051918)

  • Gonzalez DL, Giannerini S, Rosa R (2011) Circular codes revisited: a statistical approach. J Theor Biol 275(1):21–28

    Article  MathSciNet  Google Scholar 

  • Gonzalez DL, Giannerini S, Rosa R (2012) On the origin of the mitochondrial genetic code: towards a unified mathematical framework for the management of genetic information. Nat Preced. doi:10.1038/npre.2012.7136.1

    Google Scholar 

  • Hayes B (1998) The invention of the genetic code. Am Sci 86(1):8–14

    Article  Google Scholar 

  • Koch AJ, Lehmann J (1997) About a symmetry of the genetic code. J Theor Biol 189:171–174

    Article  Google Scholar 

  • Lacan J, Michel CJ (2001) Analysis of a circular code model. J Theor Biol 213:159–170

    Article  MathSciNet  Google Scholar 

  • Lewin B (2004) Genes 8. Pearson Prentice Hall, Upper Saddle River

    Google Scholar 

  • Michel CJ (2008) A 2006 review of circular codes in genes. Comput Math Appl 55:984–988

    Article  MATH  MathSciNet  Google Scholar 

  • Michel CJ (2013) Private communication

  • Michel CJ, Pirillo G (2010) Identification of all trinucleotide circular codes. Comput Biol Chem 34(2):122–125

    Article  MathSciNet  Google Scholar 

  • Michel CJ, Pirillo G (2011) Strong trinucleotide circular codes. Int J Comb 2011(ID 659567)

  • Michel CJ, Pirillo G, Pirillo MA (2008) A relation between trinucleotide comma-free codes and trinucleotide circular codes. Theor Comput Sci 401(1–3):17–26

    Article  MATH  MathSciNet  Google Scholar 

  • Michel CJ, Pirillo G, Pirillo MA (2012) A classification of 20-trinucleotide circular codes. Inf Comput 212:55–63

    Article  MATH  MathSciNet  Google Scholar 

  • Rotman JJ (1995) An introduction to the theory of groups. Springer, Berlin

    Book  MATH  Google Scholar 

Download references

Acknowledgments

We would like to thank Alberto Danielli for useful discussions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Simone Giannerini.

Appendix A: Proofs

Appendix A: Proofs

1.1 Proof of Theorem 1

Proof

We will write for a codon \(x_i\in X\, x_i=B_1^i B_2^i B_3^i, B_j^i\in \mathcal {B}, j=1,2,3\).

  1. 1.

    Let us show first that \(\overleftarrow{{X}}\) is a trinucleotide circular code. The reverse codon to \(x_i\) has the form \(\overleftarrow{{x_i}}=B_3^iB_2^iB_1^i\). Assume that \(\overleftarrow{{X}}\) is not circular and the word

    $$\begin{aligned} w=\overleftarrow{{x_1}}\cdots \overleftarrow{{x_k}}=B_3^1B_2^1B_1^1\cdots B_3^kB_2^kB_1^k,\quad x_i\in X \end{aligned}$$

    has at least two decompositions into the words from \(\overleftarrow{{X}}\) written on a circle. Without lost of generality let us assume that the second decomposition occurs with a shift by 1. That means that for all \(1\le i< k\)

    $$\begin{aligned} B_2^iB_1^iB_3^{i+1}\in \overleftarrow{{X}}\quad \text{ and }\quad B_2^kB_1^kB_3^{1}\in \overleftarrow{{X}}. \end{aligned}$$

    That means that for all \(1\le i < k\)

    $$\begin{aligned} B_3^{i+1}B_1^iB_2^i\in X\quad \text{ and }\quad B_3^1 B_1^k B_2^{k}\in X. \end{aligned}$$

    So the word

    $$\begin{aligned} w'=x_kx_{k-1}\cdots x_1=B_1^k B_2^k B_3^k B_1^{k-1} B_2^{k-1} B_3^{k-1}\cdots B_1^1 B_2^1 B_3^1 \end{aligned}$$

    has at least two decompositions into the words from \(X\) with a shift by 2. Similar arguments work when the second decomposition was obtained by shift of \(2\) positions. Let us show now with a counter-example that the remaining four permutations of the bases \(\alpha \in S_3\setminus \{id, \overleftarrow{{ }}\}\) do not guarantee the circularity of \(\alpha (X)\): Let us denote the permutations

    $$\begin{aligned} p_1=(21)(3),\quad p_2=(1)(32),\quad \alpha _1=(213),\quad \alpha _2=(312) \end{aligned}$$

    and consider for example \(X=\{TAA,ATT \}\). \(X\) and \(Y:=\alpha _1(X)=\{AAT,TTA \}\) are both circular. But \(\alpha _2(X)=\alpha _1(Y)=p_1(X)=p_2(Y)=\{ATA,TAT \}\) is not circular since the word \(w=ATATAT \) has two decompositions into the words of \(X\) on a circle: \(w=ATA,TAT \) and \(w'= TAT,ATA \).

  2. 2.

    Assume that \(\pi (X)\) is not circular and the word

    $$\begin{aligned} w=\pi (x_1)\cdots \pi (x_k)=\pi (B_1^1)\pi (B_2^1)\pi (B_3^1)\cdots \pi (B_1^k)\pi (B_2^k)\pi (B_3^k),\quad x_i\in X \end{aligned}$$

    has at least two decompositions into the words from \(\pi (X)\) written on a circle. Without lost of generality let us assume that the second decomposition occurs with a shift by 1. That means that for all \(1\le i < k\)

    $$\begin{aligned} \pi (B_2^{i})\pi (B_3^i)\pi (B_1^{i+1})\in \pi (X)\quad \text{ and }\quad \pi (B_2^k)\pi (B_3^k)\pi (B_1^{1})\in \pi (X). \end{aligned}$$

    It implies that for all \(1\le i < k\)

    $$\begin{aligned} B_2^{i}B_3^iB_1^{i+1}\in X\quad \text{ and }\quad B_2^kB_3^kB_1^{1}\in X. \end{aligned}$$

    In this case the word \(w'=\pi ^{-1}(w)\) has at least two decompositions into the words from \(X\) on a circle. This is a contradiction to the circularity of \(X\). Similar arguments work when the second decomposition was obtained by shift of \(2\) positions. For all \(\alpha \in S_3\) and \(\pi \in S_{\mathcal {B}}\) the property

    $$\begin{aligned} \alpha (\pi (X))=\pi (\alpha (X)) \end{aligned}$$

    is true. By the definition of a \(C^3\)-code \(X_1:=\alpha _1(X)\) and \(X_2:=\alpha _2(X)\) are trinucleotide circular codes. The arguments above show that \(\pi (X), \pi (X_1)=\alpha _1 (\pi (X))\) and \(\pi (X_2)=\alpha _2 (\pi (X))\) are circular codes. That means that \(\pi (X)\) is a \(C^3\)-code.

1.2 Proof of Theorem 2

Proof

According to the theorem above \(\pi (X)\) is circular. We prove that \(\pi (X)\) is self-complementary:

$$\begin{aligned} \overleftarrow{{c (\pi (X))}}=\overleftarrow{{\pi (c(X))}}=\pi (\overleftarrow{{c(X)}})=\pi (X) \end{aligned}$$

because of the self-complementarity of \(X\), the property \(\pi \circ c=c\circ \pi \) and the fact that for all \(\alpha \in S_3\) and \(\pi \in S_{\mathcal {B}}\) the property

$$\begin{aligned} \alpha (\pi (X))=\pi (\alpha (X)) \end{aligned}$$

is true.

Let us list all \(\pi \in S_{\mathcal {B}}\) satisfying \(\pi \circ c=c\circ \pi \): It is easy to prove that such maps build a subgroup of \((S_{\mathcal {B}},\circ )\). Consequently, the number of such maps must be a factor of 24. The following 8 bijective transformations have this property and build a subgroup of \((S_{\mathcal {B}},\circ )\) (easy to check):

$$\begin{aligned} L:=\{&id, c, p, r, \pi _{CG }: (A,C,G,T) \mapsto (A,G,C,T), \pi _{AT }: (A,C,G,T) \mapsto (T,C,G,A),\\&\pi _{ACTG }: (A,C,G,T) \mapsto (C,T,A,G), \pi _{AGTC }: (A,C,G,T) \mapsto (G,A,T,C)\}. \end{aligned}$$

To show that we found all \(\pi \in S_{\mathcal {B}}\) satisfying \(\pi \circ c=c\circ \pi \) and to exclude the cases of 24 or 12 elements let us add that for example for

$$\begin{aligned} \pi : A,C,G,T \mapsto C,A,G,T\; \text{ we } \text{ have }\; c\circ \pi (A)=T\ne G=\pi \circ c(A) \end{aligned}$$

and it cannot be that we have twelve such maps since 8 is not a factor of 12.

Each \(\pi \in S_{\mathcal {B}}\) preserves according the theorem above the circularity of \(X\). Let us show now with a counterexample that it is not the case with the self-complementarity if \(\pi \in S_{\mathcal {B}}\setminus L\) does not commute with \(c\): Consider for example the circular self-complementary code \(X:= \{CTG, CAG \}\). For

$$\begin{aligned}&\pi _{AC }: A,C,G,T \mapsto C,A,G,T \quad \text {we get} \quad \pi _{AC }(X) =\{ATG,ACG \},\\&\pi _{AG }: A,C,G,T \mapsto G,C,A,T \quad \text {we get} \quad \pi _{AG }(X) =\{ATA,CGA \},\\&\pi _{TG } : A,C,G,T \mapsto A,C,T,G \quad \text {we get} \quad \pi _{TG }(X) =\{CGT,CAT \},\\&\pi _{TC }: A,C,G,T \mapsto A,T,G,C \quad \text {we get} \quad \pi _{TC }(X) =\{TCG,TAG \},\\&\pi _{ATCG } : A,C,G,T \mapsto T,G,A,C \quad \text {we get} \quad \pi _{ATCG }(X)=\{GCA,GTA \},\\&\pi _{ATGC } : A,C,G,T \mapsto T,A,C,G \quad \text {we get} \quad \pi _{ATGC }(X)=\{AGC,ATC \},\\&\pi _{TACG } : A,C,G,T \mapsto C,G,T,A \quad \text {we get} \quad \pi _{TACG }(X)=\{GAT,GCT \},\\&\pi _{TAGC } : A,C,G,T \mapsto G,T,C,A \quad \text {we get} \quad \pi _{TAGC }(X)=\{ACT,AGC \},\\&\pi _{ATC } : A,C,G,T \mapsto T,A,G,C \quad \text {we get} \quad \pi _{ATC }(X) =\{ACG,ATG \},\\&\pi _{TAC } : A,C,G,T \mapsto C,T,G,A \quad \text {we get} \quad \pi _{TAC }(X) =\{TAG,TCG \},\\&\pi _{ATG } : A,C,G,T \mapsto T,C,A,G \quad \text {we get} \quad \pi _{ATG }(X) =\{CGA,CTA \},\\&\pi _{TAG }: A,C,G,T \mapsto G,C,T,A \quad \text {we get} \quad \pi _{TAG }(X) =\{CAT,CGT \},\\&\pi _{GTC } : A,C,G,T \mapsto A,G,T,C \quad \text {we get} \quad \pi _{GTC }(X) =\{GCT,GAT \},\\&\pi _{TGC } : A,C,G,T \mapsto A,T,C,G \quad \text {we get} \quad \pi _{TGC }(X) =\{TGC,TAC \},\\&\pi _{AGC } : A,C,G,T \mapsto G,A,C,T \quad \text {we get} \quad \pi _{AGC }(X) =\{ATC,AGC \},\\&\pi _{GAC } : A,C,G,T \mapsto C,G,A,T \quad \text {we get} \quad \pi _{GAC }(X) =\{GTA,GCA \}. \end{aligned}$$

In each case we get a non-self-complementary code.

1.3 Proof of Theorem 3

Proof

Let \(\pi \) be any permutation of the set of vertices of the cuboid and take a self-complementary code \(X\). Let \(x \in X\). Then the anticodon \(\overleftarrow{{c(\pi (x))}}\) of the image of \(x\) under \(\pi \) must be contained in \(\pi (X)\), hence is of the form \(\pi (x')\) for some \(x'\in X\). Now choose a self-complementary code \(Y\) with \(X\cap Y=\{x,\overleftarrow{{c(x)}} \}\). Then again \(\overleftarrow{{c(\pi (x))}}\) must be in \(\pi (Y)\) but by assumption this can only be the case if \(\overleftarrow{{c(\pi (x))}}=\pi (\overleftarrow{{c(x)}})\), hence \(\pi \) commutes with forming the anticodon.

Last but not least assume that a permutation \(\pi \) of the set of vertices of the cuboid commutes with forming the anticodon, hence preserves self-complementarity, but is not an automorphism. It is easy to see that \(\pi \) must preserve degrees of vertices since it commutes with \({{\mathrm{ref}}}\). Thus \(\pi \) implies a permutation on the middle square which therefore has to be an automorphism of the middle square because it is assumed to commute with \({{\mathrm{rot_{180}}}}\). Again commuting with \({{\mathrm{ref}}}\) shows that also the outer squares must either be invariant or be reflected onto each other followed by an automorphism of the square.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Fimmel, E., Giannerini, S., Gonzalez, D.L. et al. Circular codes, symmetries and transformations. J. Math. Biol. 70, 1623–1644 (2015). https://doi.org/10.1007/s00285-014-0806-7

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00285-014-0806-7

Keywords

Mathematics Subject Classification

Navigation