Algebraic double cut and join

Bhatia, Sangeeta; Egri-Nagy, Attila; Francis, Andrew R.

doi:10.1007/s00285-014-0852-1

Algebraic double cut and join

A group-theoretic approach to the operator on multichromosomal genomes

Published: 11 December 2014

Volume 71, pages 1149–1178, (2015)
Cite this article

Journal of Mathematical Biology Aims and scope Submit manuscript

Sangeeta Bhatia¹,
Attila Egri-Nagy¹ &
Andrew R. Francis¹

443 Accesses
6 Citations
1 Altmetric
Explore all metrics

Abstract

Establishing a distance between genomes is a significant problem in computational genomics, because its solution can be used to establish evolutionary relationships including phylogeny. The “double cut and join” (DCJ) model of chromosomal rearrangement proposed by Yancopoulos et al. (Bioinformatics 21:3340–3346, 2005) has received attention as it can model inversions, translocations, fusion and fission on a multichromosomal genome that may contain both linear and circular chromosomes. In this paper, we realize the DCJ operator as a group action on the space of multichromosomal genomes. We study this group action, deriving some properties of the group and finding group-theoretic analogues for the key results in the DCJ theory.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

What is the difference between the breakpoint graph and the de Bruijn graph?

Article Open access 17 October 2014

Genome Rearrangement Analysis: Cut and Join Genome Rearrangements and Gene Cluster Preserving Approaches

On the Family-Free DCJ Distance

References

Bafna V, Pevzner PA (1993) Genome rearrangements and sorting by reversals. In: Proceedings of 34th annual symposium on foundations of computer science. IEEE Press, New York, pp 148–157
Bergeron A, Mixtacki J, Stoye J (2006) A unifying view of genome rearrangements. Algorithms Bioinform 163–173
Braga MDV, Stoye J (2009) Counting all DCJ sorting scenarios. In: Comparative genomics. Springer, Berlin, pp 36–47
Braga MDV, Stoye J (2010) The solution space of sorting by DCJ. J Comput Biol 17(9):1145–1165
Darling AE, Miklós I, Ragan MA (2008) Dynamics of genome rearrangement in bacterial populations. PLoS Genet 4(7)
Dénes J (1959) The representation of a permutation as the product of a minimal number of transpositions and its connection with the theory of graphs. Publ Math Inst Hungar Acad Sci 4:63–70
MATH Google Scholar
Egri-Nagy A, Gebhardt V, Tanaka MM, Francis AR (2014) Group-theoretic models of the inversion process in bacterial genomes. J Math Biol 69(1):243–265
Article MATH MathSciNet Google Scholar
Feijão P, Meidanis J (2013) Extending the algebraic formalism for genome rearrangements to include linear chromosomes. IEEE/ACM Trans Comput Biol Bioinform 10(4):819–831
Article Google Scholar
Fraleigh JB (2003) A first course in abstract algebra. Pearson Education India, Delhi
Francis AR (2014) An algebraic view of bacterial genome evolution. J Math Biol 69(6):1693–1718
Fulton W (1997) Young tableaux: with applications to representation theory and geometry. London Mathematical Society Student Texts. cambridge University Press, Cambridge
Google Scholar
Hannenhalli S, Pevzner PA (1995) Transforming men into mice (polynomial algorithm for genomic distance problem). In: Proceedings of 36th annual symposium on foundations of computer science, pp 581–592
Herstein IN (2006) Topics in algebra. Wiley, New York
Google Scholar
Lin YC, Tang CY (2006) Exposing phylogenetic relationships by genome rearrangement. Adv Comput 68:1–57
Article Google Scholar
Lu CL, Huang YL, Wang TC, Chiu HT (2006) Analysis of circular genome rearrangement by fusions, fissions and block-interchanges. BMC Bioinform 7(1):295
Meidanis J, Dias Z (2000) An alternative algebraic formalism for genome rearrangements. Comparative genomics: empirical and analyitical approaches to gene order dynamics, map alignment and evolution of gene families, pp 213–223
Miklós I, Darling AE (2009) Efficient sampling of parsimonious inversion histories with application to genome rearrangement in yersinia. Genome Biol Evol
Ouangraoua A, Bergeron A (2010) Combinatorial structure of genome rearrangements scenarios. J Comput Biol 17(9):1129–1144
Article MathSciNet Google Scholar
Petersen TK, Tenner BE (2013) How to write a permutation as a product of involutions. Integers 13
Siepel AC (2002) An algorithm to enumerate all sorting reversals. In: Proceedings of the sixth annual international conference on computational biology
Stanley RP (1999) Enumerative combinatorics, vol 2. Cambridge University Press, Cambridge
Book Google Scholar
Watterson GA, Ewens WJ, Hall TE, Morgan A (1982) The chromosome inversion problem. J Theor Biol 99(1):1–7. doi:10.1016/0022-5193(82)90384-8
Yancopoulos S, Attie O, Friedberg R (2005) Efficient sorting of genomic permutations by translocation, inversion and block interchange. Bioinformatics 21(16):3340–3346
Article Google Scholar

Download references

Author information

Authors and Affiliations

Centre for Research in Mathematics, University of Western Sydney, Penrith, NSW, 2751, Australia
Sangeeta Bhatia, Attila Egri-Nagy & Andrew R. Francis

Authors

Sangeeta Bhatia
View author publications
You can also search for this author in PubMed Google Scholar
Attila Egri-Nagy
View author publications
You can also search for this author in PubMed Google Scholar
Andrew R. Francis
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sangeeta Bhatia.

Additional information

This research was supported by Australian Research Council grants DP130100248 and FT100100898.

Appendix: Some results about symmetric groups

This paper uses some standard results on symmetric groups that we collect here for ease of reference. More details on these results can be found in many undergraduate group theory textbooks, for example Fraleigh (2003).

A permutation is a bijection from a set $S$ to itself. $S$ is usually taken to be a set of natural numbers $\mathbf {n}=\{1,2,\ldots ,n\}$. A permutation can be written by specifying the value of the map on all the points.

Example

$$\begin{aligned} \pi = \Big (\begin{matrix} 1 &{}\quad 2 &{}\quad 3 &{}\quad 4 &{}\quad 5 &{}\quad 6 \\ 3 &{}\quad 4 &{}\quad 1 &{}\quad 6 &{}\quad 2 &{}\quad 5 \end{matrix}\Big ) \end{aligned}$$

is a permutation on the set $\{1,2,3,4,5,6\}$ that sends 1 to 3, 2 to 4, etc. The set of all permutations on the set ${\mathbf {n}}$ forms a group called the symmetric group and denoted by $S_{n}$.

1.1 Permutation multiplication

Since permutations are simply bijective functions, permutation multiplication is function composition. That is, to find the image of $i$ in the product $\pi _2\pi _1$, we do $\pi _2 \left( \pi _1(i)\right) $.

Example

Let $\pi _1 = \Big (\begin{matrix} 1 &{} 2 &{} 3 &{} 4 &{} 5 \\ 3 &{} 4 &{} 1 &{} 5 &{} 2 \end{matrix}\Big )$ and $\pi _2= \Big (\begin{matrix} 1 &{} 2 &{} 3 &{} 4 &{} 5 \\ 2 &{} 1 &{} 4 &{} 3 &{} 5 \end{matrix}\Big )$. The image of $1$ in the product $\pi _2\pi _1$ is $\pi _2 \left( \pi _1(1)\right) =\pi _2(3)=4$. So for each $i$, we have to “follow the string” – $\pi _1$ send $i$ to $j$, $\pi _2$ sends $j$ to $k$, so $i$ gets sent to $k$ by $\pi _2\pi _1$.

$$\begin{aligned} \Big (\begin{matrix} 1 &{} 2 &{} 3 &{} 4 &{} 5 \\ 2 &{} 1 &{} 4 &{} 3 &{} 5 \end{matrix}\Big ) \Big (\begin{matrix} 1 &{} 2 &{} 3 &{} 4 &{} 5 \\ 3 &{} 4 &{} 1 &{} 5 &{} 2 \end{matrix}\Big )= \Big (\begin{matrix} 1 &{} 2 &{} 3 &{} 4 &{} 5 \\ 4 &{} 3 &{} 2 &{} 5 &{} 1 \end{matrix}\Big ). \end{aligned}$$

1.2 Inverse of a permutation

Informally, a permutation $\pi \in S_{n}$ scrambles the elements of ${\mathbf {n}}$. The inverse of $\pi $ is the permutation that “undoes” the scrambling. Formally we define the identity permutation $\iota $ to be the permutation that maps $i$ to $i$ for all $i \in {\mathbf {n}}$.

Definition 8.1

(Inverse) Let $\pi \in S_{n}$. Then the inverse of $\pi $ is the permutation $\pi ^{-1}$ such that

$$\begin{aligned} \pi \pi ^{-1}=\iota \quad \text {and}\quad \pi \pi ^{-1}=\iota . \end{aligned}$$

If $\pi ^{-1}$ is the inverse of $\pi $ then $\pi $ is the inverse of $\pi ^{-1}$. That is, $(\pi ^{-1})^{-1}=\pi $. In general, $(\pi _1\pi _2)^{-1}=\pi _2^{-1}\pi _1^{-1}.$

1.3 Cycles and cycle decomposition

For a permutation $\pi \in S_{n}$, if we repeatedly apply $\pi $ to any $i \in {\mathbf {n}}$,

$$\begin{aligned} i \mathop {\rightarrow }\limits ^{\pi } \pi (i) \mathop {\rightarrow }\limits ^{\pi } \pi ^2(i) \ldots , \end{aligned}$$

we must eventually (say after $k$ steps) reach $i$ again since ${\mathbf {n}}$ is a finite set. If there is some $j \in {\mathbf {n}}$ which does not occur in this sequence, then we can form a similar sequence for $j$, and keep doing this until every element of ${\mathbf {n}}$ occurs in some sequence.

Definition 8.2

(Cycle) Let $i_1,i_2,\ldots i_k$ be $k$ distinct integers in ${\mathbf {n}}$. A cycle $\pi _c$ written as $(i_1,i_2,\ldots ,i_k)$ is a permutation in $S_n$ defined as

$$\begin{aligned} \pi _c(i_s):= {\left\{ \begin{array}{ll} i_{s+1} &{} \text { if } i_s \in \{i_1,i_2,\ldots i_{k-1}\}\\ i_1 &{} \text { if } i_s=i_k \\ i_s &{} \text { otherwise.} \end{array}\right. } \end{aligned}$$

A 2-cycle is a cycle of length 2. That is, $\pi =(i,j)$ means that $\pi (i)=j,\pi (j)=i$ and $\pi (k)=k$ if $k \ne i,j$. A cycle of length 2 is also called a transposition.

Two cycles are said to be disjoint if they have no elements in common.

Theorem 8.3

Any permutation $\pi \in S_{n}$ can be written as a product of disjoint cycles.

Example

Let $\pi = \Big (\begin{matrix} 1 &{} 2 &{} 3 &{} 4 &{} 5 &{} 6 \\ 3 &{} 4 &{} 1 &{} 6 &{} 2 &{} 5 \end{matrix}\Big )$. $\pi $ can be written as

$$\begin{aligned} \pi =(1,3)(2,4,6,5). \end{aligned}$$

This way of writing a permutation is referred to as cycle notation. There is a unique way of writing a permutation as a product of disjoint cycles, up to the ordering of the cycles (they commute) and cyclic equivalence of each cycle (e.g. $(1,2,3)=(2,3,1)=(3,1,2)$). Since the sizes of the disjoint cycles will always add to $n$ (including if necessary some 1-cycles), we can define the cycle type as follows.

Definition 8.4

(Cycle type) The cycle type of a permutation $\pi $ is the partition $\lambda \vdash n$ whose components are the sizes of the cycles in the disjoint cycle decomposition of $\pi $.

Example

The cycle type of $\pi =(1,3)(2,4,6,5)$ is (4, 2) since it has one cycle of length 2 and one cycle of length 4.

1.4 Conjugation

Definition 8.5

Let $\pi ,g \in S_{n}$. The conjugate of $\pi $ by $g$ is defined to be the permutation $g \pi g^{-1}$, and we say that $\pi $ and $g\pi g^{-1}$ are conjugate permutations.

Theorem 8.6

Let $\pi _1$ and $\pi _2$ be permutations on the set $\mathbf {n}$, then $\pi _1$ and $\pi _2$ are conjugate in $S_{n}$ if and only if they have the same cycle type.

1.5 Permutation as product of transpositions

Theorem 8.7

Any permutation $\pi \in S_{n}$ can be written as a product of transpositions.

Example

The permutation $\pi =(1,3)(2,4,6,5)$ can be written as

$$\begin{aligned} \pi =(1,3)(2,4,6,5)=(1,3)(2,5)(2,6)(2,4). \end{aligned}$$

While the decomposition of a permutation into a product of disjoint cycles is unique, the decomposition of a permutation into a product of transpositions is not unique. However the number of transpositions used must be either always be even, or always be odd.

Theorem 8.8

A permutation $\pi \in S_n$ can be expressed as a product of either an even number of transpositions or an odd number of transpositions, but not both.

Definition 8.9

A permutation is said to be even if it can be written as a product of an even number of transpositions. Otherwise it is said to be an odd permutation.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bhatia, S., Egri-Nagy, A. & Francis, A.R. Algebraic double cut and join. J. Math. Biol. 71, 1149–1178 (2015). https://doi.org/10.1007/s00285-014-0852-1

Download citation

Received: 09 April 2014
Revised: 25 November 2014
Published: 11 December 2014
Issue Date: November 2015
DOI: https://doi.org/10.1007/s00285-014-0852-1

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Algebraic double cut and join

Abstract

Access this article

Similar content being viewed by others

What is the difference between the breakpoint graph and the de Bruijn graph?

Genome Rearrangement Analysis: Cut and Join Genome Rearrangements and Gene Cluster Preserving Approaches

On the Family-Free DCJ Distance

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix: Some results about symmetric groups

Example

1.1 Permutation multiplication

Example

1.2 Inverse of a permutation

Definition 8.1

1.3 Cycles and cycle decomposition

Definition 8.2

Theorem 8.3

Example

Definition 8.4

Example

1.4 Conjugation

Definition 8.5

Theorem 8.6

1.5 Permutation as product of transpositions

Theorem 8.7

Example

Theorem 8.8

Definition 8.9

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

Algebraic double cut and join

Abstract

Access this article

Similar content being viewed by others

What is the difference between the breakpoint graph and the de Bruijn graph?

Genome Rearrangement Analysis: Cut and Join Genome Rearrangements and Gene Cluster Preserving Approaches

On the Family-Free DCJ Distance

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix: Some results about symmetric groups

Appendix: Some results about symmetric groups

Example

1.1 Permutation multiplication

Example

1.2 Inverse of a permutation

Definition 8.1

1.3 Cycles and cycle decomposition

Definition 8.2

Theorem 8.3

Example

Definition 8.4

Example

1.4 Conjugation

Definition 8.5

Theorem 8.6

1.5 Permutation as product of transpositions

Theorem 8.7

Example

Theorem 8.8

Definition 8.9

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation