Abstract
The distribution of the lengths of genomic segments inverted during the evolutionary divergence of two species cannot be inferred directly from the output of genome rearrangement algorithms, due to the rapid loss of signal from all but the shortest inversions. The number of short inversions produced by these algorithms, however, particularly those involving a single gene, is relatively reliable. To gain some insight into the shape of the inversion-length distribution we first apply a genome rearrangement algorithm to each of 32 pairs of bacterial genomes. For each pair we then simulate their divergence using a test distribution to generate the inversions and use the simulated genomes as input to the reconstruction algorithm. It is the comparison between the algorithm output for the real pair of genomes and the simulated pair which is used to assess the test distribution. We find that simulations based on the exponential distribution cannot provide a good fit, but that simulations based on a gamma distribution can account for both single-gene inversions and short inversions involving at most 20 genes, and we conclude that the shape of latter distribution corresponds well to the true distribution at least for small inversion lengths.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ajana, Y., Lefebvre, J.F., Tillier, E., El-Mabrouk, N.: Exploring the set of all minimal sequences of reversals - An application to test the replicationdirected reversal hypothesis. In: Guigó, R., Gusfield, D. (eds.) WABI 2002. LNCS, vol. 2452, pp. 300–315. Springer, Heidelberg (2002)
Bennetzen, J.L., Ramakrishna, W.: Numerous small rearrangements of gene content, order and orientation differentiate grass genomes. Plant Molecular Biology 48, 821–827 (2002)
Boore, J.L.: The duplication/random loss model for gene rearrangement exemplified by mitochondrial genomes of deuterostome animals. In: Sankoff, D., Nadeau, J.H. (eds.) Comparative Genomics, pp. 133–147. Kluwer Academic Press, Dordrecht (2000)
Burgetz, I.J., Shariff, S., Pang, A., Tillier, E.R.M.: Positional homology in bacterial genomes (2004) (Submitted ms)
Coghlan, A., Wolfe, K.H.: Fourfold faster rate of genome rearrangement in nematodes than in Drosophila. Genome Research 12, 857–867 (2002)
El-Mabrouk, N.: Sorting signed permutations by reversals and insertions/ deletions of contiguous segments. Journal of Discrete Algorithms 1, 105–122 (2001)
Hannenhalli, S., Pevzner, P.A.: Transforming cabbage into turnip (polynomial algorithm for sorting signed permutations by reversals). Journal of the ACM 48, 1–27 (1999)
Kececioglu, J., Sankoff, D.: Efficient bounds for oriented chromosomeinversion distance. Combinatorial Pattern Matching. In: Crochemore, M., Gusfield, D. (eds.) CPM 1994. LNCS, vol. 807, pp. 307–325. Springer, Heidelberg (1994)
Kececioglu, J., Sankoff, D.: Exact and approximation algorithms for sorting by reversals, with application to genome rearrangement. Algorithmica 13, 180–210 (1995)
Kent, W.J., Baertsch, R., Hinrichs, A., Miller, W., Haussler, D.: Evolution’s cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes. Proceedings of the National Academy of Sciences, USA 100, 11484–11489 (2003)
Lefebvre, J.-F., El-Mabrouk, N., Tillier, E., Sankoff, D.: Detection and validation of single-gene inversions. Bioinformatics 196, i190–i196 (2003)
Mahan, M.J., Roth, J.R.: Ability of a bacterial chromosome segment to invert is dictated by included material rather than flanking sequence. Genetics 129, 1021–1032 (1991)
McLysaght, A., Seoighe, C., Wolfe, K.H.: High frequency of inversions during eukaryote gene order evolution. In: Sankoff, D., Nadeau, J.H. (eds.) Comparative Genomics, pp. 47–58. Kluwer Academic Press, Dordrecht (2000)
Rogozin, I.B., Makarova, K.S., Murvai, J., Czabarka, E., Wolf, Y.I., Tatusov, R.L., Szekely, L.A., Koonin, E.V.: Connected gene neighborhoods in prokaryotic genomes. Nucleic Acids Research 30, 2212–2223 (2002)
Sankoff, D.: Short inversions and conserved gene clusters. Bioinformatics 18, 1305–1308 (2002)
Thomas, J.W., Green, E.D.: Comparative sequence analysis of a singlegene conserved segment in mouse and human. Mammalian Genome 14, 673–678 (2003)
Tillier, E.R.M., Collins, R.: Genome rearrangement by replicationdirected translocation. Nature Genetics 26, 195–197 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sankoff, D., Lefebvre, JF., Tillier, E., Maler, A., El-Mabrouk, N. (2005). The Distribution of Inversion Lengths in Bacteria. In: Lagergren, J. (eds) Comparative Genomics. RCG 2004. Lecture Notes in Computer Science(), vol 3388. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-32290-0_8
Download citation
DOI: https://doi.org/10.1007/978-3-540-32290-0_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24455-4
Online ISBN: 978-3-540-32290-0
eBook Packages: Computer ScienceComputer Science (R0)