Efficient Sampling of Transpositions and Inverted Transpositions for Bayesian MCMC
The evolutionary distance between two organisms can be determined by comparing the order of appearance of orthologous genes in their genomes. Above the numerous parsimony approaches that try to obtain the shortest sequence of rearrangement operations sorting one genome into the other, Bayesian Markov chain Monte Carlo methods have been introduced a few years ago. The computational time for convergence in the Markov chain is the product of the number of needed steps in the Markov chain and the computational time needed to perform one MCMC step. Therefore faster methods for making one MCMC step can reduce the mixing time of an MCMC in terms of computer running time.
We introduce two efficient algorithms for characterizing and sampling transpositions and inverted transpositions for Bayesian MCMC. The first algorithm characterizes the transpositions and inverted transpositions by the number of breakpoints the mutations change in the breakpoint graph, the second algorithm characterizes the mutations by the change in the number of cycles. Both algorithms run in O(n) time, where n is the size of the genome. This is a significant improvement compared with the so far available brute force method with O(n 3) running time and memory usage.
KeywordsMarkov Chain Monte Carlo Genome Rearrangement Markov Chain Monte Carlo Method Proposal Distribution Signed Permutation
Unable to display preview. Download preview PDF.
- 1.Sturtevant, A.H., Novitski, E.: The homologies of chromosome elements in the genus Drosophila. Genetics 26, 517–541 (1941)Google Scholar
- 8.Siepel, A.: An algorithm to find all sorting reversals. In: Proc. RECOMB 2002, pp. 281–290 (2002)Google Scholar
- 10.Hannenhalli, S.: Polynomial algorithm for computing translocation distance between genomes. In: Hirschberg, D.S., Meyers, G. (eds.) CPM 1996. LNCS, vol. 1075, pp. 168–185. Springer, Heidelberg (1996)Google Scholar
- 18.Larget, B., Simon, D.L., Kadane, B.J.: Bayesian phylogenetic inference from animal mitochondrial genome arrangements. J. Royal Stat. Soc. B 64(4), 681–695Google Scholar
- 19.York, T.L., Durrett, R., Nielsen, R.: Bayesian estimation of inversions in the history of two chromosomes. J. Comp. Biol. 9, 808–818 (2002)Google Scholar
- 26.Liu, J.S.: Monte Carlo strategies in scientific computing. Springer Series in Statistics, New-York (2001)Google Scholar
- 28.von Neumann, J.: Various techniques used in connection with random digits. National Bureau of Standards Applied Mathematics Series 12, 36–38 (1951)Google Scholar