Abstract
Multiple sequence alignment is an important tool in bioinformatics. Although efficient heuristic algorithms exist for this problem, the exponential growth of biological data demands an even higher throughput. The recent emergence of accelerator technologies has made it possible to achieve a highly improved execution time for many bioinformatics applications compared to general-purpose platforms. In this paper, we demonstrate how the PlayStation®3, powered by the Cell Broadband Engine, can be used as a computational platform to accelerate the distance matrix computation utilized in multiple sequence alignment algorithms.
Chapter PDF
Similar content being viewed by others
References
Lipman, D.J., Altschul, S.F., Kececioglu, J.D.: A tool for multiple sequence alignment. Proceedings of the National Academy of Sciences of the United States of America 86(12), 4412–4415 (1989)
Thompson, J.D., Higgins, D.G., Gibson, T.J.: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucl. Acids Res. 22(22), 4673–4680 (1994)
Notredame, C., Higgins, D.G., Heringa, J.: T-coffee: A novel method for fast and accurate multiple sequence alignment. Journal of Molecular Biology 302(1), 205–217 (2000)
Katoh, K., Misawa, K., Kuma, K.I., Miyata, T.: MAFFT: A novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Research 30(14), 3059–3066 (2002)
Schmollinger, M., Nieselt, K., Kaufmann, M., Morgenstern, B.: DIALIGN P: Fast pair-wise and multiple sequence alignment using parallel processors. BMC Bioinformatics, 5 (2004)
Simossis, V.A., Heringa, J.: PRALINE: A multiple sequence alignment toolbox that integrates homology-extended and secondary structure information. Nucleic Acids Research 33(suppl. 2), W289–W294 (2005)
Thompson, J.D., Gibson, T.J., Plewniak, F., Jeanmougin, F., Higgins, D.G.: The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucl. Acids Res. 25(24), 4876–4882 (1997)
Catalyurek, U., Stahlberg, E., Ferreira, R., Saltzt, J.: Improving Performance of Multiple Sequence Alignment Analysis in Multi-client Environments. In: Proceedings of the First International Workshop on High Performance Computational Biology 2002 (HiCOMB 2002, IPDPS 2002) (2002)
Catalyurek, U., Gray, M., Kurc, T., Saltzt, J., Stahlberg, E., Ferreira, R.: A component-based implementation of multiple sequence alignment. In: Proceedings of the ACM Symposium on Applied Computing: 2003, pp. 122–126 (2003)
Li, K.-B.: ClustalW-MPI: ClustalW analysis using distributed and parallel computing. Bioinformatics 19(12), 1585–1586 (2003)
Chaichoompu, K., Kittitornkun, S., Tongsima, S.: MT-ClustalW: Multithreading multiple sequence alignment. In: 20th International Parallel and Distributed Processing Symposium, IPDPS 2006 (2006)
Luo, J., Ahmad, I., Ahmed, M., Paul, R.: Parallel multiple sequence alignment with dynamic scheduling. In: International Conference on Information Technology: Coding and Computing, ITCC 2005, pp. 8–13 (2005)
Oliver, T., Schmidt, B., Nathan, D., Clemens, R., Maskell, D.: Multiple sequence alignment on an FPGA. In: Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS 2005, pp. 326–330 (2005)
Liu, W., Schmidt, B., Voss, G., Muller-Wittig, W.: Streaming Algorithms for Biological Sequence Alignment on GPUs. IEEE Transactions on Parallel and Distributed Systems (2007)
Kahle, J.A., Day, M.N., Hofstee, H.P., Johns, C.R., Maeurer, T.R., Shippy, D.: Introduction to the Cell multiprocessor. IBM Journal of Research and Development 49(4-5), 589–604 (2005)
Pande, V.: Folding@Home: Using Worldwide distributed computing to break fundamental barriers in molecular simulation. In: Proceedings of the IEEE International Symposium on High Performance Distributed Computing 2006, p. 4 (2006)
Sachdeva, V., Kistler, M., Speight, E., Tzeng, T.-H.K.: Exploring the viability of the Cell Broadband Engine for bioinformatics applications. In: IEEE International Parallel and Distributed Processing Symposium 2007, 8 p. IEEE, Long Beach (2007)
Wirawan, A., Kwoh, C.K., Hieu, N.T., Schmidt, B.: CBESW: Sequence alignment on the playstation 3. BMC Bioinformatics, 9 (2008)
Stamatakis, A., Ludwig, T., Meier, H.: RAxML-II: A program for sequential, parallel and distributed inference of large phylogenetic trees. Concurrency Computation Practice and Experience 17(14), 1705–1723 (2005)
Pham, D., Behnen, E., Bolliger, M., Hofstee, H.P., Johns, C., Kahle, J., Kameyama, A., Keaty, J., Le, B., Masubuchi, Y., et al.: The design methodology and implementation of a first-generation CELL processor: a multi-core SoC. In: Proceedings of the IEEE 2005 Custom Integrated Circuits Conference 2005, San Jose, CA, USA, pp. 45–49. IEEE, Los Alamitos (2005)
International Business Machines: Software Development Kit 2.1 Accelerated Library Framework Programmer’s Guide and API Reference, Version 1.1. In: IBM developerWorks (2007)
Feng, D.F., Doolittle, R.F.: Progressive sequence alignment as a prerequisitetto correct phylogenetic trees. Journal of Molecular Evolution 25(4), 351–360 (1987)
Saitou, N., Nei, M.: The neighbor-joining method: a new method for reconstructing phylogenetic trees. Molecular biology and evolution 4(4), 406–425 (1987)
IBM: C/C++ Language Extensions for Cell Broadband Engine Architecture v.2.5. In. IBM developerWorks (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wirawan, A., Schmidt, B., Kwoh, C.K. (2009). Pairwise Distance Matrix Computation for Multiple Sequence Alignment on the Cell Broadband Engine. In: Allen, G., Nabrzyski, J., Seidel, E., van Albada, G.D., Dongarra, J., Sloot, P.M.A. (eds) Computational Science – ICCS 2009. Lecture Notes in Computer Science, vol 5544. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01970-8_96
Download citation
DOI: https://doi.org/10.1007/978-3-642-01970-8_96
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01969-2
Online ISBN: 978-3-642-01970-8
eBook Packages: Computer ScienceComputer Science (R0)