Protein Structure Comparison and Visualization Tools on Cloud Platform

  • Yaw-Ling Lin
  • Chen-En Hsieh
  • Guan-Jie Hua
  • Che-Lun Hung
Part of the Smart Innovation, Systems and Technologies book series (SIST, volume 20)


The biological function of a protein molecule is decided by its 3D-shape, which eventually determines how the molecule interacts with other molecules in living cells. Identifying similar structures between proteins provide the opportunity to recognize homology that is undetectable by sequence comparison. Thus comparison and alignment of protein structures represents a powerful means of discovering functions, yielding direct insight into the molecular mechanisms.

This paper proposes approaches in providing visualization tools for pairwise 3D protein structure alignment; our web service takes advantage of the Map Reduce paradigm as means of management and parallelizing tools under massive number of protein pairs examined under the experiment. It shows that our previously proposed sequential combinatorial algorithms are well parallelized under the map/reduce platform. These methods are tested on the real-world data obtained in from the RCSB PDB data set; the computation efficiency can be effectively improved proportional to the number of processors being used.


protein structures comparisons bioinformatics visualization VRML MapReduce Hadoop cloud computing 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Gerstein, M., Jansen, R., Johnson, T., Tsai, J., Krebs, W.: Motions in a data-base framework: from structure to sequence. In: Thorpe, M.F., Duxbury, P.M. (eds.) Rigidity Theory and Applications, pp. 401–442. Kluwer Academic/Plenum Publishers (1999)Google Scholar
  2. 2.
    Echols, N., Milburn, D., Gerstein, M.: Molmovdb: analysis and visualization of conformational change and structural flexibility. Nucleic Acids Res. 31, 478–482 (2003)CrossRefGoogle Scholar
  3. 3.
    Dietmann, S., Holm, L.: Identification of homology in protein structure classification. Nature Struct. Biol. 8, 953–957 (2001)CrossRefGoogle Scholar
  4. 4.
    Bujnicki, J.M.: Phylogeny of the restriction endonucleaselike superfamily inferred from comparison of protein structures. J. Mol. Evol. 50, 38–44 (2000)Google Scholar
  5. 5.
    Johnson, M.S., Sutcliffe, M.J., Blundell, T.L.: Molecular anatomy: Phyletic relationships derived from threedimensional structures of proteins. J. Mol. Evol. 30, 43–59 (1990)CrossRefGoogle Scholar
  6. 6.
    Lin, Y.L., Lin, Y.H., Yu, P.S., Chang, H.C.: Randomized algorithms for three dimensional protein structures alignment. In: The 6th International Symposium on Computational Biology and Genome Informatics, pp. 122–125 (2005)Google Scholar
  7. 7.
    Gibrat, J.F., Madej, T., Bryant, S.H.: Surprising similarities in structure comparison. Curr. Opin. Struct. Biol. 6(3), 377–385 (1996)CrossRefGoogle Scholar
  8. 8.
    Holm, L., Sander, C.: Touring protein fold space with DALI/FSSP. Nucleic Acids Res. 26, 316–319 (1998)CrossRefGoogle Scholar
  9. 9.
    Shindyalov, I.N., Bourne, P.E.: Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. Protein Eng. 11, 739–747 (1998)CrossRefGoogle Scholar
  10. 10.
    Lin, Y.L., Huang, S.P.: Tools and algorithms for refined comparison of protein structures. In: The 6th WSEAS International Conference on Microelectronics (MINO 2007), Istanbul, Turkey (2007)Google Scholar
  11. 11.
    Shin, H.S., Huang, S.P., Lin, Y.L.: Parametric searching algorithms with adaptive strategy for three dimensional protein structures alignments. In: National Computer Symposium (NCS 2007), Taichung, Taiwan, pp. 144–154 (2007)Google Scholar
  12. 12.
    Hadoop - Apache Software Foundation project home page,
  13. 13.
    Taylor, R.C.: An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics. BMC Bioinformatics 11, S1 (2010)Google Scholar
  14. 14.
    Dean, J., Ghemawat, S.: MapReduce: A Flexible Data Processing Tool. Communications of the ACM 53, 72–77 (2010)CrossRefGoogle Scholar
  15. 15.
    Schatz, M.: Cloudburst: highly sensitive read mapping with MapReduce. Bioinformatics 25, 1363–1369 (2009)CrossRefGoogle Scholar
  16. 16.
  17. 17.
    McLachlan, A.D.: Rapid comparison of protein structures. Acta Cryst. A38, 871–8783 (1982)Google Scholar
  18. 18.
    Euler, L.: Formulae generales pro trandlatione quacunque corporum rigidorum. Novi Acad. Sci. Petrop. 20, 189–207 (1775)Google Scholar
  19. 19.
    Gray, A.: A treatise on gyrostatics and rotational motion. MacMillan, London (1918)Google Scholar
  20. 20.
    Munkres, J.: Algorithms for the assignment and transportation problems. Journal of the Society for Industrial and Applied Mathematics 5, 32–38 (1957)MathSciNetMATHCrossRefGoogle Scholar
  21. 21.
    Bourgeois, F., Lassalle, J.C.: An extension of the munkres algorithm for the assignment problem to rectangular matrices. Communications of the ACM 14, 802–804 (1971)MathSciNetMATHCrossRefGoogle Scholar
  22. 22.
    Shin, H.S., Lin, Y.L., Jiang, W.D.: Protein structures alignment algorithms by parametric searching with trigonometric series. In: Proceedings of the 25th Workshop on Combinatorial Mathematics and Computation Theory, Hsinchu, Taiwan, pp. 44–54 (2008)Google Scholar
  23. 23.
    Chen, H.S., Lin, Y.L.: Comparisons of Semi-local Alignment Algorithms for Protein Structures. In: Proceedings of the 27th Workshop on Combinatorial Mathematics and Computation Theory (CMCT 2010), Taichung, Taiwan, April 30-May 1, pp. V6-254–V6-259 (2010)Google Scholar
  24. 24.
    Copyright (c) 1998 by Horst Vollhardt. All rights reserved. This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself (1998)Google Scholar
  25. 25.
    Chou, T.: Introduction to Cloud Computing: Business & TechnologyGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Yaw-Ling Lin
    • 1
  • Chen-En Hsieh
    • 1
  • Guan-Jie Hua
    • 1
  • Che-Lun Hung
    • 2
  1. 1.Department of Computer Science and Information EngineeringProvidence UniversityTaichungTaiwan
  2. 2.Department of Computer Science and Communication EngineeringProvidence UniversityTaichungTaiwan

Personalised recommendations