Grid and Distributed Public Computing Schemes for Structural Proteomics: A Short Overview

  • Azhar Ali Shah
  • Daniel Barthel
  • Natalio Krasnogor
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4743)

Abstract

Grid and distributed public computing schemes has become an essential tool for many scientific fields including bioinformatics, computational biology and systems biology. The adoption of these technologies has given rise to a wide range of projects and contributions that provide various ways of setting up these environments and exploiting their potential resources and services for different domains of applications. This paper aims to provide a distilled overview of some of the major projects, technologies and resources employed in the area of structural proteomics. The major emphasis would be to briefly comment on various approaches related to the gridification and parallelization of some flagship legacy applications, tools and data resources related to key structural proteomics problems such as protein structure prediction, folding and comparison. The comments are based on theoretical analysis of some interesting parameters such as performance gain after gridification, user level interaction environments, workload distribution and the choice of deployment infrastructure and technologies. The study of these parameters would provide a basis for some motivating justification needed for further research and development in this domain.

Keywords

Protein Data Bank Protein Structure Prediction Protein Structure Comparison Structural Proteomics World Community Grid 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
  2. 2.
    Rice, P., Longden, I., Bleasby, A.: EMBOSS: The European Molecular Biology Open Software Suite. Trends in Genetics 16, 276–277 (2000)CrossRefGoogle Scholar
  3. 3.
    Stevens, R.D., Robinson, A.J., Goble, C.A.: MyGrid: Personalized bioinformatics on the information grid. Bioinformatics 19, 302–304 (2003)CrossRefGoogle Scholar
  4. 4.
    Gagliardi, F., Jones, B., Grey, F., Begin, M.E., Heikkurinen, M.: Building an infrastructure for scientific Grid computing: status and goals of the EGEE project. Philosophical Transactions: Mathematical, Physical and Engineering Sciences 363, 1729–1742 (2005)CrossRefGoogle Scholar
  5. 5.
    Michael, P., Sergio, M., Peter, K.: The SwissBioGrid Project: Objectives, Preliminary Results and Lessons Learned. In: 2nd IEEE International Conference on e-Science and Grid Computing (e-Science 2006) - Workshop on Production Grids, IEEE Computer Society Press, Amsterdam, Netherlands (2006)Google Scholar
  6. 6.
    World Community Grid: Technology Solving Problems, http://www.worldcommunitygrid.org/
  7. 7.
    Pande, V.S., Baker, I., Chapman, J., et al.: Atomistic Protein Folding Simulations on the Submillisecond Time Scale Using Worldwide Distributed Computing. Biopolymers 68, 91–109 (2003)CrossRefGoogle Scholar
  8. 8.
    Taufer, M., An, C., Kerstens, A., Brooks, C.L.: Predictor@Home: a protein structure prediction supercomputer based on public-resource computing. In: Proceedings of 19th IEEE International Parallel and Distributed Processing Symposium, IEEE Computer Society Press, Los Alamitos (2005)Google Scholar
  9. 9.
    Rosetta@Home: Protein Folding, Design and Docking, http://boinc.bakerlab.org/rosetta/
  10. 10.
    Shah, A.A., Barthel, D., Lukasiak, P., Blacewicz, J., Krasnogor, N.: Web & Grid Technologies in Bioinformatics, Computational Biology and Systems Biology. Current Bioinformatics (submitted)Google Scholar
  11. 11.
    BioSimGrid: A distributed database for biomolecular simulations, http://www.biosimgrid.org/#about
  12. 12.
    Zhang, Y., Skolnick, J.: The protein structure prediction problem could be solved using the current PDB library. Proceedings of US National Academy of Science 102, 1029–1034 (2005)CrossRefGoogle Scholar
  13. 13.
    Susumu, D., Kazutoshi, F., Hideo, M., Haruki, N., Shinji, S.: An Empirical Study of Grid Applications in Life Science Lesson learnt from Biogrid project in Japan. International Journal of Information Technology 11, 16–28 (2005)Google Scholar
  14. 14.
    Chikenji, G., Fujitsuka, Y., Takada, S.: A reversible fragment assembly method for De Novo protein structure prediction. Journal of Chemical Physics 119, 6895–6903 (2003)CrossRefGoogle Scholar
  15. 15.
    Birnbaum, A., Hayes, J., Wilfred, W., et al.: Grid Workflow Software for a High-Throughput Proteome Annotation Pipeline. In: Konagaya, A., Satou, K. (eds.) LSGRID 2004. LNCS (LNBI), vol. 3370, pp. 68–81. Springer, Heidelberg (2005)Google Scholar
  16. 16.
    Herrera, J., Huedo, E., Montero, R.S., Lorente, I.M.: Benchmarking of a joint IRISGrid/EGEE Testbed with a Bioinformatics Application. Scalable Computing: Practice and Experience 7, 25–32 (2006)Google Scholar
  17. 17.
    Huedo, E., Ruben, S.M., Ignacio, M.L.: Adaptive Grid Scheduling of a High-Throughput Bioinformatics Application. In: Wyrzykowski, R., Dongarra, J.J., Paprzycki, M., Waśniewski, J. (eds.) Parallel Processing and Applied Mathematics. LNCS, vol. 3019, pp. 840–847. Springer, Heidelberg (2004)Google Scholar
  18. 18.
    Tanimura, Y., Aoi, K., Hiroyasu, T., Miki, M., Okamoto, Y., Dongarra, J.: Implementation of Protein Tertiary Structure Prediction System with NetSolve. In: Proceedings of the High Performance Computing and Grid in Asia Pacific Region, Seventh International Conference on (HPCAsia 2004) (2004)Google Scholar
  19. 19.
    Natrajan, A., Crowley, M., Wilkins, D.N., Humphrey, M.A., Fox, A.D., Grimshaw, A.S., Brooks, C.L.: Studying Protein Folding on the Grid: Experiences using CHARMM on NPACI Resources under Legion. In: Proceeding of the HPDC Conference, San Francisco, CA, USA (2001)Google Scholar
  20. 20.
    UK, B., Taufer, M., Stricker, T., Settanni, G., Cavalli, A.: Implementation and characterization of protein folding on a desktop computational grid - is CHARMM a suitable candidate for the united devices metaprocessor? In: Technical Report 385, ETH Zurich, Institute for Computer systems (2002)Google Scholar
  21. 21.
    Ferrari, C., Guerra, C., Zanotti, G.: A grid-aware approach to protein structure comparison. Journal of Parallel Distributed Computing 63, 728–737 (2003)CrossRefGoogle Scholar
  22. 22.
    Barthel, D., Hirst, J.D., Blacewicz, J., Krasnogor, N.: Procksi: a metaserver for protein comparison using kolmogorov and other similarity measures. In: Late Breaking Papers. European Conference on Computational Biology, Eilat, Israel (2006)Google Scholar
  23. 23.
    Park, S.J., Yamamura, M.: FROG (Fitted Rotation and Orientation of protein structure by means of real-coded Genetic algorithm): Asynchronous Parallelizing for Protein Structure-Based Comparison on the Basis of Geometrical Similarity. Genome Informatics 13, 344–345 (2002)Google Scholar
  24. 24.
    Park, S., Yamamura, J.: Two-layer Protein Structure Comparison. In: Proceedings of the 15th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2003) (2003)Google Scholar
  25. 25.
    Cannataro, M., Comin, M., Ferrari, C., Guerra, C., Guzzo, A., Veltri, P.: Modeling a Protein Structure Comparison Application on the Grid Using PROTEUS. In: Herrero, P., Pérez, M.S., Robles, V. (eds.) SAG 2004. LNCS, vol. 3458, pp. 75–85. Springer, Heidelberg (2005)Google Scholar
  26. 26.
    Zamboulis, L., Fan, H., Belhajjame, K., Siepen, J., et al.: Data Access and Integration in the ISPIDER Proteomics Grid. In: Leser, U., Naumann, F., Eckman, B. (eds.) DILS 2006. LNCS (LNBI), vol. 4075, pp. 3–18. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  27. 27.
    Allan, R., Keegan, R., Meredith, D., Winn, M., Winter, G.: e-HTPX– HPC, Grid and Web-Portal Technologies in High Throughput Protein Crystallography. In: Proceedings of the UK e-Science All Hands Meeting, Nottingham, UK (2004)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • Azhar Ali Shah
    • 1
  • Daniel Barthel
    • 1
  • Natalio Krasnogor
    • 1
  1. 1.Automated Scheduling optimization And Planning (ASAP) Group, School of Computer Science and Information Technology, University of Nottingham, NG8 1BBUK

Personalised recommendations