The encyclopedia of life project: Grid software and deployment
- 97 Downloads
The ongoing global effort of genome sequencing is making large scale comparative proteomic analysis an intriguing task. The Encyclopedia of Life (EOL; http://eol.sdsc.edu) project aims to provide current functional and structural annotations for all available proteomes, a computational challenge never seen before in biology. Using an integrative genome annotation pipeline (iGAP), we have produced 3D models and functional annotations for more than 100 proteomes thus far. This process is greatly facilitated by grid compute resources, and especially by the development of grid application execution environment. AppLeS (Application-Level Scheduling) Parameter Sweep Template (APST) has been adopted by the EOL project as a mediator to grid middleware. APST has made the annotation process much more efficient, highly automated and scalable. Currently we are building a domain-specific bioinformatics workflow management system (BWMS) on top of APST, which further streamlines grid deployment of life science applications. With these developments in mind, we discuss some common problems and expectations of grid computing for high throughput proteomics.
KeywordsBiology on the Grid Integrative Genome Annotation Pipeline Encyclopedia of Life AppLes Parameter Sweep Template Bioinformatics
Unable to display preview. Download preview PDF.
- 1).Berman, F., Fox, G. and Hey, T., “The Grid: Past, Present, Future,” inGrid Computing: Making the Global Infrastructure a Reality (Berman, F., Fox, G. and Hey, T. eds.), Wiley, West Sussex, 2003.Google Scholar
- 2).Baldridge, K. and Bourne, P. E., “The New Biology and the Grid,” inGrid Computing: Making the Global Infrastructure a Reality (Berman, F., Fox, G. C. and Hey, A. J. G. eds.), pp. 907–922, Wiley, West Sussex, 2003.Google Scholar
- 4).Casanova, H. and Berman, F., “Parameter Sweeps on the Grid with APST,” inGrid Computing: Making he Global Infrastructure a Reality (Berman, F., Fox, G. C. and Hey, A. J. G. eds.), Wiley, West Sussex, 2003.Google Scholar
- 11).Berman, F., Wolski, R. and Casanova, H. et al., “Adaptive Computing on the Grid Using AppLeS,” inIEEE Transactions on Parallel and Distributed Systems (TPDS), pp. 369–382, 2003.Google Scholar
- 12).Berman, F., Wolski, R., Figueira, S., Schopf, J. and Shao, G., “Application Level Scheduling on Distributed Heterogenous Networks,” inProc. of Supercomputing 2000 (SC ’00), 1996.Google Scholar
- 13).Czajkowski, K. et al., “A Resource Management Architecture for Metacomputing Systems,” inProc. of IPPS/SPDP’98 Workshop on Job Scheduling Strategies for Parallel Processing, pp. 62–82, 1998.Google Scholar
- 14).Foster, I., Kesselman, C., Tedesco, J. and Tuecke, S., “GASS: A Data Movement and Access Service for Wide Area Computing Systems,” inProc. of the Sixth workshop on I/O in Parallel and Distributed Systems, May, 1999.Google Scholar
- 15).Allcock, W. et al., “GridFTP: Protocol Extension to FTP for the Grid, Grid Forum Internet-Draft,” March, 2001.Google Scholar
- 16).Casanova, H., Legrand, A., Zagorodnov, D. and Berman, F., “Heuristics for Scheduling Parameter Sweep Applications in Grid environments,” inProc. of the 9th Heterogeneous Computing Workshop (HCW’00), May 2000.Google Scholar
- 17).Czajkowski, K., Fitzgerald, S., Foster, I. and Kesselman, C., “Grid Information Services for Distributed Resource Sharing,” inProc. of the 10th IEEE Symposium on High-Performance Distributed Computing, 2001.Google Scholar
- 18).Wolski, R., Spring, N. and Hayes, J., “The Network Weather Service: A Distributed Resource Performance Forecasting Service for Metacomputing,” inFuture Generation Computer Systems, pp. 757–768, 1999.Google Scholar