Abstract
The Data Grid enables the sharing, selection, and connection of a wide variety of geographically distributed computational and storage resources for solving large-scale data intensive scientific applications. Such technology efficiently manage and transfer terabytes or even petabytes of data for data-intensive, high-performance computing applications in wide-area, distributed computing environments. Replica selection process allows an application to choose a replica from replica catalog, based on its performance and data access features. In this paper, we build a Grid environment based on three existing PC Cluster environments and perform performance analysis of data transfers using GridFTP protocol over these systems. In addition, based on experimental results, it is proposed a cost model to pick the best replica, in real and dynamic network situations.
This paper is supported in part by NSC Taiwan (National Science Council), under grants no. NSC92-2213-E-029-025, NSC92-2119-M-002-024, NSC 93-2119-M-002-004 and NSC93-2213-E-029-026.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Allcock, B., Bester, J., Bresnahan, J., Chervenak, A., Foster, I., Kesselman, C., Meder, S., Nefedova, V., Quesnal, D., Tuecke, S.: Data Management and Transfer in High Performance Computational Grid Environments. Parallel Computing 28(5), 749–771 (2002)
Allcock, B., Bester, J., Bresnahan, J., Chervenak, A., Foster, I., Kesselman, C., Meder, S., Nefedova, V., Quesnel, D., Tuecke, S.: Secure, Efficient Data Transport and Replica Management for High-Performance Data-Intensive Computing. In: IEEE Mass Storage Conference (2001)
Allcock, B., Tuecke, S., Foster, I., Chervenak, A., Kesselman, C.: Protocols and Services for Distributed Data-Intensive Science. In: ACAT 2000 Proceedings, pp. 161–163 (2000)
Czajkowski, K., Fitzgerald, S., Foster, I., Kesselman, C.: Grid Information Services for Distributed Resource Sharing. In: Proceedings of the Tenth IEEE International Symposium on High-Performance Distributed Computing (HPDC-10), August 2001. IEEE CS Press, Los Alamitos (2001)
Czajkowski, K., Foster, I., Karonis, N., Kesselman, C., Martin, S., Smith, W., Tuecke, S.: A Resource Management Architecture for Metacomputing Systems. In: Proc. IPPS/SPDP 1998 Workshop on Job Scheduling Strategies for Parallel Processing, pp. 62–82 (1998)
De, R.L., Costa, C., Lifschitz, S.: Database Allocation Strategies for Parallel BLAST Evaluation on Clusters. Proceedings of the Distributed and Parallel Databases 13(1), 99–127 (2003)
Foster, I.: The Grid: A New Infrastructure for 21st Century Science. Physics Today 55(2), 42–47 (2002)
Foster, I., Kesselman, C.: Globus: A Metacomputing Infrastructure Toolkit. Intl J. Supercomputer Applications 11(2), 115–128 (1997)
Foster, I., Kesselman, C.: The Grid: Blueprint for a New Computing Infrastructure. Morgan Kaufmann, San Francisco (1999)
Foster, I., Kesselman, C., Tuecke, S.: The Anatomy of the Grid: Enabling Scalable Virtual Organizations. Intl J. Supercomputer Applications 15(3) (2001)
Global Grid Forum, http://www.ggf.org/
The Globus Project, http://www.globus.org/
Introduction to Grid Computing with Globus, http://www.ibm.com/redbooks/
SETI@home: Search for Extraterrestrial Intelligence at home, http://setiathome.ssl.berkeley.edu/
SYSSTAT utilities home page, http://perso.wanadoo.fr/sebastien.godard/
Wolski, R., Spring, N., Hayes, J.: The Network Weather Service: A Distributed Resource Performance Forecasting Service for Metacomputing. Journal of Future Generation Computing Systems 15(5-6), 757–768 (1999)
Zhang, X., Freschl, J., Schopf, J.: A Performance Study of Monitoring and Information Services for Distributed Systems. In: Proceedings of HPDC (August 2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yang, CT., Chen, CH., Li, KC., Hsu, CH. (2005). Performance Analysis of Applying Replica Selection Technology for Data Grid Environments. In: Malyshkin, V. (eds) Parallel Computing Technologies. PaCT 2005. Lecture Notes in Computer Science, vol 3606. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11535294_24
Download citation
DOI: https://doi.org/10.1007/11535294_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28126-9
Online ISBN: 978-3-540-31826-2
eBook Packages: Computer ScienceComputer Science (R0)