Cluster Computing

, Volume 6, Issue 3, pp 267–278 | Cite as

The Rutgers Computational Grid: A Distributed Linux PC Cluster

  • B. Chernyavsky
  • E. Gallicchio
  • D. Knight
  • R. Levy
  • A. Page
Article

Abstract

The Rutgers Computational Grid (RCG) project is aimed at providing high throughput performance to Rutgers university faculty and students. The RCG employs dual processor PCs, with Pentium II and III processors, as computational nodes, running the Linux RedHat operating system. The Load Sharing Facility (LSF) scheduling system from Platform Computing is used for job control and monitoring. The nodes are grouped into subclusters physically located in several departments and controlled by a single master node through LSF. The hardware and software used in RCG are described. Utilization and performance issues, including parallel performance, are discussed based on the experience of the first two years of RCG operation.

grid computing scheduling system parallel computing 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [1]
    R.J. Allan, Survey of Computation Grids, Meta-Computers and Network Information Tools ed.2, CLRS Daresbury Laboratory, Daresbury, January 2000.Google Scholar
  2. [2]
    S. Anasiadis and K. Sevcik, Parallel application scheduler on network of workstations, Journal of Parallel and Distributing Computing 43 (1997) 109-124.Google Scholar
  3. [3]
    T.E. Anderson, D.E. Culler, D.A. Patterson and the NOWTeam, A case for networks of workstations: NOW, IEEE Micro (February 1995).Google Scholar
  4. [4]
    R.H. Arpachi, A.C. Dusseau, A.M. Vahdat, L.T. Liu, T.E. Anderson and D.A. Patterson, The interaction of parallel and sequential workloads on a network of workstations, in: Proceedings of ACM SIGMETRICS'95/PERFORMANCE'95 Joint International Conference on Measurement and Modeling of Computer Systems, May 1995, pp. 267-278.Google Scholar
  5. [5]
    A.C. Arpaci-Dusseau and D.E. Culler, Extending proportional-share scheduling to a network of workstations, in: International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA'97) (1997).Google Scholar
  6. [6]
    P. Boulet, J. Dongarra, F. Rastello, Y. Pobert and F. Vivien, Algorithmic issues on heterogeneous computer platforms, Parallel Processing Letters 9(2) (1999) 197-213.Google Scholar
  7. [7]
    R. Buyya, High Performance Computer Clusters: Architecture and Systems, Vol. 1 (Prentice-Hall, NJ, 1999).Google Scholar
  8. [8]
    J. Dongarra, An overview of computational grids and survey of a few research projects, in: Symposium on Global Information Processing Technology, Japan, 1999.Google Scholar
  9. [9]
    X. Du and X. Zhang, Coordinating parallel processing on network of workstations, Journal of Parallel and Distributing Computing 46 (1997) 125-135.Google Scholar
  10. [10]
    I. Foster and C. Kesselman, The Grid: Blueprint for a New Computing Infrastructure (Morgan Kauffman, San Francisco, CA, 1999).Google Scholar
  11. [11]
    V. Hamscher, U. Schiegelshohn, A. Streit and R. Yahyapour, Evaluation of job-scheduling strategies for grid computing, in: 1st IEEE/ACM International Workshop on Grid Computing, Bangalore, India, 17–;20 December 2000.Google Scholar
  12. [12]
    G.J. Henry, The fair share scheduler, AT&T Bell Laboratories Technical Journal 63(8) (October 1984) 1845-1857.Google Scholar
  13. [13]
    http://www.dhpc.adelaide.edu.au/education/honours/campusgrid.htmlGoogle Scholar
  14. [14]
    http://www-isd.fnal.gov/fbatchGoogle Scholar
  15. [15]
    J. Kay and P. Lauder, A fair share scheduler, Communications of ACM 31(1) (1988) 44-55.Google Scholar
  16. [16]
    D. Knight, G. Zhou, N. Okong'o and V. Shukla, Compressible large eddy simulation using unstructured grids, AIAA Paper No. 98-0535, 1998.Google Scholar
  17. [17]
    Load Sharing Facility, http://www.platform.comGoogle Scholar
  18. [18]
    MPICH: A Portable Implementation of MPI, Mathematics and Computer Science Division, Argonne National Lab, http://www-unix.mcs. anl.gov/mpi/mpich/index.htmlGoogle Scholar
  19. [19]
    Rutgers Computational Grid, http://coewww.rutgers.edu/rcg/Google Scholar

Copyright information

© Kluwer Academic Publishers 2003

Authors and Affiliations

  • B. Chernyavsky
    • 1
  • E. Gallicchio
    • 2
  • D. Knight
    • 1
  • R. Levy
    • 2
  • A. Page
    • 3
  1. 1.Department Mechanical and Aerospace EngineeringRutgers –; The State University of New JerseyPiscatawayUSA
  2. 2.Department of ChemistryRutgers –; The State University of New JerseyPiscatawayUSA
  3. 3.Telecommunication DivisionRutgers –; The State University of New JerseyPiscatawayUSA

Personalised recommendations