Abstract
With continuously growing numbers of nodes and CPU cores cluster scalability is becoming a more and more significant problem in high performance computing and several approaches are taken to improve it. On the hardware level, operating system level and in the communication model new approaches have been developed. Specialization of cluster nodes, introduction of light-weight kernels and new communication abstraction are all steps to increase the efficiency of compute clusters. Extending the light-weight kernel (LWK), Kitten, with RDMA capable Infiniband network interface support and developing Portals on top of that interface brings improvements to the current compute model. Furthermore, in preparation for running parallel jobs on the light-weight kernel a new Open MPI component was added as an alternative to the currently available OOB/TCP component. This component eliminates the need to have a TCP/IP software stack available on the compute nodes. It is based on the Sandia Portals 3.3 network abstraction and message passing interface.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
S. Kelly and R. Brightwell: Software architecture of the lightweight kernel, Catamount. In 2005 Cray Users Group, May Annual Technical Conference. 2005.
J. Liedtke: Toward real microkernels. Published in Communication of the ACM (CACM), vol 39/9, pp. 70–77, September 1996.
J. Lange, K. Pedretti, T. Hudson, P. Dinda, Z. Cui, L. Xia, P. Bridges, A. Gocke, S. Jaconette, M. Levenhagen, and R. Brightwell: Palacios and Kitten: New High Performance Operating Systems for Scalable Virtualized and Native Supercomputing. In IPDPS ’10: Proceedings of the 24th IEEE International Parallel and Distributed Processing Symposium (19–23 April 2010).
Erich Focht, Thomas Großmann, and Danny Sternkopf: I/O Forwarding on NEC SX-9. High Performance Computing on Vector Systems 2010.
Kitten light weight kernel project page, http://code.google.com/p/kitten.
E. Focht, J. Močnik, F. Unger, D. Sternkopf, M. Novak, and T. Grossmann: The SX-Linux Project: A Progress Report. High Performance Computing on Vector Systems 2009, pp. 79–96. DOI: 10.1007/978-3-642-03913-3_8.
Sandia Portals, http://www.cs.sandia.gov/Portals/.
Rolf Riesen, Ron Brightwell, Kevin Pedretti, Brian Barrett, Keith Underwood, Arthur B. Maccabe, and Trammell Hudson: The Portals 4.0 Message Passing Interface. Technical Report, Sandia National Laboratories, April 2008.
Rolf Riesen, Ron Brightwell, Kevin Pedretti, Arthur B. Maccabe, and Trammell Hudson: The Portals 3.3 Message Passing Interface. Technical Report, Sandia National Laboratories, April 2006.
D. Bonachea, P. Hargrove, M. Welcome, and K. Yelick: Porting GASNet to Portals: Partitioned Global Address Space (PGAS) Language Support for the Cray XT, CUG 2009.
Parallel Virtual File System, Version 2, http://www.pvfs.org/.
Open MPI, http://www.open-mpi.org/.
M. Koop, T. Jones, and D. K. Panda: MVAPICH-Aptus: Scalable High-Performance Multi-Transport MPI over InfiniBand. IEEE International Parallel and Distributed Processing Symposium (IPDPS ’08), Miami, Florida, April 2008.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Focht, E., Močnik, J., Unger, F., Jeutter, A., Novak, M. (2010). Light-Weight Kernel with Portals. In: Resch, M., et al. High Performance Computing on Vector Systems 2010. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11851-7_1
Download citation
DOI: https://doi.org/10.1007/978-3-642-11851-7_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-11850-0
Online ISBN: 978-3-642-11851-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)