Abstract
We present tools for recording and analysing kernel events on Linux clusters. The tools provide cluster-wide event logging at system clock accuracy. We demonstrate the usefulness of our tools by verifying an implementation of a simple remote scheduling feature and present interesting results regarding schedule responsiveness. With our GUI-based Java application, data recorded on multiple hosts is integrated for visualization. These tools can be used for analysis of cluster schedulers such as gang schedulers, cooperative cacheing, network RAM drivers, or parallel file systems.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Harold W. Cain, Barton P. Miller, and Brian J.N. Wylie. A callgraph-based search strategy for automated performance diagnosis. In Proc. of Euro-Par 2000 Parallel Processing, pages 108–122, 2000.
Jeremy Elson, Lewis Girod, and Deborah Estrin. Fine-grained network time synchronization using reference broadcasts. In Proc. of the Fifth Symposium on Operating Systems Design and Implementation (OSDI 2002), Boston, MA., December 2002.
Thomas Fahringer and Clóvis Seragiotto Júnior. Modeling and detecting performance problems for distributed and parallel programs with JavaPSL. In Proceedings of the 2001 ACM/IEEE conference on Supercomputing, November 2001.
W. Feng, M. Gardner, and J. Hay. The magnet toolkit: Design, evaluation, and implementation. Journal of Supercomputing, 23(1):67–79, August 2002.
Message Passing Interface Forum. MPI: A message-passing interface standard, June 1995. http://www.mpi-forum.org/docs/mpi-11.ps.
M. Gerndt, A. Schmidt, M. Schulz, and R. Wismüller. Performance analysis for teraflop computers — a distributed automatic approach. In Euromicro Workshop on Parallel, Distributed, and Network-based Processing, pages 23–30, Gran Canaria, January 2002.
Susan L. Graham, Peter B. Kessler, and Marshall K. McKusick. Gprof: A call graph execution profiler. In Proceedings of the SIGPLAN’ 82 symposium on Compiler construction, pages 120–126, Boston, Massachusetts, United States, June 1982.
Stuart Halloway. Using HPROF to tune performance, January 2000. http://developer.java.sun.com/developer/TechTips/2000/tt0124.html#tip2.
Jesús Labarta, Sergi Girona, Vincent Pillet, Toni Cortes, and Luis Gregoris. DiP: A parallel program development environment. In Proc. of the 2nd International Euro-Par Conference, 1996.
Oak Ridge National Laboratory. PVM: Parallel virtual machine. http://www.epm.ornl.gov/pvm/pvm_home.html.
Bernd Mohr, Allen D. Malony, Sameer Shende, and Felix Wolf. Towards a performance tool interface for OpenMP: An approach based on directive rewriting. In Proc. of Third European Workshop on OpenMPI (EWOMP 2001), Barcelona, September 2001.
John Postel. Request for comments RFC 792: Internet control message protocol, September 1981. http://www.ietf.org/rfc/rfc0792.txt.
Ariel Tamches and Barton P. Miller. Fine-grained dynamic instrumentation of commodity operating system kernels. In Proc. of Operating Systems Design and Implementation (OSDI 1999), 1999.
Hong-Linh Truong and Thomas Fahringer. SCALEA: A performance analysis tool for distributed and parallel programs. In Proc. of the 8th International Euro-Par Conference, LNCS 2400, Paderborn, Germany, August 2002.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Reuter, J., Tichy, W.F. (2003). Logging Kernel Events on Clusters. In: Sloot, P.M.A., Abramson, D., Bogdanov, A.V., Gorbachev, Y.E., Dongarra, J.J., Zomaya, A.Y. (eds) Computational Science — ICCS 2003. ICCS 2003. Lecture Notes in Computer Science, vol 2660. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44864-0_7
Download citation
DOI: https://doi.org/10.1007/3-540-44864-0_7
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40197-1
Online ISBN: 978-3-540-44864-8
eBook Packages: Springer Book Archive