Advertisement

HASSE: a tool for analyzing causal relationships in parallel and distributed systems

  • Peter Dauphin
  • Richard Hofmann
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 977)

Abstract

As the causal relationships induced by communication between cooperating processes are crucial for understanding the dynamic behavior of parallel and distributed systems, tool support is needed for analyzing them. This paper presents hasse, a tool which graphically represents the event traces from cooperating processes according to a formal description of their causal interdependences. Hasse allows to interactively zoom and further analyze the systems' activities shown. Especially the ability to automatically determine and display the pre-area and post-area of selected events is an important aid for determining the cause of an observed undesired behavior.

Keywords

causal relationship monitoring parallel and distributed system evaluation tool 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [1]
    T. Bemmerl, R. Lindhof, and T. Treml: The Distributed Monitor System of TOPSYS. In H. Burkhart, (Ed.): CONPAR 90-VAPP IV, Joint International Conference on Vector and Parallel Processing. Proceedings, pages 756–764, Zürich, Switzerland, September 1990. Springer, Berlin, LNCS 457.Google Scholar
  2. [2]
    P. Dauphin: Combining Functional and Performance Debugging of Parallel and Distributed Systems based on Model-driven Monitoring. In 2nd EUROMICRO Workshop on „Parallel and Distributed Processing“, University of Malaga, Spain, pages 463–470, Jan. 26.–28. 1994.Google Scholar
  3. [3]
    M.T. Heath and J.A. Etheridge: Visualizing the Performance of Parallel Programs. IEEE Software, pages 29–39, September 1991.Google Scholar
  4. [4]
    R. Hofmann, R. Klar, B. Mohr, A. Quick, and M. Siegle: Distributed Performance Monitoring: Methods, Tools, and Applications. IEEE Transactions on Parallel and Distributed Systems, 5(6):585–598, June 1994.CrossRefGoogle Scholar
  5. [5]
    D. Hogrefe: Estelle, LOTOS und SDL. Springer, Berlin, 1989.Google Scholar
  6. [6]
    W. Kern: Concept and Implementation of a Tool Representing Causal Relationships of Event Traces (in German). Master's thesis, Universität Erlangen-Nürnberg, IMMD VII, September 1993.Google Scholar
  7. [7]
    L. Lamport: Time, Clocks, and the Ordering of Events in a Distributed System. Communications of the ACM, 21(7):558–565, July 1978.CrossRefGoogle Scholar
  8. [8]
    A.D. Malony, D.H. Hammerslag, and D.J. Jablonowski: Traceview: A Trace Visualization Tool. IEEE Software, September 1991.Google Scholar
  9. [9]
    F. Mattern: Verteilte Basisalgorithmen. Springer Verlag, IFB 226, Berlin, 1989.Google Scholar
  10. [10]
    D.L. Mills: Improved algorithms for synchronizing computer network clocks. Computer Communication Review, 24(4):317–327, October 1994.CrossRefGoogle Scholar
  11. [11]
    B. Mohr: SIMPLE: a Performance Evaluation Tool Environment for Parallel and Distributed Systems. In A. Bode, editor, Distributed Memory Computing, 2nd European Conference, EDMCC2, pages 80–89, Munich, Germany, April 1991. Springer, Berlin, LNCS 487.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1995

Authors and Affiliations

  • Peter Dauphin
    • 1
  • Richard Hofmann
    • 1
  1. 1.University of Erlangen-Nürnberg IMMD VIIErlangen

Personalised recommendations