TDP_SHELL: An Interoperability Framework for Resource Management Systems and Run-Time Monitoring Tools

  • Vicente Ivars
  • Ana Cortes
  • Miquel A. Senar
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4128)


Resource management systems and tool support are two important factors for efficiently developing applications in large clusters. On the one hand, management systems (in the form of batch queue systems) are responsible for all issues related to executing jobs on the existing machines. On the other hand, run-time tools (in the form of debuggers, tracers, performance analyzers, etc.) are used to guarantee the correctness and the efficiency of execution. Executing an application under the control of both a resource management system and a run-time tool is still a challenging problem in most cases. Using run-time tools might be difficult or even impossible in usual environments due to the restrictions imposed by resource managers. We propose TDP-Shell as a framework for providing the necessary mechanisms to enable and simplify using run-time tools under a specific resource management system. We have analyzed the essential interactions between common run-time tools and resource management systems and implemented a pilot TDP-Shell. The paper describes the main components of TDP-Shell and its use with some illustrative examples.


Resource Management Attribute Space Resource Management System Tuple Space Remote Machine 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Sterling, T., Messina, P., Pool, J.: Findings of the second Pasadena Workshop on system software and tools for high performance computing environments. Tech. Report 95-162, Center of Exc. in Space Data and Inform. Sciences, NASA (1995E)Google Scholar
  2. 2.
    Johnsen, S., Anshus, O.J., Bjørndalen, J.M., Bongo, L.A.: Survey of execution monitoring tools for computer clusters, Tech. Report, Univ. of Tromso (September 2003)Google Scholar
  3. 3.
    Mutka, M.J., Livny, M., Litzkow, M.W.: Condor – A Hunter of Idle Workstations. In: 8th Int’l Conf. on Distributed Systems, June 1988, San Francisco (1988)Google Scholar
  4. 4.
  5. 5.
    Wismuller, R., Trinitis, J., Ludwig, T.: OCM-A Monitoring System for Interiperable Tools. In: Proc. 2nd SIGMETRICS Symposium on Parallel and Distrubuted Tools, August 1998, Welches, USA (1998)Google Scholar
  6. 6.
    Ludwig, T., Wismüller, R.: OMIS 2.0 – A Universal Interface for Monitoring Systems. In: Bubak, M., Waśniewski, J., Dongarra, J. (eds.) PVM/MPI 1997. LNCS, vol. 1332, pp. 267–276. Springer, Heidelberg (1997)CrossRefGoogle Scholar
  7. 7.
    Rackl, G., Lindermeier, M., Rudorfer, M., Süss, B.: MIMO-An Infraestructure for Monitoring and Managing Distributed Middleware Environments. In: Proc. Middleware 2000, pp. 71–87 (2000)Google Scholar
  8. 8.
    Prodan, R., Kewley, J.M.: A Framework for an Interoperable Tool Environment. In: Bode, A., Ludwig, T., Karl, W.C., Wismüller, R. (eds.) Euro-Par 2000. LNCS, vol. 1900, pp. 65–69. Springer, Heidelberg (2000)CrossRefGoogle Scholar
  9. 9.
    Miller, B., Cortes, A., Senar, M.A., Livny, M.: The Tool Daemon Protocol (TDP). In: Proc. SuperComputing (November 2003)Google Scholar
  10. 10.
    Etnus, L.L.C.: TotalView User’s Guide. Document version 6.0.0-1 (January 2003),
  11. 11.
    Carriero, N., Gelernter, D.: Linda in Context. Comm. of the ACM 32(4), 444–458 (1989)CrossRefGoogle Scholar
  12. 12.
    Miller, B.P., et al.: The Paradyn Parallel Performance Measurement Tools. IEEE Computer 28 11 (1995)Google Scholar
  13. 13.

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Vicente Ivars
    • 1
  • Ana Cortes
    • 1
  • Miquel A. Senar
    • 1
  1. 1.Departament d’Arquitectura d’Ordinadors i Sistemes OperatiusUniversitat Autònoma de BarcelonaBarcelonaSpain

Personalised recommendations