Skip to main content
Log in

A Monitoring Sensor Management System for Grid Environments

  • Published:
Cluster Computing Aims and scope Submit manuscript

Abstract

Large distributed systems such as Computational Grids require a large amount of monitoring data be collected for a variety of tasks such as fault detection, performance analysis, performance tuning, performance prediction, and scheduling. Ensuring that all necessary monitoring is turned on and that data is being collected can be a very tedious and error-prone task. We have developed an agent-based system to automate the execution of monitoring sensors and the collection of event data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. J. Abela and T. Debeaupuis, Universal format for logger messages, IETF Internet Draft, http://www.ietf.org/internetdrafts/ draft-abela-ulm-05.txt.

  2. J. Case, R. Mundy, D. Partain and B. Stewart, Introduction to Version 3 of the Internet-standard Network Management Framework, IETF RFC 2570 (April 1999).

  3. CORBA, Systems management: event management service, X/Open Document Number: P437, http://www.opengroup.org/ onlinepubs/008356299/.

  4. L. DeRose and D. Reed, SvPablo: A multi-language architectureindependent performance analysis system, in: Proc. of the International Conference on Parallel Processing (ICPP'99), Fukushima (September 1999).

  5. M. Genersereth and S. Ketchpel, Software agents, Communications of the ACM (July 1994).

  6. S. Fitzgerald, I. Foster, C. Kesselman, G. von Laszewski, W. Smith and S. Tueke, A directory service for configuring high-performance distributed computations, in: Proc. 6th IEEE Symp. on High Performance Distributed Computing (August 1997).

  7. Globus, http://www.globus.org.

  8. Grid Forum, Grid Performance Working Group, http://wwwdidc. lbl.gov/GridPerf/.

  9. I. Foster and C. Kesselman, eds., The Grid: Blueprint for a New Computing Infrastructure (Morgan Kaufmann, August 1998). ISBN 1–55860–475–8.

  10. R. Housely, W. Ford, W. Polk and D. Solo, Internet X.509 Public Key Infrastructure, IETF RFC 2459 (January 1999).

  11. Iperf, http://dast.nlanr.net/Projects/Iperf/ index.html.

  12. JDMK, http://www.sun.com/software/ java-dynamic/.

  13. Jini Distributed Event Specification, http://www.sun.com/ jini/specs/.

  14. JMX, http://java.sun.com/products/ JavaManagement/.

  15. Matisse, http://www.cnri.net/matisse/.

  16. D. Mills, Simple Network Time Protocol (SNTP), RFC 1769 (March 1995).

  17. Pablo Scalable Performance Tools, http://vibes.cs.uiuc. edu/.

  18. X. Peng, Survey on Event Service, http://www-unix.mcs. anl.gov/∼peng/survey.html.

  19. Performance Co-Pilot, http://oss.sgi.com/projects/ pcp/.

  20. Supernet, http://www.ngi-supernet.org/.

  21. tcpdump: NetLogger version, http://www.ittc.ukans.edu/ projects/enable/tcpdump/.

  22. M. Thompson, W. Johnston, S. Mudumbai, G. Hoo, K. Jackson and A. Essiari, Certificate-based access control for widely distributed resources, in: Proc. of the Eighth Usenix Security Symposium (August 1999).

  23. B. Tierney, J. Lee, B. Crowley, M. Holding, J. Hylton and F. Drake, A network-aware distributed storage cache for data intensive environments, in: Proc. of IEEE High Performance Distributed Computing conference (HPDC-8) (August 1999), http://wwwdidc. lbl.gov/DPSS/.

  24. B. Tierney, W. Johnston, B. Crowley, G. Hoo, C. Brooks and D. Gunter, The NetLogger methodology for high performance distributed systems performance analysis, in: Proc. of IEEE High Performance Distributed Computing conference (July 1998), http:// www-didc.lbl.gov/NetLogger/.

  25. M. Wahl, T. Howes and S. Kille, Lightweight Directory Access Protocol (v3), IETF RFC 2251 (December 1997).

  26. R. Wolski, N. Spring and J. Hayes, The network weather service: A distributed resource performance forecasting service for metacomputing, Future Generation Computing Systems (1999), http://nsw.npaci.edu/.

  27. R. Wolski, M. Swany and S. Fitzgerald, White Paper: Developing a Dynamic Performance Information Infrastructure for Grid Systems, http://dast.nlanr.net/GridForum/Perf-WG/ white.PDF.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tierney, B., Crowley, B., Gunter, D. et al. A Monitoring Sensor Management System for Grid Environments. Cluster Computing 4, 19–28 (2001). https://doi.org/10.1023/A:1011408108941

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1011408108941

Navigation