Skip to main content
Log in

Software Architecture for Processing Clusters Based on I2O

  • Published:
Cluster Computing Aims and scope Submit manuscript

Abstract

Mainstream computing equipment and the advent of affordable multi-Gigabit communication technology permit us to address data acquisition and processing problems with clusters of COTS machinery. Such networks typically contain heterogeneous platforms, real-time partitions and even custom devices. Vital overall system requirements are high efficiency and flexibility. In preceding projects we experienced the difficulties to meet both requirements at once. Intelligent I/O (I2O) is an industry specification that defines a uniform messaging format and execution environment for hardware and operating system independent device drivers in systems with processor based communication equipment. Mapping this concept to a distributed computing environment and encapsulating the details of the specification into an application-programming framework allow us to provide architectural support for (i) efficient and (ii) extensible cluster operation. This paper portrays our view of applying I2O to high-performance clusters. We demonstrate the feasibility of this approach and report on the efficiency of our XDAQ software framework for distributed data acquisition systems.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. J.E. Tomayko, Computers in Spaceflight, NASA contractor report CR-182505, National Aeronautics and Space Administration, Scientific and Technical Information Division, Washington, DC, USA (1988) p. 228.

    Google Scholar 

  2. J. Gutleber, Challenges in data acquisition at the beginning of the new millennium, in: Proceedings of the 1st International Workshop on Real-Time Mission-Critical Systems: Grand Challenge Problems, IEEE, Phoenix, AZ (1999).

    Google Scholar 

  3. W.C. Meilander, J.W. Baker and J.L. Potter, In air traffic control — the solution is the problem, in: Proceedings of the 1st International Workshop on Real-Time Mission-Critical Systems: Grand Challenge Problems, IEEE, Phoenix, AZ (1999).

    Google Scholar 

  4. J. Gutleber, Application steering for large clusters of workstations in high energy physics environments, in: Proceedings of the International Conference on Applied Informatics, ed. M.H. Hamza, Innsbruck, Austria (IASTED/ACTA Press, Anaheim, 1999) pp. 481–484.

    Google Scholar 

  5. L.R. Welch, A taxonomy of real-time applications, in: Proceedings of the 1st International Workshop on Real-Time Mission-Critical Systems: Grand Challenge Problems, IEEE, Phoenix, AZ (1999).

    Google Scholar 

  6. W.J. McCombie, High availability in software using supervised logical channels, Embedded Systems Programming Europe 2 (1999) 9–16.

    Google Scholar 

  7. C.D. Gill, F. Kuhns, D.L. Levine and D.C. Schmidt, Applying adaptive real-time middleware to address grand challenges of COTS-based mission-critical real-time systems, in: Proceedings of the 1st International Workshop on Real-Time Mission-Critical Systems: Grand Challenge Problems, IEEE, Phoenix, AZ (1999).

    Google Scholar 

  8. F. Kon, M. Román, P. Liu, J. Mao, T. Yamane, L.C. Magalhães, and R.H. Campbell, Monitoring, security, and dynamic configuration with the dynamic TAO reflective ORB, in: IFIP/ACM International Conference on Distributed Systems Platforms and Open Distributed Processing (Middleware'2000), ew York (2000).

  9. S. Lo and S. Pope, The implementation of a high performance ORB over multiple network transports, in: Proceedings of the IFIP International Conference on Distributed Systems Platforms and Open Distributed Processing (Middleware 98), eds. N. Davies, K. Raymond and J. Seitz, The Lake District, England, IFIP (Springer, 1998).

  10. N.Wang, M. Kircher and D.C. Schmidt, Applying reflective techniques to optimize a QoS-enabled CORBA component model implementation, in: The 24th Annual International Computer Software and Applications Conference (COMPSAC 2000), Taipai, Taiwan (2000).

  11. S. Morgan, Jini to the rescue, IEEE Spectrum 4 (2000) 44–49.

    Google Scholar 

  12. I. Foster and C. Kesselman, The globus toolkit, in: The GRID. Blueprint for a New Computing Infrastructure, 1st edn., eds. I. Foster and C. Kesselman (Morgan Kaufmann, San Francisco, CA, USA, 1999) pp. 259–278.

    Google Scholar 

  13. A. Chien, S. Pakin, M. Lauria, M. Buchanan, K. Hane, L. Giannini and J. Prusakova, High performance virtual machines (HPVM): Clusters with supercomputing APIs and performance, in: 8th SIAM Conference on Parallel Processing for Scientific Computing (PP97), Hyatt Regency Minneapolis on Nicollet Mall Hotel, Minneapolis, MI, USA (SIAM, 1997).

  14. M. Lauria and A. Chien, MPI-FM: High performance MPI on workstation clusters, Journal of Parallel and Distributed Computing 40(1) (1997) 4–18.

    Google Scholar 

  15. I. Foster, J. Geisler, C. Kesselman and S. Tuecke, Managing multiple communication methods in high-performance networked computing systems, Journal of Parallel and Distributed Computing 40 (1997) 35–48.

    Google Scholar 

  16. R. Lea, S. Gibbs, A. Dara-Abrams and E. Eytchison, Networking home entertainment devices with HAVi, IEEE Computer 9 (2000) 35–43.

    Google Scholar 

  17. Universal Plug and Play Device Architecture, Version 1.0, Microsoft Corp. Redmond, WA (2000).

  18. M. Shaw and D. Garlan, Software Architecture: Perspectives on an Emerging Discipline (Prentice-Hall, 1996).

  19. I2O Special Interest Group, Intelligent I/O (I2O) Architecture Specification v2.0 (1999).

  20. K. Li, Shared virtual memory on loosley coupled multiprocessors, Ph.D. thesis, Department of Computer Science, Yale University, YALEU/DCS/RR-492 (1986).

  21. M.J. Flynn, Very high speed computing systems, Proc. IEEE 54 (1966) 1902–1909.

    Google Scholar 

  22. C.L. Smith, The large hadron collider, Scientific American 283(1) (2000) 70.

    Google Scholar 

  23. G. Harangozó, Two nondeterministic event building methods derived from the Barrel Shifter, in: 30th Annual Simulation Symposium (IEEE Press, 1997) pp. 137–144.

  24. E. Barsotti, A. Booth and M. Bowden, Effects of various event building techniques on data acquisition system architectures, Fermilab note FERMILAB-CONF-90/61, Batavia, IL, USA (1990).

  25. G. Antchev, E. Cano, S. Chatelier, S. Cittolin, S. Erhan, D. Gigi, J. Gutleber, C. Jacobs, F.Meijers, L. Orsini, L. Pollet, A. Racz, D. Samyn, N. Sinanis and P. Sphicas, The CMS event builder demonstrator based on Myrinet, IEEE Transactions on Nuclear Science 47(2) (2000) 293–298.

    Google Scholar 

  26. P.A. Bernstein, Middleware, An architecture for distributed system services, Technical Report CRL 93/16, Cambridge Research Lab, Digital Research Corp. (1993).

  27. P. Jardin, Supporting scalability and flexibility in a distributed management platform, Distrib. Syst. Engrg. 3 (1996) 115–123.

    Google Scholar 

  28. D.P. Ghormley, D. Petrou and S.H. Rodrigues, GLUnix: A global layer Unix for a network of workstations, Software: Practice and Experience 28(9) (1998) 929–961.

    Google Scholar 

  29. GT-64120A, Product Preview, Revision 0.2, Galileo Technology, Galileo Technology, Inc., San Jose, CA, USA (1998).

  30. D. Pendery and J. Eunice, InfiniBand architecture: Bridge over troubled waters, Research Note, InfiniBand Trade Ass'n (2000).

  31. B.N. Bershad, S. Savage, P. Pardyak, E.G. Sirer, M.E. Fiuczunski, D. Becker, C. Chambers and S. Eggers, Extensibility, safety and performance in the SPIN operating system, in: Proceedings of the 15th ACM Symposium on Operating System Principles (1995) pp. 267–284.

  32. H. Kopetz and W. Ochsenreiter, Clock synchronization in distributed real-time systems, IEEE Transactions on Computers C-36(8) (1987) 933–940.

    Google Scholar 

  33. P. Pardyak and B.N. Bershad, Dynamic binding for an extensible system, in: Proceedings of the 2nd USENIX Symposium on Operating Systems Design and Implementation (1996) pp. 201–212.

  34. O. Nierstrasz, S. Gibbs and D. Tsichritzis, Component-oriented software development, Communications of the ACM 35(9) (1992) 160–164.

    Google Scholar 

  35. C. Pfister and C. Szyperski, Why objects are not enough, in: 1st International Component Users Conference (CUC'96), Munich, Germany (SIGS Publishers, 1996).

  36. M.D. McIlroy, Mass-produced software components, in: Software Engineering Concepts and Techniques, eds. J.M. Buxton, P. Nauran and B. Randell, Reprinted proceedings of the 1968 and 1969 NATO Conferences, Petrocelli/Charter (ACM Press, 1976) pp. 88–98.

  37. N.J. Boden, D. Cohen, R.E. Felderman, A.E. Kulawik, C.L. Seitz, J.N. Seizovic and W.-K. Su, MYRINET: A gigabit per second local area network, IEEE-Micro 15(1) (1995) 29–36.

    Google Scholar 

  38. D.C. Schmidt, V. Kachroo, Y. Krishnamurthy and F. Kuhns, Developing next-generation distributed applications with QoS-enabled DPE middleware, IEEE Communications Magazine 38(10) (2000) 112–123.

    Google Scholar 

  39. M. Shapiro, Structure and encapsulation in distributed systems: The proxy principle, in: Proceedings of the 6th International Conference on Distributed Computing Systems (IEEE Computer Society Press, 1986) pp. 198–204.

  40. InfiniBand Trade Association, InfiniBand Architecture, Specification Volume 1, Release 1.0 (2000).

  41. J. Pranevich, Wonderful World of Linux 2.4, Linux Today, http: //linuxtoday.com.

  42. Banderacom partners withWindRiver, SD Times, BZMedia LLC, Oyster Bay, NY, USA 19 (1 December 2000) 16.

  43. R. Wang and E. Crisostomo, Event bridges across CORBA event service and programming language event models, Journal of Object Oriented Programming (July/August 1999).

  44. Simple object access protocol (SOAP) 1.1, W3C Note 08 (May 2000), http://www.w3.org/TR/SOAP/

  45. Myricom, The GM Message Passing System (1999).

  46. T. von Eicken, D. Culler, S. Goldstein and K. Schasuer, Active messages: A mechanism for integrated communication and computation, in: Proceedings of the 19th International Symposium on Computer Architecture, Gold Coast, Australia (1992).

  47. M.E. Fiucynski and B.N. Bershad, An extensible protocol architecture for application-specific networking, in: Proceedings of the USENIX 1996 Annual Technical Conference (1996).

  48. M.E. Fiucynski and B.N. Bershad, SPINE — A safe programmable and integrated network environment, in: 16th ACM Symposium on Operating System Principles (1997).

  49. D.C. Schmidt, D.L. Levine and C. Cleeland, Architectures and patterns for high-performance, real-time ORB endsystems, in: Advances in Computers, ed. M. Zelkowitz (Academic Press, 1999).

  50. F. Kuhns, D.C. Schmidt, C. O'Ryan and D.L. Levine, Supporting highperformance I/O in QoS-enabled ORB middleware, cluster computing, Journal on Networks, Software, and Applications 3 (2000) 151–173.

    Google Scholar 

  51. D.C. Schmidt and S. Vinoski, An overview of the OMG CORBA messaging quality of service (QoS) framework, C++ Report 12(3) (2000).

  52. D.C. Schmidt and T. Suda, The performance of alternative threading architectures for parallel communication subsystems, Journal of Parallel and Distributed Computing, submitted.

  53. C. O'Ryan, F. Kuhns, D.C. Schmidt, O. Othman and J. Parsons, The design and performance of a pluggable protocols framework for realtime distributed object computing middleware, in: IFIP/ACM International Conference on Distributed Systems Platforms and Open Distributed Processing (ACM Press, 2000) pp. 372–395.

  54. I. Yuji, S. Toshiaki, I. Tooru and K. Mitsuhiro, CrispORB: High performance CORBA for system area network, in: Proceedings of the 8th IEEE International Symposium on High Performance Distributed Computing (1999).

  55. C. O'Ryan, D.C. Schmidt, F. Kuhns, M. Spivak, J. Parsons, I. Pyarali and D.L. Levine, Evaluating policies and mechanisms for supporting embedded, real-time applications with CORBA 3.0, in: Proceedings of the 6th IEEE Real-Time Technology and Applications Symposium (RTAS 00) (2000).

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gutleber, J., Orsini, L. Software Architecture for Processing Clusters Based on I2O. Cluster Computing 5, 55–64 (2002). https://doi.org/10.1023/A:1012744721976

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1012744721976

Navigation