Skip to main content
Log in

Efficient Multiple Multicast on Heterogeneous Network of Workstations

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

In recent years, network of workstations/PCs (so called NOW) are becoming appealing vehicles for cost-effective parallel computing. Due to the commodity nature of workstations and networking equipment, LAN environments are gradually becoming heterogeneous. The diverse sources of heterogeneity in NOW systems pose a challenge on the design of efficient communication algorithms for this class of systems. In this paper, we propose efficient algorithms for multiple multicast on heterogeneous NOW systems, focusing on heterogeneity in processing speeds of workstations/PCs. Multiple multicast is an important operation in many scientific and industrial applications. Multicast on heterogeneous systems has not been investigated until recently. Our work distinguishes itself from others in two aspects: (1) In contrast to the blocking communication model used in prior works, we model communication in a heterogeneous cluster more accurately by a non-blocking communication model, and design multicast algorithms that can fully take advantage of non-blocking communication. (2) While prior works focus on single multicast problem, we propose efficient algorithms for general, multiple multicast (in which single multicast is a special case) on heterogeneous NOW systems. To our knowledge, our work is the earliest effort that addresses multiple multicast for heterogeneous NOW systems. These algorithms are evaluated using a network simulator for heterogeneous NOW systems. Our experimental results on a system of up to 64 nodes show that some of the algorithms outperform others in many cases. The best algorithm achieves completion time that is within 2.5 times of the lower bound.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. H. R. Arabnia and S. M. Bhandarkar. Parallel stereocorelation on a reconfigurable multi-ring network. The Journal of Supercomputing, 10(3):243–270, 1996.

    Google Scholar 

  2. J. V. Bala, J. Bruck, R. Cypher, P. Elustando, A. Ho, C.-T. Ho, S. Kipnis, and M. Snir. CCL: A portable and tunable collective communication library for scalable parallel computers. Journal of Parallel and Distributed Computing, 6(2):154–164, 1995.

    Google Scholar 

  3. M. Banikazemi, V. Moorthy, and D. K. Panda. Efficient collective communication on heterogeneous networks of workstations. In Proceedings of International Conference on Parallel Processing, pp. 460–467, 1998.

  4. M. Banikazemi, J. Sampathkumar, S. Prabhu, D. K. Panda, and P. Sadayappan. Communication modeling of heterogeneous networks of workstations for performance characterization of collective operations. In Proceedings of the Heterogenenous Computing Workshop, 1999.

  5. S. M. Bhandarkar and H. R. Arabnia. The REFINE multiprocessor—theoretical properties and algorithms. Parallel Computing, 21(11):1783–1806, 1995.

    Google Scholar 

  6. N. J. Boden, D. Cohen, R. F. Felderman, A. E. Kulawik, C. L. Seitz, J. Seizovic, and W. Su. Myrinet—a gigabit per second local area network. IEEE Micro, pp. 29–36, February 1995.

  7. J. Bruck, R. Cypher, P. Elustando, A. Ho, C.-T. Ho, V. Bala, S. Kipnis, and M. Snir. Efficient message passing interface (MPI) for parallel computing on clusters of workstations. Journal of Parallel and Distributed Computing, 40(1):19–34, January 1997.

    Google Scholar 

  8. A. Chien, S. Pakin, M. Lauria, M. Badanan, K. Hane, and L. Giannini. High performance virtual machines (HPVM): Clusters with supercomputing and performance. In Proceedings of 8th SIAM Conference on Parallel Processing for Scientific Computing, 1996.

  9. D. Culler, A. Arpaci-Dusseau, R. Arpaci-Dusseau, B. Chun, S. Lumetta, A. Mainwaring, R. Martin, C. Yoshikawa, and F. Wang. Parallel computing on the berkeley now. In 9th Joint Symposium on Parallel Processing, 1997.

  10. M. D. Schroeder, A. D. Birrell, M. Burrows, H. Murray, R. M. Needham, T. L. Rodeheffer, E. H. Satterthwaite, and C. P. Thacker. Autonet: A high-speed, self-configuring local area network using point-to-point links. Technical Report SRC research report 59, DEC, April 1990.

  11. K.-P. Fan and C.-T. King. Efficient multicast on wormhole switch-based irregular networks of workstations and processor clusters. In Proceedings of Parallel and Distributed Computing Symposium (PDCS), 1997.

  12. W. Gropp and E. Lusk. User's guide for mpich, a portable implementation of mpi. Technical Report ANL/MCS-TM-ANL-96/6, Mathematics and Computer Science Division, Argonne National Laboratory, 1996.

  13. S. Hambrusch, F. Hameed, and A. Khokhar. Communication operations on coarse-grained mesh architectures. Parallel Computing, 21:731–751, 1995.

    Google Scholar 

  14. R. Horst. Servernet deadlock avoidance and fractahedral topologies. In Proceedings of the International Parallel Processing Symposium, pp. 274–280, April 1996.

  15. C. Huang, Y. Huang, and P. K. McKinley. A thread-based interface for collective communication on ATM networks. In Proceedings of ICDCS, pp. 254–261, 1995.

  16. Y. Hwang and P. K. McKinley. Efficient collective operations with atm network interface support. In Proceedings of ICPP, 1996.

  17. M. Jacunski, P. Sadayappan, and D. K. Panda. All-to-all broadcast on switch-based clusters of workstations. In Proceedings of International Parallel and Distributed Processing Symposium (IPDPS), 1999.

  18. R. Kesavan, K. Bondalapati, and D. K. Panda. Multicast on irregular switch-based networks with wormhole routing. In International Symposium on High Performance Computer Architecture, February 1997.

  19. R. Kesavan and D. K. Panda. Multiple multicast with minimized node contention on wormhole k-ary n-cube networks. IEEE Transacations on Parallel and Distributed Systems, 10(4):371–393, 1999.

    Google Scholar 

  20. R. Kielmann, R. F. H. Hofman, H. E. Bal, A. Plaat, and R. A. F. Bhoedjang. MAGPIE: MPI's collective communication operations for clustered wide area systems. In PPoPP'99, Symposium on Principles and Practice of Parallel Programming, May 1999.

  21. M. Lauria. High performance MPI implementation on a network of workstations. Technical Report Master Thesis, Department of Computer Science, University of Illinois at Urbana-Champaign, 1996.

  22. W. Liu, C.-L. Wang, and V. K. Prasanna. Portable and scalable algorithms for irregular all-to-all communication. In Proceedings of the 16th ICDCS, pp. 428–435, 1996.

  23. B. Lowekamp and A. Beguelin. ECO: Efficient collective operations for communication on heterogeneous networks. In Proceedings of International Parallel Processing Symposium, pp. 399–405, 1996.

  24. P. K. McKinley, H. Xu, A. Esfahanian, and L. M. Ni. Unicast-based multicast communication in wormhole-routed direct networks. IEEE Transactions on Parallel and Distributed Systems, (5):1252–1265, 1994.

  25. C. S. Raghavendra, V. K. Prasanna, and P. B. Bhat. Efficient collective communication in distributed heterogeneous systems. In Proceedings of International Conference on Distributed Computing Systems (ICDCS), 1999.

  26. V. K. Prasanna, P. B. Bhat, and C. S. Raghavendra. Adaptive communication algorithms for distributed heterogeneous systems. In Proceedings of the 7th IEEE International Symposium on High Performance Distributed Computing (HPDC), 1998.

  27. J. Y. L. Park, H. A. Choi, N. Nupairoj, and L. M. Ni. Construction of optimal multicast trees based on the parameterized communication model. In Proceedings of the International Conference on Parallel Processing, 1996.

  28. S. Ranka, R. Shankar, and K. Alsabti. Many-to-many communication with bounded traffic. In Symposium on Frontiers of Massively Parallel Computation, 1995.

  29. H. Tezuka, A. Hori, Y. Ishikawa, and M. Sato. PM: An operating system coordinated highperformance communication library. In Proceedings of High Performance Computing and Networking (LNCS vol. 1225), 1997.

  30. K. Verstoep, K. Langendoen, and H. Bal. Efficient reliable multicast on myrinet. In Proceedings of International Conference Parallel Processing, volume III, pp. 156–165, August 1996.

    Google Scholar 

  31. S. Y. Wang, Y. C. Tseng, C. S. Sheu, and J. P. Sheu. Balancing traffic load for multi-node multicast in a wormhole 2d torus/mesh. In International Parallel and Distributed Processing Symposium, pp. 611–616, 2000.

  32. Z. Zhou and Z. Tang. Optimal algorithms to reduce contention in multiple multicast. Technical report, Institute of Computing Technology, Chinese Academy of Sciences, 2000.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jan-jan Wu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wu, Jj., Yeh, Sh. & Liu, P. Efficient Multiple Multicast on Heterogeneous Network of Workstations. The Journal of Supercomputing 29, 59–88 (2004). https://doi.org/10.1023/B:SUPE.0000022573.11074.00

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/B:SUPE.0000022573.11074.00

Navigation