Intelligent Dependability Services for Overlay Networks

  • Barry Porter
  • Geoff Coulson
  • Daniel Hughes
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4025)


Application-level overlays have emerged as a useful means of offering network services that are not supported by the underlying physical network. Most overlays employ proprietary dependability mechanisms to render them more resilient to node failure; but the use of proprietary approaches leads to duplication of effort during development and adds design complexity. In this paper we propose generic dependability services which simplify the design of overlays. Our services are fully decentralized and are configurable to take advantage of current network conditions, which can enable us to make better repairs following failures.


Overlay Network Failure Detection Node Failure Recovery Service Dependability Service 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Rowstron, A., Druschel, P.: Pastry: Scalable, decentralized object location, and routing for large-scale peer-to-peer systems. In: Guerraoui, R. (ed.) Middleware 2001. LNCS, vol. 2218, p. 329. Springer, Heidelberg (2001)CrossRefGoogle Scholar
  2. 2.
    Castro, M., Druschel, P., Kermarrec, A.M., Rowstron, A.: SCRIBE: A large-scale and decentralized application-level multicast infrastructure. IEEE Journal on Selected Areas in communications, JSAC (2002)Google Scholar
  3. 3.
    Zhao, B.Y., Kubiatowicz, J.D., Joseph, A.D.: Tapestry: An infrastructure for fault-tolerant wide-area location and routing. Technical Report UCB/CSD-01-1141, UC Berkeley (2001)Google Scholar
  4. 4.
    Pendarakis, D., Shi, S., Verma, D., Waldvogel, M.: ALMI: An application level multicast infrastructure. In: 3rd USNIX Symposium on Internet Technologies and Systems (USITS 2001), San Francisco, CA, USA, pp. 49–60 (2001)Google Scholar
  5. 5.
    Chawathe, Y., McCanne, S., Brewer, E.A.: RMX: Reliable multicast for heterogeneous networks. In: INFOCOM, Tel Aviv, Israel, pp. 795–804. IEEE, Los Alamitos (2000)Google Scholar
  6. 6.
    Clarke, I., Sandberg, O., Wiley, B., Hong, T.W.: Freenet: A distributed anonymous information storage and retrieval system. In: Federrath, H. (ed.) Designing Privacy Enhancing Technologies. LNCS, vol. 2009, p. 46. Springer, Heidelberg (2001)CrossRefGoogle Scholar
  7. 7.
    Stoica, I., Morris, R., Karger, D., Kaashoek, M.F., Balakrishnan, H.: Chord: A scalable peer-to-peer lookup service for internet applications. In: Proceedings of the 2001 conference on applications, technologies, architectures, and protocols for computer communications, pp. 149–160. ACM Press, New York (2001)Google Scholar
  8. 8.
    Jannotti, J., Gifford, D.K., Johnson, K.L., Kaashoek, M.F., O’Toole Jr., J.W.: Overcast: Reliable multicasting with an overlay network. In: Proceedings of the Fourth Symposium on Operating System Design and Implementation (OSDI), pp. 197–212 (2000)Google Scholar
  9. 9.
    Touch, J.: Dynamic internet overlay deployment and management using the x-bone. In: ICNP 2000: Proceedings of the 2000 International Conference on Network Protocols, Washington, DC, USA, p. 59. IEEE Computer Society, Los Alamitos (2000)CrossRefGoogle Scholar
  10. 10.
    Grace, P., Coulson, G., Blair, G., Mathy, L., Yeung, W.K., Cai, W., Duce, D., Cooper, C.: GRIDKIT: Pluggable overlay networks for grid computing. In: DOA 2004: Proceedings of Distributed Objects and Applications, Cyprus (2004)Google Scholar
  11. 11.
    Li, B., Guo, J., Wang, M.: iOverlay: A lightweight middleware infrastructure for overlay application implementations. In: Proceedings of IFIP/ACM/USENIX Middleware, Toronto, Canada (2004)Google Scholar
  12. 12.
    Pallickara, S., Fox, G.: NaradaBrokering: A distributed middleware framework and architecture for enabling durable peer-to-peer grids. In: Endler, M., Schmidt, D.C. (eds.) Middleware 2003. LNCS, vol. 2672, pp. 41–61. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  13. 13.
    Ratnasamy, S., Francis, P., Handley, M., Karp, R., Shenker, S.: A scalable content addressable network. Technical Report TR-00-010, UC Berkeley, Berkeley, CA (2000)Google Scholar
  14. 14.
    Mathy, L., Canonico, R., Hutchison, D.: An overlay tree building control protocol. In: Crowcroft, J., Hofmann, M. (eds.) NGC 2001. LNCS, vol. 2233, p. 76. Springer, Heidelberg (2001)CrossRefGoogle Scholar
  15. 15.
    Yang, M., Fei, Z.: A proactive approach to reconstructing overlay multicast trees. In: IEEE INFOCOM, Hong Kong (2004)Google Scholar
  16. 16.
  17. 17.
    Yang, B., Garcia-Molina, H.: Designing a super-peer network. In: Proceedings of the 19th International Conference on Data Engineering, Bangalore, India (2003)Google Scholar
  18. 18.
    Renesse, R.V., Minsky, Y., Hayden, M.: A gossip-style failure detection service. Technical Report TR98-1687, Cornell University (1998)Google Scholar
  19. 19.
    Porter, B., Taïani, F., Coulson, G.: Generalizing repair for overlay networks. Technical Report PTC–06–01, Lancaster University (2006)Google Scholar
  20. 20.
    Marzullo, K., Cooper, R., Wood, M.D., Birman, K.P.: Tools for distributed application management. IEEE Computer 24(8), 42–51 (1991)CrossRefGoogle Scholar
  21. 21.
    Bagchi, S., Whisnant, K., Kalbarczyk, Z., Iyer, R.K.: The chameleon infrastructure for adaptive, software implemented fault tolerance. In: Symposium on Reliable Distributed Systems, pp. 261–267 (1998)Google Scholar
  22. 22.
    Andersen, D.G., Balakrishnan, H., Kaashoek, M.F., Morris, R.: Resilient overlay networks. In: Symposium on Operating Systems Principles, pp. 131–145 (2001)Google Scholar
  23. 23.
    Chu, Y.H., Rao, S.G., Zhang, H.: A case for end system multicast. In: Measurement and Modeling of Computer Systems, pp. 1–12 (2000)Google Scholar
  24. 24.
    Ganek, A., Corbi, T.: The dawning of the autonomic computing era. IBM Systems Journal 42(1), 5–19 (2003)CrossRefGoogle Scholar

Copyright information

© IFIP International Federation for Information Processing 2006

Authors and Affiliations

  • Barry Porter
    • 1
  • Geoff Coulson
    • 1
  • Daniel Hughes
    • 1
  1. 1.Computing DepartmentLancaster UniversityLancasterUK

Personalised recommendations