Abstract
This paper describes and evaluates Sprinkler, a reliable high-throughput broadcast facility for geographically dispersed datacenters. For scaling cloud services, datacenters use caching throughout their infrastructure. Sprinkler can be used to broadcast update events that invalidate cache entries. The number of recipients can scale to many thousands in such scenarios. The Sprinkler infrastructure consists of two layers: one layer to disseminate events among datacenters, and a second layer to disseminate events among machines within a datacenter. A novel garbage collection interface is introduced to save storage space and network bandwidth. The first layer is evaluated using an implementation deployed on Emulab. For the second layer, involving thousands of nodes, we use a discrete event simulation. The effect of garbage collection is analyzed using simulation. The evaluation shows that Sprinkler can disseminate millions of events per second throughout a large cloud infrastructure, and garbage collection is effective in workloads like cache invalidation.
Chapter PDF
Similar content being viewed by others
References
Emulab, http://www.emulab.net
IBM WebSphere MQ, http://www.ibm.com/WebSphere-MQ
PRObE: Parallel Reconf. Observational Env., http://newmexicoconsortium.org/probe
Apache HedWig (2010), https://cwiki.apache.org/ZOOKEEPER/hedwig.html
Allavena, A., Demers, A., Hopcroft, J.E.: Correctness of a gossip based membership protocol. In: Proc. of the 24th ACM Symp. on Principles of Distributed Computing, pp. 292–301 (2005)
Altherr, M., Erzberger, M., Maffeis, S.: iBus - a software bus middleware for the Java platform. In: Proceedings of the Workshop on Reliable Middleware Systems, pp. 43–53 (1999)
Birman, K., Hayden, M., Ozkasap, O., Xiao, Z., Budiu, M., Minsky, Y.: Bimodal Multicast. ACM Transactions on Computer Systems 17(2), 41–88 (1999)
Birman, K.P., Joseph, T.A.: Exploiting virtual synchrony in distributed systems. In: Proc. of the 11th ACM Symp. on Operating Systems Principles (1987)
Bozdog, A., van Renesse, R., Dumitriu, D.: Selectcast – a scalable and self-repairing multicast overlay routing facility. In: First ACM Workshop on Survivable and Self-Regenerative Systems, Fairfax, VA (October 2003)
Castro, M., Druschel, P., Kermarrec, A., Rowstron, A.: SCRIBE: A large-scale and decentralized application-level multicast infrastructure. IEEE Journal on Selected Areas in Communications (JSAC) 20(8) (2002)
Castro, M., Druschel, P., Kermarrec, A.-M., Nandi, A., Rowstron, A., Singh, A.: SplitStream: high-bandwidth multicast in cooperative environments. In: Proc. of the 19th ACM Symp. on Operating Systems Principles, pp. 298–313 (2003)
Curry, E.: Message-Oriented Middleware. In: Mahmoud, Q.H. (ed.) Middleware for Communications, Chichester, UK. John Wiley and Sons, Ltd. (2005)
Demers, A., Greene, D., Hauser, C., Irish, W., Larson, J., Shenker, S., Sturgis, H., Swinehart, D., Terry, D.: Epidemic algorithms for replicated database maintenance. In: Proc. of the 6th ACM Symp. on Principles of Distributed Computing, pp. 1–12 (1987)
Eugster, P.T., Felber, P.A., Guerraoui, R., Kermarrec, A.-M.: The many faces of publish/subscribe. ACM Comput. Surv. 35(2), 114–131 (2003)
Eugster, P.T., Guerraoui, R., Handurukande, S.B., Kouznetsov, P., Kermarrec, A.-M.: Lightweight probabilistic broadcast. ACM Trans. Comput. Syst. 21(4), 341–374 (2003)
Ganesh, A.J., Kermarrec, A.-M., Massoulié, L.: Peer-to-peer membership management for gossip-based protocols. IEEE Trans. Comput. 52(2), 139–149 (2003)
Hunt, P., Konar, M., Junqueira, F.P., Reed, B.: Zookeeper: wait-free coordination for internet-scale systems. In: Proceedings of the 2010 USENIX Annual Technical Conference, pp. 145–158 (2010)
Jannotti, J., Gifford, D., Johnson, K., Kaashoek, M., O’Toole, J.W.: Overcast: Reliable multicasting with an overlay network. In: Proc. of the 4th Symp. on Operating Systems Design and Implementation (October 2000)
Kostić, D., Rodriguez, A., Albrecht, J., Vahdat, A.: Bullet: high bandwidth data dissemination using an overlay mesh. In: Proc. of the 19th ACM Symp. on Operating Systems Principles, pp. 282–297 (2003)
Kreps, J., Narkhede, N., Rao, J.: Kafka: a distributed messaging system for log processing. In: 6th International Workshop on Networking Meets Databases, NetDB 2011 (2011)
Nishtala, R., Fugal, H., Grimm, S., Kwiatkowski, M., Lee, H., Li, H.C., McElroy, R., Paleczny, M., Peek, D., Saab, P., Stafford, D., Tung, T., Venkataramani, V.: Scaling Memcache at Facebook. In: Proc. of the 10th Symp. on Networked Systems Design and Implementation, Lombard, IL (April 2013)
Oki, B.M., Pfluegl, M., Siegel, A., Skeen, D.: The Information Bus—an architecture for extensible distributed systems. In: Proc. of the 14th ACM Symp. on Operating Systems Principles, Asheville, NC, pp. 58–68 (December 1993)
Pai, V., Kumar, K., Tamilmani, K., Sambamurthy, V., Mohr, A.E.: Chainsaw: Eliminating trees from overlay multicast. In: van Renesse, R. (ed.) IPTPS 2005. LNCS, vol. 3640, pp. 127–140. Springer, Heidelberg (2005)
Powell, D.: Group communication. Commun. ACM 39(4), 50–53 (1996)
Shen, K.: Structure management for scalable overlay service construction. In: Proc. of the 1st Symp. on Networked Systems Design and Impl., pp. 281–294 (2004)
Van Renesse, R., Birman, K.P., Vogels, W.: Astrolabe: A robust and scalable technology for distributed system monitoring, management, and data mining. ACM Trans. Comput. Syst. 21(2), 164–206 (2003)
van Renesse, R., Schneider, F.B.: Chain Replication for supporting high throughput and availability. In: Proc. of the 6th Symp. on Operating Systems Design and Implementation (December 2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 IFIP International Federation for Information Processing
About this paper
Cite this paper
Geng, H., van Renesse, R. (2013). Sprinkler — Reliable Broadcast for Geographically Dispersed Datacenters. In: Eyers, D., Schwan, K. (eds) Middleware 2013. Middleware 2013. Lecture Notes in Computer Science, vol 8275. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-45065-5_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-45065-5_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-45064-8
Online ISBN: 978-3-642-45065-5
eBook Packages: Computer ScienceComputer Science (R0)