Abstract
More and more, users store their data in the cloud. While the content is then retrieved, the retrieval has to respect quality of service (QoS) constraints. In order to reduce transfer latency, data is replicated. The idea is make data close to users and to take advantage of providers home storage. However to minimize the cost of their platform, cloud providers need to limit the amount of storage usage. This is still more crucial for big contents.
This problem is hard, the distribution of the popularity among the stored pieces of data is highly non-uniform: several pieces of data will never be accessed while others may be retrieved thousands of times. Thus, the trade-off between storage usage and QoS of data retrieval has to take into account the data popularity.
This paper presents our architecture gathering several storage domains composed of small-sized datacenters and edge devices; and it shows the importance of adapting the replication degree to data popularity.
Our simulations, using realistic workloads, show that a simple cache mechanism provides a eight-fold decrease in the number of SLA violations, requires up to 10 times less of storage capacity for replicas, and reduces aggregate bandwidth and number of flows by half.
Chapter PDF
Similar content being viewed by others
Keywords
- Service Level Agreement
- Replication Scheme
- Storage Element
- Edge Network
- Service Level Agreement Violation
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Bonvin, N., Papaioannou, T.G., Aberer, K.: A self-organized, fault-tolerant and scalable replication scheme for cloud storage. In: ACM (ed.) ACM Symposium on Cloud Computing 2010, SOCC 2010, Indianapolis, USA (June 2010)
Cisco visual networking index: Forecast and methodology (2010-2015), http://www.cisco.com (June 2011)
Figueiredo, F., Benevenuto, F., Almeida, J.M.: The tube over time: characterizing popularity growth of youtube videos. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, pp. 745–754. ACM, New York (2011)
Ghemawat, S., Gobioff, H., Leung, S.-T.: The google file system. In: SOSP 2003: Proceedings of the 9th ACM Symposium on Operating Systems Principles, pp. 29–43. ACM Press, New York (2003)
Nygren, E., Sitaraman, R.K., Sun, J.: The akamai network: a platform for high-performance internet applications. SIGOPS Oper. Syst. Rev. 44(3), 2–19 (2010)
Pathan, M., Buyya, R.: A Taxonomy of CDNs. In: Buyya, R., Pathan, M., Vakali, A. (eds.) Content Delivery Networks. LNEE, vol. 9, pp. 33–77. Springer, Heidelberg (2008)
Shen, H.: An efficient and adaptive decentralized file replication algorithm in p2p file sharing systems. IEEE Transactions on Parallel and Distributed Systems 21, 827–840 (2010)
Enabling digital media content delivery: Emerging opportunities for network service providers (March 2010), http://www.velocix.com/formwp.php
Weil, S., Brandt, S.A., Miller, E.L., Long, D.D.E., Maltzahn, C.: Ceph: A scalable, high-performance distributed file system. In: Proceedings of the 7th Conference on Operating Systems Design and Implementation, OSDI 2006 (November 2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Silvestre, G., Monnet, S., Krishnaswamy, R., Sens, P. (2013). Caju: A Content Distribution System for Edge Networks. In: Caragiannis, I., et al. Euro-Par 2012: Parallel Processing Workshops. Euro-Par 2012. Lecture Notes in Computer Science, vol 7640. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36949-0_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-36949-0_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-36948-3
Online ISBN: 978-3-642-36949-0
eBook Packages: Computer ScienceComputer Science (R0)