Optimizing Multi-deployment on Clouds by Means of Self-adaptive Prefetching

  • Bogdan Nicolae
  • Franck Cappello
  • Gabriel Antoniu
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6852)


With Infrastructure-as-a-Service (IaaS) cloud economics getting increasingly complex and dynamic, resource costs can vary greatly over short periods of time. Therefore, a critical issue is the ability to deploy, boot and terminate VMs very quickly, which enables cloud users to exploit elasticity to find the optimal trade-off between the computational needs (number of resources, usage time) and budget constraints. This paper proposes an adaptive prefetching mechanism aiming to reduce the time required to simultaneously boot a large number of VM instances on clouds from the same initial VM image (multi-deployment). Our proposal does not require any foreknowledge of the exact access pattern. It dynamically adapts to it at run time, enabling the slower instances to learn from the experience of the faster ones. Since all booting instances typically access only a small part of the virtual image along almost the same pattern, the required data can be pre-fetched in the background. Large scale experiments under concurrency on hundreds of nodes show that introducing such a prefetching mechanism can achieve a speed-up of up to 35% when compared to simple on-demand fetching.


Cloud Computing Access Pattern Local Disk IaaS Cloud Virtual Machine Image 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Armbrust, M., Fox, A., Griffith, R., Joseph, A., Katz, R., Konwinski, A., Lee, G., Patterson, D., Rabkin, A., Stoica, I., Zaharia, M.: A view of cloud computing. Commun. ACM 53, 50–58 (2010)CrossRefGoogle Scholar
  2. 2.
    Buyya, R., Yeo, C.S., Venugopal, S.: Market-oriented cloud computing: Vision, hype, and reality for delivering it services as computing utilities. In: HPCC 2008: Proceedings of the 2008 10th IEEE International Conference on High Performance Computing and Communications, pp. 5–13. IEEE Computer Society, Washington, DC, USA (2008)Google Scholar
  3. 3.
    Amazon Elastic Compute Cloud (EC2),
  4. 4.
  5. 5.
  6. 6.
    Wartel, R., Cass, T., Moreira, B., Roche, E., Manuel Guijarro, S.G., Schwickerath, U.: Image distribution mechanisms in large scale cloud providers. In: CloudCom 2010: Proc. 2nd IEEE International Conference on Cloud Computing Technology and Science, Indianapolis, USA (2010) (in press)Google Scholar
  7. 7.
    Andrzejak, A., Kondo, D., Yi, S.: Decision model for cloud computing under sla constraints. In: Proceedings of the 2010 IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems, MASCOTS 2010, pp. 257–266. IEEE Computer Society, Washington, DC, USA (2010)CrossRefGoogle Scholar
  8. 8.
    Amazon elastic block storage (ebs),
  9. 9.
    Nicolae, B.: BlobSeer: Towards efficient data storage management for large-scale, distributed systems. PhD thesis, University of Rennes 1 (November 2010)Google Scholar
  10. 10.
    Nicolae, B., Antoniu, G., Bougé, L., Moise, D., Carpen-Amarie, A.: Blobseer: Next-generation data management for large scale infrastructures. J. Parallel Distrib. Comput. 71, 169–184 (2011)CrossRefGoogle Scholar
  11. 11.
    Bolze, R., Cappello, F., Caron, E., Daydé, M., Desprez, F., Jeannot, E., Jégou, Y., Lanteri, S., Leduc, J., Melab, N., Mornet, G., Namyst, R., Primet, P., Quetier, B., Richard, O., Talbi, E.G., Touche, I.: Grid’5000: A large scale and highly reconfigurable experimental grid testbed. Int. J. High Perform. Comput. Appl. 20, 481–494 (2006)CrossRefGoogle Scholar
  12. 12.
    Nicolae, B., Bresnahan, J., Keahey, K., Antoniu, G.: Going Back and Forth: Efficient Multi-Deployment and Multi-Snapshotting on Clouds. In: HPDC 2011: The 20th International ACM Symposium on High-Performance Parallel and Distributed Computing, San José, CA United States (2011)Google Scholar
  13. 13.
    Gagné, M.: Cooking with linux: still searching for the ultimate linux distro? Linux J. 2007(161), 9 (2007)Google Scholar
  14. 14.
    Carns, P.H., Ligon, W.B., Ross, R.B., Thakur, R.: Pvfs: A parallel file system for Linux clusters. In: Proceedings of the 4th Annual Linux Showcase and Conference, Atlanta, GA, pp. 317–327. USENIX Association (2000)Google Scholar
  15. 15.
    Weil, S.A., Brandt, S.A., Miller, E.L., Long, D.D.E., Maltzahn, C.: Ceph: a scalable, high-performance distributed file system. In: Proceedings of the 7th symposium on Operating systems design and implementation, OSDI 2006, pp. 307–320. USENIX Association, Berkeley (2006)Google Scholar
  16. 16.
    Schmuck, F., Haskin, R.: Gpfs: A shared-disk file system for large computing clusters. In: Proceedings of the 1st USENIX Conference on File and Storage Technologies, FAST 2002, USENIX Association, Berkeley (2002)Google Scholar
  17. 17.
    Amazon Simple Storage Service (S3),
  18. 18.
    DeCandia, G., Hastorun, D., Jampani, M., Kakulapati, G., Lakshman, A., Pilchin, A., Sivasubramanian, S., Vosshall, P., Vogels, W.: Dynamo: Amazon’s highly available key-value store. In: SOSP 2007: Proceedings of Twenty-First ACM SIGOPS Symposium on Operating Systems Principles, pp. 205–220. ACM, New York (2007)CrossRefGoogle Scholar
  19. 19.
    Zhu, Q., Gelenbe, E., Qiao, Y.: Adaptive prefetching algorithm in disk controllers. Perform. Eval. 65, 382–395 (2008)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Bogdan Nicolae
    • 1
  • Franck Cappello
    • 1
    • 2
  • Gabriel Antoniu
    • 3
  1. 1.INRIA SaclayFrance
  2. 2.University of Illinois at Urbana ChampaignUSA
  3. 3.INRIA Rennes Bretagne AtlantiqueFrance

Personalised recommendations