Systematic Mapping Study on Performance Scalability in Big Data on Cloud Using VM and Container

  • Cansu Gokhan
  • Ziya KarakayaEmail author
  • Ali Yazici
Conference paper
Part of the IFIP Advances in Information and Communication Technology book series (IFIPAICT, volume 475)


In recent years, big data and cloud computing have gained importance in IT and business. These two technologies are becoming complementing in a way that the former requires large amount of storage and computation power, which are the key enabler technologies of Big Data; the latter, cloud computing, brings the opportunity to scale on-demand computation power and provides massive quantities of storage space. Until recently, the only technique used in computation resource utilization was based on the hypervisor, which is used to create the virtual machine. Nowadays, another technique, which claims better resource utilization, called “container” is becoming popular. This technique is otherwise known as “lightweight virtualization” since it creates completely isolated virtual environments on top of underlying operating systems. The main objective of this study is to clarify the research area concerned with performance issues using VM and container in big data on cloud, and to give a direction for future research.


  1. 1.
    Mytilinis, I., Tsoumakos, D., Kantere, V., Nanos, A., Koziris, N.: I/O performance modeling for big data applications over cloud infrastructures. In: Proceeding of 2015 IEEE International Conference on Cloud Engineering, pp. 201–206 (2015)Google Scholar
  2. 2.
    Morabito, R., Kjllman, J., Komu, M.: Hypervisors vs. lightweight virtualization: a performance comparison. In: Proceeding of 2015 IEEE International Conference on Cloud Engineering, pp. 386–393 (2015)Google Scholar
  3. 3.
    Felter, W., Ferreira, A., Rajamony, R., Rubio, J.: An updated performance comparison of virtual machines and linux containers. In: 2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS). IEEE (2015)Google Scholar
  4. 4.
    Intel White Paper: Linux Containers Streamline Virtualization and Complement Hypervisor-Based Virtual Machines (2014).
  5. 5.
    Abaker, I., Hashem, T., Yaqoob, I., Anuar, N.B., Mokhtar, S., Gani, A., Khan, S.U.: The rise of ‘big data’ on cloud computing: review and open research issues. J. Inf. Syst. 47, 98–115 (2015)CrossRefGoogle Scholar
  6. 6.
    He, Y., Jiang, X., Wu, Z., Ye, K., Chen, Z.: Scalability analysis and improvement of hadoop virtual cluster with cost consideration. In: 2014 IEEE 7th International Conference on Proceeding of the Cloud Computing (CLOUD), pp. 594–601 (2014)Google Scholar
  7. 7.
    Yang, Y., Long, X., Dou, X., Wen, C.: Impacts of virtualization technologies on hadoop. In: 2013 Third International Conference Proceeding of Intelligent System Design and Engineering Applications (ISDEA), pp. 846–849 (2013)Google Scholar
  8. 8.
    Vasconcelos, P.R.M., de Araújo Freitas, G.A.: Performance analysis of hadoop MapReduce on an OpenNebula cloud with KVM and OpenVZ virtualizations. In: Proceeding 9th International Conference for Internet Technology and Secured Transactions, ICITST (2014)Google Scholar
  9. 9.
    Kai, P., Vakkalanka, S., Kuzniarz, L.: Guidelines for conducting systematic mapping studies in software engineering: an update. Inf. Softw. Technol. 64, 1–18 (2015)CrossRefGoogle Scholar

Copyright information

© IFIP International Federation for Information Processing 2016

Authors and Affiliations

  1. 1.Institute of Natural and Applied SciencesAtilim UniversityAnkaraTurkey
  2. 2.Faculty of EngineeringAtilim UniversityAnkaraTurkey

Personalised recommendations