Abstract
Cloud computing consists of a set of new technologies that permit the dynamic allocation of computational resources (storage, CPU, memory) when performing high demanding data analysis. In the modern world of information data, cloud computing can provide valuable solutions for the Big Data Analytics domain. The correct allocation of resources in a Big Data analysis problem can both increase performance and decrease cost. This article proposes a system architecture for allocating computational resources according to the problem demands in a cloud infrastructure. Workflows are utilized in order to coordinate the execution of complex data analysis pipelines.
Chapter PDF
Similar content being viewed by others
References
Gantz, J., Reinsel, D.: The digital universe in 2020: Big data, bigger digital shadows, and biggest growth in the far east. IDC iView: IDC Analyze the Future, 1–16 (2012)
Ji, C., Li, Y., Qiu, W., Awada, U., Li, K.: Big data processing in cloud computing environments. In: IEEE International Symposium on Parallel Architectures, Algorithms and Networks, ISPAN (2012)
Deelman, E., Gannon, D., Shields, M., Taylor, I.: Workflows and e-Science: An overview of workflow system features and capabilities. Future Generation Computer Systems 25(5), 528–540 (2009)
Da Xu, L.: Enterprise systems: state-of-the-art and future trends. IEEE Transactions on Industrial Informatics 7(4), 630–640 (2011)
Talia, D.: Clouds for Scalable Big Data Analytics. Computer 46(5), 98–101 (2013)
Kagadis, G.C., Kloukinas, C., Moore, K., Philbin, J., Papadimitroulas, P., Alexakos, C., Nagy, P.G., Visvikis, D., Hendee, W.R.: Cloud computing in medical imaging. Medical Physics 40(7), 070901 (2013)
Chen, H., Chiang, R.H., Storey, V.C.: Business Intelligence and Analytics: From Big Data to Big Impact. MIS Quarterly 36(4), 1165–1188 (2012)
Howe, D., et al.: Big data: The future of biocuration. Nature 455(7209), 47–50 (2008)
Demchenko, Y., Grosso, P., de Laat, C., Membrey, P.: Addressing big data issues in scientific data infrastructure. In: 2013 International Conference on Collaboration Technologies and Systems (CTS), pp. 48–55. IEEE, New York (2013)
Padhy, N., Mishra, D., Panigrahi, R.: The survey of data mining applications and feature scope. International Journal of Computer Science, Engineering and Information Technology 1.2(3), 43–58 (2012)
Iosup, A., Ostermann, S., Yigitbasi, M.N., Prodan, R., Fahringer, T., Epema, D.H.: Performance analysis of cloud computing services for many-tasks scientific computing. IEEE Transactions on Parallel and Distributed Systems 22(6), 931–945 (2011)
Mell, P., Grance, T.: The NIST definition of cloud computing. NIST Special Publication 800-145 (2011)
Baset, S.A.: Open source cloud technologies. In: 3rd ACM Symposium on Cloud Computing, p. 28. ACM (2012)
Williams, D.E.: Virtualization with Xen (tm): Including XenEnterprise, XenServer, and XenExpress. Syngress (2012)
Van Der Aalst, W., Van Hee, K.M.: Workflow management: models, methods, and systems. MIT press (2004)
Curcin, V., Ghanem, M.: Scientific workflow systems-can one size fit all? In: Biomedical Engineering Conference (CIBEC 2008), pp. 1–9. IEEE (2008)
Ko, R.K., Lee, S.S., Wah Lee, E.: Business process management (BPM) standards: a survey. Business Process Management Journal 15(5), 744–791 (2009)
Chinosi, M., Trombetta, A.: BPMN: An introduction to the standard. Computer Standards & Interfaces 34(1), 124–134 (2012)
Cambronero, M.E., Dı, G., Macià , H.: A Petri net approach for the design and analysis of Web Services Choreographies. The Journal of Logic and Algebraic Programming 78(5), 359–380 (2009)
Wolstencroft, K., et al.: The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud. Nucleic Acids Research 41(W1), W557–W561 (2013)
Kambatla, K., Kollias, G., Kumar, V., Grama, A.: Trends in big data analytics. Journal of Parallel and Distributed Computing 74(7), 2561–2573 (2014)
Agrawal, D., Das, S., El Abbadi, A.: Big data and cloud computing: current state and future opportunities. In: 14th International Conference on Extending Database Technology, pp. 530–533. ACM (March 2011)
Vaquero, L.M., Rodero-Merino, L., Buyya, R.: Dynamically scaling applications in the cloud. ACM SIGCOMM Computer Communication Review 41(1), 45–52 (2011)
Mao, M., Humphrey, M.: Auto-scaling to minimize cost and meet application deadlines in cloud workflows. In: 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, p. 49. ACM (2011)
Herodotou, H., Lim, H., Luo, G., Borisov, N., Dong, L., Cetin, F.B., Babu, S.: Starfish: A Self-tuning System for Big Data Analytics. In: 5th Biennial Conference on Innovative Data Systems Research (CIDR 2011), vol. 11, pp. 261–272 (2011)
Bernstein, D., Ludvigson, E., Sankar, K., Diamond, S., Morrow, M.: Blueprint for the intercloud-protocols and formats for cloud computing interoperability. In: Fourth International Conference on Internet and Web Applications and Services (ICIW 2009), pp. 328–336. IEEE (2009)
Korfiati, A., Sfika, N., Daloukas, K., Alexakos, C., Tsompanopoulou, P., Likothanassis, S.: IRaaS: A Cloud Implementation of an Interface Relaxation Method for the Solution of PDEs. In: 2015 International Conference of Parallel and Distributed Computing, part of World Congress on Engineering 2015 (WCE 2015), IAENG, Hong Kong (2015)
Korfiati, A., Tsompanopoulou, P., Likothanassis, S.: Serial and Parallel Implementation of an Interface Relaxation Method. In: 6th International Conference on Numerical Analysis (NumAn 2014), pp. 167–173 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 IFIP International Federation for Information Processing
About this paper
Cite this paper
Sfika, N., Manos, K., Korfiati, A., Alexakos, C., Likothanassis, S. (2015). Workflow Coordinated Resources Allocation for Big Data Analytics in the Cloud. In: Chbeir, R., Manolopoulos, Y., Maglogiannis, I., Alhajj, R. (eds) Artificial Intelligence Applications and Innovations. AIAI 2015. IFIP Advances in Information and Communication Technology, vol 458. Springer, Cham. https://doi.org/10.1007/978-3-319-23868-5_28
Download citation
DOI: https://doi.org/10.1007/978-3-319-23868-5_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23867-8
Online ISBN: 978-3-319-23868-5
eBook Packages: Computer ScienceComputer Science (R0)