Abstract
Cloud infrastructures are competent to providing massive processing capabilities of computational and data resources in virtualized environments. Introduction of big data analytics in many spheres of science, technology and business has led to the trend of employing data-parallel frameworks, like Hadoop for handling such massive data requirements. Since most Hadoop based systems make the two decisions of scheduling data and computation independently, it seems a promising prospective to map computations within cloud resources based on data blocks already distributed to them. This paper proposes a computation scheduling framework that adopts the strategy of improving computation and data co-allocation within a Hadoop cloud infrastructure based on knowledge of data blocks availability, hereafter referred to as Data Aware Scheduling (DAS) framework. The proposed DAS employs a dependency based grouping of data. Experiments have been conducted using standard map-reduce applications and results presented herein conclusively demonstrate the efficacy of the proposed framework.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Yuan D, Yang Y, Liu X, Chen J (2010) A data placement strategy in scientific cloud workflows. Future Gener Comput Syst 26:1200–1214
www.gridgain.com. Last accessed 5 Dec 2013
http://hadoop.apache.org/. Accessed 15 Nov 2013
Acknowledgments
This work has been carried out at the Data Sciences Lab, Department of Computer Science and Engineering, National Institute of Science and Technology, Berhampur.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer India
About this paper
Cite this paper
Jaykishan, B., Hemant Kumar Reddy, K., Roy, D.S. (2014). A Data-Aware Scheduling Framework for Parallel Applications in a Cloud Environment. In: Sengupta, S., Das, K., Khan, G. (eds) Emerging Trends in Computing and Communication. Lecture Notes in Electrical Engineering, vol 298. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1817-3_49
Download citation
DOI: https://doi.org/10.1007/978-81-322-1817-3_49
Published:
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-1816-6
Online ISBN: 978-81-322-1817-3
eBook Packages: EngineeringEngineering (R0)