An Efficient Data Integration Framework in Cloud Using MapReduce
- 939 Downloads
In Bigdata applications, providing security to massive data is an important challenge because working with such data requires large scale resources that must be provided by cloud service provider. Here, this paper demonstrates a cloud implementation and technologies using big data and discusses how to protect such data using hashing and how users can be authenticated. In particular, technologies using big data such as the Hadoop project of Apache are discussed, which provides parallelized and distributed data analyzing and processing of petabyte of data, along with a summarized view of monitoring and usage of Hadoop cluster. In this paper, an algorithm called FNV hashing is introduced to provide integrity of the data that has been outsourced to cloud by the user. The data within Hadoop cluster can be accessed and verified using hashing. This approach brings out to enable many new security challenges over the cloud environment using Hadoop distributed file system. The performance of the cluster can be monitored by using ganglia monitoring tool. This paper designs an evaluation cloud model which will provide quantity related results for regularly checking accuracy and cost. From the results of the experiment found out that this model is more accurate, cheaper and can respond in real time.
KeywordsBig data Hadoop MapReduce Cloud computing Accuracy Consumption
- 6.Rupesh M, Chitre DK (2012) Data leakage and detection of guilty agent. Int J Sci Eng Res 3(6)Google Scholar
- 7.Hadoop, http://hadoop.apache.org
- 12.AL-Saiyd NA, Sail N (2013) Data integrity in cloud computing security. Theor Appl Inform Technol 58Google Scholar
- 13.Dillibabu M, Kumari S, Saranya T, Preethi R (2013) Assured protection and veracity for cloud data using Merkle hash tree algorithm. Indian J Appl Res 3:1–3Google Scholar