Abstract
In the recent world, one of the buzzwords in information technology is cloud computing. Cloud computing provides a novel approach of service provision by rearranging numerous resources over the Internet. Cloud computing uses a cloud database to store information. Cloud database can be accessed by the users from anywhere, and at any time with the help of Internet. Cloud database can handle a huge amount of data. But, the data duplicity can be a major problem associated to cloud computing database. This redundant data may go unnoticed but can consume a lot of storage space. In this paper, a novel algorithm is proposed to reduce data redundancy in the cloud environment by separating and analyzing the raw data and the meta-data. The output result shows that the proposed redundancy elimination process in the cloud database could be highly beneficial for the vendor.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Zaffos, A.S.: Cloud storage: benefits, risks and cost considerations. Gartner (Apr 2009)
Fox, A., Griffith, R., Joseph, A., Katz, R., Konwinski, A., Lee, G., Patterson, D., Rabkin, A., Stoica, I.: Above the clouds: a Berkeley view of cloud computing. Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Rep. UCB/EECS 28(13) (2009)
Gunawi, H.S., Agrawal, N., Arpaci-Dusseau, A.C., Schindler, J., Arpaci-Dusseau, R.H.: Deconstructing commodity storage clusters. In: 32nd International Symposium on Computer Architecture (ISCA’05), pp. 60–71. IEEE, June 2005
Douceur, J.R., Adya, A., Bolosky, W.J., Simon, P., Theimer, M.: Reclaiming space from duplicate files in a serverless distributed file system. In: Proceedings 22nd International Conference on Distributed Computing Systems, pp. 617–624. IEEE, 2002
Quinlan, S., Dorward, S.L.: Venti: a new approach to archival storage. In: FAST, vol. 2, pp. 89–101, Jan 2002
Muthitacharoen, A., Chen, B., Mazieres, D.: A low-bandwidth network file system. In: ACM SIGOPS Operating Systems Review, vol. 35, no. 5, pp. 174–187. ACM, Oct 2001
You, L.L., Pollack, K.T., Long, D.D.: Deep store: an archival storage system architecture. In: 21st International Conference on Data Engineering (ICDE’05), pp. 804–815. IEEE, Apr 2005
Harnik, D., Pinkas, B., Shulman-Peleg, A.: Side channels in cloud services: deduplication in cloud storage. IEEE Secur. Priv. 8(6), 40–47 (2010)
Saritha, K., Subasree, S.: Analysis of hybrid cloud approach for private cloud in the de-duplication mechanism. In: 2015 IEEE International Conference on Engineering and Technology (ICETECH), pp. 1–3. IEEE, Mar 2015
Backialakshmi, N., Manikandan, M.: Data de-duplication using N0SQL databases in cloud. In: 2015 International Conference on Soft-Computing and Networks Security (ICSNS), pp. 1–4. IEEE, Feb 2015
Thakar, M.P.D., Harkut, D.G.: Hybrid model for authorized de-duplication in cloud. Int. J. Emerg. Trends & Technol. Comput. Sci. 4(1), 147–151 (2015)
Harish, B., Harshitha, K.: Data deduplication in cloud. Int. J. Pure Appl. Math. 115(8), 353–358 (2017)
Shieh, F., Arani, M.G., Shamsi, M.: 2015. De-duplication approaches in cloud computing environment: a survey. Int. J. Comput. Appl. 120(13), 7–10 (2015)
Hunashikatti, L., Pujar, P.M.: Review on data deduplication and secured auditing of data on cloud. IEEE Trans. Comput. 65(8), 2386–2396 (2016)
Yu, C.M., Gochhayat, S.P., Conti, M., Lu, C.S.: Privacy aware data deduplication for side channel in cloud storage. IEEE Trans. Cloud Comput. 17(1), 1 (2018)
Kaur, M., Singh, J.: Data de-duplication approach based on hashing techniques for reducing time consumption over a cloud network. Int. J. Comput. Appl. 142(5), 4–10 (2016)
Aumasson, J.P., Neves, S., Wilcox-O’Hearn, Z., Winnerlein, C.: BLAKE2: simpler, smaller, fast as MD5. In: International Conference on Applied Cryptography and Network Security, pp. 119–135. Springer, Berlin, Heidelberg, June 2013
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Mohapatra, S., Bajpai, N., Swarnkar, T., Mishra, M. (2020). Raw Data Redundancy Elimination on Cloud Database. In: Das, A., Nayak, J., Naik, B., Dutta, S., Pelusi, D. (eds) Computational Intelligence in Pattern Recognition. Advances in Intelligent Systems and Computing, vol 1120. Springer, Singapore. https://doi.org/10.1007/978-981-15-2449-3_34
Download citation
DOI: https://doi.org/10.1007/978-981-15-2449-3_34
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-2448-6
Online ISBN: 978-981-15-2449-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)