Skip to main content

Raw Data Redundancy Elimination on Cloud Database

  • Conference paper
  • First Online:
Computational Intelligence in Pattern Recognition

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1120))

Abstract

In the recent world, one of the buzzwords in information technology is cloud computing. Cloud computing provides a novel approach of service provision by rearranging numerous resources over the Internet. Cloud computing uses a cloud database to store information. Cloud database can be accessed by the users from anywhere, and at any time with the help of Internet. Cloud database can handle a huge amount of data. But, the data duplicity can be a major problem associated to cloud computing database. This redundant data may go unnoticed but can consume a lot of storage space. In this paper, a novel algorithm is proposed to reduce data redundancy in the cloud environment by separating and analyzing the raw data and the meta-data. The output result shows that the proposed redundancy elimination process in the cloud database could be highly beneficial for the vendor.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Zaffos, A.S.: Cloud storage: benefits, risks and cost considerations. Gartner (Apr 2009)

    Google Scholar 

  2. Fox, A., Griffith, R., Joseph, A., Katz, R., Konwinski, A., Lee, G., Patterson, D., Rabkin, A., Stoica, I.: Above the clouds: a Berkeley view of cloud computing. Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Rep. UCB/EECS 28(13) (2009)

    Google Scholar 

  3. Gunawi, H.S., Agrawal, N., Arpaci-Dusseau, A.C., Schindler, J., Arpaci-Dusseau, R.H.: Deconstructing commodity storage clusters. In: 32nd International Symposium on Computer Architecture (ISCA’05), pp. 60–71. IEEE, June 2005

    Google Scholar 

  4. Douceur, J.R., Adya, A., Bolosky, W.J., Simon, P., Theimer, M.: Reclaiming space from duplicate files in a serverless distributed file system. In: Proceedings 22nd International Conference on Distributed Computing Systems, pp. 617–624. IEEE, 2002

    Google Scholar 

  5. Quinlan, S., Dorward, S.L.: Venti: a new approach to archival storage. In: FAST, vol. 2, pp. 89–101, Jan 2002

    Google Scholar 

  6. Muthitacharoen, A., Chen, B., Mazieres, D.: A low-bandwidth network file system. In: ACM SIGOPS Operating Systems Review, vol. 35, no. 5, pp. 174–187. ACM, Oct 2001

    Article  Google Scholar 

  7. You, L.L., Pollack, K.T., Long, D.D.: Deep store: an archival storage system architecture. In: 21st International Conference on Data Engineering (ICDE’05), pp. 804–815. IEEE, Apr 2005

    Google Scholar 

  8. Harnik, D., Pinkas, B., Shulman-Peleg, A.: Side channels in cloud services: deduplication in cloud storage. IEEE Secur. Priv. 8(6), 40–47 (2010)

    Article  Google Scholar 

  9. Saritha, K., Subasree, S.: Analysis of hybrid cloud approach for private cloud in the de-duplication mechanism. In: 2015 IEEE International Conference on Engineering and Technology (ICETECH), pp. 1–3. IEEE, Mar 2015

    Google Scholar 

  10. Backialakshmi, N., Manikandan, M.: Data de-duplication using N0SQL databases in cloud. In: 2015 International Conference on Soft-Computing and Networks Security (ICSNS), pp. 1–4. IEEE, Feb 2015

    Google Scholar 

  11. Thakar, M.P.D., Harkut, D.G.: Hybrid model for authorized de-duplication in cloud. Int. J. Emerg. Trends & Technol. Comput. Sci. 4(1), 147–151 (2015)

    Google Scholar 

  12. Harish, B., Harshitha, K.: Data deduplication in cloud. Int. J. Pure Appl. Math. 115(8), 353–358 (2017)

    Google Scholar 

  13. Shieh, F., Arani, M.G., Shamsi, M.: 2015. De-duplication approaches in cloud computing environment: a survey. Int. J. Comput. Appl. 120(13), 7–10 (2015)

    Article  Google Scholar 

  14. Hunashikatti, L., Pujar, P.M.: Review on data deduplication and secured auditing of data on cloud. IEEE Trans. Comput. 65(8), 2386–2396 (2016)

    Article  MathSciNet  Google Scholar 

  15. Yu, C.M., Gochhayat, S.P., Conti, M., Lu, C.S.: Privacy aware data deduplication for side channel in cloud storage. IEEE Trans. Cloud Comput. 17(1), 1 (2018)

    Google Scholar 

  16. Kaur, M., Singh, J.: Data de-duplication approach based on hashing techniques for reducing time consumption over a cloud network. Int. J. Comput. Appl. 142(5), 4–10 (2016)

    Google Scholar 

  17. Aumasson, J.P., Neves, S., Wilcox-O’Hearn, Z., Winnerlein, C.: BLAKE2: simpler, smaller, fast as MD5. In: International Conference on Applied Cryptography and Network Security, pp. 119–135. Springer, Berlin, Heidelberg, June 2013

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Mohapatra, S., Bajpai, N., Swarnkar, T., Mishra, M. (2020). Raw Data Redundancy Elimination on Cloud Database. In: Das, A., Nayak, J., Naik, B., Dutta, S., Pelusi, D. (eds) Computational Intelligence in Pattern Recognition. Advances in Intelligent Systems and Computing, vol 1120. Springer, Singapore. https://doi.org/10.1007/978-981-15-2449-3_34

Download citation

Publish with us

Policies and ethics