Skip to main content

A Storage Model for Handling Big Data Variety

  • Conference paper
  • First Online:
  • 833 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 775))

Abstract

In present scenario of data storage world, selection of storage system is performed on the basis of performance which can be hardware compatibility, cost, response time or application features. In case of big data, data is collected in different format. At the other end, each and every data storage system is not suitable for storing any types of data. Here, the question is, how to prepare a storage system that can be able to store any types of data in the corresponding data storage system. Again, the data format conversion of heterogeneous raw data into a particular format is costly and time consuming. Therefore, it is require to have such data storage mechanism which can store the raw data in the data storage system in their originated data format. In this paper, we proposed an xml based storage model which is going to solve the above mentioned problem. The proposed model makes a decision regarding the allocation of storage resources (or databases) on the basis of data type. Our storage model can customized the storage structure of each and individual storage resource. Collection of data fragmentation and selection of the compatible database are the responsibility of our storage model. Also, the applicability of this storage model is described through an experiment in this paper.

This is a preview of subscription content, log in via an institution.

Notes

  1. 1.

    http://www.datastax.com/dev/blog/basic-rules-of-cassandra-data-modeling.

  2. 2.

    https://docs.mongodb.com/manual/core/data-modeling-introduction/.

References

  1. Cassandra. http://cassandra.apache.org/

  2. Mongodb. https://www.mongodb.org/

  3. Anderson, J.C., Lehnardt, J., Slater, N.: CouchDB: the Definitive Guide. O’Reilly Media Inc, Sebastopol (2010)

    Google Scholar 

  4. Armbrust, M., Fox, A., Griffith, R., Joseph, A.D., Katz, R., Konwinski, A., Lee, G., Patterson, D., Rabkin, A., Stoica, I., Zaharia, M.: A view of cloud computing. ACM Commun. 53(4), 50–58 (2010)

    Article  Google Scholar 

  5. Chen, M., Mao, S., Liu, Y.: Big data: a survey. Mob. Netw. Appl. 19(2), 171–209 (2014)

    Article  Google Scholar 

  6. Gudivada, V.N., Baeza-Yates, R., Raghavan, V.V.: Big data: promises and problems. IEEE Comput. J. 3, 20–23 (2015)

    Article  Google Scholar 

  7. Kaisler, S., Armour, F., Espinosa, J.A., Money, W.: Big data: issues and challenges moving forward. In: 46th Hawaii International Conference on System Sciences (HICSS 2013) (2013)

    Google Scholar 

  8. Kulkarni, G., Waghmare, R., Palwe, R., Waykule, V., Bankar, H., Koli, K.: Cloud storage architecture. In: 7th International Conference on Telecommunication Systems, Services, and Applications (TSSA 2012), pp. 76–81. IEEE (2012)

    Google Scholar 

  9. Li, Y., Guo, L., Guo, Y.: Cacss: Towards a generic cloud storage service. In: CLOSER (2012)

    Google Scholar 

  10. Mathur, G., Desnoyers, P., Ganesan, D., Shenoy, P.: Capsule: an energy-optimized object storage system for memory-constrained sensor devices. ACM (2006)

    Google Scholar 

  11. Momjian, B.: PostgreSQL: Introduction and Concepts. Addison-Wesley, New York (2001)

    Google Scholar 

  12. Palankar, M.R., Iamnitchi, A., Ripeanu, M., Garfinkel, S.: Amazon s3 for science grids: a viable solution?. In: Proceedings of the 2008 International Workshop on Data-aware Distributed Computing, New York, NY, USA (2008)

    Google Scholar 

  13. Panda, P.R., Catthoor, F., Dutt, N.D., Danckaert, K., Brockmeyer, E., Kulkarni, C., Vandercappelle, A., Kjeldsberg, P.G.: Data and memory optimization techniques for embedded systems. ACM Trans. Des. Autom. Electron. 6(2), 149–206 (2001)

    Article  Google Scholar 

  14. Pepple, K.: Deploying Openstack. O’Reilly Media Inc, Sebastopol (2011)

    Google Scholar 

  15. Ruiz-Alvarez, A., Humphrey, M.: An automated approach to cloud storage service selection. In: Proceedings of the 2nd International Workshop on Scientific Cloud Computing, pp. 39–48. ACM (2011)

    Google Scholar 

  16. Ruiz-Alvarez, A., Humphrey, M.: A model and decision procedure for data storage in cloud computing. In: 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2012), pp. 572–579. IEEE (2012)

    Google Scholar 

  17. Shvachko, K., Kuang, H., Radia, S., Chansler, R.: The hadoop distributed file system. In: IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST 2010) (2010)

    Google Scholar 

  18. Sidwell, L.P.: System for modifying jcl parameters to optimize data storage allocations (2000)

    Google Scholar 

  19. Spillner, J., Müller, J., Schill, A.: Creating optimal cloud storage systems. Future Gener. Comput. Syst. 29(4), 1062–1072 (2013)

    Article  Google Scholar 

  20. Takahashi, K., Yamamoto, S., Okushi, A., Matsumoto, S., Nakamura, M.: Design and implementation of service api for large-scale house log in smart city cloud. In: IEEE 4th International Conference on Cloud Computing Technology and Science (CloudCom 2012), pp. 815–820 (2012)

    Google Scholar 

  21. Timmaraju, S., Ravi, V., Gangadharan, G.: Ranking of cloud services using opinion mining and multi-attribute decision making. In: Handbook of Research on Advanced Data Mining Techniques and Applications for Business Intelligence, p. 379 (2017)

    Google Scholar 

  22. Vrable, M., Savage, S., Voelker, G.M.: Bluesky: a cloud-backed file system for the enterprise. In: Proceedings of the 10th USENIX Conference on File and Storage Technologies, p. 19. USENIX Association (2012)

    Google Scholar 

  23. Wu, K., Vassileva, J., Zhao, Y.: Understanding users’ intention to switch personal cloud storage services: evidence from the chinese market. Comput. Hum. Behav. 68, 300–314 (2017)

    Article  Google Scholar 

  24. Yamamoto, S., Matsumoto, S., Saiki, S., Nakamura, M.: Materialized view as a service for large-scale house log in smart city. In: IEEE 5th International Conference on Cloud Computing Technology and Science (CloudCom 2013), vol. 2, pp. 311–316 (2013)

    Google Scholar 

  25. Zhang, M., Ranjan, R., Haller, A., Georgakopoulos, D., Menzel, M., Nepal, S.: An ontology-based system for cloud infrastructure services’ discovery. In: 8th International Conference on Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom 2012), pp. 524–530. IEEE (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anindita Sarkar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Sarkar, A., Chattopadhyay, S. (2017). A Storage Model for Handling Big Data Variety. In: Mandal, J., Dutta, P., Mukhopadhyay, S. (eds) Computational Intelligence, Communications, and Business Analytics. CICBA 2017. Communications in Computer and Information Science, vol 775. Springer, Singapore. https://doi.org/10.1007/978-981-10-6427-2_5

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-6427-2_5

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-6426-5

  • Online ISBN: 978-981-10-6427-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics