A Storage Model for Handling Big Data Variety

Sarkar, Anindita; Chattopadhyay, Samiran

doi:10.1007/978-981-10-6427-2_5

A Storage Model for Handling Big Data Variety

Anindita Sarkar¹³ &
Samiran Chattopadhyay¹²

Conference paper
First Online: 24 September 2017

833 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 775))

Abstract

In present scenario of data storage world, selection of storage system is performed on the basis of performance which can be hardware compatibility, cost, response time or application features. In case of big data, data is collected in different format. At the other end, each and every data storage system is not suitable for storing any types of data. Here, the question is, how to prepare a storage system that can be able to store any types of data in the corresponding data storage system. Again, the data format conversion of heterogeneous raw data into a particular format is costly and time consuming. Therefore, it is require to have such data storage mechanism which can store the raw data in the data storage system in their originated data format. In this paper, we proposed an xml based storage model which is going to solve the above mentioned problem. The proposed model makes a decision regarding the allocation of storage resources (or databases) on the basis of data type. Our storage model can customized the storage structure of each and individual storage resource. Collection of data fragmentation and selection of the compatible database are the responsibility of our storage model. Also, the applicability of this storage model is described through an experiment in this paper.

This is a preview of subscription content, log in via an institution.

Notes

References

Cassandra. http://cassandra.apache.org/
Mongodb. https://www.mongodb.org/
Anderson, J.C., Lehnardt, J., Slater, N.: CouchDB: the Definitive Guide. O’Reilly Media Inc, Sebastopol (2010)
Google Scholar
Armbrust, M., Fox, A., Griffith, R., Joseph, A.D., Katz, R., Konwinski, A., Lee, G., Patterson, D., Rabkin, A., Stoica, I., Zaharia, M.: A view of cloud computing. ACM Commun. 53(4), 50–58 (2010)
Article Google Scholar
Chen, M., Mao, S., Liu, Y.: Big data: a survey. Mob. Netw. Appl. 19(2), 171–209 (2014)
Article Google Scholar
Gudivada, V.N., Baeza-Yates, R., Raghavan, V.V.: Big data: promises and problems. IEEE Comput. J. 3, 20–23 (2015)
Article Google Scholar
Kaisler, S., Armour, F., Espinosa, J.A., Money, W.: Big data: issues and challenges moving forward. In: 46th Hawaii International Conference on System Sciences (HICSS 2013) (2013)
Google Scholar
Kulkarni, G., Waghmare, R., Palwe, R., Waykule, V., Bankar, H., Koli, K.: Cloud storage architecture. In: 7th International Conference on Telecommunication Systems, Services, and Applications (TSSA 2012), pp. 76–81. IEEE (2012)
Google Scholar
Li, Y., Guo, L., Guo, Y.: Cacss: Towards a generic cloud storage service. In: CLOSER (2012)
Google Scholar
Mathur, G., Desnoyers, P., Ganesan, D., Shenoy, P.: Capsule: an energy-optimized object storage system for memory-constrained sensor devices. ACM (2006)
Google Scholar
Momjian, B.: PostgreSQL: Introduction and Concepts. Addison-Wesley, New York (2001)
Google Scholar
Palankar, M.R., Iamnitchi, A., Ripeanu, M., Garfinkel, S.: Amazon s3 for science grids: a viable solution?. In: Proceedings of the 2008 International Workshop on Data-aware Distributed Computing, New York, NY, USA (2008)
Google Scholar
Panda, P.R., Catthoor, F., Dutt, N.D., Danckaert, K., Brockmeyer, E., Kulkarni, C., Vandercappelle, A., Kjeldsberg, P.G.: Data and memory optimization techniques for embedded systems. ACM Trans. Des. Autom. Electron. 6(2), 149–206 (2001)
Article Google Scholar
Pepple, K.: Deploying Openstack. O’Reilly Media Inc, Sebastopol (2011)
Google Scholar
Ruiz-Alvarez, A., Humphrey, M.: An automated approach to cloud storage service selection. In: Proceedings of the 2nd International Workshop on Scientific Cloud Computing, pp. 39–48. ACM (2011)
Google Scholar
Ruiz-Alvarez, A., Humphrey, M.: A model and decision procedure for data storage in cloud computing. In: 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2012), pp. 572–579. IEEE (2012)
Google Scholar
Shvachko, K., Kuang, H., Radia, S., Chansler, R.: The hadoop distributed file system. In: IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST 2010) (2010)
Google Scholar
Sidwell, L.P.: System for modifying jcl parameters to optimize data storage allocations (2000)
Google Scholar
Spillner, J., Müller, J., Schill, A.: Creating optimal cloud storage systems. Future Gener. Comput. Syst. 29(4), 1062–1072 (2013)
Article Google Scholar
Takahashi, K., Yamamoto, S., Okushi, A., Matsumoto, S., Nakamura, M.: Design and implementation of service api for large-scale house log in smart city cloud. In: IEEE 4th International Conference on Cloud Computing Technology and Science (CloudCom 2012), pp. 815–820 (2012)
Google Scholar
Timmaraju, S., Ravi, V., Gangadharan, G.: Ranking of cloud services using opinion mining and multi-attribute decision making. In: Handbook of Research on Advanced Data Mining Techniques and Applications for Business Intelligence, p. 379 (2017)
Google Scholar
Vrable, M., Savage, S., Voelker, G.M.: Bluesky: a cloud-backed file system for the enterprise. In: Proceedings of the 10th USENIX Conference on File and Storage Technologies, p. 19. USENIX Association (2012)
Google Scholar
Wu, K., Vassileva, J., Zhao, Y.: Understanding users’ intention to switch personal cloud storage services: evidence from the chinese market. Comput. Hum. Behav. 68, 300–314 (2017)
Article Google Scholar
Yamamoto, S., Matsumoto, S., Saiki, S., Nakamura, M.: Materialized view as a service for large-scale house log in smart city. In: IEEE 5th International Conference on Cloud Computing Technology and Science (CloudCom 2013), vol. 2, pp. 311–316 (2013)
Google Scholar
Zhang, M., Ranjan, R., Haller, A., Georgakopoulos, D., Menzel, M., Nepal, S.: An ontology-based system for cloud infrastructure services’ discovery. In: 8th International Conference on Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom 2012), pp. 524–530. IEEE (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Technology, Jadavpur University, Kolkata, India
Samiran Chattopadhyay
School of Mobile Computing, Jadavpur University, Kolkata, India
Anindita Sarkar

Authors

Anindita Sarkar
View author publications
You can also search for this author in PubMed Google Scholar
Samiran Chattopadhyay
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Anindita Sarkar .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, University of Kalyani, Kalyani, West Bengal, India
J. K. Mandal
Department of Computer and System Sciences, Visva Bharati University, Bolpur Santiniketan, West Bengal, India
Paramartha Dutta
Department of Information Technology, Calcutta Business School, Kolkata, India
Somnath Mukhopadhyay

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sarkar, A., Chattopadhyay, S. (2017). A Storage Model for Handling Big Data Variety. In: Mandal, J., Dutta, P., Mukhopadhyay, S. (eds) Computational Intelligence, Communications, and Business Analytics. CICBA 2017. Communications in Computer and Information Science, vol 775. Springer, Singapore. https://doi.org/10.1007/978-981-10-6427-2_5

Download citation

DOI: https://doi.org/10.1007/978-981-10-6427-2_5
Published: 24 September 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-6426-5
Online ISBN: 978-981-10-6427-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics