Abstract
In cloud era as the data stored is enormous, efficient retrieval of data with reduced latency plays a major role. In cloud, owing to the size of the stored data and lack of locality information among the stored files, metadata is a suitable method of keeping track of the storage. This paper describes a novel framework for efficient retrieval of data from the cloud data servers using metadata with less amount of time. Performance of queries due to availability of files for query processing can be greatly improved by the efficient use of metadata and its analysis thereof. Hence this paper proposes a generic approach of using metadata in cloud, named ‘MaaS—Metadata as a Service.’ The proposed approach has exploited various methodologies in reducing the latency during data retrieval. This paper investigates the issues on creation of metadata, metadata management and analysis of metadata in a cloud environment for fast retrieval of data. Cloud bloom filter, a probabilistic data structure used for efficient retrieval of metadata is stored across various metadata servers dispersed geographically. We have implemented the model in a cloud environment, and the experimental results show that methodology used is efficient in increasing the throughput and also by handling large number of queries efficiently with reduced latency. The efficacy of the approach is tested through experimental studies using KDD Cup 2003 dataset. In the experimental results, proposed ‘MaaS’ has outperformed other existing methods.
Similar content being viewed by others
References
Mostafa A.M., Youssef A.E.: Improving resource utilization, scalability, and availability in replication systems using object ownership distribution. Arab. J. Sci. Eng. 39(12), 8731–8741 (2014)
Li Q., Wang C., Wu J., Li J., Wang Z.-Y.: Towards the business-information technology alignment in cloud computing environment: an approach based on collaboration points and agents. Int. J. Comput. Integr. Manuf. 24, 1038–1057 (2011)
Elgedawy I.: NASEEB: an escrow-based approach for ensuring data correctness over global clouds. Arab. J. Sci. Eng. 39(12), 8743–8764 (2014)
Ahmad A., Maynard S.B., Park S.: Information security strategies: towards an organizational multi-strategy perspective. J. Intell. Manuf. 25(2), 357–370 (2014)
Tameem E., Cho G.: Providing privacy and access control in cloud storage services using a KPABE system with secret attributes. Arab. J. Sci. Eng. 39(11), 7877–7884 (2014)
Wu, J.-J.; Liu, P.; Chung, Y.-C.: Metadata partitioning for large-scale distributed storage systems. In: Proceedings of the IEEE International Conference on Cloud Computing (2010)
Bice, T.; Chiu, D.; Gagan, A.: Time and cost sensitive data-intensive computing on hybrid clouds. In: Proceedings of the IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (2012)
Pierson, J.-M.; Seitz, L.; Duque, H.; Montagnat, J.: Metadata for efficient, secure and extensible access to data in medical grid. In: Proceedings of the 15th International Workshop on Database and Expert Systems Applications (2004)
Anitha R., Mukherjee S.: A dynamic metadata model in cloud computing. Proc. Springer CCIS 2, 13–21 (2011)
Wang S., Liu Z., Sun Q., Zou H., Yang F.: Towards an accurate evaluation of quality of cloud service in service-oriented cloud computing. J. Intell. Manuf. 25(2), 283–291 (2014)
Li Q., Wang Z., Li W., Li J., Wang C., Du R.: Applications integration in a hybrid cloud computing environment: modelling and platform. Enterp. Inf. Syst. 7(3), 237–271 (2013)
Li Q., Wang Z., Li W., Cao Z., Du R., Lu H.: Model based services convergence and multi-clouds integration. Comput. Ind. 64, 813–832 (2013)
Leung A.W., Shao M., Bisson T., Pasupathy S., Miller E.L.: High-performance metadata indexing and search in petascale data storage systems. J. Phys. Conf. Ser. 125(1), 1–6 (2008)
Leung A.W., Shao M., Bisson T., Pasupathy S., Miller E.L.: Spyglass: fast, scalable metadata search for large-scale storage systems. Proc. Int. Conf. File Storage Technol. 9, 153–166 (2009)
Xu Q., Arumugam R.V., Yong K.L., Mahadevan Z.: Efficient and scalable metadata management in EB-scale file system. IEEE Trans. Parallel Distrib. Syst. 6(1), 1–10 (2013)
Wu Q., Zhu Q., Zhou M.: A correlation-driven optimal service selection approach for virtual enterprise establishment. J. Intell. Manuf. 25(6), 1441–1453 (2014)
Li, W.; Xue, W.; Shu, J.; Zheng, W.: Dynamic hashing: adaptive metadata management for petabyte-scale file systems. In: Proceedings of IEEE NASA Goddard Conference on Mass Storage Systems and Technologies, pp. 1–6 (2006)
Zhu, Y.; Jiang, H.: Efficient update control of bloom filter replicas in distributed systems, pp. 1–24 (2010)
Broder A.Z., Mitzenmacher M.: Network applications of bloom filters: a survey. J. Internet Math. 1(4), 485–509 (2003)
Zhu Y., Jiang H., Wang J., Xian F.: HBA: Distributed metadata management for large cluster-based storage systems. IEEE Trans. Parallel Distrib. Syst. 19(6), 750–763 (2008)
Li Q., Zhou J., Peng Q.R., Li C.Q., Wang C., Wu J., Shao B.-E.: Business processes oriented heterogeneous systems integration platform for networked enterprises. Comput. Ind. 61, 127–144 (2010)
Chen, S.; Huang, X.; Xu, P.; Zheng, W.: Distributed metadata management based on hierarchical bloom filters in data grid. In: Proceedings of the IEEE ChinaGrid Conference (2009)
Hua Y., Jiang H., Zhu Y., Feng D., Tian L.: Semantic-aware metadata organization paradigm in next-generation file systems. IEEE Trans. Parallel Distrib. Syst. 23(2), 337–344 (2012)
Gray J., Liu D.T., Nieto-Santisteban M., Szalay A., DeWitt D.J., Heber G.: Scientific data management in the coming decade. ACM SIGMOD 34(4), 34–41 (2005)
Maria H., Batistakis Y., Vazirgiannis M.: On clustering validation techniques. J. Intell. Inf. Syst. 17(2), 107–145 (2001)
Lei P.-R., Li S.-C., Peng W.C.: QS-STT: QuadSection clustering and spatial-temporal trajectory model for location prediction. J. Distrib. Parallel Databases 31, 231–258 (2013)
Guha S., Meyerson A., Mishra N., Motwani R., O’Callaghan L.: Clustering data streams: theory and practice. IEEE Trans. Knowl. Data Eng. 15(3), 515–528 (2003)
Guha S., Rastogi T., Shim K.: CURE: an efficient clustering algorithm for large databases. ACM SIGMOD Record 27(2), 73–84 (1998)
Weil, S.A., Pollack, K.T.; Brandt, S.A.; Miller, E.L.: Dynamic metadata management for petabyte-scale file systems. In: Proceedings IEEE Computer Society conference on Supercomputing, pp. 172–180 (2004)
Xiong, M.; Jin, H.; Wu, S.: FDSSS: an efficient metadata management scheme in large scale data environment. In: International Conference on Grid and Cooperative Computing Workshops, pp. 71–77 (2006)
Ravimaran S, Mohamed MAM: Integrated Obj_FedRep: Evaluation of Surrogate Object based mobile cloud system for federation, replica and data management. Arab. J. Sci. Eng. 39(6), 4577–4592 (2014)
Liu, C.; An, J.: Fast mining and updating frequent itemsets. In: Proceedings of the International Colloquium on Computing, Communication. Control and Management, vol. 1, pp. 365–368 (2008)
Chan B.Y., Si A., Hong V.L.: Framework for cache management for mobile databases: design and evaluation. J. Distrib. Parallel Databases 10, 23–57 (2001)
Choudhary A., Harding J.A., Tiwari M.K.: Data mining in manufacturing: a review based on the kind of knowledge. J. Intell. Manuf. 20(5), 501–521 (2009)
Al-Haidari F., Sqalli M., Salah K.: Evaluation of the impact of EDoS attacks against cloud computing services. Arab. J. Sci. Eng. 40(3), 773–785 (2014)
Dublin Core: Dublin core metadata element set. Version 1.1: Reference description. http://dublincore.org/documents/docs (2004)
Liu X., Zhang Y., Wang B., Yan J.: Mona: secure multi-owner data sharing for dynamic groups in the cloud. IEEE Trans. Parallel Distrib. Syst. 24(6), 1182–1191 (2013)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Anitha, R., Mukherjee, S. ‘MaaS’: Fast Retrieval of Data in Cloud Using Metadata as a Service. Arab J Sci Eng 40, 2323–2343 (2015). https://doi.org/10.1007/s13369-015-1652-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13369-015-1652-7