Skip to main content
Log in

‘MaaS’: Fast Retrieval of Data in Cloud Using Metadata as a Service

  • Research Article - Computer Engineering and Computer Science
  • Published:
Arabian Journal for Science and Engineering Aims and scope Submit manuscript

Abstract

In cloud era as the data stored is enormous, efficient retrieval of data with reduced latency plays a major role. In cloud, owing to the size of the stored data and lack of locality information among the stored files, metadata is a suitable method of keeping track of the storage. This paper describes a novel framework for efficient retrieval of data from the cloud data servers using metadata with less amount of time. Performance of queries due to availability of files for query processing can be greatly improved by the efficient use of metadata and its analysis thereof. Hence this paper proposes a generic approach of using metadata in cloud, named ‘MaaS—Metadata as a Service.’ The proposed approach has exploited various methodologies in reducing the latency during data retrieval. This paper investigates the issues on creation of metadata, metadata management and analysis of metadata in a cloud environment for fast retrieval of data. Cloud bloom filter, a probabilistic data structure used for efficient retrieval of metadata is stored across various metadata servers dispersed geographically. We have implemented the model in a cloud environment, and the experimental results show that methodology used is efficient in increasing the throughput and also by handling large number of queries efficiently with reduced latency. The efficacy of the approach is tested through experimental studies using KDD Cup 2003 dataset. In the experimental results, proposed ‘MaaS’ has outperformed other existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Mostafa A.M., Youssef A.E.: Improving resource utilization, scalability, and availability in replication systems using object ownership distribution. Arab. J. Sci. Eng. 39(12), 8731–8741 (2014)

    Article  Google Scholar 

  2. Li Q., Wang C., Wu J., Li J., Wang Z.-Y.: Towards the business-information technology alignment in cloud computing environment: an approach based on collaboration points and agents. Int. J. Comput. Integr. Manuf. 24, 1038–1057 (2011)

    Article  Google Scholar 

  3. Elgedawy I.: NASEEB: an escrow-based approach for ensuring data correctness over global clouds. Arab. J. Sci. Eng. 39(12), 8743–8764 (2014)

    Article  Google Scholar 

  4. Ahmad A., Maynard S.B., Park S.: Information security strategies: towards an organizational multi-strategy perspective. J. Intell. Manuf. 25(2), 357–370 (2014)

    Article  Google Scholar 

  5. Tameem E., Cho G.: Providing privacy and access control in cloud storage services using a KPABE system with secret attributes. Arab. J. Sci. Eng. 39(11), 7877–7884 (2014)

    Article  Google Scholar 

  6. Wu, J.-J.; Liu, P.; Chung, Y.-C.: Metadata partitioning for large-scale distributed storage systems. In: Proceedings of the IEEE International Conference on Cloud Computing (2010)

  7. Bice, T.; Chiu, D.; Gagan, A.: Time and cost sensitive data-intensive computing on hybrid clouds. In: Proceedings of the IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (2012)

  8. Pierson, J.-M.; Seitz, L.; Duque, H.; Montagnat, J.: Metadata for efficient, secure and extensible access to data in medical grid. In: Proceedings of the 15th International Workshop on Database and Expert Systems Applications (2004)

  9. Anitha R., Mukherjee S.: A dynamic metadata model in cloud computing. Proc. Springer CCIS 2, 13–21 (2011)

    Google Scholar 

  10. Wang S., Liu Z., Sun Q., Zou H., Yang F.: Towards an accurate evaluation of quality of cloud service in service-oriented cloud computing. J. Intell. Manuf. 25(2), 283–291 (2014)

    Article  Google Scholar 

  11. Li Q., Wang Z., Li W., Li J., Wang C., Du R.: Applications integration in a hybrid cloud computing environment: modelling and platform. Enterp. Inf. Syst. 7(3), 237–271 (2013)

    Article  Google Scholar 

  12. Li Q., Wang Z., Li W., Cao Z., Du R., Lu H.: Model based services convergence and multi-clouds integration. Comput. Ind. 64, 813–832 (2013)

    Article  Google Scholar 

  13. Leung A.W., Shao M., Bisson T., Pasupathy S., Miller E.L.: High-performance metadata indexing and search in petascale data storage systems. J. Phys. Conf. Ser. 125(1), 1–6 (2008)

    Google Scholar 

  14. Leung A.W., Shao M., Bisson T., Pasupathy S., Miller E.L.: Spyglass: fast, scalable metadata search for large-scale storage systems. Proc. Int. Conf. File Storage Technol. 9, 153–166 (2009)

    Google Scholar 

  15. Xu Q., Arumugam R.V., Yong K.L., Mahadevan Z.: Efficient and scalable metadata management in EB-scale file system. IEEE Trans. Parallel Distrib. Syst. 6(1), 1–10 (2013)

    Google Scholar 

  16. Wu Q., Zhu Q., Zhou M.: A correlation-driven optimal service selection approach for virtual enterprise establishment. J. Intell. Manuf. 25(6), 1441–1453 (2014)

    Article  Google Scholar 

  17. Li, W.; Xue, W.; Shu, J.; Zheng, W.: Dynamic hashing: adaptive metadata management for petabyte-scale file systems. In: Proceedings of IEEE NASA Goddard Conference on Mass Storage Systems and Technologies, pp. 1–6 (2006)

  18. Zhu, Y.; Jiang, H.: Efficient update control of bloom filter replicas in distributed systems, pp. 1–24 (2010)

  19. Broder A.Z., Mitzenmacher M.: Network applications of bloom filters: a survey. J. Internet Math. 1(4), 485–509 (2003)

    Article  MathSciNet  Google Scholar 

  20. Zhu Y., Jiang H., Wang J., Xian F.: HBA: Distributed metadata management for large cluster-based storage systems. IEEE Trans. Parallel Distrib. Syst. 19(6), 750–763 (2008)

    Article  Google Scholar 

  21. Li Q., Zhou J., Peng Q.R., Li C.Q., Wang C., Wu J., Shao B.-E.: Business processes oriented heterogeneous systems integration platform for networked enterprises. Comput. Ind. 61, 127–144 (2010)

    Article  Google Scholar 

  22. Chen, S.; Huang, X.; Xu, P.; Zheng, W.: Distributed metadata management based on hierarchical bloom filters in data grid. In: Proceedings of the IEEE ChinaGrid Conference (2009)

  23. Hua Y., Jiang H., Zhu Y., Feng D., Tian L.: Semantic-aware metadata organization paradigm in next-generation file systems. IEEE Trans. Parallel Distrib. Syst. 23(2), 337–344 (2012)

    Article  Google Scholar 

  24. Gray J., Liu D.T., Nieto-Santisteban M., Szalay A., DeWitt D.J., Heber G.: Scientific data management in the coming decade. ACM SIGMOD 34(4), 34–41 (2005)

    Article  Google Scholar 

  25. Maria H., Batistakis Y., Vazirgiannis M.: On clustering validation techniques. J. Intell. Inf. Syst. 17(2), 107–145 (2001)

    Google Scholar 

  26. Lei P.-R., Li S.-C., Peng W.C.: QS-STT: QuadSection clustering and spatial-temporal trajectory model for location prediction. J. Distrib. Parallel Databases 31, 231–258 (2013)

    Article  Google Scholar 

  27. Guha S., Meyerson A., Mishra N., Motwani R., O’Callaghan L.: Clustering data streams: theory and practice. IEEE Trans. Knowl. Data Eng. 15(3), 515–528 (2003)

    Article  Google Scholar 

  28. Guha S., Rastogi T., Shim K.: CURE: an efficient clustering algorithm for large databases. ACM SIGMOD Record 27(2), 73–84 (1998)

    Article  Google Scholar 

  29. Weil, S.A., Pollack, K.T.; Brandt, S.A.; Miller, E.L.: Dynamic metadata management for petabyte-scale file systems. In: Proceedings IEEE Computer Society conference on Supercomputing, pp. 172–180 (2004)

  30. Xiong, M.; Jin, H.; Wu, S.: FDSSS: an efficient metadata management scheme in large scale data environment. In: International Conference on Grid and Cooperative Computing Workshops, pp. 71–77 (2006)

  31. Ravimaran S, Mohamed MAM: Integrated Obj_FedRep: Evaluation of Surrogate Object based mobile cloud system for federation, replica and data management. Arab. J. Sci. Eng. 39(6), 4577–4592 (2014)

    Article  Google Scholar 

  32. Liu, C.; An, J.: Fast mining and updating frequent itemsets. In: Proceedings of the International Colloquium on Computing, Communication. Control and Management, vol. 1, pp. 365–368 (2008)

  33. Chan B.Y., Si A., Hong V.L.: Framework for cache management for mobile databases: design and evaluation. J. Distrib. Parallel Databases 10, 23–57 (2001)

    Article  MATH  Google Scholar 

  34. Choudhary A., Harding J.A., Tiwari M.K.: Data mining in manufacturing: a review based on the kind of knowledge. J. Intell. Manuf. 20(5), 501–521 (2009)

    Article  Google Scholar 

  35. Al-Haidari F., Sqalli M., Salah K.: Evaluation of the impact of EDoS attacks against cloud computing services. Arab. J. Sci. Eng. 40(3), 773–785 (2014)

    Article  Google Scholar 

  36. Dublin Core: Dublin core metadata element set. Version 1.1: Reference description. http://dublincore.org/documents/docs (2004)

  37. Liu X., Zhang Y., Wang B., Yan J.: Mona: secure multi-owner data sharing for dynamic groups in the cloud. IEEE Trans. Parallel Distrib. Syst. 24(6), 1182–1191 (2013)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to R. Anitha.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Anitha, R., Mukherjee, S. ‘MaaS’: Fast Retrieval of Data in Cloud Using Metadata as a Service. Arab J Sci Eng 40, 2323–2343 (2015). https://doi.org/10.1007/s13369-015-1652-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13369-015-1652-7

Keywords

Navigation