Advertisement

Dr. Hadoop: In Search of a Needle in a Haystack

  • Sabuzima Nayak
  • Ripon Patgiri
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11319)

Abstract

Dr. Hadoop is a framework to achieve the infinite scalability of metadata, hot-standby, high availability, automatic fault-tolerance, least hiccup time upon failure. The design is based on the Doubly Circular Linked List. However, the Dr. Hadoop considers the conventional hashing scheme of metadata. In this paper, Bloom Filter is integrated with Dr. Hadoop to boost up the metadata service performance. The conventional hashing is replaced by Bloom Filter in Dr. Hadoop to reduce extra space consumption. We propose to implement an existing Bloom Filter on the platform of the Dr. Hadoop. The rFilter is a variant of Bloom Filter, that is considered for the Dr Hadoop framework. We deploy rFilter in Dr. Hadoop platform and rFilter exhibits very good performance as compared to other variants of Bloom Filter. We conduct a series of rigorous experiments using Microsoft Traces to investigate the behavior of the rFilter on the Dr. Hadoop framework.

Keywords

Dr. Hadoop In-memory replication Bloom Filter Cuckoo filter rFilter Big data Metadata server Microsoft Traces Distributed system 

References

  1. 1.
    Dev, D., Patgiri, R.: Dr. Hadoop: an infinite scalable metadata management for Hadoop-how the baby elephant becomes immortal. Front. Inf. Technol. Electron. Eng. 17(1), 15–31 (2016)CrossRefGoogle Scholar
  2. 2.
    Fan, B., Andersen, D.G., Kaminsky, M., Mitzenmacher, M.D.: Cuckoo filter: practically better than bloom. In: Proceedings of the 10th ACM International on Conference on emerging Networking Experiments and Technologies, pp. 75–88. ACM (2014)Google Scholar
  3. 3.
    Grandi, F.: On the analysis of bloom filters. Inf. Process. Lett. 129, 35–39 (2018)MathSciNetCrossRefGoogle Scholar
  4. 4.
    SNIA: IOTTA: Storage networking industry’s association. Accessed 20 Apr 2018. http://iotta.snia.org/tracetypes/6
  5. 5.
    Pagh, R., Rodler, F.F.: Cuckoo hashing. J. Algorithms 51(2), 122–144 (2004)MathSciNetCrossRefGoogle Scholar
  6. 6.
    Patgiri, R.: MDS: in-depth insight. In: 2016 International Conference on Information Technology (ICIT), pp. 193–199, December 2016Google Scholar
  7. 7.
    Patgiri, R., Borgohain, S.K., Bhattacharjee, A.: rFilter: a scalable and space-efficient membership filter. In: 2018 5th International Conference on Signal Processing and Integrated Networks (SPIN), pp. 478–485 (2018)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  1. 1.National Institute of Technology SilcharAssamIndia

Personalised recommendations