Advertisement

Inline Block Level Data De-duplication Technique for EXT4 File System

  • Rahul Shinde
  • Vinay Patil
  • Akshay Bhargava
  • Atul Phatak
  • Amar More
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 249)

Abstract

Day by day data centers are growing and also their data. Data is key part of their organization and hence backed up after a regular interval. Due to huge data size, to improve utilization and life span of the disks, data de-duplication techniques are followed. In data de-duplication single copy of the data is stored on the disk by finding and eliminating the redundant copies. Now a days EXT4 has become a popular file system as it supports increased file system size and improved performance. So EXT4 file system can be used to store the backups and the data de-duplication could still increase the disk capacity virtually and could reduce the number of disk writes. In this paper we present a data de-duplication algorithm for EXT4 file system. Using this algorithm the duplicate data is eliminated before it is actually written to the disk and the extents in the EXT4 file system are arranged accordingly.

Keywords

File system EXT4 file system Data de-duplication Data backup 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    El-Shimi, A., Kalach, R., Kumar, A., Oltean, A., Li, J., Sengupta, S.: Primary Data Deduplication-Large Scale Study and System Design. In: Proc. USENIX ATC, Boston, MA (2012)Google Scholar
  2. 2.
  3. 3.
    EMC Corporation: EMC Centera: Content Addresses Storage System, Data Sheet (2002)Google Scholar
  4. 4.
    Quinlan, S., Dorward, S.: Venti: a new approach to archival storage. In: The First USENIX Conference on File and Storage Technologies (Fast 2002), vol. 2, pp. 89–101 (2002)Google Scholar
  5. 5.
    Alvarez, C.: NetApp de-duplication for FAS and V-Series deployment and implementation guide. Technical ReportTR-3505 (2011)Google Scholar
  6. 6.
    Brown, A.: Kristopher Kosmatka: Block-level Inline Data de-duplication in EXT3. In: University of Wisconsin - Madison Department of Computer Sciences (2010)Google Scholar
  7. 7.
    More, A., Shaikh, Z., Salve, V.: DEXT3 Block Level Inline De-duplication using EXT3 File System. In: Linux Symposium, p. 87 (2012)Google Scholar
  8. 8.
    Larabel, M.: SDFS: A File-System With Inline De-Duplication (2011)Google Scholar
  9. 9.
    Bhagwat, D., Eshghi, K., Long, D.D., Lillibridge, M.: Extreme Binning: Scalable, Parallel de-duplication for Chunk-based File Backup. In: IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems, MASCOTS 2009, pp. 1–9. IEEE (2009)Google Scholar
  10. 10.
    Zhu, B., Li, K., Hugo Patterson, R.: Avoiding the disk bottleneck in the data domain de- duplication file system. In: Fast, vol. 8, pp. 269–282 (2008)Google Scholar
  11. 11.
    Cao, M., Santos, J.R., Dilger, A.: EXT4 Block and Inode Allocator Improvements. In: Linux Symposium, p. 263 (2008)Google Scholar
  12. 12.
    Fairbanks, K.D.: An analysis of EXT4 for digital forensics. Digital Investigation 9, S118–S130 (2012)Google Scholar
  13. 13.
    Avantika, M., Cao, M., Bhattacharya, S., Dilger, A., Tomas, A., Vivier, L.: The new ext4 filesystem: current status and future plans. In: Proceedings of the Linux Symposium, vol. 2, pp. 21–33 (2007)Google Scholar
  14. 14.
    Kadekodi., S., et al.: Taking Linux Filesystems to the Space Age: Space Maps in EXT4. In: Linux Symposium (2010)Google Scholar
  15. 15.

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Rahul Shinde
    • 1
  • Vinay Patil
    • 1
  • Akshay Bhargava
    • 1
  • Atul Phatak
    • 1
  • Amar More
    • 1
  1. 1.MIT Academy of EngineeringPuneIndia

Personalised recommendations