Abstract
This work is aimed to develop a system, that will effectively solve the problem of storing and analyzing files containing text data, by using modern software development tools, techniques and approaches. The main challenge of storing a large number of text documents defined at the problem formulation stage, have to be resolved with such functionality as full text search and document clustering depends on their contents. Main system features could be described with notions of distributed multilevel architecture, flexibility and interchangeability of components, achieved through the standard functionality incapsulation in independent executable modules.
Similar content being viewed by others
References
Administration Guide for Version 2.0. http:// www.opendedup.org/sdfs-20-administration-guide. Accessed Sept. 24, 2015.
Deduplication–btrfs Wiki. https://btrfs.wiki.kernel. org/index.php/Deduplication. Accessed Sept. 24, 2015.
lessfs & btier, Open Source Data De-duplication and Data Tiering for less. http://www.lessfs.com/wordpress/. Accessed Sept. 24, 2015.
openstack/swifton_le GitHub. https://github.com/openstack/ swiftonfile. Accessed Sept. 24, 2015.
OpenZFS. http://open-zfs.org/wiki/Main_Page. Accessed Sept. 24, 2015.
Planning40–Gluster Documentation. http:// www.gluster.org/community/documentation/index. php/Planning40. Accessed Sept. 24, 2015.
Dedup Performance Considerations (Bizarre! Vous avez dit Bizarre?). https://blogs.oracle.com/roch/entry/ dedup_performance_considerations1. Accessed Sept. 24, 2015.
Welcome to Swift’s documentation!- swift 2.5.1.dev109 Documentation. docs.openstack.org/developer/swift. Accessed Sept. 24, 2015.
Glynn Foster, Swift Object Storage with ZFS Storage Appliance (Oracle OpenStack Blog). https://blogs.oracle. com/openstack/entry/swift_object_storage_with_zfs. Accessed Sept. 24, 2015.
Yuan Zhou, Using Multiple Backends in Openstack Swift Intel® Developer Zone. https://software. intel.com/en-us/blogs/. Accessed Sept. 24, 2015.
Author information
Authors and Affiliations
Corresponding author
Additional information
The article is published in the original.
Rights and permissions
About this article
Cite this article
Bogdanov, A., Ivashchenko, A. System of HPC content archiving. Phys. Part. Nuclei Lett. 14, 997–1000 (2017). https://doi.org/10.1134/S1547477117070056
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1134/S1547477117070056