Simulation of Hierarchical Storage Systems for TCO and QoS
- 1.3k Downloads
Due to the variety of storage technologies deep storage hierarchies turn out to be the most feasible choice to meet performance and cost requirements when handling vast amounts of data. Long-term archives employed by scientific users are mainly reliant on tape storage, as it remains the most cost-efficient option. Archival systems are often loosely integrated into the HPC storage infrastructure. In expectation of exascale systems and in situ analysis also burst buffers will require integration with the archive. Exploring new strategies and developing open software for tape systems is a hurdle due to the lack of affordable storage silos and availability outside of large organizations and due to increased wariness requirements when dealing with ultra-durable data. Lessening these problems by providing virtual storage silos should enable community-driven innovation and enable site operators to add features where they see fit while being able to verify strategies before deploying on production systems. Different models for the individual components in tape systems are developed. The models are then implemented in a prototype simulation using discrete event simulation. The work shows that the simulations can be used to approximate the behavior of tape systems deployed in the real world and to conduct experiments without requiring a physical tape system.
KeywordsModeling Simulation Tape Long-term archive Hierarchical storage systems Performance Total cost of ownership
This work is part of the ESiWACE project which received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 675191.
- 1.Dashti, A., Shahabi, C.: Data placement techniques for serpentine tapes. In: Proceedings of the 33rd Hawaii International Conference on System Sciences, pp. 1–10 (2000). http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=927005
- 3.Fontana, R.E., Decad, G.M., Hetzler, S.R.: The impact of areal density and millions of square inches (MSI) of produced memory on petabyte shipments of TAPE, NAND flash, and HDD storage class memories. In: 2013 IEEE 29th Symposium on Mass Storage Systems and Technologies (MSST), pp. 1–8. IEEE (2013). http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=6558421
- 4.Hughes, J., Fisher, D., Dehart, K., Wilbanks, B., Alt, J.: HPSS RAIT Architecture. White paper of the HPSS collaboration (2009). http://www.hpss-collaboration.org/documents/HPSS_RAIT_Architecture.pdf
- 5.Koltsidas, I., Sarafijanovic, S., Petermann, M., Haustein, N., Seipp, H.: Seamlessly Integrating Disk and Tape in a Multi-tiered Distributed File System, pp. 1328–1339 (2015)Google Scholar
- 6.Lingfang Zeng, D.F.: Hybrid RAID-Tape-Library Storage System for Backup. In: Second International Conference on Embedded Software and Systems (ICESS 2005), pp. 31–36 (2005). http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=1609854
- 7.Mäsker, M., Nagel, L., Süß, T., Brinkmann, A., Sorth, L.: Simulation and Performance Analysis of the ECMWF Tape Library System. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 22: 1–22: 12. SC 2016, IEEE Press, Piscataway (2016). http://dl.acm.org/citation.cfm?id=3014904.3014934
- 8.Pantazi, A., Furrer, S., Rothuizen, H.E., Cherubini, G., Jelitto, J., Lantz, M.A.: Nanoscale track-following for tape storage, pp. 2837–2843 (2015)Google Scholar
- 9.Pease, D., Amir, A., Villa Real, L., Biskeborn, B., Richmond, M., Abek, A.: The linear tape file system. In: 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies, MSST2010 4 (2010)Google Scholar
- 10.Zhang, X., He, D., Du, D., Lu, Y.: Object placement in parallel tape storage systems. In: Proceedings of the 2006 International Conference on Parallel Processing (ICPP 2006), pp. 0–7 (2006). http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=1690610