Abstract
Data backup and archiving is an important aspect of business processes to avoid loss due to system failures and natural calamities. As the amount of data and applications grow in number, concerns regarding cost efficient data preservation force organizations to scout for inexpensive storage options. Addressing these concerns, we present Tape Cloud, a novel, highly cost effective, unified storage solution. We leverage the notably economic nature of Magnetic Tapes and design a cloud storage infrastructure-as-a-service that provides a centralized storage platform for unstructured data generated by many diverse applications. We propose and evaluate a proficient middleware that manages data and IO requests, overcomes latencies and improves the overall response time of the storage system. We analyze traces obtained by live archiving applications to obtain workload characteristics. Based on this analysis, we synthesize archiving workloads and design suitable algorithms to evaluate the performance of the middleware and storage tiers. From the results, we see that the use of the middleware provides close to 100% improvement in task distribution efficiency within the system leading to a 70% reduction in overall response time of data retrieval from storage. Due to its easy adaptability with the state of the art storage practices, the middleware contributes in providing the much needed boost in reducing storage costs for data archiving in cloud and colocated infrastructures.
Keywords
- Data Storage
- Backup
- Archiving
- Cloud
- Data Centers
- Cost Efficiency
- Magnetic Tapes
- Middleware
- Read Probability Weight
- Priority Queue
Chapter PDF
References
Seagate, Video surveillance storage: How much is enough?
County of cameras: Cheshire constabulary aims to count every private camera in the county, CCTV Image Online
Chamness, M.: Capacity forecasting in a backup storage environment. In: Usenix LISA 2011 (2011)
Jackson, J.: Most network data sits untouched. Government Computer News (July 2008), http://gcn.com/Articles/2008/07/01/Most-network-data-sits-untouched.aspx
Sandstå, O., Olav, S., St, A., Midtstraum, R.: Improving the access time performance of serpentine tape drives (1999)
Giurgiu, I., Castillo, C., Tantawi, A., Steinder, M.: Enabling efficient placement of virtual infrastructures in the cloud. In: Narasimhan, P., Triantafillou, P. (eds.) Middleware 2012. LNCS, vol. 7662, pp. 332–353. Springer, Heidelberg (2012)
Gulati, A., Kumar, C., Ahmad, I.: Modeling workloads and devices for io load balancing in virtualized environments. SIGMETRICS Perform. Eval. Rev.
Raab, M., Steger, A.: "Balls into bins" - A simple and tight analysis. In: Rolim, J.D.P., Serna, M., Luby, M. (eds.) RANDOM 1998. LNCS, vol. 1518, pp. 159–170. Springer, Heidelberg (1998)
Peres, Y., Talwar, K., Wieder, U.: The (1 + β)-choice process and weighted balls-into-bins
Berenbrink, P., Friedetzky, T., Hu, Z., Martin, R.: On weighted balls-into-bins games. Theor. Comput. Sci.
Fuse filesystem project, http://fuse.sourceforge.net/
Ahmad, I.: Easy and efficient disk i/o workload characterization in vmware esx server. In: Proceedings of the 2007 IEEE 10th International Symposium on Workload Characterization, IISWC 2007, IEEE Computer Society, Washington, DC (2007)
Agrawal, N., Bolosky, W.J., Douceur, J.R., Lorch, J.R.: A five-year study of file-system metadata. Trans. Storage
Douceur, J.R., Bolosky, W.J.: A large-scale study of file-system contents. In: Proceedings of the 1999 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS 1999. ACM, New York (1999)
Kavalanekar, S., Worthington, B., Zhang, Q., Sharda, V.: Characterization of storage workload traces from production windows servers. In: IEEE International Symposium on Workload Characterization, IISWC 2008, pp. 119–128 (2008)
Lee, D., O’Sullivan, M., Walker, C.: Benchmarking and modeling disk-based storage tiers for practical storage design. SIGMETRICS Perform. Eval. Rev. 40(2), 113–118 (2012), http://doi.acm.org/10.1145/2381056.2381080
Wallace, G., Douglis, F., Qian, H., Shilane, P., Smaldone, S., Chamness, M., Hsu, W.: Characteristics of backup workloads in production systems
Delimitrou, C., Sankar, S., Vaid, K., Kozyrakis, C.: Decoupling datacenter studies from access to large-scale applications: A modeling approach for storage workloads. In: 2011 IEEE International Symposium on Workload Characterization, IISWC (2011)
Vdbench, http://vdbench.sourceforge.net/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 IFIP International Federation for Information Processing
About this paper
Cite this paper
Prakash, V.S., Zhao, X., Wen, Y., Shi, W. (2013). Back to the Future: Using Magnetic Tapes in Cloud Based Storage Infrastructures. In: Eyers, D., Schwan, K. (eds) Middleware 2013. Middleware 2013. Lecture Notes in Computer Science, vol 8275. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-45065-5_17
Download citation
DOI: https://doi.org/10.1007/978-3-642-45065-5_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-45064-8
Online ISBN: 978-3-642-45065-5
eBook Packages: Computer ScienceComputer Science (R0)