Measuring Database Performance in Online Services: A Trace-Based Approach
Many large-scale online services use structured storage to persist metadata and sometimes data. The structured storage is typically provided by standard database servers such as Microsoft’s SQL Server. It is important to understand the workloads seen by these servers, both for provisioning server hardware as well as to exploit opportunities for energy savings and server consolidation. In this paper we analyze disk I/O traces from production servers in four internet services as well as servers running TPC benchmarks. We show using a range of load metrics that the services differ substantially from each other and from standard TPC benchmarks. Online services also show significant diurnal patterns in load that can be exploited for energy savings or consolidation. We argue that TPC benchmarks do not capture these important characteristics and argue for developing benchmarks that can be parameterized with workload features extracted from live production workload traces.
Keywordsonline services TPC benchmarks storage traces storage performance data centers capacity planning
Unable to display preview. Download preview PDF.
- 1.Hamilton, J.: Internet-scale service efficiency (September 2008), http://mvdirona.com/jrh/TalksAndPapers/JamesRH_Ladis2008.pdf
- 2.Narayanan, D., Donnelly, A., Rowstron, A.: Write off-loading: Practical power management for enterprise storage. In: Proc. USENIX Conference on File and Storage Technologies (FAST), San Jose, CA, February 2008, pp. 256–267 (2008)Google Scholar
- 3.Narayanan, D., Donnelly, A., Thereska, E., Elnikety, S., Rowstron, A.: Everest: Scaling down peak loads through I/O off-loading. In: Proc. Symposium on Operating Systems Design and Implementation (OSDI), San Diego, CA, December 2008, pp. 15–28 (2008)Google Scholar
- 4.Narayanan, D., Thereska, E., Donnelly, A., Elnikety, S., Rowstron, A.: Migrating enterprise storage to SSDs: analysis of tradeoffs. In: Proceedings of EuroSys 2009, Nuremberg, Germany. ACM, New York (2009)Google Scholar
- 5.Kavalanekar, S., Worthington, B., Zhang, Q., Sharda, V.: Characterization of storage workload traces from production Windows servers. In: Proc. IEEE International Symposium on Workload Characterization (IISWC), Austin, TX, October 2008, pp. 119–128 (2008)Google Scholar
- 6.Zhu, Q., Chen, Z., Tan, L., Zhou, Y., Keeton, K., Wilkes, J.: Hibernator: Helping disk arrays sleep through the winter. In: Proceedings of the ACM Symposium on Operating Systems Principles (SOSP 2005), Brighton, United Kingdom (October 2005)Google Scholar
- 7.Narayanan, D., Thereska, E., Ailamaki, A.: Continuous resource monitoring for self-predicting DBMS. In: Proceedings of IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS 2005), Atlanta, GA. IEEE, Los Alamitos (2005)Google Scholar
- 8.Sankar, S., Vaid, K.: Storage characterization for unstructured data in online services applications. In: Proc. IEEE International Symposium on Workload Characterization (IISWC) (to appear)Google Scholar
- 9.SNIA: IOTTA repository (January 2009), http://iotta.snia.org/