Abstract
In this paper we propose a benchmark, called RTDW-bench, for testing a performance of a real-time data warehouse. The benchmark is based on TPC-H. In particular, RTDW-bench permits to verify whether an already deployed RTDW is able to handle without any delays a transaction stream of a given arrival rate. The benchmark also includes an algorithm for finding the maximum stream arrival rate that can be handled by a RTDW without delays. The applicability of the proposed benchmark was verified in a RTDW implemented in Oracle11g.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Transaction processing performance council, http://www.tpc.org
Acharya, S., Gibbons, P.B., Poosala, V.: Congressional samples for approximate answering of group-by queries. In: Proc. of ACM SIGMOD Int. Conf. on Management of Data, pp. 487–498 (2000)
Acharya, S., Gibbons, P.B., Poosala, V., Ramaswamy, S.: The Aqua approximate query answering system. SIGMOD Rec. 28, 574–576 (1999)
Chakrabarti, K., Garofalakis, M., Rastogi, R., Shim, K.: Approximate query processing using wavelets. The VLDB Journal 10, 199–223 (2001)
Colby, L.S., Kawaguchi, A., Lieuwen, D.F., Mumick, I.S., Ross, K.A.: Supporting multiple view maintenance policies. In: Proc. of ACM SIGMOD Int. Conf. on Management of Data, pp. 405–416 (1997)
Condie, T., Conway, N., Alvaro, P., Hellerstein, J.M., Gerth, J., Talbot, J., Elmeleegy, K., Sears, R.: Online aggregation and continuous query support in mapreduce. In: Proc. of ACM SIGMOD Int. Conf. on Management of Data, pp. 1115–1118. ACM (2010)
Domingos, P., Hulten, G.: Catching up with the data: Research issues in mining data streams. In: Proc. of ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (2001)
Golab, L., Johnson, T., Shkapenyuk, V.: Scheduling updates in a real-time stream warehouse. In: Proc. of Int. Conf. on Data Engineering (ICDE), pp. 1207–1210. IEEE Computer Society (2009)
Graefe, G., König, A.C., Kuno, H.A., Markl, V., Sattler, K.-U.: Robust query processing. Dagstuhl Seminar Proceedings, vol. 10381. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, Germany (2011)
Karakasidis, A., Vassiliadis, P., Pitoura, E.: Etl queues for active data warehousing. In: Proc. of Int. Workshop on Information Quality in Information Systems, pp. 28–39. ACM (2005)
Krueger, J., Tinnefeld, C., Grund, M., Zeier, A., Plattner, H.: A case for online mixed workload processing. In: Proc. of Int. Workshop on Testing Database Systems (DBTest). ACM (2010)
Poess, M., Nambiar, R.O., Walrath, D.: Why you should run tpc-ds: a workload analysis. In: Proc. of Int. Conf. on Very Large Data Bases (VLDB), pp. 1138–1149. VLDB Endowment (2007)
Polyzotis, N., Skiadopoulos, S., Vassiliadis, P., Simitsis, A., Frantzell, N.: Supporting streaming updates in an active data warehouse. In: Proc. of Int. Conf. on Data Engineering (ICDE), pp. 476–485. ACM (2007)
Sharaf, M.A., Chrysanthis, P.K., Labrinidis, A., Pruhs, K.: Algorithms and metrics for processing multiple heterogeneous continuous queries. ACM Trans. Database Syst. 33, 5:1–5:44 (2008)
Simitsis, A., Vassiliadis, P., Dayal, U., Karagiannis, A., Tziovara, V.: Benchmarking ETL Workflows. In: Nambiar, R., Poess, M. (eds.) TPCTC 2009. LNCS, vol. 5895, pp. 199–220. Springer, Heidelberg (2009)
Thiele, M., Fischer, U., Lehner, W.: Partition-based workload scheduling in living data warehouse environments. Information Systems 34(4-5), 382–399 (2009)
Tziovara, V., Vassiliadis, P., Simitsis, A.: Deciding the physical implementation of etl workflows. In: Proc. of ACM Int. Workshop on Data Warehousing and OLAP (DOLAP), pp. 49–56 (2007)
Wyatt, L., Caufield, B., Pol, D.: Principles for an ETL Benchmark. In: Nambiar, R., Poess, M. (eds.) TPCTC 2009. LNCS, vol. 5895, pp. 183–198. Springer, Heidelberg (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jedrzejczak, J., Koszlajda, T., Wrembel, R. (2012). RTDW-bench: Benchmark for Testing Refreshing Performance of Real-Time Data Warehouse. In: Liddle, S.W., Schewe, KD., Tjoa, A.M., Zhou, X. (eds) Database and Expert Systems Applications. DEXA 2012. Lecture Notes in Computer Science, vol 7447. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32597-7_18
Download citation
DOI: https://doi.org/10.1007/978-3-642-32597-7_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32596-0
Online ISBN: 978-3-642-32597-7
eBook Packages: Computer ScienceComputer Science (R0)