Skip to main content

Managing Dynamic Mixed Workloads for Operational Business Intelligence

  • Conference paper
Book cover Databases in Networked Information Systems (DNIS 2010)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5999))

Included in the following conference series:

Abstract

As data warehousing technology gains a ubiquitous presence in business today, companies are becoming increasingly reliant upon the information contained in their data warehouses to inform their operational decisions. This information, known as business intelligence (BI), traditionally has taken the form of nightly or monthly reports and batched analytical queries that are run at specific times of day. However, as the time needed for data to migrate into data warehouses has decreased, and as the amount of data stored has increased, business intelligence has come to include metrics, streaming analysis, and reports with expected delivery times that are measured in hours, minutes, or seconds. The challenge is that in order to meet the necessary response times for these operational business intelligence queries, a given warehouse must be able to support at any given time multiple types of queries, possibly with different sets of performance objectives for each type. In this paper, we discuss why these dynamic mixed workloads make workload management for operational business intelligence (BI) databases so challenging, review current and proposed attempts to address these challenges, and describe our own approach. We have carried out an extensive set of experiments, and report on a few of our results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Arlitt, M.F.: Characterizing Web user sessions. SIGMETRICS Performance Evaluation Review 28(2), 50–63 (2000)

    Article  Google Scholar 

  2. Bach, F.R., Jordan, M.I.: Kernel Independent Component Analysis. Journal of Machine Learning Research 3, 1–48 (2003)

    Article  MATH  MathSciNet  Google Scholar 

  3. Benoit, D.G.: Automated Diagnosis and Control of DBMS Resources. In: EDBT PhD. Workshop (2000)

    Google Scholar 

  4. Carey, M.J., Livny, M., Lu, H.: Dynamic Task Allocation In A Distributed Database System. In: ICDCS, pp. 282–291 (1985)

    Google Scholar 

  5. Chaudhuri, S., Kaushik, R., Ramamurthy, R.: When Can We Trust Progress Estimators for SQL Queries? In: Proc. of the ACM SIGMOD Intl. Conf. on Management of Data, pp. 575–586 (2005)

    Google Scholar 

  6. Chaudhuri, S., Narasayya, V., Ramamurthy, R.: Estimating Progress of Execution for SQL Queries. In: Proc. of the ACM SIGMOD Intl. Conf. on Management of Data, pp. 803–814 (2004)

    Google Scholar 

  7. Davison, D.L., Graefe, G.: Dynamic Resource Brokering for Multi-User Query Execution. In: Proc. of the ACM SIGMOD Intl. Conf. on Management of Data, pp. 281–292 (1995)

    Google Scholar 

  8. Dayal, U., Kuno, H., Wiener, J.L., Wilkinson, K., Ganapathi, A., Krompass, S.: Managing operational business intelligence workloads. SIGOPS Oper. Syst. Rev. 43(1), 92–98 (2009)

    Article  Google Scholar 

  9. Eeckhout, L., Vandierendonck, H., Bosschere, K.D.: How Input Data Sets Change Program Behaviour. In: 5th Workshop on Computer Architecture Evaluation Using Commercial Workloads (2002)

    Google Scholar 

  10. Elnaffar, S., Martin, P., Horman, R.: Automatically Classifying Database Workloads. In: Proc. of ACM Conference on Information and Knowledge Management (CIKM), pp. 622–624 (2002)

    Google Scholar 

  11. Ganapathi, A., Kuno, H., Dayal, U., Wiener, J., Fox, A., Jordan, M., Patterson, D.: Predicting Multiple Metrics for Queries: Better Decisions Enabled by Machine Learning. In: Proc. of the 21st Intl. Conf. on Data Engineering, ICDE (2009)

    Google Scholar 

  12. Gillin, P.: BI @ the Speed of Business. Computer World Technology (December 2007)

    Google Scholar 

  13. Gupta, C., Mehta, A.: PQR: Predicting Query Execution Times for Autonomous Workload Management. In: Proc. Intl Conf on Autonomic Computing, ICAC (2008)

    Google Scholar 

  14. Keeton, K., Patterson, D.A., He, Y.Q., Raphael, R.C., Baker, W.E.: Performance Characterization of a Quad Pentium Pro SMP using OLTP Workloads. In: The 25th Intl. Symposium on Computer Architecture (ISCA), pp. 15–26 (1998)

    Google Scholar 

  15. Krompass, S., Gmach, D., Scholz, A., Seltzsam, S., Kemper, A.: Quality of Service Enabled Database Applications. In: Dan, A., Lamersdorf, W. (eds.) ICSOC 2006. LNCS, vol. 4294, pp. 215–226. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  16. Krompass, S., Kuno, H., Dayal, U., Kemper, A.: Dynamic Workload Management for Very Large Data Warehouses: Juggling Feathers and Bowling Balls. In: Proc. of the 33rd Intl. Conf. on Very Large Data Bases, VLDB (2007)

    Google Scholar 

  17. Krompass, S., Kuno, H., Wiener, J.L., Wilkinson, K., Dayal, U., Kemper, A.: Managing long-running queries. In: EDBT 2009, pp. 132–143. ACM, New York (2009)

    Chapter  Google Scholar 

  18. Lo, J.L., Barroso, L.A., Eggers, S.J., Gharachorloo, K., Levy, H.M., Parekh, S.S.: An Analysis of Database Workload Performance on Simultaneous Multithreaded Processors. In: The 25th Intl. Symposium on Computer Architecture (ISCA), pp. 39–50 (1998)

    Google Scholar 

  19. Luo, G., Naughton, J.F., Ellmann, C.J., Watzke, M.W.: Toward a Progress Indicator for Database Queries. In: Proc. of the ACM SIGMOD Intl. Conf. on Management of Data, pp. 791–802 (2004)

    Google Scholar 

  20. Luo, G., Naughton, J.F., Ellmann, C.J., Watzke, M.W.: Increasing the Accuracy and Coverage of SQL Progress Indicators. In: Proc. of the 21st Intl. Conf. on Data Engineering (ICDE), pp. 853–864 (2005)

    Google Scholar 

  21. Luo, G., Naughton, J.F., Yu, P.S.: Multi-query SQL Progress Indicators. In: Ioannidis, Y., Scholl, M.H., Schmidt, J.W., Matthes, F., Hatzopoulos, M., Böhm, K., Kemper, A., Grust, T., Böhm, C. (eds.) EDBT 2006. LNCS, vol. 3896, pp. 921–941. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  22. Markl, V., Lohman, G.: Learning Table Access Cardinalities with LEO. In: Proc. of the ACM SIGMOD Intl. Conf. on Management of Data, p. 613 (2002)

    Google Scholar 

  23. Mehta, M., DeWitt, D.J.: Dynamic Memory Allocation for Multiple-Query Workload. In: Proc. of the 19th Intl. Conf. on Very Large Data Bases (VLDB) (August 1993)

    Google Scholar 

  24. Moore, J., Chase, J., Farkas, K., Ranganathan, P.: Data Center Workload Monitoring, Analysis, and Emulation (2005)

    Google Scholar 

  25. Schroeder, B., Harchol-Balter, M., Iyengar, A., Nahum, E.M.: Achieving Class-Based QoS for Transactional Workloads. In: Proc. of the 22nd Intl. Conf. on Data Engineering (ICDE), p. 153 (2006)

    Google Scholar 

  26. Stillger, M., Lohman, G.M., Markl, V., Kandil, M.: LEO - DB2’s LEarning Optimizer. In: Proc. of the 27th Intl. Conf. on Very Large Data Bases (VLDB), pp. 19–28 (2001)

    Google Scholar 

  27. Weikum, G., Hasse, C., Mönkeberg, A., Zabback, P.: The COMFORT Automatic Tuning Project. Information Systems 19(5), 381–432 (1994)

    Article  Google Scholar 

  28. White, C.: The Next Generation of Business Intelligence: Operational BI. DM Review Magazine (May 2005)

    Google Scholar 

  29. Yoo, R.M., Lee, H., Chow, K., Lee, H.-H.S.: Constructing a Non-Linear Model with Neural Networks for Workload Characterization. In: IISWC, pp. 150–159 (2006)

    Google Scholar 

  30. Yu, P.S., Chen, M.-S., Heiss, H.-U., Lee, S.: On Workload Characterization of Relational Database Environments. Software Engineering 18(4), 347–355 (1992)

    Article  Google Scholar 

  31. Zhang, N., Haas, P.J., Josifovski, V., Lohman, G.M., Zhang, C.: Statistical Learning Techniques for Costing XML Queries. In: Proc. of the 31st Intl. Conf. on Very Large Data Bases (VLDB), pp. 289–300 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kuno, H., Dayal, U., Wiener, J.L., Wilkinson, K., Ganapathi, A., Krompass, S. (2010). Managing Dynamic Mixed Workloads for Operational Business Intelligence. In: Kikuchi, S., Sachdeva, S., Bhalla, S. (eds) Databases in Networked Information Systems. DNIS 2010. Lecture Notes in Computer Science, vol 5999. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12038-1_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-12038-1_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-12037-4

  • Online ISBN: 978-3-642-12038-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics