Skip to main content

Adding a Performance-Oriented Perspective to Data Warehouse Design

Part of the Lecture Notes in Computer Science book series (LNCS,volume 2454)

Abstract

Data warehouse design is clearly dominated by the business perspective. Quite often, data warehouse administrators are lead to data models with little room for performance improvement. However, the increasing demands for interactive response time from the users make query performance one of the central problems of data warehousing today. In this paper we defend that data warehouse design must take into account both the business and the performance perspective from the beginning, and we propose the extension to typical design methodologies to include performance concerns in the early design steps. Specific analysis to predicted data warehouse usage profile and meta-data analysis are proposed as new inputs for improving the transition from logical to physical schema. The proposed approach is illustrated and discussed using the TPC-H performance benchmark and it is shown that significant performance improvement can be achieved without jeopardizing the business view required for data warehouse models.

Keywords

  • Usage Profile
  • Fact Table
  • Significant Performance Improvement
  • Performance Perspective
  • Star Scheme

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (Canada)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. R. Kimball, “The Data Warehouse Toolkit”, John Willey & Sons, Inc.; 1996.

    Google Scholar 

  2. R. Kimball et. al, “The Data Warehouse Lifecycle Toolkit”, Ralph Kimbal, Ed. J. Wiley & Sons, Inc, 1998

    Google Scholar 

  3. E. F. Codd, S. B. Codd, C. T. Salley. “Beyond decision support”. ComputerWorld, 27(30), July 1993.

    Google Scholar 

  4. P. Valduriez. “Join Indices”. ACM TODS, Vol 12, N° 2, pp 218–246; June 1987.

    CrossRef  Google Scholar 

  5. P. O'Neil, D. Quass. “Improved Query Performance with Variant Indexes”. SIGMOD 1997.

    Google Scholar 

  6. T. Flanagan and E. Safdie. “Data Warehouse Technical Guide”. White Paper, Sybase 1997.

    Google Scholar 

  7. Marcus Jurgens and Hans-J. Lenz. “Tree Based Indexes vs. Bitmap Indexes: A Performance Study”. Proceedings of the Int. Workshop DMDW’99, Heidelberg, Germany, 1999.

    Google Scholar 

  8. S. Chauduri and U. Dayal. “An overview of data warehousing and OLAP technology”. SIGMOD Record, 26(1):65–74, March 1997.

    CrossRef  Google Scholar 

  9. Joseph M. Hellerstein, “Online Processing Redux”. Data Engineering Bulletin 20(3): 20–29 (1997).

    MathSciNet  Google Scholar 

  10. P. Furtado and H. Madeira. “Analysis of Accuracy of Data Reduction Techniques”. First International Conference, DaWaK’99, Florence, Italy, Springer-Verlag, pp.377–388.

    Google Scholar 

  11. H. Boral, W. Alexander, L. Clay, G. Copeland, S. Danforth, M. Franklin, B. Hart, M. Smith & P. Valduriez. “Prototyping Bubba, A highly parallel database system”. IEEE Transactions on Knowledge and Data Engineering 2 (1990), 4–24.September 1990.

    Google Scholar 

  12. D. J. DeWitt et al.. “The Gamma Database Machine Project”. IEEE Trans. Knowledge and Data Engineering, Vol. 2, N°1, March 1990, pp.44–62.

    CrossRef  Google Scholar 

  13. G. Graefe. “Query evaluation techniques for large databases”. ACM Computing Surveys, 25(2):73–170, 1993.

    CrossRef  Google Scholar 

  14. Michael Stonebraker: “The Postgres DBMS”. SIGMOD Conference 1990: 394.

    Google Scholar 

  15. Tandem Database Group. “NonStop SQL: A Distributed, High-Performance, High-Availability Implementation of SQL”. HPTS 1987: 60–104.

    Google Scholar 

  16. J. Bernardino and H. Madeira, “Experimental Evaluation of a New Distributed Partitioning Technique for Data Warehouses”, IDEAS’01, Int. Symp. on Database Engineering and Applications, Grenoble, France, July, 2001.

    Google Scholar 

  17. J. Bernardino, P. Furtado, and H. Madeira, “Approximate Query Answering Using Data Warehouse Striping”, 3rd Int. Conf. on Data Warehousing and Knowledge Discovery, Dawak’01, Munich, Germany, 2001.

    Google Scholar 

  18. Matteo Golfarelli, Dario Maio, Stefano Rizzi, “Applying Vertical Fragmentation Techniques in Logical Design of Multidimensional Databases”, 2nd International Conference on Data Warehousing and Knowledge Discovery, Dawak’00, Greenwich, United Kingdom, September 2000.

    Google Scholar 

  19. L. Cabibbo, R. Torlone: “The Design and Development of a Logical System for OLAP”. DaWaK 2000: 1–10

    Google Scholar 

  20. The Transaction Processing Council. http://www.tpc.org.

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bizarro, P., Madeira, H. (2002). Adding a Performance-Oriented Perspective to Data Warehouse Design. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2002. Lecture Notes in Computer Science, vol 2454. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46145-0_23

Download citation

  • DOI: https://doi.org/10.1007/3-540-46145-0_23

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44123-6

  • Online ISBN: 978-3-540-46145-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics