Abstract
Data warehouse design is clearly dominated by the business perspective. Quite often, data warehouse administrators are lead to data models with little room for performance improvement. However, the increasing demands for interactive response time from the users make query performance one of the central problems of data warehousing today. In this paper we defend that data warehouse design must take into account both the business and the performance perspective from the beginning, and we propose the extension to typical design methodologies to include performance concerns in the early design steps. Specific analysis to predicted data warehouse usage profile and meta-data analysis are proposed as new inputs for improving the transition from logical to physical schema. The proposed approach is illustrated and discussed using the TPC-H performance benchmark and it is shown that significant performance improvement can be achieved without jeopardizing the business view required for data warehouse models.
Keywords
- Usage Profile
- Fact Table
- Significant Performance Improvement
- Performance Perspective
- Star Scheme
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, access via your institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
R. Kimball, “The Data Warehouse Toolkit”, John Willey & Sons, Inc.; 1996.
R. Kimball et. al, “The Data Warehouse Lifecycle Toolkit”, Ralph Kimbal, Ed. J. Wiley & Sons, Inc, 1998
E. F. Codd, S. B. Codd, C. T. Salley. “Beyond decision support”. ComputerWorld, 27(30), July 1993.
P. Valduriez. “Join Indices”. ACM TODS, Vol 12, N° 2, pp 218–246; June 1987.
P. O'Neil, D. Quass. “Improved Query Performance with Variant Indexes”. SIGMOD 1997.
T. Flanagan and E. Safdie. “Data Warehouse Technical Guide”. White Paper, Sybase 1997.
Marcus Jurgens and Hans-J. Lenz. “Tree Based Indexes vs. Bitmap Indexes: A Performance Study”. Proceedings of the Int. Workshop DMDW’99, Heidelberg, Germany, 1999.
S. Chauduri and U. Dayal. “An overview of data warehousing and OLAP technology”. SIGMOD Record, 26(1):65–74, March 1997.
Joseph M. Hellerstein, “Online Processing Redux”. Data Engineering Bulletin 20(3): 20–29 (1997).
P. Furtado and H. Madeira. “Analysis of Accuracy of Data Reduction Techniques”. First International Conference, DaWaK’99, Florence, Italy, Springer-Verlag, pp.377–388.
H. Boral, W. Alexander, L. Clay, G. Copeland, S. Danforth, M. Franklin, B. Hart, M. Smith & P. Valduriez. “Prototyping Bubba, A highly parallel database system”. IEEE Transactions on Knowledge and Data Engineering 2 (1990), 4–24.September 1990.
D. J. DeWitt et al.. “The Gamma Database Machine Project”. IEEE Trans. Knowledge and Data Engineering, Vol. 2, N°1, March 1990, pp.44–62.
G. Graefe. “Query evaluation techniques for large databases”. ACM Computing Surveys, 25(2):73–170, 1993.
Michael Stonebraker: “The Postgres DBMS”. SIGMOD Conference 1990: 394.
Tandem Database Group. “NonStop SQL: A Distributed, High-Performance, High-Availability Implementation of SQL”. HPTS 1987: 60–104.
J. Bernardino and H. Madeira, “Experimental Evaluation of a New Distributed Partitioning Technique for Data Warehouses”, IDEAS’01, Int. Symp. on Database Engineering and Applications, Grenoble, France, July, 2001.
J. Bernardino, P. Furtado, and H. Madeira, “Approximate Query Answering Using Data Warehouse Striping”, 3rd Int. Conf. on Data Warehousing and Knowledge Discovery, Dawak’01, Munich, Germany, 2001.
Matteo Golfarelli, Dario Maio, Stefano Rizzi, “Applying Vertical Fragmentation Techniques in Logical Design of Multidimensional Databases”, 2nd International Conference on Data Warehousing and Knowledge Discovery, Dawak’00, Greenwich, United Kingdom, September 2000.
L. Cabibbo, R. Torlone: “The Design and Development of a Logical System for OLAP”. DaWaK 2000: 1–10
The Transaction Processing Council. http://www.tpc.org.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bizarro, P., Madeira, H. (2002). Adding a Performance-Oriented Perspective to Data Warehouse Design. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2002. Lecture Notes in Computer Science, vol 2454. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46145-0_23
Download citation
DOI: https://doi.org/10.1007/3-540-46145-0_23
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44123-6
Online ISBN: 978-3-540-46145-6
eBook Packages: Springer Book Archive
