Abstract
The ever-changing data requirements of today’s dynamic businesses are not handled well by current OLAP systems. Physical integration of data into OLAP systems is a time-consuming process, making logical federations the better choice in many cases. The increasing use of XML suggests that the required data will often be available in XML format. Thus, federations of OLAP and XML databases will be very attractive in many situations. In an efficient implementation of OLAP-XML federations, cost-based optimization is a must, creating a need for an effective cost model for OLAP-XML federations.
In this paper we present a cost model for OLAP-XML federations, and outline techniques for estimating the cost model parameters in a federated OLAP-XML environment. The paper also outlines the cost models for the OLAP and XML components in the federation on which the federation cost model is built. The cost model has been used as the basis for effective cost-based query optimization in OLAP-XML federations. Experiments show that the cost model is precise enough to make a substantial difference in the query optimization process.
This is a preview of subscription content, access via your institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
W. Du, R. Krishnamurthy, and M.-C. Shan. Query Optimization in a Heterogeneous DBMS. In Proceedings of the 18th VLDB Conference, pp. 277–291, 1992.
R. Elmasri and S. B. Navathe. Fundamentals of Database Systems. Addison-Wesley, 2000.
G. Gardarin, F. Sha, and Z.-H. Tang. Calibrating the Query Optimizer Cost Model of IRODB, an Object-Oriented Federated Database System. In Proceedings of the 22nd VLDB Conference, pp. 378–389, 1996.
H. Lu, K.-L. Tan, and S. Dao. The Fittest Survives: An Adaptive Approach to Query Optimization. In Proceedings of 21st VLDB Conference, pp. 251–262, 1995.
J. McHugh and J. Widom. Query Optimization For XML. In Proceedings of 25th VLDB Conference, pp. 315–326, 1999.
H. Naacke, G. Gardarin, A. Tomasic. Leveraging Mediator Cost Models with Heterogeneous Data Sources. In Proceedings of the 14th ICDE Conference, pp. 351–360, 1998.
C. Ozkan, A. Dogac, and M. Altinel. A Cost Model for Path Expressions in Object-Oriented Queries. Journal of Database Management 7(3), 1996.
D. Pedersen, K. Riis, and T. B. Pedersen. XML-Extended OLAP Querying. To appear in Proceedings of SSDBM, 2002.
D. Pedersen, K. Riis, and T. B. Pedersen. Query Processing and Optimization for OLAP-XML Federations. Submitted for publication, 2002.
D. Pedersen, K. Riis, and T. B. Pedersen. Cost Modeling and Estimation for OLAP-XML Federations. TR R02-5003, Department of Computer Science, Aalborg University, 2002.
T. B. Pedersen, A. Shoshani, J. Gu, and C. S. Jensen. Extending OLAP Querying To External Object Databases. In Proceedings of the 9th CIKM Conference, pp. 405–413, 2000.
M. T. Roth et al. Cost models do matter: Providing cost information for diverse data sources in a federated system. In Proceedings of 25th VLDB Conference, pp. 599–610, 1999.
A. P. Sheth and J. A. Larson. Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases. ACM Computing Surveys, 22(3):183–236, 1990.
A. Shukla et al. Storage Estimation for Multidimensional Aggregates in the Presence of Hierarchies. In Proceedings of 22nd VLDB Conference, pp. 522–531, 1996.
E. Thomsen. OLAP Solutions: Building Multidimensional Information Systems. Wiley, 1997.
Q. Zhu and P.-Å. Larson. Global Query Processing and Optimization in the CORDS Multidatabase System. In Proceedings of the 9th PDCS Conference, pp. 640–646, 1996.
Q. Zhu, Y. Sun, and S. Motheramgari. Developing Cost Models with Qualitative Variables for Dynamic Multidatabase Environments. In Proceedings of the 16th ICDE Conference, pp. 413–424, 2000.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pedersen, D., Riis, K., Pedersen, T.B. (2002). Cost Modeling and Estimation for OLAP-XML Federations. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2002. Lecture Notes in Computer Science, vol 2454. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46145-0_24
Download citation
DOI: https://doi.org/10.1007/3-540-46145-0_24
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44123-6
Online ISBN: 978-3-540-46145-6
eBook Packages: Springer Book Archive
