Skip to main content

Cost Modeling and Estimation for OLAP-XML Federations

Part of the Lecture Notes in Computer Science book series (LNCS,volume 2454)

Abstract

The ever-changing data requirements of today’s dynamic businesses are not handled well by current OLAP systems. Physical integration of data into OLAP systems is a time-consuming process, making logical federations the better choice in many cases. The increasing use of XML suggests that the required data will often be available in XML format. Thus, federations of OLAP and XML databases will be very attractive in many situations. In an efficient implementation of OLAP-XML federations, cost-based optimization is a must, creating a need for an effective cost model for OLAP-XML federations.

In this paper we present a cost model for OLAP-XML federations, and outline techniques for estimating the cost model parameters in a federated OLAP-XML environment. The paper also outlines the cost models for the OLAP and XML components in the federation on which the federation cost model is built. The cost model has been used as the basis for effective cost-based query optimization in OLAP-XML federations. Experiments show that the cost model is precise enough to make a substantial difference in the query optimization process.

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (Canada)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. W. Du, R. Krishnamurthy, and M.-C. Shan. Query Optimization in a Heterogeneous DBMS. In Proceedings of the 18th VLDB Conference, pp. 277–291, 1992.

    Google Scholar 

  2. R. Elmasri and S. B. Navathe. Fundamentals of Database Systems. Addison-Wesley, 2000.

    Google Scholar 

  3. G. Gardarin, F. Sha, and Z.-H. Tang. Calibrating the Query Optimizer Cost Model of IRODB, an Object-Oriented Federated Database System. In Proceedings of the 22nd VLDB Conference, pp. 378–389, 1996.

    Google Scholar 

  4. H. Lu, K.-L. Tan, and S. Dao. The Fittest Survives: An Adaptive Approach to Query Optimization. In Proceedings of 21st VLDB Conference, pp. 251–262, 1995.

    Google Scholar 

  5. J. McHugh and J. Widom. Query Optimization For XML. In Proceedings of 25th VLDB Conference, pp. 315–326, 1999.

    Google Scholar 

  6. H. Naacke, G. Gardarin, A. Tomasic. Leveraging Mediator Cost Models with Heterogeneous Data Sources. In Proceedings of the 14th ICDE Conference, pp. 351–360, 1998.

    Google Scholar 

  7. C. Ozkan, A. Dogac, and M. Altinel. A Cost Model for Path Expressions in Object-Oriented Queries. Journal of Database Management 7(3), 1996.

    Google Scholar 

  8. D. Pedersen, K. Riis, and T. B. Pedersen. XML-Extended OLAP Querying. To appear in Proceedings of SSDBM, 2002.

    Google Scholar 

  9. D. Pedersen, K. Riis, and T. B. Pedersen. Query Processing and Optimization for OLAP-XML Federations. Submitted for publication, 2002.

    Google Scholar 

  10. D. Pedersen, K. Riis, and T. B. Pedersen. Cost Modeling and Estimation for OLAP-XML Federations. TR R02-5003, Department of Computer Science, Aalborg University, 2002.

    Google Scholar 

  11. T. B. Pedersen, A. Shoshani, J. Gu, and C. S. Jensen. Extending OLAP Querying To External Object Databases. In Proceedings of the 9th CIKM Conference, pp. 405–413, 2000.

    Google Scholar 

  12. M. T. Roth et al. Cost models do matter: Providing cost information for diverse data sources in a federated system. In Proceedings of 25th VLDB Conference, pp. 599–610, 1999.

    Google Scholar 

  13. A. P. Sheth and J. A. Larson. Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases. ACM Computing Surveys, 22(3):183–236, 1990.

    CrossRef  Google Scholar 

  14. A. Shukla et al. Storage Estimation for Multidimensional Aggregates in the Presence of Hierarchies. In Proceedings of 22nd VLDB Conference, pp. 522–531, 1996.

    Google Scholar 

  15. E. Thomsen. OLAP Solutions: Building Multidimensional Information Systems. Wiley, 1997.

    Google Scholar 

  16. Q. Zhu and P.-Å. Larson. Global Query Processing and Optimization in the CORDS Multidatabase System. In Proceedings of the 9th PDCS Conference, pp. 640–646, 1996.

    Google Scholar 

  17. Q. Zhu, Y. Sun, and S. Motheramgari. Developing Cost Models with Qualitative Variables for Dynamic Multidatabase Environments. In Proceedings of the 16th ICDE Conference, pp. 413–424, 2000.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pedersen, D., Riis, K., Pedersen, T.B. (2002). Cost Modeling and Estimation for OLAP-XML Federations. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2002. Lecture Notes in Computer Science, vol 2454. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46145-0_24

Download citation

  • DOI: https://doi.org/10.1007/3-540-46145-0_24

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44123-6

  • Online ISBN: 978-3-540-46145-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics