Skip to main content

Quality and Cost of Data Warehouse Views

  • Chapter
  • 2375 Accesses

Abstract

Query rewriting has been used as a query optimization technique for several decades to reduce the computational cost of a query. Traditional problems in query rewriting include in particular query optimization au][28, 60, 6] and rewriting queries using views [40, 7]. Most of these works deal with the problem of maintaining the exact original interface (schema) and extent of a given query while optimizing performance.

This work was in part supported by several NSF grants, namely, the NSF NYI grant #IRI 9796264, NSF CISE Instrumentation Grant #IRIS 9729878, and the NSF grant #IIS 9988776.

This work was performed while Andreas Koeller was a Research Assistant at Worcester Polytechnic Institute.

This work was performed while Amy Lee was a Research Assistant at Worcester Polytechnic Institute and a Ph.d. student at the University of Michigan, Ann Arbor.

This work was performed while Anisoara Nica was a Ph.d. student at the University of Michigan, Ann Arbor.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   429.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD   549.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. S. Abiteboul, R. Goldman, J. McHugh, V. Vassalos, and Y. Zhuge. Views for semistructured data. In Workshop on Management of Semistructured Data, Tucson, Arizona, 1997.

    Google Scholar 

  2. Serge Abiteboul and Oliver M. Duschka. Complexity of answering queries using materialized views. In ACM, editor, Proceedings of ACM Symposium on Principles of Database Systems, pages 254–263, New York, NY 10036, USA, 1998. ACM Press.

    Google Scholar 

  3. Serge Abiteboul, Jason McHugh, Michael Rys, Vasilis Vassalos, and Janet L. Wiener. Incremental maintenance for materialized views over semistructured data. In Proc. 24th Int. Conf. very Large Data Bases, VLDB, pages 38–49, 1998.

    Google Scholar 

  4. D. Agrawal, A. El Abbadi, A. Singh, and T. Yurek. Efficient View Maintenance at Data Warehouses. In Proceedings of SIGMOD, pages 417–427, 1997.

    Google Scholar 

  5. Y. Arens, C. A. Knoblock, and W-M. Shen. Query Reformulation for Dynamic Information Integration. Journal of Intelligent Information Systems, 6 (2/3):99–130, 1996.

    Article  Google Scholar 

  6. J. A. Blakeley, P.-E. Larson, and F. W. Tompa. Efficiently Updating Materialized Views. Proceedings of SIGMOD, pages 61–71,1986.

    Google Scholar 

  7. S. Chaudhuri, R. Krishnamurthy, and S. Potamianos. Optimizing Query with Materialized Views. In Proceedings of IEEE International Conference on Data Engineering, 1995.

    Google Scholar 

  8. W. W. Chu, M. A. Merzbacher, and L. Berkovich. The Design and Implementation of CoBase. SIGMOD Record, 22(2):517–522, June 1993.

    Article  Google Scholar 

  9. Wesley W. Chu, Hua Yang, Kuorong Chiang, Michael Minock, Gladys Chow, and Chris Larson. CoBase: A scalable and extensible cooperative information system. Intelligent Information Systems (JIIS), 6(2/3):223–259, 1996.

    Article  Google Scholar 

  10. W. Du, R. Krishnamurthy, and M-C. Shan. Query Optimization in Heterogeneous DBMS. International Conference on very Large Data Bases, pages 277–291, 1992.

    Google Scholar 

  11. Oliver M. Duschka. Query Planning and Optimization in Information Integration. PhD thesis, Stanford University, Stanford, California, December 1997.

    Google Scholar 

  12. Oliver M. Duschka and Michael R. Genesereth. Answering recursive queries using views. In ACM, editor, Proceedings of ACM Symposium on Principles of Database Systems, pages 109–116, New York, NY 10036, USA, 1997. ACM Press.

    Google Scholar 

  13. R. Elnlasri and S. B. Navathe. Fundamentals of Database Systems. The Benjamin/Cummings Publishing Company, Inc., 1994.

    Google Scholar 

  14. Oren Etzioni and Daniel Weld. A Softbot-based interface to the Internet. Communications of the ACM, 37(7):72–76, July 1994.

    Article  Google Scholar 

  15. Sumit Ganguly, Phillip B. Gibbons, Yossi Matias, and Avi Silberschatz. Bifocal sampling for skew-resistant join size estimation. SIGMOD Record, 25(2):271–281, June 1996.

    Article  Google Scholar 

  16. Michael R. Genesereth, Arthur M. Keller, and Oliver M. Duschka. Infomaster: An information integration system. SIGMOD Record (ACM Special Interest Group on Manayement of Data), 26(2):539ff., 1997.

    Google Scholar 

  17. G. Graefe, R. L. Cole, D. L. Davison, W J. McKenna, and R. H. Wolniewicz. Extensible query optimization and parallel execution in volcano. In J. C. Freytag, G. Vossen and D. Maier, editor, Query Processing for Advanced Database Applications, page 305. Morgan Kaufmann, San Francisco, CA, 1994.

    Google Scholar 

  18. Goetz Graefe and William J. McKenna. The volcano optimizer generator: Extensibility and efficient search. In Proceedings of IEEE International Conference on Data Engineering, pages 209–218. IEEE Computer Society, 1993.

    Google Scholar 

  19. A. Gupta, H. V. Jagadish, and I. S. Mumick. Data Integration using Self-Maintainable Views. In Proceedings of International Conference on Extending Database Technology (EDBT), pages 140–144, 1996.

    Google Scholar 

  20. A. Gupta, I. S. Mumick, and V. S. Subrahmanian. Maintaining Views Incrementally. In Proceedings of SIGMOD, pages 157–166, 1993.

    Google Scholar 

  21. A. Gupta and I. S. Mumick. Maintenance of Materialized Views: Problems, Techniques, and Applications. IEEE Data Engineering Bulletin, Special Issue on Materialized Views and Warehousing, 18(2):3–19, 1995.

    Google Scholar 

  22. Peter J. Haas, Jeffrey F. Naughton, S. Seshadri, and Lynne Stokes. Sampling-based estimation of the number of distinct values of an attribute. In International Conference on very Large Data Bases, pages 311–322, 1995.

    Google Scholar 

  23. Peter J. Haas, Jeffrey F. Naughton, S. Seshadri, and Arun N. Swami. Fixed-precision estimation ofjoin selectivity. In Proceedings of ACM Symposium on Principles of Database Systems, pages 190–201. ACM Press, May 1993.

    Google Scholar 

  24. Peter J. Haas and A. N. Swami. Sampling-based selectivity estimation for joins using augmented frequent value statistics. In Proceedings of IEEE International Conference on Data Engineering, pages 522–531, 1995.

    Google Scholar 

  25. J. Hammer, Héctor García-Molina, S. Nestorov, R. Yerneni, M. Breunig, and V. Vassalos. Template-Based Wrappers in the TSIMMIS System. In Proceedings of SIGMOD, pages 532–535, 1997.

    Google Scholar 

  26. Wen-Chi Hou and Gultekin Özsoyoĝlu. Statistical estimators for aggregate relational algebra queries. ACM Transactions on Database Systems, 16(4):600–654, December 1991.

    Article  Google Scholar 

  27. M. Jarke and J. Koch. Query Optimization in Database Systems. ACM Computing Surveys, pages 111–152,1984.

    Google Scholar 

  28. A. Koeller, E. A. Rundensteiner, and N. Hachem. Integrating the Rewriting and Ranking Phases of View Synchronization. In Proceedings of the ACM First International Workshop on Data Warehousing and OLAP (DOLAP’98), pages 60–65, November 1998.

    Google Scholar 

  29. Andreas Koeller and Elke A. Rundensteiner. Discovery of high-dimensional inclusion dependencies. Technical Report WPI-CS-TR-02-15, Worcester Polytechnic Institute, Dept. of Computer Science, 2002.

    Google Scholar 

  30. Donald Kossmann and Konrad Stocker. Iterative dynamic programming: a new class of query optimization algorithms. ACM Transactions on Database Systems, 25(1):43–82, March 2000.

    Article  Google Scholar 

  31. A. J. Lee, A. Koeller, A. Nica, and E. A. Rundensteiner. Data Warehouse Evolution: Trade-offs between Quality and Cost of Query Rewritings. In Proceedings of IEEE International Conference on Data Engineering, Special Poster Session, page 255, March, Sydney, Australia 1999.

    Google Scholar 

  32. A. J. Lee, A. Koeller, A. Nica, and E. A. Rundensteiner. Non-Equivalent Query Rewritings. In Proceedings of the 9th International Databases Conierence, pages 248–262. City University of Hong Kong Press, Hong Kong, July 1999.

    Google Scholar 

  33. A.J. Lee, A. Nica, and E. A. Rundensteiner. Keeping Virtual Information Resources Up and Running. In Proceedings of IBM Centrefor Advanced Studies Conference (CASCON’97), Best Paper Award, pages 1–14, November 1997.

    Google Scholar 

  34. A. Levy, I. S. Mumick, Y. Sagiv, and O. Shmueli. Equivalence, query reachability and satisfiability in datalog extensions. In Proceedings of the Twelfth ACM SICACT-SIGMOD-SICART Symposium 011 Principles of Database Systems, pages 109–122, Washington, DC, 25–28 May 1993.

    Google Scholar 

  35. A. Y. Levy, Inderpal Singh Mumick, and Y. Sagiv. Query optimization by predicate move-around. In Jorgeesh Bocca, Matthias Jarke, and Carlo Zaniolo, editors, International Conierence on very Large Data Bases, pages 96–107, Los Altos, CA 94022, USA, 1994. Morgan Kaufmann Publishers.

    Google Scholar 

  36. A. Y. Levy, A. Rajaraman, and J. D. Ullman. Answering queries using limited external processors. In pods, pages 227–237, Montreal, Canada, 3–5 June 1996.

    Google Scholar 

  37. Alon Levy and Yehoshua Sagiv. Constraints and Redundancy in Datalog. In Proceedings of the Eleventh ACM SICACT-SIGMOD-SICART Symposium on Principles of Database Systems, June 2–4, 1992, San Diego, CA, pages 67–80, 1992.

    Google Scholar 

  38. Alon Y. Levy, Divesh Srivastava, and Thomas Kirk. Data model and query evaluation in global informarion systems. Journal of Intelligent Information Systems—Special Issue on Networked Information Discovery and Retrieval, 5(2):121–143,1995.

    Google Scholar 

  39. A. Y. Levy, A. O. Mendelzon, and Y. Sagiv. Answering Queries Using Views. In Proceedings ofACM Symposium on Principles of Database Systems, pages 95–104, May 1995.

    Google Scholar 

  40. C. Li, R. Yerneni, V. Vassalos, Héctor García-Molina, Y. Papakonstantinou, J. D. Ullman, and M. Valiveti. Capability Based Mediation in TSIMMIS. In Proceedings of SIGMOD, pages 564–566, 1998.

    Google Scholar 

  41. A. Nica. View Evolution Support for Information Integration Systems over Dynamic Distributed Information Spaces. PhD thesis, University of Michigan in Ann Arbor, in progress 1999.

    Google Scholar 

  42. A. Nica, A.J. Lee, and E. A. Rundensteiner. The CVS Algorithm for View Synchronization in Evolvable Large-Scale Information Systems. In Proceedings of International Conference on Extending Database Iechnology (EDBT’98), pages 359–373, Valencia, Spain, March 1998.

    Google Scholar 

  43. A. Nica and E. A. Rundensteiner. On Translating Loosely-Specified Queries into Executable Plans in Large-Scale Information Systems. In Proceedings of Second IFCIS International Conierence on Cooperative Information Systems CoopIS’97, pages 213–222, June 1997.

    Google Scholar 

  44. A. Nica and E. A. Rundensteiner. Using Containment Information for View Evolution in Dynamic Distributed Environments. In Proceedings of International Workshop on Data Warehouse Design and OLAP Technology (DWDOT’98), Vienna, Austria, August 1998.

    Google Scholar 

  45. Gregory Piatetsky-Shapiro and Charles Connell. Accurate estimation ofthe number of tuples satisfying a condition. SIGMOD Record, 14(2):256–276, 1984.

    Article  Google Scholar 

  46. Viswanath Poosala and Yannis E. Ioannidis. Selectivity estimation without the attribute value independence assumption. In International Conference on very Laroe Data Bases, pages 486–495, 1997.

    Google Scholar 

  47. D. Quass and J. Widom. On-Line Warehouse View Maintenance. In Proceedings of SIGMOD, pages 393–400, 1997.

    Google Scholar 

  48. A. Rajaraman, Y. Sagiv, and J. D. Ullman. Answering Queries Using Templates With Binding Patterns. In Proceedinos of ACM Symposium on Principles of Database Systems, pages 105–112, May 1995.

    Google Scholar 

  49. A. Rajaraman and J. D. Ullman. Integrating Information by Outerjoins and Full Disjunctions. In Proceedings of ACM Symposium on Principles of Database Systems, pages 238–248, 1996.

    Google Scholar 

  50. E. A. Rundensteiner, A. Koeller, A. Lee, Y. Li, A. Nica, and X. Zhang. Evolvable View Environment (EVE) Project: Synchronizing Views over Dynamic Distributed Information Sources. In Demo Session Proceedings of International Conference on Extending Database Technology (EDBT’98), pages 41–42, Valencia, Spain, March 1998.

    Google Scholar 

  51. E. A. Rundensteiner, A. Koeller, X. Zhang, A. Lee, A. Nica, A. VanWyk, and Y. Li. Evolvable View Environment. In Proceedinys of SIGMOD’99 Demo Session, pages 553–555, May 1999.

    Google Scholar 

  52. E. A. Rundensteiner, A. J. Lee, and A. Nica. On Preserving Views in Evolving Environments. In Proceedinos of 4th Int. Workshop on Knowledge Representation Meets Databases (KRDB’97): Intelligent Access to Heterogeneous Information, pages 13.1–13.11, Athens, Greece, August 1997.

    Google Scholar 

  53. Elke A. Rundensteiner, Andreas Koeller, and Xin Zhang. Maintaining Data Warehouses over Changing Information Sources. Communications of the ACM, pages 57–62, June 2000.

    Google Scholar 

  54. Torsten Schlieder. Schema-driven evaluation of approximate tree-pattern queries. In Proceedings of International Conference on Extending Database Technology (EDBT), volume LNCS 2287, pages 514–532. Springer, 2002.

    Google Scholar 

  55. Patricia G. Selinger, Morton M. Astrahan, Donald D. Chamberlin, Raymond A. Lorie, and Thomas G. Price. Access path selection in a relational database management system. In Proceedings of SIGMOD, pages 23–34. ACM, 1979.

    Google Scholar 

  56. D. Srivastava, S. Dar, H. V. Jagadish, and A.Y. Levy. Answering Queries with Aggregation Using Views. In International Conference on very Large Data Bases, pages 318–329, 1996.

    Google Scholar 

  57. Anja Theobald and Gerhard Weikum. Adding relevance to XML. Lecture Notes in Computer Science, 1997:105-??, 2001.

    Google Scholar 

  58. Anja Theobald and Gerhard Weikum. The index-based XXL search engine for querying XML data with relevance ranking. In Proceedings of International Conference on Extending Database Technology (EDBT), volume LNCS 2287, pages 477–495. Springer, 2002.

    Google Scholar 

  59. C. A. van den Berg and M. L. Kersten. An Analysis of a Dynamic Query Optimization Schema for Different Data Distributions. In J. C. Freytag, D. Maier, and G. Vossen, editors, Query Processing for Advanced Database Systems, chapter 15, pages 449–473. Morgan Kaufmann Pub., 1994.

    Google Scholar 

  60. S. B. Yao. An Attribute Based Model for Database Access Cost Analysis. ACM Transactions on Database Systems (TODS), 2(1):45–67, March 1977.

    Article  Google Scholar 

  61. X. Zhang, L. Ding, and E. A. Rundensteiner. PSWEEP: Parallel View Maintenance Under Concurrent Data Updates of Distributed Sources. Technical Report WPI-CS-TR-99-14, Worcester Polytechnic Institute, Computer Science Department, May 1999.

    Google Scholar 

  62. X. Zhang and E. A. Rundensteiner. The SDCC Framework for Integrating Existing Algorithms for Diverse Data Warehouse Maintenance Tasks. In International Database Engineering and Application Symposium, pages 206–214, Montreal, Canada, August, 1999.

    Google Scholar 

  63. Xin Zhang, Elke A. Rundensteiner, and Lingli Ding. PVM: Parallel View Maintenance Under Concurrent Data Updates of Distributed Sources. In Data Warehousing and Knowledge Discovery, Proceedings, Munich, Germany, September 2001. 230–239.

    Google Scholar 

  64. Y. Zhuge, Héctor García-Molina, J. Hammer, and J. Widom. View Maintenance in a Warehousing Environment. In Proceedings of SIGMOD, pages 316–327, May 1995.

    Google Scholar 

  65. Y. Zhuge, Héctor García-Molina, and J. L. Wiener. The Strobe Algorithms for Multi-Source Warehouse Consistency. In International Conference on Parallel and Distributed Information Systems, pages 146–157, December 1996.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Kluwer Academic Publishers

About this chapter

Cite this chapter

Koeller, A., Rundensteiner, E.A., Lee, A., Nica, A. (2005). Quality and Cost of Data Warehouse Views. In: Leondes, C.T. (eds) Intelligent Knowledge-Based Systems. Springer, Boston, MA. https://doi.org/10.1007/978-1-4020-7829-3_17

Download citation

  • DOI: https://doi.org/10.1007/978-1-4020-7829-3_17

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4020-7746-3

  • Online ISBN: 978-1-4020-7829-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics