Mining Queries for Constructing Materialized Views in a Data Warehouse

  • T. V. Vijay Kumar
  • Archana Singh
  • Gaurav Dubey
Part of the Advances in Intelligent Systems and Computing book series (volume 167)

Abstract

A data warehouse stores historical information, continuously being generated over time, to support decision making. The queries posed for decision making are usually exploratory, long, complex and analytical in nature. These queries, when posed against a large and continuously growing data warehouse, consume a lot of time for processing and thereby resulting in high response times. This problem of high response time can be addressed by constructing materialized views on the data warehouse. These views, which store data along with its definition, cannot be arbitrarily constructed as they need to contain relevant and required information for answering most future queries. The approach proposed in this paper attempts to identify such information, from previously posed queries on a data warehouse, using clustering and association rule mining techniques. The information identified using the approach is likely to answer most future queries in reduced query response times. As a result, the decision making would become more efficient.

Keywords

Subject Area Hash Table Support Count Query Similarity Query Response Time 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Agrawal, S., Chaudhari, S., Narasayya, V.: Automated Selection of Materialized Views and Indexes in SQL databases. In: 26th International Conference on Very Large Data Bases (VLDB 2000), Cairo, Egypt, pp. 495–505 (2000)Google Scholar
  2. 2.
    Ankerst, Breunig, Kriegel, Sander: OPTICS: Ordering Points to Identify the Clustering Structure. ACM SIGMOD Record Archive 28(2) (June 1999)Google Scholar
  3. 3.
    Aouiche, K., Darmont, J.: Data mining-based materialized view and index selection in data warehouse. Journal of Intelligent Information Systems, 65–93 (2009)Google Scholar
  4. 4.
    Baralis, E., Paraboschi, S., Teniente, E.: Materialized View Selection in a Multidimansional Database. In: 23rd International Conference on Very Large Data Bases (VLDB 1997), Athens, Greece, pp. 156–165 (1997)Google Scholar
  5. 5.
    Chirkova, R., Halevy, A.Y., Suciu, D.: A Formal Perspective on the View Selection Problem. In: Proceedings of VLDB, pp. 59–68 (2001)Google Scholar
  6. 6.
    Gupta, H., Mumick, I.S.: Selection of Views to Materialize in a Data warehouse. IEEE Transactions on Knowledge & Data Engineering 17(1), 24–43 (2005)CrossRefGoogle Scholar
  7. 7.
    Gupta, H., Harinarayan, V., Rajaraman, V., Ullman, J.: Index Selection for OLAP. In: Proceedings of the 13th International Conference on Data Engineering, ICDE 1997, Birmingham, UK (1997)Google Scholar
  8. 8.
    Harinarayan, V., Rajaraman, A., Ullman, J.D.: Implementing Data Cubes Efficiently. In: ACM SIGMOD, Montreal, Canada, pp. 205–216 (1996)Google Scholar
  9. 9.
    Horng, J.T., Chang, Y.J., Liu, B.J., Kao C. Y.: Materialized View Selection Using Genetic Algorithms in a Data warehouse System. In: Proceedings of the 1999 Congress on Evolutionary Computation, Washington DC, USA, vol. 3 (1999)Google Scholar
  10. 10.
    Inmon, W.H.: Building the Data Warehouse, 3rd edn. Wiley Dreamtech India Pvt. Ltd. (2003)Google Scholar
  11. 11.
    Lawrence, M.: Multiobjective Genetic Algorithms for Materialized View Selection in OLAP Data Warehouses. In: GECCO 2006, Seattle, Washington, USA, July 8-12 (2006)Google Scholar
  12. 12.
    Lehner, W., Ruf, T., Teschke, M.: Improving Query Response Time in Scientific Databases Using Data Aggregation. In: Proceedings of 7th International Conference and Workshop on Database and Expert Systems Applications, DEXA 1996 (1996)Google Scholar
  13. 13.
    Park, J.S., Chen, M., Yu, P.S.: An effective hash based algorithm for mining association rules. In: ACM SIGMOD International Conference on Management of Data (May 1995)Google Scholar
  14. 14.
    Rizzi, S., Saltarelli, E.: View Materialization vs. Indexing: Balancing Space Constraints in Data Warehouse Design. In: Eder, J., Missikoff, M. (eds.) CAiSE 2003. LNCS, vol. 2681, pp. 502–519. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  15. 15.
    Teschke, M., Ulbrich, A.: Using Materialized Views to Speed Up Data Warehousing. Technical Report, IMMD 6, Universität Erlangen-Nümberg (1997)Google Scholar
  16. 16.
    Theodoratos, D., Sellis, T.: Data Warehouse Configuration. In: Proceeding of VLDB, Athens, Greece, pp. 126–135 (1997)Google Scholar
  17. 17.
    Theodoratos, D., Xu, W.: Constructing Search Spaces for Materialized View Selection. In: 7th ACM Internatioanl Workshop on Data Warehousing and OLAP (DOLAP 2004), Washington, USA (2004)Google Scholar
  18. 18.
    Clemonsa, T.E., Bradley Jr., E.L.: A nonparametric measure of the overlapping coefficient. Published in Journal “Computational Statistics & Data Analysis” 34(1) (July 28, 2000)Google Scholar
  19. 19.
    Vijay Kumar, T.V., Ghoshal, A.: A reduced lattice greedy algorithm for selecting materialized views. In: Prasad, S.K., Routray, S., Khurana, R., Sahni, S. (eds.) ICISTM 2009. Communications in Computer and Information Science, vol. 31, pp. 6–18. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  20. 20.
    Vijay Kumar, T.V., Haider, M., Kumar, S.: Proposing candidate views for materialization. In: Prasad, S.K., Vin, H.M., Sahni, S., Jaiswal, M.P., Thipakorn, B. (eds.) ICISTM 2010. Communications in Computer and Information Science, vol. 54, pp. 89–98. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  21. 21.
    Vijay Kumar, T.V., Haider, M.: A Query Answering Greedy Algorithm for Selecting Materialized Views. In: Pan, J.-S., Chen, S.-M., Nguyen, N.T. (eds.) ICCCI 2010. LNCS, vol. 6422, pp. 153–162. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  22. 22.
    Vijay Kumar, T.V., Jain, N.: Selection of Frequent Queries for Constructing Materialized Views in Data Warehouse. The IUP Journal of Systems Management 8(2), 46–64 (2010)Google Scholar
  23. 23.
    Vijay Kumar, T.V., Goel, A., Jain, N.: Mining Information for Constructing Materialised Views. International Journal of Information and Communication Technology 2(4), 386–405 (2010)CrossRefGoogle Scholar
  24. 24.
    Vijay Kumar, T.V., Haider, M.: Greedy views selection using size and query frequency. In: Unnikrishnan, S., Surve, S., Bhoir, D. (eds.) ICAC3 2011. Communications in Computer and Information Science, vol. 125, pp. 11–17. Springer, Heidelberg (2011)CrossRefGoogle Scholar
  25. 25.
    Vijay Kumar, T.V., Haider, M., Kumar, S.: A view recommendation greedy algorithm for materialized views selection. In: Dua, S., Sahni, S., Goyal, D.P. (eds.) ICISTM 2011. Communications in Computer and Information Science, vol. 141, pp. 61–70. Springer, Heidelberg (2011)CrossRefGoogle Scholar
  26. 26.
    Vijay Kumar, T.V., Devi, K.: Frequent Queries Identification for Constructing Materialized Views. In: The proceedings of the International Conference on Electronics Computer Technology (ICECT 2011), April 8-10, vol. 6, pp. 177–181. IEEE, Kanyakumari (2011)Google Scholar
  27. 27.
    Kumar, T.V.V., Haider, M.: Selection of views for materialization using size and query frequency. In: Das, V.V., Thomas, G., Lumban Gaol, F. (eds.) AIM 2011. Communications in Computer and Information Science, vol. 147, pp. 150–155. Springer, Heidelberg (2011)CrossRefGoogle Scholar
  28. 28.
    Widom, J.: Research Problems in Data Warehousing. In: 4th International Conference on Information and Knowledge Management, Baltimore, Maryland, pp. 25–30 (1995)Google Scholar
  29. 29.
    Yang, J., Karlapalem, K., Li, Q.: Algorithms for Materialized View Design in Data Warehousing Environment. The Very Large Databases (VLDB) Journal, 136–145 (1997)Google Scholar
  30. 30.
    Zhou, J., Larson, P., Goldstein, J., Ding, L.: Dynamic Materialized Views. In: IEEE 23rd International Conference on Data Engineering, Istanbul, pp. 526–535 (2007)Google Scholar

Copyright information

© Springer-Verlag GmbH Berlin Heidelberg 2012

Authors and Affiliations

  • T. V. Vijay Kumar
    • 1
  • Archana Singh
    • 2
  • Gaurav Dubey
    • 3
  1. 1.School of Computer and Systems SciencesJawaharlal Nehru UniversityNew DelhiIndia
  2. 2.Amity Institute of Information TechnologyNoidaIndia
  3. 3.Amity School of Computer SciencesNoidaIndia

Personalised recommendations