Skip to main content

Data Warehouses in Grids with High QoS

  • Conference paper
Data Warehousing and Knowledge Discovery (DaWaK 2006)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4081))

Included in the following conference series:

Abstract

Data warehouses are repositories of large amounts of historical data and are used primarily for decision support purposes. On the other hand, grids consist on the aggregation of distributed computational resources and presentation of these as a single service with a common interface. The deployment of distributed data warehouses on a grid architecture with QoS control strategies could lead to high levels of flexibility, scalability, reliability and efficiency. However, due to grids characteristics, it could also lead to great challenges. In this paper we investigate an efficient architecture to deploy large data warehouses in grids with high availability and good load balancing. We propose architecture and present experimental results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bruneo, D., Scarpa, M., Zaia, A., Puliato, A.: Communication Paradigms for Mobile Grid Users. In: Procs. of the 3rd IEEE/ACM Intl. Symposium on Cluster Computing and the Grid (CCGRID) (2003)

    Google Scholar 

  2. Bell, W.H., Cameron, D.G., Carvajal-Schiaffino, R., Millar, A.P., Stockinger, K., Zini, F.: Evaluation of an Economy-Based File Replication Strategy for a Data Grid. In: Intl. Workshop on Agent based Cluster and Grid Computing at CCGrid, IEEE Computer Society Press, Los Alamitos (2003)

    Google Scholar 

  3. Baralis, E., Paraboschi, S., Teniente, E.: Materialzed View Selection in a Multidimensional Database. In: Proc. of 23rd Intl. Conf. on Very Large Databases (VLDB) (1997)

    Google Scholar 

  4. Chaudhuri, S., Dayal, U.: An Overview of Data Warehousing and OLAP Technology. SIGMOD Record 26(1) (1997)

    Google Scholar 

  5. Corte, A.L., Puliato, A., Tomarchio, O.: An agent-based framework for mobile users. In: ERSADS 1999 (1999)

    Google Scholar 

  6. Cao, J., Spooner, D.P., Jarvis, S.A., Nudd, G.R.: Grid Load Balancing Using Intelligent Agents Future Generation Computer Systems. Special issue on Intelligent Grid Environments: Principles and Applications 21(1), 135–149 (2005)

    Google Scholar 

  7. DeWitt, D., et al.: The Gamma Database Machine Project. IEEE Knowledge and Data Engineering 2(1) (March 1990)

    Google Scholar 

  8. DeWitt, D.J., Gray, J.: Parallel Database Systems: The Future of High Per-formance Database Processing. Communications of the ACM (1992)

    Google Scholar 

  9. Dobbera, M., Koolea, G., van der Mei, R.: Dynamic Load Balancing Experiments in a Grid. In: Proc. of the 5th IEEE/ACM Intl. Symposium on Cluster Computing and the Grid (CCGrid 2005) (2005)

    Google Scholar 

  10. DeWitt, D.J., Naughton, J.F., Schneider, D.A., Seshadri, S.: Pratical Skew Handling in Parallel Joins. In: Proc. of 18ht Intl. Conf. on Very Large Databases (VLDB), pp. 27–40 (1992)

    Google Scholar 

  11. Erwin, D.W., Snelling, D.F.: UNICORE: A Grid computing environment. In: Sakellariou, R., Keane, J.A., Gurd, J.R., Freeman, L. (eds.) Euro-Par 2001. LNCS, vol. 2150, Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  12. Foster, I.: The anatomy of the Grid: Enabling scalable virtual organizations. Concurrency and Computation: Practice and Experience 13 (2001)

    Google Scholar 

  13. Furtado, P.: Workload-Based Placement and Join Processing in Node-Partitioned Data Warehouses. In: Kambayashi, Y., Mohania, M., Wöß, W. (eds.) DaWaK 2004. LNCS, vol. 3181, pp. 38–47. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  14. Furtado, P.: Experimental Evidence on Partitioning in Parallel Data Warehouses. In: Procs. of the 7th ACM Intl. Workshop on Data Warehousing and OLAP (DOLAP) (2004)

    Google Scholar 

  15. Furtado, P.: Replication in Node Partitioned Data Warehouses. In: VLDB Workshop on Design, Implementation, and Deployment of Database Replication (DIDDR) (2005)

    Google Scholar 

  16. Gray, J., et al.: Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab and Sub-Totals. In: Data-Mining and Knowledge Discovery (1997)

    Google Scholar 

  17. Hua, K.A., Young, H.C.: Desining a Highly Parallel Database Server Using Off-the-shelf Components. In: Procs. of the Int’I Computer Symposium, pp. 47–54 (1990)

    Google Scholar 

  18. Lu, H., Tan, K.: Load-Balanced Join Processing in Shared-Nothing Systems. Journal of Parallel and Distributed Computing 23, 382–398 (1994)

    Article  Google Scholar 

  19. Märtens, H., Rahm, E., Stöhr, T.: Dynamic Query Scheduling in Parallel Data Warehouses. In: Monien, B., Feldmann, R.L. (eds.) Euro-Par 2002. LNCS, vol. 2400, pp. 321–331. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  20. Min, W.H., Wilson, W.Y., Ngi, Y.H., Donghong, W., Zhixiang, L., Hong, L.K., Y.K.L.: Dynamic Storage Resource Management Framework for the Grid. In: Procs. of the 22nd IEEE/13th NASA Goddard Conf. on Mass Storage Systems and Technologies (MSST) (2005)

    Google Scholar 

  21. Nieto-Santisteban, M.A., Gray, J., Szalay, A.S., Annis, J., Thakar, A.R., O’Mullane, W.: When Database Systems Meet the Grid. In: Second Biennial Conf. on Innovative Data Systems Research (CIDR). Online Proceedings, pp. 154–161 (2005)

    Google Scholar 

  22. O’Neil, P., Graefe, G.: Multi-Table Joins Throug Bitmapped Join Indices. ACM SIGMOD Record 23(4) (1995)

    Google Scholar 

  23. O’Neil, P., Quass, D.: Improved Query Performance With Variant Indexes. In: Proc. of ACM SIGMOD Conf. (1995)

    Google Scholar 

  24. Poess, M., Othayoth, R.K.: Large Scale Data Warehouses on Grid: Oracle Database 10g and HP ProLiant Servers. In: Proc. of the 31st Intl. Conf. on Very Large Databases (VLDB), pp. 1055–1066 (2005)

    Google Scholar 

  25. Rana, O.F., Moreau, L.: Issues in Building Agent based Computational Grids. In: Third Workshop of the UK Special Interest Group on Multi-Agent Systems (UKMAS 2000) (2000)

    Google Scholar 

  26. Watson, P.: Databases in Grid Applications: Locality and Distribution. In: Jackson, M., Nelson, D., Stirk, S. (eds.) BNCOD 2005. LNCS, vol. 3567, pp. 1–16. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  27. Stöhr, T., Märtens, H., Rahm, E.: Multi-Dimensional Database Allocation for Parallel Data Warehouses. In: Procs. of the 26th Intl. Conf. on Very Large Databases (VLDB), pp. 273–284 (2000)

    Google Scholar 

  28. Shoshani, A., Sim, A., Stockinger, K.: RRS: Replica Registration Service for Data Grids. In: VLDB Workshop on Data Management in Grids (2005)

    Google Scholar 

  29. Smith, J., Watson, P., Gounaris, A., Paton, N.W., Fernandes, A., Sakellariou, R.: Distributed Query Processing on the Grid. Intl. Journal of High Performance Computing Applications 17, 353–367 (2003)

    Article  Google Scholar 

  30. White, J., Thompson, D.R.: Load Balancing on a Grid Using Data Characteristics. In: Proc. Int’l Conf. Grid Computing and Applications (GCA), pp. 184–188 (2005)

    Google Scholar 

  31. Zhou, S., Williams, M.H.: Data placement in parallel database systems. In: Parallel Database Techniques, IEEE Computer Society Press, Los Alamitos (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

de Carvalho Costa, R.L., Furtado, P. (2006). Data Warehouses in Grids with High QoS. In: Tjoa, A.M., Trujillo, J. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2006. Lecture Notes in Computer Science, vol 4081. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11823728_20

Download citation

  • DOI: https://doi.org/10.1007/11823728_20

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-37736-8

  • Online ISBN: 978-3-540-37737-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics