Data Warehouses in Grids with High QoS

  • Rogério Luís de Carvalho Costa
  • Pedro Furtado
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4081)


Data warehouses are repositories of large amounts of historical data and are used primarily for decision support purposes. On the other hand, grids consist on the aggregation of distributed computational resources and presentation of these as a single service with a common interface. The deployment of distributed data warehouses on a grid architecture with QoS control strategies could lead to high levels of flexibility, scalability, reliability and efficiency. However, due to grids characteristics, it could also lead to great challenges. In this paper we investigate an efficient architecture to deploy large data warehouses in grids with high availability and good load balancing. We propose architecture and present experimental results.


Load Balance Query Processing Data Warehouse Grid Environment Processing Node 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Bruneo, D., Scarpa, M., Zaia, A., Puliato, A.: Communication Paradigms for Mobile Grid Users. In: Procs. of the 3rd IEEE/ACM Intl. Symposium on Cluster Computing and the Grid (CCGRID) (2003)Google Scholar
  2. 2.
    Bell, W.H., Cameron, D.G., Carvajal-Schiaffino, R., Millar, A.P., Stockinger, K., Zini, F.: Evaluation of an Economy-Based File Replication Strategy for a Data Grid. In: Intl. Workshop on Agent based Cluster and Grid Computing at CCGrid, IEEE Computer Society Press, Los Alamitos (2003)Google Scholar
  3. 3.
    Baralis, E., Paraboschi, S., Teniente, E.: Materialzed View Selection in a Multidimensional Database. In: Proc. of 23rd Intl. Conf. on Very Large Databases (VLDB) (1997)Google Scholar
  4. 4.
    Chaudhuri, S., Dayal, U.: An Overview of Data Warehousing and OLAP Technology. SIGMOD Record 26(1) (1997)Google Scholar
  5. 5.
    Corte, A.L., Puliato, A., Tomarchio, O.: An agent-based framework for mobile users. In: ERSADS 1999 (1999)Google Scholar
  6. 6.
    Cao, J., Spooner, D.P., Jarvis, S.A., Nudd, G.R.: Grid Load Balancing Using Intelligent Agents Future Generation Computer Systems. Special issue on Intelligent Grid Environments: Principles and Applications 21(1), 135–149 (2005)Google Scholar
  7. 7.
    DeWitt, D., et al.: The Gamma Database Machine Project. IEEE Knowledge and Data Engineering 2(1) (March 1990)Google Scholar
  8. 8.
    DeWitt, D.J., Gray, J.: Parallel Database Systems: The Future of High Per-formance Database Processing. Communications of the ACM (1992)Google Scholar
  9. 9.
    Dobbera, M., Koolea, G., van der Mei, R.: Dynamic Load Balancing Experiments in a Grid. In: Proc. of the 5th IEEE/ACM Intl. Symposium on Cluster Computing and the Grid (CCGrid 2005) (2005)Google Scholar
  10. 10.
    DeWitt, D.J., Naughton, J.F., Schneider, D.A., Seshadri, S.: Pratical Skew Handling in Parallel Joins. In: Proc. of 18ht Intl. Conf. on Very Large Databases (VLDB), pp. 27–40 (1992)Google Scholar
  11. 11.
    Erwin, D.W., Snelling, D.F.: UNICORE: A Grid computing environment. In: Sakellariou, R., Keane, J.A., Gurd, J.R., Freeman, L. (eds.) Euro-Par 2001. LNCS, vol. 2150, Springer, Heidelberg (2001)CrossRefGoogle Scholar
  12. 12.
    Foster, I.: The anatomy of the Grid: Enabling scalable virtual organizations. Concurrency and Computation: Practice and Experience 13 (2001)Google Scholar
  13. 13.
    Furtado, P.: Workload-Based Placement and Join Processing in Node-Partitioned Data Warehouses. In: Kambayashi, Y., Mohania, M., Wöß, W. (eds.) DaWaK 2004. LNCS, vol. 3181, pp. 38–47. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  14. 14.
    Furtado, P.: Experimental Evidence on Partitioning in Parallel Data Warehouses. In: Procs. of the 7th ACM Intl. Workshop on Data Warehousing and OLAP (DOLAP) (2004)Google Scholar
  15. 15.
    Furtado, P.: Replication in Node Partitioned Data Warehouses. In: VLDB Workshop on Design, Implementation, and Deployment of Database Replication (DIDDR) (2005)Google Scholar
  16. 16.
    Gray, J., et al.: Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab and Sub-Totals. In: Data-Mining and Knowledge Discovery (1997)Google Scholar
  17. 17.
    Hua, K.A., Young, H.C.: Desining a Highly Parallel Database Server Using Off-the-shelf Components. In: Procs. of the Int’I Computer Symposium, pp. 47–54 (1990)Google Scholar
  18. 18.
    Lu, H., Tan, K.: Load-Balanced Join Processing in Shared-Nothing Systems. Journal of Parallel and Distributed Computing 23, 382–398 (1994)CrossRefGoogle Scholar
  19. 19.
    Märtens, H., Rahm, E., Stöhr, T.: Dynamic Query Scheduling in Parallel Data Warehouses. In: Monien, B., Feldmann, R.L. (eds.) Euro-Par 2002. LNCS, vol. 2400, pp. 321–331. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  20. 20.
    Min, W.H., Wilson, W.Y., Ngi, Y.H., Donghong, W., Zhixiang, L., Hong, L.K., Y.K.L.: Dynamic Storage Resource Management Framework for the Grid. In: Procs. of the 22nd IEEE/13th NASA Goddard Conf. on Mass Storage Systems and Technologies (MSST) (2005)Google Scholar
  21. 21.
    Nieto-Santisteban, M.A., Gray, J., Szalay, A.S., Annis, J., Thakar, A.R., O’Mullane, W.: When Database Systems Meet the Grid. In: Second Biennial Conf. on Innovative Data Systems Research (CIDR). Online Proceedings, pp. 154–161 (2005)Google Scholar
  22. 22.
    O’Neil, P., Graefe, G.: Multi-Table Joins Throug Bitmapped Join Indices. ACM SIGMOD Record 23(4) (1995)Google Scholar
  23. 23.
    O’Neil, P., Quass, D.: Improved Query Performance With Variant Indexes. In: Proc. of ACM SIGMOD Conf. (1995)Google Scholar
  24. 24.
    Poess, M., Othayoth, R.K.: Large Scale Data Warehouses on Grid: Oracle Database 10g and HP ProLiant Servers. In: Proc. of the 31st Intl. Conf. on Very Large Databases (VLDB), pp. 1055–1066 (2005)Google Scholar
  25. 25.
    Rana, O.F., Moreau, L.: Issues in Building Agent based Computational Grids. In: Third Workshop of the UK Special Interest Group on Multi-Agent Systems (UKMAS 2000) (2000)Google Scholar
  26. 26.
    Watson, P.: Databases in Grid Applications: Locality and Distribution. In: Jackson, M., Nelson, D., Stirk, S. (eds.) BNCOD 2005. LNCS, vol. 3567, pp. 1–16. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  27. 27.
    Stöhr, T., Märtens, H., Rahm, E.: Multi-Dimensional Database Allocation for Parallel Data Warehouses. In: Procs. of the 26th Intl. Conf. on Very Large Databases (VLDB), pp. 273–284 (2000)Google Scholar
  28. 28.
    Shoshani, A., Sim, A., Stockinger, K.: RRS: Replica Registration Service for Data Grids. In: VLDB Workshop on Data Management in Grids (2005)Google Scholar
  29. 29.
    Smith, J., Watson, P., Gounaris, A., Paton, N.W., Fernandes, A., Sakellariou, R.: Distributed Query Processing on the Grid. Intl. Journal of High Performance Computing Applications 17, 353–367 (2003)CrossRefGoogle Scholar
  30. 30.
    White, J., Thompson, D.R.: Load Balancing on a Grid Using Data Characteristics. In: Proc. Int’l Conf. Grid Computing and Applications (GCA), pp. 184–188 (2005)Google Scholar
  31. 31.
    Zhou, S., Williams, M.H.: Data placement in parallel database systems. In: Parallel Database Techniques, IEEE Computer Society Press, Los Alamitos (1997)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Rogério Luís de Carvalho Costa
    • 1
  • Pedro Furtado
    • 1
  1. 1.Departamento de Engenharia InformáticaUniversity of CoimbraCoimbraPortugal

Personalised recommendations