Abstract
Data warehouses are repositories of large amounts of historical data and are used primarily for decision support purposes. On the other hand, grids consist on the aggregation of distributed computational resources and presentation of these as a single service with a common interface. The deployment of distributed data warehouses on a grid architecture with QoS control strategies could lead to high levels of flexibility, scalability, reliability and efficiency. However, due to grids characteristics, it could also lead to great challenges. In this paper we investigate an efficient architecture to deploy large data warehouses in grids with high availability and good load balancing. We propose architecture and present experimental results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bruneo, D., Scarpa, M., Zaia, A., Puliato, A.: Communication Paradigms for Mobile Grid Users. In: Procs. of the 3rd IEEE/ACM Intl. Symposium on Cluster Computing and the Grid (CCGRID) (2003)
Bell, W.H., Cameron, D.G., Carvajal-Schiaffino, R., Millar, A.P., Stockinger, K., Zini, F.: Evaluation of an Economy-Based File Replication Strategy for a Data Grid. In: Intl. Workshop on Agent based Cluster and Grid Computing at CCGrid, IEEE Computer Society Press, Los Alamitos (2003)
Baralis, E., Paraboschi, S., Teniente, E.: Materialzed View Selection in a Multidimensional Database. In: Proc. of 23rd Intl. Conf. on Very Large Databases (VLDB) (1997)
Chaudhuri, S., Dayal, U.: An Overview of Data Warehousing and OLAP Technology. SIGMOD Record 26(1) (1997)
Corte, A.L., Puliato, A., Tomarchio, O.: An agent-based framework for mobile users. In: ERSADS 1999 (1999)
Cao, J., Spooner, D.P., Jarvis, S.A., Nudd, G.R.: Grid Load Balancing Using Intelligent Agents Future Generation Computer Systems. Special issue on Intelligent Grid Environments: Principles and Applications 21(1), 135–149 (2005)
DeWitt, D., et al.: The Gamma Database Machine Project. IEEE Knowledge and Data Engineering 2(1) (March 1990)
DeWitt, D.J., Gray, J.: Parallel Database Systems: The Future of High Per-formance Database Processing. Communications of the ACM (1992)
Dobbera, M., Koolea, G., van der Mei, R.: Dynamic Load Balancing Experiments in a Grid. In: Proc. of the 5th IEEE/ACM Intl. Symposium on Cluster Computing and the Grid (CCGrid 2005) (2005)
DeWitt, D.J., Naughton, J.F., Schneider, D.A., Seshadri, S.: Pratical Skew Handling in Parallel Joins. In: Proc. of 18ht Intl. Conf. on Very Large Databases (VLDB), pp. 27–40 (1992)
Erwin, D.W., Snelling, D.F.: UNICORE: A Grid computing environment. In: Sakellariou, R., Keane, J.A., Gurd, J.R., Freeman, L. (eds.) Euro-Par 2001. LNCS, vol. 2150, Springer, Heidelberg (2001)
Foster, I.: The anatomy of the Grid: Enabling scalable virtual organizations. Concurrency and Computation: Practice and Experience 13 (2001)
Furtado, P.: Workload-Based Placement and Join Processing in Node-Partitioned Data Warehouses. In: Kambayashi, Y., Mohania, M., Wöß, W. (eds.) DaWaK 2004. LNCS, vol. 3181, pp. 38–47. Springer, Heidelberg (2004)
Furtado, P.: Experimental Evidence on Partitioning in Parallel Data Warehouses. In: Procs. of the 7th ACM Intl. Workshop on Data Warehousing and OLAP (DOLAP) (2004)
Furtado, P.: Replication in Node Partitioned Data Warehouses. In: VLDB Workshop on Design, Implementation, and Deployment of Database Replication (DIDDR) (2005)
Gray, J., et al.: Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab and Sub-Totals. In: Data-Mining and Knowledge Discovery (1997)
Hua, K.A., Young, H.C.: Desining a Highly Parallel Database Server Using Off-the-shelf Components. In: Procs. of the Int’I Computer Symposium, pp. 47–54 (1990)
Lu, H., Tan, K.: Load-Balanced Join Processing in Shared-Nothing Systems. Journal of Parallel and Distributed Computing 23, 382–398 (1994)
Märtens, H., Rahm, E., Stöhr, T.: Dynamic Query Scheduling in Parallel Data Warehouses. In: Monien, B., Feldmann, R.L. (eds.) Euro-Par 2002. LNCS, vol. 2400, pp. 321–331. Springer, Heidelberg (2002)
Min, W.H., Wilson, W.Y., Ngi, Y.H., Donghong, W., Zhixiang, L., Hong, L.K., Y.K.L.: Dynamic Storage Resource Management Framework for the Grid. In: Procs. of the 22nd IEEE/13th NASA Goddard Conf. on Mass Storage Systems and Technologies (MSST) (2005)
Nieto-Santisteban, M.A., Gray, J., Szalay, A.S., Annis, J., Thakar, A.R., O’Mullane, W.: When Database Systems Meet the Grid. In: Second Biennial Conf. on Innovative Data Systems Research (CIDR). Online Proceedings, pp. 154–161 (2005)
O’Neil, P., Graefe, G.: Multi-Table Joins Throug Bitmapped Join Indices. ACM SIGMOD Record 23(4) (1995)
O’Neil, P., Quass, D.: Improved Query Performance With Variant Indexes. In: Proc. of ACM SIGMOD Conf. (1995)
Poess, M., Othayoth, R.K.: Large Scale Data Warehouses on Grid: Oracle Database 10g and HP ProLiant Servers. In: Proc. of the 31st Intl. Conf. on Very Large Databases (VLDB), pp. 1055–1066 (2005)
Rana, O.F., Moreau, L.: Issues in Building Agent based Computational Grids. In: Third Workshop of the UK Special Interest Group on Multi-Agent Systems (UKMAS 2000) (2000)
Watson, P.: Databases in Grid Applications: Locality and Distribution. In: Jackson, M., Nelson, D., Stirk, S. (eds.) BNCOD 2005. LNCS, vol. 3567, pp. 1–16. Springer, Heidelberg (2005)
Stöhr, T., Märtens, H., Rahm, E.: Multi-Dimensional Database Allocation for Parallel Data Warehouses. In: Procs. of the 26th Intl. Conf. on Very Large Databases (VLDB), pp. 273–284 (2000)
Shoshani, A., Sim, A., Stockinger, K.: RRS: Replica Registration Service for Data Grids. In: VLDB Workshop on Data Management in Grids (2005)
Smith, J., Watson, P., Gounaris, A., Paton, N.W., Fernandes, A., Sakellariou, R.: Distributed Query Processing on the Grid. Intl. Journal of High Performance Computing Applications 17, 353–367 (2003)
White, J., Thompson, D.R.: Load Balancing on a Grid Using Data Characteristics. In: Proc. Int’l Conf. Grid Computing and Applications (GCA), pp. 184–188 (2005)
Zhou, S., Williams, M.H.: Data placement in parallel database systems. In: Parallel Database Techniques, IEEE Computer Society Press, Los Alamitos (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
de Carvalho Costa, R.L., Furtado, P. (2006). Data Warehouses in Grids with High QoS. In: Tjoa, A.M., Trujillo, J. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2006. Lecture Notes in Computer Science, vol 4081. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11823728_20
Download citation
DOI: https://doi.org/10.1007/11823728_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37736-8
Online ISBN: 978-3-540-37737-5
eBook Packages: Computer ScienceComputer Science (R0)