Dynamic Data Warehousing

Dayal, Umeshwar; Chen, Qiming; Hsu, Meichun

doi:10.1007/3-540-48298-9_14

Umeshwar Dayal⁶,
Qiming Chen⁶ &
Meichun Hsu⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1676))

Included in the following conference series:

International Conference on Data Warehousing and Knowledge Discovery

836 Accesses
1 Citations

Abstract

Data warehouses and on-line analytical processing (OLAP) tools have become essential elements of decision support systems. Traditionally, data warehouses are refreshed periodically (for example, nightly) by extracting, transforming, cleaning and consolidating data from several operational data sources. The data in the warehouse is then used to periodically generate reports, or to rebuild multidimensional (data cube) views of the data for on-line querying and analysis. Increasingly, however, we are seeing business intelligence applications in telecommunications, electronic commerce, and other industries, that are characterized by very high data volumes and data flow rates, and that require continuous analysis and mining of the data. For such applications, rather different data warehousing and on-line analysis architectures are required. In this paper, we first motivate the need for a new architecture by summarizing the requirements of these applications. Then, we describe a few approaches that are being developed, including virtual data warehouses or enterprise portals that support access through views or links directly to the operational data sources. We discuss the relative merits of these approaches. We then focus on a dynamic data warehousing and OLAP architecture that we have developed and prototyped at HP Labs. In this architecture, data flows continuously into a data warehouse, and is staged into one or more OLAP tools that are used as computation engines to continuously and incrementally build summary data cubes, which might then be stored back in the data warehouse. Analysis and data mining functions are performed continuously and incrementally over these summary cubes. Retirement policies define when to discard data from the warehouse (i.e., move data from the warehouse into off-line archival storage). Data at different levels of aggregation may have different life spans depending on how they are to be used for downstream analysis and data mining. The key features of the architecture are the following: incremental data reduction using OLAP engines to generate summaries and enable data mining; staging large volumes and flow rates of data with different life spans at different levels of aggregation; and scheduling operations on data depending on the type of processing to be performed and the age of the data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

HP Labs, Hewlett-Packard, 1501 Page Mill Road, MS 1U4, Palo Alto, CA, 94303, USA
Umeshwar Dayal, Qiming Chen & Meichun Hsu

Authors

Umeshwar Dayal
View author publications
You can also search for this author in PubMed Google Scholar
Qiming Chen
View author publications
You can also search for this author in PubMed Google Scholar
Meichun Hsu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer and Information Science, University of South Australia, The Levels, Adelaide, Australia, 05
Mukesh Mohania
IFS, Technical University of Vienna, Resselgasse 3, A-1040, Vienna, Austria
A Min Tjoa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dayal, U., Chen, Q., Hsu, M. (1999). Dynamic Data Warehousing. In: Mohania, M., Tjoa, A.M. (eds) DataWarehousing and Knowledge Discovery. DaWaK 1999. Lecture Notes in Computer Science, vol 1676. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48298-9_14

Download citation

DOI: https://doi.org/10.1007/3-540-48298-9_14
Published: 01 March 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66458-1
Online ISBN: 978-3-540-48298-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics