Cascaded Star: A Hyper-Dimensional Model for a Data Warehouse
A data warehouse is defined as subject-oriented, integrated, time-variant and nonvolatile collection of data. Often, the data representing different subjects is multi-dimensional in nature, where each dimension of each subject could again be multi-dimensional. We refer to this as hyper-dimensional nature of data. Traditional multi-dimensional data models (e.g., the star schema) cannot adequately model these data. This is because, a star schema models one single multi-dimensional subject, hence a complex query crossing different subjects at different dimensional levels has to be specified as multiple queries and the results of each query must be composed together manually. In this paper, we present a novel data model, called the cascaded star model, to model hyper-dimensional data, and propose the cascaded OLAP (COLAP) operations that enable ad-hoc specification of queries that encompass multiple stars. Specifically, our COALP operations include cascaded-roll-up, cascaded-drill-down, cascaded-slice, cascaded-dice and MCUBE. We show that COLAP can be represented by the relational algebra to demonstrate that the cascaded star can be built on top of the traditional star schema framework.
KeywordsRelational Algebra Complex Query Dimension Table Star Model Single Star
Unable to display preview. Download preview PDF.
- 1.Gray, J., Chaudhuri, S.: Data cube: A relational aggregation operator generating group-by, cross-tab, and sub-totals. Data Mining and Knowledge Discovery 1 (1997)Google Scholar
- 2.Yu, S., Atluri, V., Adam, N.: Cascaded star and cascaded olap for spatial data warehouses. Technical Report (2005)Google Scholar
- 4.Han, J., Kamber, M.: Data Mining: Concepts and Techniques, 1st edn. Morgan Kaufman Publishers, San Francisco (2001)Google Scholar
- 9.Timoko, I., Pedersen, T.: Capturing complex multidimensional data in location-based warehouses. In: Proc. of ACM GIS. LNCS. Springer, Heidelberg (2004)Google Scholar
- 10.Adam, N., Atluri, V., Yu, S., Yesha, Y.: Efficient storage and management of environmental information. In: Kobler, B., Hariharan, P. (eds.) Proc. of the 19th IEEE Symposium on Mass Storage Systems, NASA, pp. 165–181 (2002)Google Scholar
- 11.Adam, N., Atluri, V., Guo, D., Yu, S.: Challenges in Environmental Data Warehousing and Mining. In: Data Mining: Next Generation Challenges and Future Directions, 1st edn., Ch. 18, pp. 315–335. AAAI Press, Menlo Park (2004)Google Scholar