Multivariate and multidimensional OLAP

  • Shin -Chung Shao
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1377)

Abstract

The author presents a new relational approach to multivariate and multidimensional OLAP. In this approach, a multivariate aggregate view (MAV) is defined. MAV contains categorized univariate and multivariate aggregated data, which can be used to support many more advanced statistical methods not currently supported by any OLAP models. The author shows that MAV can be created, materialized, and manipulated using SQL commands. Thus, it can be implemented using commercial relational DBMS. A query rewrite algorithm is also presented to convert aggregate queries to base tables into those to MAV. Therefore, users need not to know the existence and definition of MAV in order to share materialized data. Incremental update of MAV created from single base table is also considered. The application of MAV to data mining is presented to illustrate the use of multivariate and multidimensional OLAP.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Agarwal, S., R. Agrawal, P. M. Deshpande, A. Gupta, J. F. Naughton, R. Ramakrishnan and S. Sarawagi: On the Computation of Multidimensional Aggregat.es, in Proc. Of the 22nd VLDB Conference, Mumbai, India, 1996, 506–521.Google Scholar
  2. 2.
    Agrawal, D., A. El Abbadi, A. Singh and T. Yurek: Efficient View Maintenance Warehouses, In Proc. Of the 1997 ACM SIGMOD International Conference on Management of Data, Tucson, Arizona, USA, May 1997, 417–427.Google Scholar
  3. 3.
    Anderson, T.W.: An Introduction to Multivariate Statistical Analysis, Second Edition, New York, Wiley, 1981.Google Scholar
  4. 4.
    Surajit Chaudhuri and Umeshwar Dayal. Data Warehousing and OLAP for Decision Support. In Proc. Of the 1997 ACM SIGMOD International Conference on Management of Data, Tucson, Arizona, USA, May 1997, 507–508.Google Scholar
  5. 5.
    U. Fayyad, G.P. Shapiro, P. Smyth and R. Uthurusamy (editors): Advances in Knowledge Discovery and Data Mining, AAAI Press/The MIT Press, 1996.Google Scholar
  6. 6.
    J. Gray, A. Bosworth, A. Layman, and H. Pirahesh. Data Cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-total, In Proc. Of the twelve IEEE International Conference on Data Engineering, New Orleans, LA, Feb. 1996, 152–159.Google Scholar
  7. 7.
    A. Gupta, V. Harinarayan and D. Quass: Aggregate-Query Processing in Data Warehousing, in Proc. Of the 21st VLDB Conference, Zurich, Switzerland, 1995, 358–369.Google Scholar
  8. 8.
    V. Harinarayan, A. Rajaraman and J. Ullman: Implementing Data Cubes Efficiently, In Proc. Of the 1996 ACM SIGMOD International Conference on Management of Data, Montreal, Canada, June 1996, 205–216.Google Scholar
  9. 9.
    C. T. Ho, R. Agrawal, N. Megiddo and R. Srikant: Range Queries in OLAP Data Cubes, In Proc. Of the 1997 ACM SIGMOD International Conference on Management of Data, Tucson, Arizona, USA, May 1997, 73–88.Google Scholar
  10. 10.
    J. D. Jobson: Applied Multivariate Data Analysis, Vol II: Categorical and Multivariate Methods, Springer-Verlag, 1992.Google Scholar
  11. 11.
    F. Korn, H.V. Jagadish, and C. Faloutsos: Efficiently Supporting Ad Hoc Queries in Large Datasets of Time Sequences, In Proc. Of the 1997 ACM SIGMOD International Conference on Management of Data, Tucson, Arizona, USA, May 1997, 289–300.Google Scholar
  12. 12.
    I. S. Mumick, D. Quass and B. S. Mumick: Maintenance of Data Cubes and Summary Tables in a Warehouse, In Proc. Of the 1997 ACM SIGMOD International Conference on Management of Data, Tucson, Arizona, USA, May 1997, 100–111.Google Scholar
  13. 13.
    R. S. Pindyck and D. L. Rubinfeld: Econometric Models and Economic Forecasts, Third Edition, McGraw-Hill Inc., 1991.Google Scholar
  14. 14.
    D. Quass and J. Widom: On-Line Warehouse View Maintenance, in In Proc. Of the 1997 ACM SIGMOD International Conference on Management of Data, Tucson, Arizona, USA, May 1997, 393–404.Google Scholar
  15. 15.
    N. Roussopoulos, Y. Kotidis and Mema Roussopoulos: Cubtree: Organization of and Bulk Incremental Updates on the Data Cube, In Proc. Of the 1997 ACM SIGMOD International Conference on Management of Data, Tucson, Arizona, USA, May 1997, 89–99.Google Scholar
  16. 16.
    A. Segev and S. C. Shao: Statistical Computing in A Database Environment, Technical Report 35436, Lawrence Berkeley Lab., Berkeley, CA 94720, January 1994.Google Scholar
  17. 17.
    S. C. Shao: Statistical Computing in the Database Environment: A New Paradigm, Ph.D Dissertation, Haas School of Business, Univ. of California at Berkeley, 1994.Google Scholar
  18. 18.
    B.Y Sher, S.C. Shao and W.S. Hsieh: Mining Regression Rules and Regression Tree, to appear in Proc. of PAKDD-98, Lecture Notes in Artificial Intelligence, Springer-Verlag, April 1998.Google Scholar
  19. 19.
    D. Srivastava, S. Dar, H. V. Jagadish and A. Y. Levy: Answering Queries with Aggregation Using Views, in Proc. Of the 22nd VLDB Conference, Mumbai, India, 1996, 318–329.Google Scholar
  20. 20.
    Y. Zhao, P. Deshpande and J. F. Naughton: An Array-Based Algorithm for Simultaneous Multidimensional Aggregates, in Proc. Of the 1997 ACM SIGMOD International Conference on Management of Data, Tucson, Arizona, USA, May 1997, 159–170.Google Scholar

Copyright information

© Springer-Verlag 1998

Authors and Affiliations

  • Shin -Chung Shao
    • 1
  1. 1.Department of Information Management NanHwa Management CollegeFo-Kuan UniversityDa-Lin, Chia-YihTaiwan, ROC

Personalised recommendations