Skip to main content

NAP-SC: A Neural Approach for Prediction over Sparse Cubes

  • Conference paper
Advanced Data Mining and Applications (ADMA 2012)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7713))

Included in the following conference series:

Abstract

OLAP techniques provide efficient solutions to navigate through data cubes. However, they are not equipped with frameworks that empower user investigation of interesting information. They are restricted to exploration tasks.

Recently, various studies have been trying to extend OLAP to new capabilities by coupling it with data mining algorithms. However, most of these algorithms are not designed to deal with sparsity, which is an unavoidable consequence of the multidimensional structure of OLAP cubes.

In [1], we proposed a novel approach that embeds Multilayer Perceptrons into OLAP environment to extend it to prediction. This approach has largely met its goals with limited sparsity cubes. However, its performances have decreased progressively with the increase of cube sparsity.

In this paper, we propose a substantially modified version of our previous approach called NAP-SC (Neural Approach for Prediction over Sparse Cubes). Its main contribution consists in minimizing sparsity effect on measures prediction process through the application of a cube transformation step, based on a dedicated aggregation technique.

Carried out experiments demonstrate the effectiveness and the robustness of NAP-SC against high sparsity data cubes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abdelbaki, W., Ben Messaoud, R., Ben Yahia, S.: A Neural-Based Approach for Extending OLAP to Prediction. In: Cuzzocrea, A., Dayal, U. (eds.) DaWaK 2012. LNCS, vol. 7448, pp. 117–129. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  2. Inmon, W.H.: Building the Data Warehouse. John Wiley & Sons (1996)

    Google Scholar 

  3. Agarwal, D., Chen, B.C.: Latent OLAP: Data Cubes Over Latent Variables. In: Proceedings of the 2011 International Conference on Management of Data, SIGMOD 2011, pp. 877–888. ACM, New York (2011)

    Google Scholar 

  4. Codd, E.F., Codd, S.B., Salley, C.T.: Providing OLAP (on-line Analytical Processing) to User-analysts: An IT Mandate, vol. 32. Codd & Date, Inc. (1993)

    Google Scholar 

  5. Olson, D., Delen, D.: Advanced Data Mining Techniques. Springer (2008)

    Google Scholar 

  6. Sarawagi, S., Agrawal, R., Megiddo, N.: Discovery-Driven Exploration of OLAP Data Cubes. In: Schek, H.-J., Saltor, F., Ramos, I., Alonso, G. (eds.) EDBT 1998. LNCS, vol. 1377, pp. 168–182. Springer, Heidelberg (1998)

    Chapter  Google Scholar 

  7. Ben Messaoud, R., Loudcher-Rabaseda, S.: OLEMAR: An On-Line Environment for Mining Association Rules in Multidimensional Data. In: Advances in Data Warehousing and Mining, vol. 2. Idea Group Publishing (2007)

    Google Scholar 

  8. Bodin-Niemczuk, A., Ben Messaoud, R., Rabaséda, S.L., Boussaid, O.: Vers l’intégration de la prédiction dans les cubes OLAP. In: EGC, pp. 203–204 (2008)

    Google Scholar 

  9. Niemi, T., Nummenmaa, J., Thanisch, P.: Normalising OLAP Cubes for Controlling Sparsity. Data & Knowledge Engineering 46(3), 317–343 (2003)

    Article  Google Scholar 

  10. Kriegel, H.P., Borgwardt, K.M., Kröger, P., Pryakhin, A., Schubert, M., Zimek, A.: Future Trends in Data Mining. Data Min. Knowl. Discov. 15(1), 87–97 (2007)

    Article  MathSciNet  Google Scholar 

  11. Juyoung, K., Hwanseung, Y., Yoshifumi, M.: Classification of Sparsity Patterns and Performance Evaluation in OLAP Systems. IPSJ SIG Notes 67, 449–455 (2002)

    Google Scholar 

  12. Goil, S., Choudhary, A.: High Performance Multidimensional Analysis and Data Mining. In: Proceedings of the High Performance Networking and Computing Conference (SC 1998), Orlando, Florida, USA (November 1998)

    Google Scholar 

  13. Palpanas, T., Koudas, N., Mendelzon, A.: Using Datacube Aggregates for Approximate Querying and Deviation Detection. IEEE Trans. on Knowl. and Data Eng. 17, 1465–1477 (2005)

    Article  Google Scholar 

  14. Chen, Y., Dong, G., Han, J., Pei, J., Wah, B.W., Wang, J.: Regression Cubes with Lossless Compression and Aggregation. IEEE Trans. on Knowl. and Data Eng. 18 (December 2006)

    Google Scholar 

  15. Chen, B.C., Chen, L., Lin, Y., Ramakrishnan, R.: Prediction Cubes. In: Proceedings of the 31st International Conference on Very large Data Bases, VLDB 2005, pp. 982–993 (2005)

    Google Scholar 

  16. Thomsen, E.: Olap Solutions: Building Multidimensional Information Systems. Wiley Computer Publishing (2002)

    Google Scholar 

  17. Hornik, K., Stinchcombe, M., White, H.: Multilayer Feedforward Networks are Universal Approximators. Neural Networks 2(5), 359–366 (1989)

    Article  Google Scholar 

  18. Haykin, S.: Neural Networks: a Comprehensive Foundation. Prentice Hall International Editions Series. Prentice Hall (1999)

    Google Scholar 

  19. Hotelling, H.: Analysis of a Complex of Statistical Variables into Principal Components. Journal of Educational Psychology 24(7), 498–520 (1933)

    Article  Google Scholar 

  20. Rubin, D.B.: Multiple Imputations in Sample Surveys: a Phenomenological Bayesian Approach to Nonresponse. In: Proceedings of the Survey Research Methods Section, pp. 20–28 (1978)

    Google Scholar 

  21. Ben Messaoud, R., Boussaid, O., Rabaséda, S.L.: Using a Factorial Approach for Efficient Representation of Relevant OLAP Facts. In: Proceedings of the 7th International Baltic Conference on Databases and Information Systems (DB&IS 2006), pp. 98–105. IEEE Communications Society, Vilnius (2006)

    Google Scholar 

  22. Rumelhart, D., McClelland, J., University of California, S.D.P.R.G: Parallel Distributed Processing: Explorations in the Microstructure of Cognition. Foundations. Computational Models of Cognition and Perception. MIT Press (1986)

    Google Scholar 

  23. Ben Othman, L., Rioult, F., Ben Yahia, S., Crémilleux, B.: Missing Values: Proposition of a Typology and Characterization with an Association Rule-Based Model. In: Pedersen, T.B., Mohania, M.K., Tjoa, A.M. (eds.) DaWaK 2009. LNCS, vol. 5691, pp. 441–452. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Abdelbaki, W., Ben Yahia, S., Ben Messaoud, R. (2012). NAP-SC: A Neural Approach for Prediction over Sparse Cubes. In: Zhou, S., Zhang, S., Karypis, G. (eds) Advanced Data Mining and Applications. ADMA 2012. Lecture Notes in Computer Science(), vol 7713. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35527-1_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-35527-1_29

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-35526-4

  • Online ISBN: 978-3-642-35527-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics