Abstract
As the amount of textual information grows explosively in various kinds of business systems, it becomes more and more essential to analyze both structured data and unstructured textual data simultaneously. However information contained in non structured data (documents and so on) is only partially used in business intelligence (BI). Indeed On-Line Analytical Processing (OLAP) cubes which are the main support of BI analysis in decision support systems have focused on structured data. This is the reason why OLAP is being extended to unstructured textual data. In this paper we introduce the innovative “Diamond” multidimensional model that will serve as a basis for semantic OLAP on XML documents and then we describe the meta modeling, generation and implementation of a the Diamond multidimensional model.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aouiche, K., Lemire, D., Godin, R.: Web 2.0 OLAP: From Data Cubes to Tag Clouds. In: Cordeiro, J., Hammoudi, S., Filipe, J. (eds.) WEBIST 2008. LNBIP, vol. 18, pp. 51–64. Springer, Heidelberg (2009)
Agrawal, R., Srikant, R.: Fast Algorithms for mining Association rules. In: Proceedings of VLDB, Santiago, Chile (September 1994)
Bautista, M., Molina, C., Tejeda, E., Vila, A.: A new multidimensional model with text dimensions: definition and implementation. In: International Conference, IPMU, Dortmund, Germany, pp. 158–167 (2013)
Ben Mefteh, S., Khrouf, K., Feki, J., Soulé-Dupuy, C.: Semantic Structure for XML Documents: Structuring and pruning. Journal of Information Organization 3(1), 36–46 (2013)
Etcheverry, L., Vaisman, A.A.: Enhancing OLAP Analysis with Web Cubes. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds.) ESWC 2012. LNCS, vol. 7295, pp. 469–483. Springer, Heidelberg (2012)
Hachaichi, Y., Feki, J.: An Automatic Method for the Design of Multidimensional Schemas from Object Oriented Databases. International Journal of Information Technology and Decision Making 12(6), 1223–1260 (2013)
Janet, B., Reddy, A.V.: Cube Index for Unstructured Text Analysis and Mining. In: Proceedings of the 2011 International Conference on Communication, Computing & Security, ICCC 2011, pp. 397–402 (2011)
Kimball, R., Ross, M.: The Data Warehouse Toolkit. Wiley, New York (2003)
Lin, C.X., Ding, B., Han, J., Zhu, F., Zhao, B.: Text cube: Computing in measures for multidimensional text database analysis. In: Eighth IEEE International Conference on Data Mining, vol. 54, pp. 905–910 (2008)
Oukid, L., Asfari, O., Bentayeb, F., Benblidia, N., Boussaid, O.: CXT-cube: contextual text cube model and aggregation operator for text OLAP (2013)
Yu, Y., Lin, C., Sun, Y., Chen, C., Han, J., Liao, B., Wu, T., Zhai, C., Zhang, D., Zhao, B.: iNextCube: Information network-enhanced text cube. In: VLDB 2009: Proceedings of the 35th International Conference on Very Large Data Bases, Lyon, France (2009)
Zhang, D., Zhai, C., Han, J.: Topic cube: Topic modeling for olap on multidimensional text databases. In: SDM 2009: Proceedings of the 2009 SIAM International Conference on Data Mining, Sparks, NV, USA, pp. 1124–1135 (2009)
Zhang, D., Zhai, C., Han, J.: Mitexcube: microtextcluster cube for online analysis of text cells. In: The NASA Conference on Intelligent Data Understanding (CIDU), pp. 204–218 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Azabou, M., Khrouf, K., Feki, J., Soulé-Dupuy, C., Vallès, N. (2014). A Novel Multidimensional Model for the OLAP on Documents: Modeling, Generation and Implementation. In: Ait Ameur, Y., Bellatreche, L., Papadopoulos, G.A. (eds) Model and Data Engineering. MEDI 2014. Lecture Notes in Computer Science, vol 8748. Springer, Cham. https://doi.org/10.1007/978-3-319-11587-0_24
Download citation
DOI: https://doi.org/10.1007/978-3-319-11587-0_24
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11586-3
Online ISBN: 978-3-319-11587-0
eBook Packages: Computer ScienceComputer Science (R0)