Abstract
Analyzing Web log data is important in order to study the usage of a website. Even though some approaches propose data warehousing techniques for structuring the Web log data into a multidimensional model, they present two main drawbacks: (i) they are based on informal guidelines and must be manually applied; and (ii) they consider data tailored to a specific Web log format, thus being restricted to specific analysis tools. To overcome these limitations, we present a model-driven approach for obtaining a conceptual multidimensional model from Web log data in a comprehensive, integrated and automatic manner. This approach consists of the following steps: (i) obtaining a conceptual model of the Web log data based on a unified metamodel, (ii) deriving a multidimensional model from this Web log model by formally defining a set of QVT (Query/View/Transformation) transformation rules.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Alves, R., Belo, O.: Mining clickstream-based data cubes. In: 6th International Conference on Enterprise Information Systems, pp. 583–586 (2004)
Alves, R., Belo, O., Cavalcanti, F., Ferreira, P.: Clickstreams, the basis to establish user navigation patterns on web sites. In: Fifth International Conference on Data Mining, Text Mining and their Business Applications, pp. 87–96. WIT Press, Southampton (2004)
Aurélio, D.M., Jorge, A.M., Soares, C., Leal, J.P., Machado, P.: A data warehouse for web intelligence. In: Neves, J., Santos, M.F., Machado, J.M. (eds.) EPIA 2007. LNCS (LNAI), vol. 4874, pp. 487–499. Springer, Heidelberg (2007)
Cooley, R., Mobasher, B., Srivastava, J.: Data preparation for mining world wide web browsing patterns. Knowl. Inf. Syst. 1, 5–32 (1999)
Eirinaki, M., Vazirgiannis, M.: Web mining for web personalization. ACM Trans. Internet Techn. 3, 1–27 (2003)
Fraternali, P., Lanzi, P.L., Matera, M., Maurino, A.: Model-driven web usage analysis for the evaluation of web application quality. J. Web Eng. 3, 124–152 (2004)
Golfarelli, M., Maio, D., Rizzi, S.: The Dimensional Fact Model: A conceptual model for data warehouses. Int. J. Cooperative Inf. Syst. 7, 215–247 (1998)
Hüsemann, B., Lechtenbörger, J., Vossen, G.: Conceptual data warehouse modeling. In: 2nd Intl. Workshop on Design and Management of Data Warehouses, pp. 6–1–6–11 (2000)
Jensen, M.R., Holmgren, T., Pedersen, T.B.: Discovering multidimensional structure in relational data. In: Kambayashi, Y., Mohania, M., Wöß, W. (eds.) DaWaK 2004. LNCS, vol. 3181, pp. 138–148. Springer, Heidelberg (2004)
Joshi, K.P., Joshi, A., Yesha, Y.: On using a warehouse to analyze web logs. Distributed and Parallel Databases 13, 161–180 (2003)
Kimball, R., Merz, R.: The data webhouse toolkit: building the web-enabled data warehouse. John Wiley & Sons, Inc., New York (2000)
Lopes, C.T., David, G.: Higher education web information system usage analysis with a data webhouse. In: Gavrilova, M.L., Gervasi, O., Kumar, V., Tan, C.J.K., Taniar, D., Laganá, A., Mun, Y., Choo, H. (eds.) ICCSA 2006. LNCS, vol. 3983, pp. 78–87. Springer, Heidelberg (2006)
Luján-Mora, S., Trujillo, J., Song, I.Y.: A uml profile for multidimensional modeling in data warehouses. Data Knowl. Eng. 59, 725–769 (2006)
Mazón, J.N., Trujillo, J.: A model driven modernization approach for automatically deriving multidimensional models in data warehouses. In: Parent, C., Schewe, K.-D., Storey, V.C., Thalheim, B. (eds.) ER 2007. LNCS, vol. 4801, pp. 56–71. Springer, Heidelberg (2007)
Mazón, J.N., Trujillo, J.: A hybrid model driven development framework for the multidimensional modeling of data warehouses. SIGMOD Record 38, 12–17 (2009)
Phipps, C., Davis, K.C.: Automating data warehouse conceptual schema design and evaluation. In: 4th Intl. Workshop on Design and Management of Data Warehouses, pp. 23–32 (2002)
Rizzi, S., Abelló, A., Lechtenbörger, J., Trujillo, J.: Research in data warehouse modeling and design: dead or alive? In: 9th International Workshop on Data Warehousing and OLAP, pp. 3–10 (2006)
The Apache Software Foundation: Log files, http://eregie.premier-ministre.gouv.fr/manual/logs.html
W3C Consortium: Extended common log file format, http://www.w3.org/TR/WD-logfile.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hernández, P., Garrigós, I., Mazón, JN. (2010). Model-Driven Development of Multidimensional Models from Web Log Files. In: Trujillo, J., et al. Advances in Conceptual Modeling – Applications and Challenges. ER 2010. Lecture Notes in Computer Science, vol 6413. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16385-2_22
Download citation
DOI: https://doi.org/10.1007/978-3-642-16385-2_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16384-5
Online ISBN: 978-3-642-16385-2
eBook Packages: Computer ScienceComputer Science (R0)