Improving Effectiveness on Clickstream Data Mining

Wanzeller, Cristina; Belo, Orlando

doi:10.1007/11790853_13

Cristina Wanzeller¹⁹ &
Orlando Belo²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4065))

Included in the following conference series:

Industrial Conference on Data Mining

1790 Accesses

Abstract

Developing and applying data mining processes are often very complex tasks to users without deep knowledge in this domain, particularly when such tasks involve clickstream data processing. One important and known challenge arises in the selection of mining methods to apply on a specific data analysis problem, trying to get better and useful results for a particular goal. Our approach to address this challenge relies on the reuse of the acquired experience from similar problems, which had provided successful mining processes in the past. In order to accomplish such goal, we implemented a prototype mining plans selection system, based on the Case-Based Reasoning paradigm. In this paper we explain how this paradigm and the implemented system may be explored to assist decisions on the data mining or Web usage mining specific scope. Additionally, we also identify the underlying issues and the approaches that were followed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aamodt, A., Plaza, E.: Case-Based Reasoning: Foundational Issues, Methodological Variations and Systems Approaches. Artificial Intelligence Communications (AICom) 7(1), 39–59 (1994)
Google Scholar
Aamodt, A.: Knowledge Acquisition and Learning by Experience - The Role of Case Specific Knowledge. In: Machine Learning and Knowledge Acquisition, Integrated Approaches, pp. 197–245. Academic Press, London (1995)
Google Scholar
Ansari, S., Kohavi, R., Mason, L., Zheng, Z.: Integrating E-Commerce and Data Mining: Architecture and Challenges. In: Proc. 2001 IEEE International Conf. on Data Mining, pp. 27–34. IEEE Comput. Soc., Los Alamitos (2001)
Chapter Google Scholar
Apache Jakarta Tomcat (access, April 2006), http://tomcat.apache.org/
Bos, B.: W3C. Web Style Sheets – Home Page (access, April 2006), http://www.w3.org/Style/
Hilario, M., Kalousis, A.: Fusion of Meta-knowledge and Meta-data for Case-Based Model Selection. In: Siebes, A., De Raedt, L. (eds.) PKDD 2001. LNCS, vol. 2168, pp. 180–191. Springer, Heidelberg (2001)
Chapter Google Scholar
Java 2 Platform, Standard Edition (J2SE). Sun Microsystems (access, April 2006), http://java.sun.com/javase/index.jsp
Java API for XML Processing (JAXP). Sun Microsystems (access, April 2006), http://java.sun.com/webservices/jaxp/
Java Database Connectivity, JDBC Data Access API. Sun Microsystems (access, April 2006), http://www.javasoft.com/products/jdbc/index.html
Java Server Pages. Sun Microsystems (access, April 2006), http://java.sun.com/products/jsp/
Kolodner, J.: Case-Based Reasoning. Morgan Kaufman, San Francisco (1993)
Google Scholar
Koutri, M., Avouris, N., Daskalaki, S.: A Survey on Web Usage Mining Techniques for Web-Based Adaptive Hypermedia Systems. In: Chen, S.Y., Magoulas, G.D. (eds.) Adaptable and Adaptive Hypermedia Systems, Idea Publishing Inc., Hershey (2005)
Google Scholar
Lindner, G., Studer, R.: AST: Support for algorithm selection with a CBR approach. In: Żytkow, J.M., Rauch, J. (eds.) PKDD 1999. LNCS, vol. 1704, pp. 418–423. Springer, Heidelberg (1999)
Chapter Google Scholar
MetaL project (access, April 2006), http://www.metal-kdd.org/
Mobasher, B., Berendt, B., Spiliopoulou, M.: KDD for Personalization. In: PKDD 2001 Tutorial (2001)
Google Scholar
Morik, K., Scholz, M.: The MiningMart Approach to Knowledge Discovery in Databases. In: Zhong, N., Liu, J. (eds.) Intelligent Technologies for Information Analysis. Springer, Heidelberg (2004)
Google Scholar
Predictive Model Markup Language. Data Mining Group (access, April 2006), http://www.dmg.org/index.html
Richter, M.: The Knowledge Contained in Similarity Measures. In: Aamodt, A., Veloso, M.M. (eds.) ICCBR 1995. LNCS (LNAI), vol. 1010. Springer, Heidelberg (1995)
Google Scholar
Riesbeck, C.K., Schank, R.C.: Inside Case-Based Reasoning. Lawrence Erlbaum Associates, Hillsdale (1989)
Google Scholar
Srivastava, J., Cooley, R., Deshpande, M., Tan, P.-N.: Web Usage Mining: Discovery and Applications of Usage Patterns from Web Data. SIGKDD Explorations 1(2), 1–12 (2000)
Article Google Scholar
W3C HTML Working Group. HyperText Markup Language (HTML) – Home Page (access, April 2006), http://www.w3.org/MarkUp/

Download references

Author information

Authors and Affiliations

Departamento de Informática, Instituto Superior Politécnico de Viseu, Escola Superior de Tecnologia de Viseu, Campus Politécnico de Repeses, 3505-510, Viseu, Portugal
Cristina Wanzeller
Departamento de Informática, Escola de Engenharia, Universidade do Minho, Campus de Gualtar, 4710-057, Braga, Portugal
Orlando Belo

Authors

Cristina Wanzeller
View author publications
You can also search for this author in PubMed Google Scholar
Orlando Belo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Computer Vision and applied Computer Sciences, IBaI, Germany
Petra Perner

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wanzeller, C., Belo, O. (2006). Improving Effectiveness on Clickstream Data Mining. In: Perner, P. (eds) Advances in Data Mining. Applications in Medicine, Web Mining, Marketing, Image and Signal Mining. ICDM 2006. Lecture Notes in Computer Science(), vol 4065. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11790853_13

Download citation

DOI: https://doi.org/10.1007/11790853_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-36036-0
Online ISBN: 978-3-540-36037-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics