A Website Mining Model Centered on User Queries

Baeza-Yates, Ricardo; Poblete, Barbara

doi:10.1007/11908678_1

Ricardo Baeza-Yates^28,29,30 &
Barbara Poblete^28,29

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4289))

Included in the following conference series:

402 Accesses
3 Citations

Abstract

We present a model for mining user queries found within the access logs of a website and for relating this information to the website’s overall usage, structure and content. The aim of this model is to discover, in a simple way, valuable information to improve the quality of the website, allowing the website to become more intuitive and adequate for the needs of its users. This model presents a methodology of analysis and classification of the different types of queries registered in the usage logs of a website, such as queries submitted by users to the site’s internal search engine and queries on global search engines that lead to documents in the website. These queries provide useful information about topics that interest users visiting the website and the navigation patterns associated to these queries indicate whether or not the documents in the site satisfied the user’s needs at that moment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Berendt, B., Spiliopoulou, M.: Analysis of navigation behaviour in web sites integrating multiple information systems. VLDB Journal (special issue on “Databases and the Web”) 9(1), 56–75 (2000)
Article Google Scholar
Srivastava, J., Cooley, R., Deshpande, M., Tan, P.N.: Web usage mining: Discovery and applications of usage patterns from web data. SIGKDD Explorations 1(2), 12–23 (2000)
Article Google Scholar
Cooley, R., Tan, P.N., Srivastava, J.: Discovery of interesting usage patterns from web data. In: Masand, B., Spiliopoulou, M. (eds.) WebKDD 1999. LNCS, vol. 1836, pp. 163–182. Springer, Heidelberg (2000)
Chapter Google Scholar
Baeza-Yates, R.: Web usage mining in search engines. In: Scime, A. (ed.) Web Mining: Applications and Techniques, pp. 307–321. Idea Group (2004)
Google Scholar
Perkowitz, M., Etzioni, O.: Adaptive web sites: an AI challenge. IJCAI (1), 16–23 (1997)
Google Scholar
Mobasher, B., Cooley, R., Srivastava, J.: Automatic personalization based on web usage mining. Commun. ACM 43(8), 142–151 (2000)
Article Google Scholar
Spiliopoulou, M.: Web usage mining for web site evaluation. Commun. ACM 43(8), 127–134 (2000)
Article Google Scholar
Batista, P., Silva, M.J.: Mining on-line newspaper web access logs. In: Ricci, F., Smyth, B. (eds.) Proceedings of the AH 2002, Workshop on Recommendation and Personalization in eCommerce, pp. 100–108 (2002)
Google Scholar
Cooley, R., Tan, P., Srivastava, J.: Websift: the web site information filter system. In: KDD Workshop on Web Mining, San Diego, CA. Springer, Heidelberg (1999) (in press)
Google Scholar
Masseglia, F., Poncelet, P., Teisseire, M.: Using data mining techniques on web access logs to dynamically improve hypertext structure. ACM SigWeb Letters 8(3), 1–19 (1999)
Google Scholar
Huang, Z., Ng, J., Cheung, D., Ng, M., Ching, W.: A cube model for web access sessions and cluster analysis. In: Kohavi, R., Masand, B., Spiliopoulou, M., Srivastava, J. (eds.) WebKDD 2001. LNCS (LNAI), vol. 2356, pp. 47–57. Springer, Heidelberg (2002)
Chapter Google Scholar
Nasraoui, O., Krishnapuram, R.: An evolutionary approach to mining robust multi-resolution web profiles and context sensitive url associations. Intl’ Journal of Computational Intelligence and Applications 2(3), 339–348 (2002)
Article Google Scholar
Nasraoui, O., Petenes, C.: Combining web usage mining and fuzzy inference for website personalization. In: Proceedings of the WebKDD workshop, pp. 37–46 (2003)
Google Scholar
Pei, J., Han, J., Mortazavi-asl, B., Zhu, H.: Mining access patterns efficiently from web logs. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 396–407 (2000)
Google Scholar
Perkowitz, M., Etzioni, O.: Adaptive web sites: automatically synthesizing web pages. In: AAAI 1998/IAAI 1998: Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence, pp. 727–732. American Association for Artificial Intelligence, Menlo Park (1998)
Google Scholar
Xue, G.R., Zeng, H.J., Chen, Z., Ma, W.Y., Lu, C.J.: Log mining to improve the performance of site search. In: WISEW 2002: Proceedings of the Third International Conference on Web Information Systems Engineering (Workshops) - (WISEw 2002), p. 238. IEEE Computer Society, Washington (2002)
Google Scholar
Baeza-Yates, R.A., Hurtado, C.A., Mendoza, M.: Query clustering for boosting web page ranking. In: Favela, J., Menasalvas, E., Chávez, E. (eds.) AWIC 2004. LNCS (LNAI), vol. 3034, pp. 164–175. Springer, Heidelberg (2004)
Chapter Google Scholar
Baeza-Yates, R.A., Hurtado, C.A., Mendoza, M.: Query recommendation using query logs in search engines. In: Lindner, W., Mesiti, M., Türker, C., Tzitzikas, Y., Vakali, A.I. (eds.) EDBT 2004. LNCS, vol. 3268, pp. 588–596. Springer, Heidelberg (2004)
Chapter Google Scholar
Kang, I.H., Kim, G.: Query type classification for web document retrieval. In: SIGIR 2003: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, pp. 64–71. ACM Press, New York (2003)
Chapter Google Scholar
Sieg, A., Mobasher, B., Lytinen, S., Burke, R.: Using concept hierarchies to enhance user queries in web-based information retrieval. In: IASTED International Conference on Artificial Intelligence and Applications (2004)
Google Scholar
Radlinski, F., Joachims, T.: Query chains: learning to rank from implicit feedback. In: KDD 2005: Proceeding of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, pp. 239–248. ACM Press, New York (2005)
Chapter Google Scholar
Davison, B.D., Deschenes, D.G., Lewanda, D.B.: Finding relevant website queries. In: Poster Proceedings of the Twelfth International World Wide Web Conference, Budapest, Hungary (2003)
Google Scholar
Baeza-Yates, R.: Mining the web (in spanish). El profesional de la información (The Information Professional) 13(1), 4–10 (2004)
Google Scholar
Cooley, R., Mobasher, B., Srivastava, J.: Data preparation for mining world wide web browsing patterns. Knowledge and Information Systems 1(1), 5–32 (1999)
Google Scholar
Mobasher, B.: Web usage mining and personalization. In: Singh, M.P. (ed.) Practical Handbook of Internet Computing. Chapman Hall & CRC Press, Baton Rouge (2004)
Google Scholar
Poblete, B.: A web mining model and tool centered in queries. M.sc. in Computer Science, CS Dept., Univ. of Chile (2004)
Google Scholar
Pirolli, P.: Computational models of information scent-following in a very large browsable text collection. In: CHI 1997: Proceedings of the SIGCHI conference on Human factors in computing systems, pp. 3–10. ACM Press, New York (1997)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Web Research Group, Technology Department, University Pompeu Fabra, Barcelona, Spain
Ricardo Baeza-Yates & Barbara Poblete
Center for Web Research, CS Department, University of Chile, Santiago, Chile
Ricardo Baeza-Yates & Barbara Poblete
Yahoo! Research, Barcelona, Spain
Ricardo Baeza-Yates

Authors

Ricardo Baeza-Yates
View author publications
You can also search for this author in PubMed Google Scholar
Barbara Poblete
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Natural Language Processing, Institute for Computer Science, University of Leipzig,
Markus Ackermann
Department of Computer Science, K.U. Leuven, B-3001, Heverlee, Belgium
Bettina Berendt
Dept. of Knowledge Technologies, Jozef Stefan Institute, Jamova 39, 1000, Ljubljana, Slovenia
Marko Grobelnik
Knowledge & Data Engineering Group, University of Kassel, Wilhelmshöher Allee 73, D-34121, Kassel, Germany
Andreas Hotho
Jožef Stefan Institute, Jamova 39, 1000, Ljubljana, Slovenia
Dunja Mladenič
Dipartimento di Informatica, Università di Bari, Via E. Orabona, 4, 70125, Bari, Italia
Giovanni Semeraro
Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Germany
Myra Spiliopoulou
Research Center L3S, Appelstr. 9a, D-30167, Hannover, Germany
Gerd Stumme
Dept. Information and Knowledge Engineering, University of Economics, Prague, Winston Churchill Sq. 4, 130 67 Praha 3, Prague, Czech Republic
Vojtěch Svátek
Human Computer Studies Lab, University of Amsterdam, Kruislaan 419, 1089, Amsterdam, VA, The Netherlands
Maarten van Someren

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Baeza-Yates, R., Poblete, B. (2006). A Website Mining Model Centered on User Queries. In: Ackermann, M., et al. Semantics, Web and Mining. EWMF KDO 2005 2005. Lecture Notes in Computer Science(), vol 4289. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11908678_1

Download citation

DOI: https://doi.org/10.1007/11908678_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-47697-9
Online ISBN: 978-3-540-47698-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics