Clustering of Search Engine Keywords Using Access Logs

Otsuka, Shingo; Kitsuregawa, Masaru

doi:10.1007/11827405_82

Shingo Otsuka¹⁸ &
Masaru Kitsuregawa¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4080))

Included in the following conference series:

International Conference on Database and Expert Systems Applications

1412 Accesses
3 Citations

Abstract

It the becomes possible that users can get kinds of information by just inputting search keyword(s) representing the topic which users are interested in. But it is not always true that users can hit upon search keyword(s) properly. In this paper, by using Web access logs (called panel logs), which are collected URL histories of Japanese users (called panels) selected without static deviation similar to the survey on TV audience rating, we study the methods of clustering search keywords. Different from the existing systems where the related search keywords are extracted based on the set of URLs viewed by the users after input of their original search keyword(s), we propose two novel methods of clustering the search words. One is based on the Web communities (set of similar web pages); the other is based on the set of nouns obtained by morphological analysis of Web pages. According to evaluation results, our proposed methods can extract more related search keywords than that based on URL.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Eirinaki, M., Vazirgiannis, M.: Web mining for web personalization. ACM Transactions on Internet Technology (ACM TIT) 3(1), 1–27 (2003)
Article Google Scholar
Cooley, R., Mobasher, B., Srivastava, J.: Web mining: Information and pattern discovery on the world wide web. In: Proceedings of the 9th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 1997) (1997)
Google Scholar
Ungar, L., Foster, D.: Clustering methods for collaborative filtering. In: AAAI Workshop on Recommendation Systems (1998)
Google Scholar
Otsuka, S., Toyoda, M., Hirai, J., Kitsuregawa, M.: Extracting User Behavior by Web Communities Technology on Global Web Logs. In: Galindo, F., Takizawa, M., Traunmüller, R. (eds.) DEXA 2004. LNCS, vol. 3180, pp. 957–968. Springer, Heidelberg (2004)
Chapter Google Scholar
Su, Z., Yang, Q., Zhang, H., Xu, X., Hu, Y.: Correlation-based document clustering using web logs. In: 34th Hawaii International Conference on System Sciences (HICSS-34) (2001)
Google Scholar
Tan, P., Kumar, V.: Mining association patterns in web usage data. In: International Conference on Advances in Infrastructure for e-Business, e-Education, e-Science, and e-Medicine on the Internet (2002)
Google Scholar
Beeferman, D., Berger, A.: Agglomerative clustering of search engine query log. In: The 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2000) (2000)
Google Scholar
Wen, J., Nie, J., Zhang, H.: Query clustering using user logs. ACM Transactions on Information Systems (ACM TOIS) 20(1), 59–81 (2002)
Article Google Scholar
Ohkubo, M., Sugizaki, M., Inoue, T., Tanaka, K.: Extracting information demand by analyzing a www search log. IPSJ Journal 39(7), 2250–2258 (1998)
Google Scholar
Koutsoupias, N.: Exploring web access logs with correspondence analysis. Methods and Applications of Artificial Intelligence, Second Hellenic (2002)
Google Scholar
Prasetyo, B., Pramudiono, I., Takahashi, K., Kitsuregawa, M.: naviz : Website navigational behavior visualizer. In: Chen, M.-S., Yu, P.S., Liu, B. (eds.) PAKDD 2002. LNCS, vol. 2336, p. 276. Springer, Heidelberg (2002)
Chapter Google Scholar
Zeng, H., Chen, Z., Ma, W.: A unified framework for clustering heterogeneous web objects. In: The Third International Conference on Web Information Systems Engineering (WISE 2002) (2002)
Google Scholar
Catledge, L., Pitkow, J.: Characterizing browsing behaviors on the world-wide web. Computer Networks and ISDN Systems 27(6) (1995)
Google Scholar
Flake, G., Lawrence, S., Giles, C.L., Coetzee, F.: Self-organization and identification of web communities. IEEE Computer 35(3), 66–71 (2002)
Google Scholar
Kumar, R., Raghavan, P., Rajagopalan, S., Tomkins, A.: Trawling the web for emerging cyber-communities. In: Proc. of the 8th WWW conference, pp. 403–416 (1999)
Google Scholar
Toyoda, M., Kitsuregawa, M.: Creating a web community chart for navigating related communities. In: Conference Proceedings of Hypertext 2001, pp. 103–112 (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Industrial Science, The University of Tokyo, 4-6-1 Komaba, Meguro-ku, Tokyo, 153-8505, Japan
Shingo Otsuka & Masaru Kitsuregawa

Authors

Shingo Otsuka
View author publications
You can also search for this author in PubMed Google Scholar
Masaru Kitsuregawa
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computing, National University of Singapore,
Stéphane Bressan
University of Linz, Altenbergerstraße 69, 4040, Linz, Austria
Josef Küng & Roland Wagner &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Otsuka, S., Kitsuregawa, M. (2006). Clustering of Search Engine Keywords Using Access Logs. In: Bressan, S., Küng, J., Wagner, R. (eds) Database and Expert Systems Applications. DEXA 2006. Lecture Notes in Computer Science, vol 4080. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11827405_82

Download citation

DOI: https://doi.org/10.1007/11827405_82
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37871-6
Online ISBN: 978-3-540-37872-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics