Skip to main content

Clustering of Search Engine Keywords Using Access Logs

  • Conference paper
Database and Expert Systems Applications (DEXA 2006)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4080))

Included in the following conference series:

Abstract

It the becomes possible that users can get kinds of information by just inputting search keyword(s) representing the topic which users are interested in. But it is not always true that users can hit upon search keyword(s) properly. In this paper, by using Web access logs (called panel logs), which are collected URL histories of Japanese users (called panels) selected without static deviation similar to the survey on TV audience rating, we study the methods of clustering search keywords. Different from the existing systems where the related search keywords are extracted based on the set of URLs viewed by the users after input of their original search keyword(s), we propose two novel methods of clustering the search words. One is based on the Web communities (set of similar web pages); the other is based on the set of nouns obtained by morphological analysis of Web pages. According to evaluation results, our proposed methods can extract more related search keywords than that based on URL.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Eirinaki, M., Vazirgiannis, M.: Web mining for web personalization. ACM Transactions on Internet Technology (ACM TIT) 3(1), 1–27 (2003)

    Article  Google Scholar 

  2. Cooley, R., Mobasher, B., Srivastava, J.: Web mining: Information and pattern discovery on the world wide web. In: Proceedings of the 9th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 1997) (1997)

    Google Scholar 

  3. Ungar, L., Foster, D.: Clustering methods for collaborative filtering. In: AAAI Workshop on Recommendation Systems (1998)

    Google Scholar 

  4. Otsuka, S., Toyoda, M., Hirai, J., Kitsuregawa, M.: Extracting User Behavior by Web Communities Technology on Global Web Logs. In: Galindo, F., Takizawa, M., Traunmüller, R. (eds.) DEXA 2004. LNCS, vol. 3180, pp. 957–968. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  5. Su, Z., Yang, Q., Zhang, H., Xu, X., Hu, Y.: Correlation-based document clustering using web logs. In: 34th Hawaii International Conference on System Sciences (HICSS-34) (2001)

    Google Scholar 

  6. Tan, P., Kumar, V.: Mining association patterns in web usage data. In: International Conference on Advances in Infrastructure for e-Business, e-Education, e-Science, and e-Medicine on the Internet (2002)

    Google Scholar 

  7. Beeferman, D., Berger, A.: Agglomerative clustering of search engine query log. In: The 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2000) (2000)

    Google Scholar 

  8. Wen, J., Nie, J., Zhang, H.: Query clustering using user logs. ACM Transactions on Information Systems (ACM TOIS) 20(1), 59–81 (2002)

    Article  Google Scholar 

  9. Ohkubo, M., Sugizaki, M., Inoue, T., Tanaka, K.: Extracting information demand by analyzing a www search log. IPSJ Journal 39(7), 2250–2258 (1998)

    Google Scholar 

  10. Koutsoupias, N.: Exploring web access logs with correspondence analysis. Methods and Applications of Artificial Intelligence, Second Hellenic (2002)

    Google Scholar 

  11. Prasetyo, B., Pramudiono, I., Takahashi, K., Kitsuregawa, M.: naviz : Website navigational behavior visualizer. In: Chen, M.-S., Yu, P.S., Liu, B. (eds.) PAKDD 2002. LNCS, vol. 2336, p. 276. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  12. Zeng, H., Chen, Z., Ma, W.: A unified framework for clustering heterogeneous web objects. In: The Third International Conference on Web Information Systems Engineering (WISE 2002) (2002)

    Google Scholar 

  13. Catledge, L., Pitkow, J.: Characterizing browsing behaviors on the world-wide web. Computer Networks and ISDN Systems 27(6) (1995)

    Google Scholar 

  14. Flake, G., Lawrence, S., Giles, C.L., Coetzee, F.: Self-organization and identification of web communities. IEEE Computer 35(3), 66–71 (2002)

    Google Scholar 

  15. Kumar, R., Raghavan, P., Rajagopalan, S., Tomkins, A.: Trawling the web for emerging cyber-communities. In: Proc. of the 8th WWW conference, pp. 403–416 (1999)

    Google Scholar 

  16. Toyoda, M., Kitsuregawa, M.: Creating a web community chart for navigating related communities. In: Conference Proceedings of Hypertext 2001, pp. 103–112 (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Otsuka, S., Kitsuregawa, M. (2006). Clustering of Search Engine Keywords Using Access Logs. In: Bressan, S., Küng, J., Wagner, R. (eds) Database and Expert Systems Applications. DEXA 2006. Lecture Notes in Computer Science, vol 4080. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11827405_82

Download citation

  • DOI: https://doi.org/10.1007/11827405_82

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-37871-6

  • Online ISBN: 978-3-540-37872-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics