Mining Website Log to Improve Its Findability

  • Jiann-Cherng Shieh
Part of the Communications in Computer and Information Science book series (CCIS, volume 88)


Under the network environments with large amounts of digitalized data, websites are the information strongholds that institutions, organizations or enterprises must set up for their specific purposes. No matter how they have been built, websites should offer the capability that users can find their required information quickly and intuitively. Surfing around the library websites, the website logs always keep tracks of users’ factual behaviors of finding their required information. Thus we can apply data mining techniques possibly to explore users’ information seeking behavior. Based on these evidences, we attempt to reconstruct the websites to promote their internal findability. In this paper, we proposed a heuristic algorithm to clean the website log data, to extract user sub-sessions according to their respective the critical time of session navigation, and to calculate each sub-session’s the threshold time of target page with different weights to determine its navigating parent page. We utilized the alternate parent pages of weights to reconstruct various websites. We conduct task-oriented experiments of 4 tasks and 25 participants to measure the effects of their findability respectively. By the analysis of variance on time to complete the tasks, the result has shown that the reconstructed website has better findability performance.


Usability web log mining findability 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Information Architecture Institute (n.d.),
  2. 2.
    Morville, P.: Ambient findability: Libraries at the crossroads of ubiquitous computing and the internet. Online 29(6), 16–21 (2005)Google Scholar
  3. 3.
  4. 4.
    Deaton, M.: Sorting techniques for user-centered information design (2002),
  5. 5.
    Hawley, M.: Extending card-sorting techniques to inform the design of web site hierarchies (2008),
  6. 6.
    Paul, C.L.: A modified delphi approach to a new card sorting methodology. Information Arts and Technologies University, Baltimore (2007)Google Scholar
  7. 7.
    Upchurch, L., Rugg, G., Kitchenham, B.: Using card sorts to elicit web page quality attributes. IEEE Software 18(4), 84–89 (2001)CrossRefGoogle Scholar
  8. 8.
    Agosti, M., Nunzio, G., Niero, A.: From Web Log Analysis to Web User Profiling. In: DELOS Conference on Digital Libraries, Pisa, Italy (2007)Google Scholar
  9. 9.
    Baglioni, M., Ferrara, U., Romei, A., Ruggieri, S., Turini, F.: Preprocessing and mining web log data for web personalization. In: 8th Congress of the Italian Association for Artificial Intelligence, Pisa, Italy, pp. 237–249 (2003)Google Scholar
  10. 10.
    Morville, P., Rosenfeld, L.: Information Architecture for the World Wide Web, 3rd edn. O’Reilly Media, Sebastopol (2006)Google Scholar
  11. 11.
    International Standard Organization.: ISO 9241 Part 11: Guidance on usability (n.d.),
  12. 12.
    Battleson, B., Booth, A., Weintrop, J.: Usability Testing of an Academic Library Web Site: A Case Study. The Journal of Academic Librarianship 27(3), 188–198 (2001)CrossRefGoogle Scholar
  13. 13.
    Dickstein, R., Mills, V.: Usability testing at the university of Arizona library: How to let users in on the design. Information Technology and Libraries 19(3), 144–150 (2000)Google Scholar
  14. 14.
    Genuis, S.K.: Web site usability testing: A critical tool for libraries. Feliciter 50(4), 161–164 (2004)Google Scholar
  15. 15.
    Jeng, J.: Usability Assessment of Academic Digital Library: Effectiveness, Efficiency, Satisfaction, and Learnability. Libri 55, 96–121 (2005)CrossRefGoogle Scholar
  16. 16.
    Nielsen, J.: Why You Only Need to Test With 5 Users (2000),
  17. 17.
    Nielsen, J.: Return on Investment for usability (2003), (retrieved January 2, 2010)
  18. 18.
    Wei-Kuen, S.: Design Usability Evaluation with Data Mining Methods for Adaptive Websites (2004),
  19. 19.
    Turnbow, D., Kasianovitz, K., Snyder, L., Gilbert, D., Yamamoto, D.: Usability testing for web redesign a UCLA case study. OCLC System & Services 21(3), 226–234 (2005)CrossRefGoogle Scholar
  20. 20. (n.d.) (retrieved March 2, 2010)
  21. 21.
    Srikant, R., Yang, Y.: Mining Web Logs to Improve Website Organization. In: Proceedings of WWW 2001, Hong Kong (2001)Google Scholar
  22. 22.
    Koichiro, M., Masahiro, T., Hashimoto: A Proposal of Web Log Mining Method Considering Page Browsing Time, IPSJ(Information Processing Society of Japan) SIG Notes. ICS (67), 39–44 (2007)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Jiann-Cherng Shieh
    • 1
  1. 1.Graduate Institute of Library and Information StudiesNational Taiwan Normal UniversityTaipeiTaiwan

Personalised recommendations