LumberJack: Intelligent Discovery and Analysis of Web User Traffic Composition

  • Ed H. Chi
  • Adam Rosien
  • Jeffrey Heer
Conference paper

DOI: 10.1007/978-3-540-39663-5_1

Part of the Lecture Notes in Computer Science book series (LNCS, volume 2703)
Cite this paper as:
Chi E.H., Rosien A., Heer J. (2003) LumberJack: Intelligent Discovery and Analysis of Web User Traffic Composition. In: Zaïane O.R., Srivastava J., Spiliopoulou M., Masand B. (eds) WEBKDD 2002 - Mining Web Data for Discovering Usage Patterns and Profiles. WebKDD 2002. Lecture Notes in Computer Science, vol 2703. Springer, Berlin, Heidelberg

Abstract

Web Usage Mining enables new understanding of user goals on the Web. This understanding has broad applications, and traditional mining techniques such as association rules have been used in business applications. We have developed an automated method to directly infer the major groupings of user traffic on a Web site [Heer01]. We do this by utilizing multiple data features of user sessions in a clustering analysis. We have performed an extensive, systematic evaluation of the proposed approach, and have discovered that certain clustering schemes can achieve categorization accuracies as high as 99% [Heer02b]. In this paper, we describe the further development of this work into a prototype service called LumberJack, a push-button analysis system that is both more automated and accurate than past systems.

Keywords

Clustering Log Analysis Web Mining User Profile User Sessions World Wide Web 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2003

Authors and Affiliations

  • Ed H. Chi
    • 1
  • Adam Rosien
    • 1
  • Jeffrey Heer
    • 1
  1. 1.PARC (Palo Alto Research Center)Palo AltoUSA

Personalised recommendations