Skip to main content

Web Usage Mining: Discovering Usage Patterns for Web Applications

  • Chapter
Advanced Techniques in Web Intelligence-2

Part of the book series: Studies in Computational Intelligence ((SCI,volume 452))

Abstract

The heterogeneous nature of the Web combined with the rapid diffusion of Web-based applications have made Web browsing an intricate activity for users. This has given rise to an urgent need for developing systems capable to assist and guide users during their navigational activity in the Web. Web Usage Mining (WUM) refers to the application of Data Mining techniques for the automatic discovery of meaningful usage patterns characterizing the browsing behavior of users, starting from access data collected from interactions of users with sites. The discovered patterns may be conveniently exploited in order to implement functionalities offering useful assistance to users. This chapter is mainly intended to provide an overview of the different stages involved in a general WUM process. As an example, a WUM approach is presented which is based on the use of fuzzy clustering to discovery user categories starting from usage patterns.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abraham, A.: Business intelligence from web usage mining. Journal of Information & Knowledge Management 2(4), 375–390 (2003)

    Article  Google Scholar 

  2. Abraham, A.: i-Miner: A web usage mining framework using hierarchical intelligent systems. In: Proc. of the IEEE International Conference on Fuzzy Systems (FUZZ-IEEE 2003), pp. 1129–1134 (2003)

    Google Scholar 

  3. Agrawal, R., Imielinski, T., Swami, A.N.: Mining association rules between sets of items in large databases. In: Proc. of the 1993 ACM SIGMOD International Conference on Management of Data (SIGMOD 1993), pp. 207–216 (1993)

    Google Scholar 

  4. Agrawal, R., Srikant, R.: Mining sequential patterns. In: Proc. of the Eleventh International Conference on Data Engineering (ICDE 1995), pp. 3–14 (1995)

    Google Scholar 

  5. Anderson, C.R., Domingos, P., Weld, D.S.: Adaptive Web Navigation for Wireless Devices. In: Proc. of the 17th International Joint Conference on Artificial Intelligence (IJCAI 2001), pp. 879–884 (2001)

    Google Scholar 

  6. Arotariteia, D., Mitra, S.: Web mining: a survey in the fuzzy framework. Fuzzy Sets and Systems 148(1), 5–19 (2004)

    Article  MathSciNet  Google Scholar 

  7. Bayir, M.A., Cosar, A., Toroslu, I.H., Fidan, G.: Smart Miner: A New Framework for Mining Large Scale Web Usage Data. In: Proc. of the 18th International Conference on World Wide Web, pp. 161–170 (2009)

    Google Scholar 

  8. Berendt, B.: Web usage mining, site semantics, and the support of navigation. In: Proc. of Workshop Web Mining for E-Commerce - Challenges and Opportunities, pp. 83–93 (2000)

    Google Scholar 

  9. Runkler, T.A., Bezdek, J.C.: Web mining with relational clustering. International Journal of Approximate Reasoning 32, 217–236 (2003)

    Article  MATH  Google Scholar 

  10. Borges, J.A., Levene, M.: Generating Dynamic Higher-Order Markov Models in Web Usage Mining. In: Jorge, A.M., Torgo, L., Brazdil, P.B., Camacho, R., Gama, J. (eds.) PKDD 2005. LNCS (LNAI), vol. 3721, pp. 34–45. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  11. Davison, B.D.: A Web caching primer. IEEE Internet Computing 5(4), 38–45 (2001)

    Article  Google Scholar 

  12. Buchner, A.G., Mulvenna, M.D.: Discovering internet marketing intelligence through online analytical web usage mining. SIGMOD Record 27(4), 54–61 (1999)

    Article  Google Scholar 

  13. Cadez, I., Heckerman, D., Meek, C., Smyth, P., White, S.: Visualization of Navigation Patterns on a Web Site Using Model Based Clustering. Technical Report MSR-TR-00-18 (2000)

    Google Scholar 

  14. Castellano, G., Fanelli, A.M., Mencar, C., Torsello, M.A.: Log data preprocessing for mining Web browsing patterns. In: Proc. of the 8th Asian Pacific Industrial Engineering and Management Systems Conference (APIEMS 2007) (2007)

    Google Scholar 

  15. Castellano, G., Fanelli, A.M., Torsello, M.A.: Relational Fuzzy approach for Mining User Profiles. In: Aggarwal, A., Yager, R., Sandeberg, I.W. (eds.) Lectures Notes in Computational Intelligence, pp. 175–179. Wseas Press (2007)

    Google Scholar 

  16. Castellano, G., Mesto, F., Minunno, M., Torsello, M.A.: Web User Profiling Using Fuzzy Clustering. In: Masulli, F., Mitra, S., Pasi, G. (eds.) WILF 2007. LNCS (LNAI), vol. 4578, pp. 94–101. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  17. Chan, P.K.: A non-invasive learning approach to building Web user profiles. In: Proc. of 5th ACM SIGKDD International Conference, Workshop on Web Usage Analysis and User Profiling, pp. 7–12 (1999)

    Google Scholar 

  18. Chen, P., Kuo, F.: An information retrieval system based on an user profile. The Journal of Systems and Software 54, 3–8 (2000)

    Article  Google Scholar 

  19. Chitraa, V., Davamani, A.S.: A Survey on Preprocessing Methods for Web Usage Data. International Journal of Computer Science and Information Security 7(3), 78–83 (2010)

    Google Scholar 

  20. Cho, Y., Kim, J.K., Kim, S.H.: A personalized recommender system based on web usage mining and decision tree induction. Expert Systems with Applications 23(3), 329–342 (2003)

    Article  Google Scholar 

  21. Cimiano, P., Staab, S.: Learning by googling. SIGKDD Explorations Newsletter 6(2), 24–33 (2004)

    Article  Google Scholar 

  22. Cohen, E., Krishnamurthy, B., Rexford, J.: Improving end-to-end performance of the web using server volumes and proxy filters. In: Proc. of Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication (ACM SIGCOMM 1998), pp. 241–253 (1998)

    Google Scholar 

  23. Cooley, R., Mobasher, B., Srivastava, J.: Data preparation for mining world wide web browsing patterns. Knowledge and Information Systems. 1(1), 55–32 (1999)

    Google Scholar 

  24. Cooley, R.: Web usage mining: discovery and application of interesting patterns from Web data. PhD thesis, University of Minnesota (2000)

    Google Scholar 

  25. Costa, M., Gong, Z.: Web structure mining: an introduction. In: Proc. of the IEEE International Conference on Information Acquisition, pp. 590–595 (2005)

    Google Scholar 

  26. Dai, H., Mobasher, B.: Using ontologies to discover domain-level web usage profiles. In: Proc. of the 2nd Semantic Web Mining Workshop (2002)

    Google Scholar 

  27. Facca, F.M., Lanzi, P.: Mining interesting knowledge from weblogs: a survey. Data & Knowledge Engineering 53, 225–241 (2005)

    Article  Google Scholar 

  28. Frías-Martínez, E., Karamcheti, V.: A Customizable Behavior Model for Temporal Prediction of Web User Sequences. In: Zaïane, O.R., Srivastava, J., Spiliopoulou, M., Masand, B. (eds.) WebKDD 2003. LNCS (LNAI), vol. 2703, pp. 66–85. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  29. Furnkranz, J.: Web structure mining - exploiting the graph structure of the world-wide web. GAI-Journal 21(2), 17–26 (2002)

    Google Scholar 

  30. Furnkranz, J.: Web mining. In: Maimon, O., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook, pp. 899–920. Springer (2005)

    Google Scholar 

  31. Ghorbani, A.A., Xu, X.: A fuzzy markov model approach for predicting user navigation. Web Intelligence, 307–311 (2007)

    Google Scholar 

  32. Godoy, D., Amandi, A.: Learning browsing patterns for context-aware recommendation. In: Bramer, M. (ed.) Artificial Intelligence in Theory and Pratice, pp. 61–70. Springer (2006)

    Google Scholar 

  33. Han, J., Kamber, M.: Data Mining Concepts and Techniques. Morgan Kaufmann (2001)

    Google Scholar 

  34. Krishnapuram, R., Joshi, A., Nasraoui, O., Yi, L.: Low-complexity fuzzy relational clustering algorithms for web mining. Journal IEEE-FS 9, 595–607 (2001)

    Google Scholar 

  35. Halkidi, M., Batistakis, Y., Vazirgiannis, M.: Cluster Validity Methods: Part II. SIGMOD Record 31(3), 19–27 (2002)

    Article  Google Scholar 

  36. Hansen, M., Shriver, E.: Using navigation data to improve IR functions in the context of web search. In: Proc. of the 10th International Conference on Information and Knowledge Management, pp.135–142 (2001)

    Google Scholar 

  37. Heer, J., Chi, E.H.: Identification of web user traffic composition using multi-modal clustering and information scent. In: Proc. of the Workshop on Web Mining, SIAM Conference on Data Mining, pp. 51–58 (2001)

    Google Scholar 

  38. Huang, X., Cercone, N., An, A.: Comparison of interestingness functions for learning web usage patterns. In: Proc. of the 11th International Conference on Information and Knowledge Management, pp. 617–620 (2002)

    Google Scholar 

  39. Hussain, T., Asghar, S., Masood, N.: Web usage mining: A survey on preprocessing of web log file. In: Proc. of the International Conference on Information and Emerging Technologies (ICIET), pp. 1–6 (2010)

    Google Scholar 

  40. Jespersen, S.E., Thorhauge, J., Bach Pedersen, T.: A Hybrid Approach to Web Usage Mining. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds.) DaWaK 2002. LNCS, vol. 2454, pp. 73–82. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  41. Joachims, T., Freitag, D., Mitchell, T.: Webwatcher: A tour guide for the world wide web. In: Proc. of the 15th International Conference on Artificial Intelligence, pp. 770–775 (1997)

    Google Scholar 

  42. Joshi, K., Joshi, A., Yesha, Y.: On using a warehouse to analyse web logs. Distributed and Parallel Databases 13(2), 161–180 (2003)

    Article  MATH  Google Scholar 

  43. Kamdar, T., Joshi, A.: On creating adaptive web sites using web log mining. Technical report tr-cs-00-05. Department of Computer Science and Electrical Engineering University of Maryland (2000)

    Google Scholar 

  44. Khasawneh, N., Chan, C.-C.: Active user-based and ontology-based web log data preprocessing for web usage mining. In: Proc. of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 325–328 (2006)

    Google Scholar 

  45. Khasawneh, N., Chan, C.-C.: Web Usage Mining Using Rough Sets. In: Proc. of the 2005 Annual Meeting of the North American Fuzzy Information Processing Society, pp. 580–585 (2005)

    Google Scholar 

  46. Kima, J.K., Chob, Y.H., Kimc, W.J., Kimc, J.R., Suha, J.H.: A personalized recommendation procedure for Internet shopping support. Electronic Commerce Research and Applications 1, 301–313 (2002)

    Article  Google Scholar 

  47. Kosala, R., Blockeel, H.: Web mining research: a survey. ACM SIGKDD Explorations Newsletter 2, 1–15 (2000)

    Article  Google Scholar 

  48. Koutri, M., Avouris, N., Daskalaki, S.: A Survey of Web-Usage Mining: Techniques for Building Web-Based Adaptive Hypermedia Systems. In: Chen, S.Y., Magoulas, G.D. (eds.) Adaptable and Adaptive Hypermedia Systems, pp. 125–150. IRM Press (2005)

    Google Scholar 

  49. Lan, B., Bressan, S., Ooi, B.-C., Tay, Y.C.: Making Web Servers Pushier. In: Masand, B., Spiliopoulou, M. (eds.) WebKDD 1999. LNCS (LNAI), vol. 1836, pp. 112–125. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  50. Lazzerini, B., Marcelloni, F.: A hierarchical fuzzy clustering-based system to create user profiles. International Journal on Soft Computing 11, 157–168 (2007)

    Article  MATH  Google Scholar 

  51. Lieberman, H.: Letizia: An agent that assists web browsing. In: Proc. of the 1995 International Joint Conference on Artificial Intelligence, pp. 924–929 (1995)

    Google Scholar 

  52. Liu, B., Chang, K.C.C.: Editorial: Special issue on web content mining. SIGKDD Explorations special issue on Web Content Mining 6(2), 1–4 (2004)

    MATH  Google Scholar 

  53. Maheswari, V.U., Siromoney, A., Mehata, K.M.: The Variable Precision Rough Set Model for Web Usage Mining. In: Zhong, N., Yao, Y., Ohsuga, S., Liu, J. (eds.) WI 2001. LNCS (LNAI), vol. 2198, pp. 520–524. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  54. Menasalvas, E., Millan, S., Pena, J., Hadjimichael, M., Marban, O.: Subsessions: a granular approach to click path analysis. In: Proc. of the FUZZ-IEEE Fuzzy Sets and Systems Conference, pp. 12–17 (2002)

    Google Scholar 

  55. Mobasher, B.: Web usage mining and personalization. In: Singh, M.P. (ed.) Practical Handbook of Internet Computing, pp. 1–35. CRC Press (2005)

    Google Scholar 

  56. Mobasher, B., Cooley, R., Srivastava, J.: Automatic personalization based on web usage mining. Communications of the ACM 43(8), 142–151 (2000)

    Article  Google Scholar 

  57. Mobasher, B.: Web usage mining. In: Liu, B. (ed.) Web Data Mining: Exploring Hyperlinks, Contents and Usage Data, pp. 449–483. Springer, Heidelberg (2006)

    Google Scholar 

  58. Mortazavi-Asl, B.: Discovering and mining user web-page traversal patterns. Masters thesis, Simon Fraser University (2001)

    Google Scholar 

  59. Mulvenna, M., Anand, S., Buchner, A.: Personalization on the net using Web mining CACM, vol. 43, pp. 123–125 (2000)

    Google Scholar 

  60. Nanopoulos, A., Katsaros, D., Manolopoulos, Y.: Exploiting Web Log Mining for Web Cache Enhancement. In: Kohavi, R., Masand, B., Spiliopoulou, M., Srivastava, J. (eds.) WebKDD 2001. LNCS (LNAI), vol. 2356, pp. 68–87. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  61. Nanopoulos, A., Katsaros, D., Manolopoulos, Y.: Effective prediction of Web-user accesses: A data mining approach. In: Proc. of the 3rd International Workshop on Mining Web Log Data Across (2001)

    Google Scholar 

  62. Nasraoui, O., Frigui, H., Joshi, A., Krishnapuram, R.: Mining Web access log using relational competitive fuzzy clustering. Journal of Computer Engineering 1, 195–204 (1999)

    Google Scholar 

  63. Nasraoui, O., Krishnapuram, R., Joshi, A., Kamdar, T.: Automatic Web User Profiling and Personalization using a Robust Fuzzy Relational Clustering. In: Segovia, J., Szczepaniak, P., Niedzwiedzinski, M. (eds.) E-Commerce and Intelligent Methods in Studies in Fuzziness and Soft Computing. Springer (2002)

    Google Scholar 

  64. Nasraoui, O., Petenes, C.: Combining web usage mining and fuzzy inference for website personalization. In: Proc. of Workshop on Web Mining and Web Usage Analysis, pp. 37–46 (2003)

    Google Scholar 

  65. Nasraoui, O., Soliman, M., Saka, E., Badia, A., Germain, R.: A web usage mining framework for mining evolving user profiles in dynamic web sites. IEEE Transaction on Knowledge Data Engineering 20(2), 202–215 (2008)

    Article  Google Scholar 

  66. Ngu, D.S.W., Wu, X.: Sitehelper: A localized agent that helps incremental exploration of the world wide web. In: Proc. of the 6th International World Wide Web Conference, pp. 1249–1255 (1997)

    Google Scholar 

  67. Oikonomakou, N., Vazirgiannis, M.: A Review of Web Document Clustering Approaches. In: Maimon, O., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook, pp. 921–943. Springer (2005)

    Google Scholar 

  68. Paulakis, S., Lampos, C., Eirinaki, M., Vazirgiannis, M.: SEWeP: A Web Mining System Supporting Semantic Personalization. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) PKDD 2004. LNCS (LNAI), vol. 3202, pp. 552–554. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  69. Pei, J., Han, J., Motazavi-Asl, B., Zhu, H.: Mining access patterns efficiently from web logs. In: Proc. of the Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 396–407 (2000)

    Google Scholar 

  70. Pierrakos, D., Paliouras, G., Papatheodorou, C., Spyropoulos, C.D.: Web usage mining as a tool for personalization: a survey. User Modeling and User-Adapted Interaction 13(4), 311–372 (2003)

    Article  Google Scholar 

  71. Piramuthu, S.: On learning to predict web traffic. Decision Support Systems 35(2), 213–229 (2003)

    Article  Google Scholar 

  72. Pitkow, J.: In search of reliable usage data on the WWW. In: Proc. of the 6th Int. World Wide Web Conference, pp. 451–463 (1997)

    Google Scholar 

  73. Rossi, F., De Carvalho, F., Lechevallier, Y., Da Silva, A.: Dissimilarities for Web Usage Mining. In: Batagelj, V., Hans-Hermann, B., Ferligoj, A., Ziberna, A. (eds.) Data Science and Classification, Studies in Classification, Data Analysis and Knowledge Organization, pp. 39–46. Springer (2006)

    Google Scholar 

  74. Roussinov, D., Zhao, J.L.: Automatic discovery of similarity relationships through web mining. Decision Support Systems 35(1), 149–166 (2003)

    Article  Google Scholar 

  75. Sarwar, B.M., Karypis, G., Konstan, J., Riedl, J.: Analysis of recommender algorithms for e-commerce. In: Proc. of the 2nd ACM E-Commerce Conference (EC 2000), pp. 158–167 (2000)

    Google Scholar 

  76. Sathiyamoorthi, V., Murali Bhaskaran, V.: Data Preparation Techniques for Web Usage Mining in World Wide Web-An Approach. International Journal of Recent Trends in Engineering 2(4), 1–4 (2009)

    Google Scholar 

  77. Schafer, J.B., Konstan, J.A., Riedl, J.: E-commerce recommendation applications. Data Mining and Knowledge Discovery 5(1-2), 115–153 (2001)

    Article  MATH  Google Scholar 

  78. Schechter, S.E., Krishnan, M., Smith, M.D.: Using path profiles to predict HTTP requests. In: Proc. of the 7th International World Wide Web Conference, pp. 457–467 (1998)

    Google Scholar 

  79. Shahabi, C., Banaei-Kashani, F., Faruque, J.: A reliable, efficient, and scalable system for web usage data acquisition. In: Proc. of WEBKDD 2001 Mining Log Data Across All Customer Touch Points (2001)

    Google Scholar 

  80. Spilipoulou, M., Mobasher, B., Berendt, B.: A framework for the Evaluation of Session Reconstruction Heuristics in Web Usage Analysis. INFORMS Journal on Computing Spring 15(2), 171–190 (2003)

    Google Scholar 

  81. Spiliopoulou, M.: Data mining for the web. In: Proc. of the 3rd European Conference on Principles and Practice of Knowledge Discovery in Databases, pp. 588–589 (1999)

    Google Scholar 

  82. Spiliopoulou, M., Faulstich, L.C.: WUM: A Web Utilization Miner. In: Proc. of the International Workshop on the Web and Databases, pp. 109–115 (1999)

    Google Scholar 

  83. Srivastava, J., Cooley, R., Deshpande, M., Tan, P.-N.: Web usage mining: Discovery and applications of usage patterns from web data. SIGKDD Explorations 1(2), 1–12 (2000)

    Article  Google Scholar 

  84. Stumme, G., Hotho, A., Berendt, B.: Usage Mining for and on the Semantic Web. Methods, pp. 461–481. AAAI Press (2004)

    Google Scholar 

  85. Suryavanshi, B., Shiri, N., Mudur, S.: An efficient technique for mining usage profiles using relational fuzzy subtractive clustering. In: Proc. of the 2005 International Workshop on Challenges in Web Information Retrieval and Integration (WIRI 2005), pp. 23–29 (2005)

    Google Scholar 

  86. Tan, P.N., Kumar, V.: Discovery of web robot sessions based on their navigational patterns. Data Mining and Knowledge Discovery 6(1), 9–35 (2002)

    Article  MathSciNet  Google Scholar 

  87. Vakali, A.I., Pokorný, J., Dalamagas, T.: An Overview of Web Data Clustering Practices. In: Lindner, W., Fischer, F., Türker, C., Tzitzikas, Y., Vakali, A.I. (eds.) EDBT 2004. LNCS, vol. 3268, pp. 597–606. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  88. Wong, S., Pal, S.: Mining fuzzy association rules for web access case adaptation. In: Proc. of the Workshop on Soft Computing in Case-Based Reasoning (2001)

    Google Scholar 

  89. Zhizhen, L., Pengfei, S.: Similarity measures on intuitionistic fuzzy sets. Pattern Recognition Letter 24, 2687–2693 (2003)

    Article  MATH  Google Scholar 

  90. Zhang, D., Dong, Y.: A novel web usage mining approach for search engines. Computer Networks 39(3), 303–310 (2003)

    Article  Google Scholar 

  91. Zhou, B., Hui, S.C., Fong, A.C.M.: Web usage mining for semantic web personalization. In: Proc. of the Workshop on Personalization on the Semantic Web (PerSWeb 2005) (2005)

    Google Scholar 

  92. Xie, Y., Phoha, V.V.: Web user clustering from access log using belief function. In: Proc. of the First International Conference on Knowledge Capture (K-CAP 2001), pp. 202–208 (2001)

    Google Scholar 

  93. Yang, Q., Zhang, H.H.: Web-log mining for predictive web caching. IEEE Transactions on Knowledge and Data Engineering 15(4), 1050–1053 (2003)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Giovanna Castellano .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Castellano, G., Fanelli, A.M., Torsello, M.A. (2013). Web Usage Mining: Discovering Usage Patterns for Web Applications. In: Velásquez, J., Palade, V., Jain, L. (eds) Advanced Techniques in Web Intelligence-2. Studies in Computational Intelligence, vol 452. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33326-2_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-33326-2_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-33325-5

  • Online ISBN: 978-3-642-33326-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics