Abstract
The heterogeneous nature of the Web combined with the rapid diffusion of Web-based applications have made Web browsing an intricate activity for users. This has given rise to an urgent need for developing systems capable to assist and guide users during their navigational activity in the Web. Web Usage Mining (WUM) refers to the application of Data Mining techniques for the automatic discovery of meaningful usage patterns characterizing the browsing behavior of users, starting from access data collected from interactions of users with sites. The discovered patterns may be conveniently exploited in order to implement functionalities offering useful assistance to users. This chapter is mainly intended to provide an overview of the different stages involved in a general WUM process. As an example, a WUM approach is presented which is based on the use of fuzzy clustering to discovery user categories starting from usage patterns.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abraham, A.: Business intelligence from web usage mining. Journal of Information & Knowledge Management 2(4), 375–390 (2003)
Abraham, A.: i-Miner: A web usage mining framework using hierarchical intelligent systems. In: Proc. of the IEEE International Conference on Fuzzy Systems (FUZZ-IEEE 2003), pp. 1129–1134 (2003)
Agrawal, R., Imielinski, T., Swami, A.N.: Mining association rules between sets of items in large databases. In: Proc. of the 1993 ACM SIGMOD International Conference on Management of Data (SIGMOD 1993), pp. 207–216 (1993)
Agrawal, R., Srikant, R.: Mining sequential patterns. In: Proc. of the Eleventh International Conference on Data Engineering (ICDE 1995), pp. 3–14 (1995)
Anderson, C.R., Domingos, P., Weld, D.S.: Adaptive Web Navigation for Wireless Devices. In: Proc. of the 17th International Joint Conference on Artificial Intelligence (IJCAI 2001), pp. 879–884 (2001)
Arotariteia, D., Mitra, S.: Web mining: a survey in the fuzzy framework. Fuzzy Sets and Systems 148(1), 5–19 (2004)
Bayir, M.A., Cosar, A., Toroslu, I.H., Fidan, G.: Smart Miner: A New Framework for Mining Large Scale Web Usage Data. In: Proc. of the 18th International Conference on World Wide Web, pp. 161–170 (2009)
Berendt, B.: Web usage mining, site semantics, and the support of navigation. In: Proc. of Workshop Web Mining for E-Commerce - Challenges and Opportunities, pp. 83–93 (2000)
Runkler, T.A., Bezdek, J.C.: Web mining with relational clustering. International Journal of Approximate Reasoning 32, 217–236 (2003)
Borges, J.A., Levene, M.: Generating Dynamic Higher-Order Markov Models in Web Usage Mining. In: Jorge, A.M., Torgo, L., Brazdil, P.B., Camacho, R., Gama, J. (eds.) PKDD 2005. LNCS (LNAI), vol. 3721, pp. 34–45. Springer, Heidelberg (2005)
Davison, B.D.: A Web caching primer. IEEE Internet Computing 5(4), 38–45 (2001)
Buchner, A.G., Mulvenna, M.D.: Discovering internet marketing intelligence through online analytical web usage mining. SIGMOD Record 27(4), 54–61 (1999)
Cadez, I., Heckerman, D., Meek, C., Smyth, P., White, S.: Visualization of Navigation Patterns on a Web Site Using Model Based Clustering. Technical Report MSR-TR-00-18 (2000)
Castellano, G., Fanelli, A.M., Mencar, C., Torsello, M.A.: Log data preprocessing for mining Web browsing patterns. In: Proc. of the 8th Asian Pacific Industrial Engineering and Management Systems Conference (APIEMS 2007) (2007)
Castellano, G., Fanelli, A.M., Torsello, M.A.: Relational Fuzzy approach for Mining User Profiles. In: Aggarwal, A., Yager, R., Sandeberg, I.W. (eds.) Lectures Notes in Computational Intelligence, pp. 175–179. Wseas Press (2007)
Castellano, G., Mesto, F., Minunno, M., Torsello, M.A.: Web User Profiling Using Fuzzy Clustering. In: Masulli, F., Mitra, S., Pasi, G. (eds.) WILF 2007. LNCS (LNAI), vol. 4578, pp. 94–101. Springer, Heidelberg (2007)
Chan, P.K.: A non-invasive learning approach to building Web user profiles. In: Proc. of 5th ACM SIGKDD International Conference, Workshop on Web Usage Analysis and User Profiling, pp. 7–12 (1999)
Chen, P., Kuo, F.: An information retrieval system based on an user profile. The Journal of Systems and Software 54, 3–8 (2000)
Chitraa, V., Davamani, A.S.: A Survey on Preprocessing Methods for Web Usage Data. International Journal of Computer Science and Information Security 7(3), 78–83 (2010)
Cho, Y., Kim, J.K., Kim, S.H.: A personalized recommender system based on web usage mining and decision tree induction. Expert Systems with Applications 23(3), 329–342 (2003)
Cimiano, P., Staab, S.: Learning by googling. SIGKDD Explorations Newsletter 6(2), 24–33 (2004)
Cohen, E., Krishnamurthy, B., Rexford, J.: Improving end-to-end performance of the web using server volumes and proxy filters. In: Proc. of Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication (ACM SIGCOMM 1998), pp. 241–253 (1998)
Cooley, R., Mobasher, B., Srivastava, J.: Data preparation for mining world wide web browsing patterns. Knowledge and Information Systems. 1(1), 55–32 (1999)
Cooley, R.: Web usage mining: discovery and application of interesting patterns from Web data. PhD thesis, University of Minnesota (2000)
Costa, M., Gong, Z.: Web structure mining: an introduction. In: Proc. of the IEEE International Conference on Information Acquisition, pp. 590–595 (2005)
Dai, H., Mobasher, B.: Using ontologies to discover domain-level web usage profiles. In: Proc. of the 2nd Semantic Web Mining Workshop (2002)
Facca, F.M., Lanzi, P.: Mining interesting knowledge from weblogs: a survey. Data & Knowledge Engineering 53, 225–241 (2005)
Frías-Martínez, E., Karamcheti, V.: A Customizable Behavior Model for Temporal Prediction of Web User Sequences. In: Zaïane, O.R., Srivastava, J., Spiliopoulou, M., Masand, B. (eds.) WebKDD 2003. LNCS (LNAI), vol. 2703, pp. 66–85. Springer, Heidelberg (2003)
Furnkranz, J.: Web structure mining - exploiting the graph structure of the world-wide web. GAI-Journal 21(2), 17–26 (2002)
Furnkranz, J.: Web mining. In: Maimon, O., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook, pp. 899–920. Springer (2005)
Ghorbani, A.A., Xu, X.: A fuzzy markov model approach for predicting user navigation. Web Intelligence, 307–311 (2007)
Godoy, D., Amandi, A.: Learning browsing patterns for context-aware recommendation. In: Bramer, M. (ed.) Artificial Intelligence in Theory and Pratice, pp. 61–70. Springer (2006)
Han, J., Kamber, M.: Data Mining Concepts and Techniques. Morgan Kaufmann (2001)
Krishnapuram, R., Joshi, A., Nasraoui, O., Yi, L.: Low-complexity fuzzy relational clustering algorithms for web mining. Journal IEEE-FS 9, 595–607 (2001)
Halkidi, M., Batistakis, Y., Vazirgiannis, M.: Cluster Validity Methods: Part II. SIGMOD Record 31(3), 19–27 (2002)
Hansen, M., Shriver, E.: Using navigation data to improve IR functions in the context of web search. In: Proc. of the 10th International Conference on Information and Knowledge Management, pp.135–142 (2001)
Heer, J., Chi, E.H.: Identification of web user traffic composition using multi-modal clustering and information scent. In: Proc. of the Workshop on Web Mining, SIAM Conference on Data Mining, pp. 51–58 (2001)
Huang, X., Cercone, N., An, A.: Comparison of interestingness functions for learning web usage patterns. In: Proc. of the 11th International Conference on Information and Knowledge Management, pp. 617–620 (2002)
Hussain, T., Asghar, S., Masood, N.: Web usage mining: A survey on preprocessing of web log file. In: Proc. of the International Conference on Information and Emerging Technologies (ICIET), pp. 1–6 (2010)
Jespersen, S.E., Thorhauge, J., Bach Pedersen, T.: A Hybrid Approach to Web Usage Mining. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds.) DaWaK 2002. LNCS, vol. 2454, pp. 73–82. Springer, Heidelberg (2002)
Joachims, T., Freitag, D., Mitchell, T.: Webwatcher: A tour guide for the world wide web. In: Proc. of the 15th International Conference on Artificial Intelligence, pp. 770–775 (1997)
Joshi, K., Joshi, A., Yesha, Y.: On using a warehouse to analyse web logs. Distributed and Parallel Databases 13(2), 161–180 (2003)
Kamdar, T., Joshi, A.: On creating adaptive web sites using web log mining. Technical report tr-cs-00-05. Department of Computer Science and Electrical Engineering University of Maryland (2000)
Khasawneh, N., Chan, C.-C.: Active user-based and ontology-based web log data preprocessing for web usage mining. In: Proc. of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 325–328 (2006)
Khasawneh, N., Chan, C.-C.: Web Usage Mining Using Rough Sets. In: Proc. of the 2005 Annual Meeting of the North American Fuzzy Information Processing Society, pp. 580–585 (2005)
Kima, J.K., Chob, Y.H., Kimc, W.J., Kimc, J.R., Suha, J.H.: A personalized recommendation procedure for Internet shopping support. Electronic Commerce Research and Applications 1, 301–313 (2002)
Kosala, R., Blockeel, H.: Web mining research: a survey. ACM SIGKDD Explorations Newsletter 2, 1–15 (2000)
Koutri, M., Avouris, N., Daskalaki, S.: A Survey of Web-Usage Mining: Techniques for Building Web-Based Adaptive Hypermedia Systems. In: Chen, S.Y., Magoulas, G.D. (eds.) Adaptable and Adaptive Hypermedia Systems, pp. 125–150. IRM Press (2005)
Lan, B., Bressan, S., Ooi, B.-C., Tay, Y.C.: Making Web Servers Pushier. In: Masand, B., Spiliopoulou, M. (eds.) WebKDD 1999. LNCS (LNAI), vol. 1836, pp. 112–125. Springer, Heidelberg (2000)
Lazzerini, B., Marcelloni, F.: A hierarchical fuzzy clustering-based system to create user profiles. International Journal on Soft Computing 11, 157–168 (2007)
Lieberman, H.: Letizia: An agent that assists web browsing. In: Proc. of the 1995 International Joint Conference on Artificial Intelligence, pp. 924–929 (1995)
Liu, B., Chang, K.C.C.: Editorial: Special issue on web content mining. SIGKDD Explorations special issue on Web Content Mining 6(2), 1–4 (2004)
Maheswari, V.U., Siromoney, A., Mehata, K.M.: The Variable Precision Rough Set Model for Web Usage Mining. In: Zhong, N., Yao, Y., Ohsuga, S., Liu, J. (eds.) WI 2001. LNCS (LNAI), vol. 2198, pp. 520–524. Springer, Heidelberg (2001)
Menasalvas, E., Millan, S., Pena, J., Hadjimichael, M., Marban, O.: Subsessions: a granular approach to click path analysis. In: Proc. of the FUZZ-IEEE Fuzzy Sets and Systems Conference, pp. 12–17 (2002)
Mobasher, B.: Web usage mining and personalization. In: Singh, M.P. (ed.) Practical Handbook of Internet Computing, pp. 1–35. CRC Press (2005)
Mobasher, B., Cooley, R., Srivastava, J.: Automatic personalization based on web usage mining. Communications of the ACM 43(8), 142–151 (2000)
Mobasher, B.: Web usage mining. In: Liu, B. (ed.) Web Data Mining: Exploring Hyperlinks, Contents and Usage Data, pp. 449–483. Springer, Heidelberg (2006)
Mortazavi-Asl, B.: Discovering and mining user web-page traversal patterns. Masters thesis, Simon Fraser University (2001)
Mulvenna, M., Anand, S., Buchner, A.: Personalization on the net using Web mining CACM, vol. 43, pp. 123–125 (2000)
Nanopoulos, A., Katsaros, D., Manolopoulos, Y.: Exploiting Web Log Mining for Web Cache Enhancement. In: Kohavi, R., Masand, B., Spiliopoulou, M., Srivastava, J. (eds.) WebKDD 2001. LNCS (LNAI), vol. 2356, pp. 68–87. Springer, Heidelberg (2002)
Nanopoulos, A., Katsaros, D., Manolopoulos, Y.: Effective prediction of Web-user accesses: A data mining approach. In: Proc. of the 3rd International Workshop on Mining Web Log Data Across (2001)
Nasraoui, O., Frigui, H., Joshi, A., Krishnapuram, R.: Mining Web access log using relational competitive fuzzy clustering. Journal of Computer Engineering 1, 195–204 (1999)
Nasraoui, O., Krishnapuram, R., Joshi, A., Kamdar, T.: Automatic Web User Profiling and Personalization using a Robust Fuzzy Relational Clustering. In: Segovia, J., Szczepaniak, P., Niedzwiedzinski, M. (eds.) E-Commerce and Intelligent Methods in Studies in Fuzziness and Soft Computing. Springer (2002)
Nasraoui, O., Petenes, C.: Combining web usage mining and fuzzy inference for website personalization. In: Proc. of Workshop on Web Mining and Web Usage Analysis, pp. 37–46 (2003)
Nasraoui, O., Soliman, M., Saka, E., Badia, A., Germain, R.: A web usage mining framework for mining evolving user profiles in dynamic web sites. IEEE Transaction on Knowledge Data Engineering 20(2), 202–215 (2008)
Ngu, D.S.W., Wu, X.: Sitehelper: A localized agent that helps incremental exploration of the world wide web. In: Proc. of the 6th International World Wide Web Conference, pp. 1249–1255 (1997)
Oikonomakou, N., Vazirgiannis, M.: A Review of Web Document Clustering Approaches. In: Maimon, O., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook, pp. 921–943. Springer (2005)
Paulakis, S., Lampos, C., Eirinaki, M., Vazirgiannis, M.: SEWeP: A Web Mining System Supporting Semantic Personalization. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) PKDD 2004. LNCS (LNAI), vol. 3202, pp. 552–554. Springer, Heidelberg (2004)
Pei, J., Han, J., Motazavi-Asl, B., Zhu, H.: Mining access patterns efficiently from web logs. In: Proc. of the Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 396–407 (2000)
Pierrakos, D., Paliouras, G., Papatheodorou, C., Spyropoulos, C.D.: Web usage mining as a tool for personalization: a survey. User Modeling and User-Adapted Interaction 13(4), 311–372 (2003)
Piramuthu, S.: On learning to predict web traffic. Decision Support Systems 35(2), 213–229 (2003)
Pitkow, J.: In search of reliable usage data on the WWW. In: Proc. of the 6th Int. World Wide Web Conference, pp. 451–463 (1997)
Rossi, F., De Carvalho, F., Lechevallier, Y., Da Silva, A.: Dissimilarities for Web Usage Mining. In: Batagelj, V., Hans-Hermann, B., Ferligoj, A., Ziberna, A. (eds.) Data Science and Classification, Studies in Classification, Data Analysis and Knowledge Organization, pp. 39–46. Springer (2006)
Roussinov, D., Zhao, J.L.: Automatic discovery of similarity relationships through web mining. Decision Support Systems 35(1), 149–166 (2003)
Sarwar, B.M., Karypis, G., Konstan, J., Riedl, J.: Analysis of recommender algorithms for e-commerce. In: Proc. of the 2nd ACM E-Commerce Conference (EC 2000), pp. 158–167 (2000)
Sathiyamoorthi, V., Murali Bhaskaran, V.: Data Preparation Techniques for Web Usage Mining in World Wide Web-An Approach. International Journal of Recent Trends in Engineering 2(4), 1–4 (2009)
Schafer, J.B., Konstan, J.A., Riedl, J.: E-commerce recommendation applications. Data Mining and Knowledge Discovery 5(1-2), 115–153 (2001)
Schechter, S.E., Krishnan, M., Smith, M.D.: Using path profiles to predict HTTP requests. In: Proc. of the 7th International World Wide Web Conference, pp. 457–467 (1998)
Shahabi, C., Banaei-Kashani, F., Faruque, J.: A reliable, efficient, and scalable system for web usage data acquisition. In: Proc. of WEBKDD 2001 Mining Log Data Across All Customer Touch Points (2001)
Spilipoulou, M., Mobasher, B., Berendt, B.: A framework for the Evaluation of Session Reconstruction Heuristics in Web Usage Analysis. INFORMS Journal on Computing Spring 15(2), 171–190 (2003)
Spiliopoulou, M.: Data mining for the web. In: Proc. of the 3rd European Conference on Principles and Practice of Knowledge Discovery in Databases, pp. 588–589 (1999)
Spiliopoulou, M., Faulstich, L.C.: WUM: A Web Utilization Miner. In: Proc. of the International Workshop on the Web and Databases, pp. 109–115 (1999)
Srivastava, J., Cooley, R., Deshpande, M., Tan, P.-N.: Web usage mining: Discovery and applications of usage patterns from web data. SIGKDD Explorations 1(2), 1–12 (2000)
Stumme, G., Hotho, A., Berendt, B.: Usage Mining for and on the Semantic Web. Methods, pp. 461–481. AAAI Press (2004)
Suryavanshi, B., Shiri, N., Mudur, S.: An efficient technique for mining usage profiles using relational fuzzy subtractive clustering. In: Proc. of the 2005 International Workshop on Challenges in Web Information Retrieval and Integration (WIRI 2005), pp. 23–29 (2005)
Tan, P.N., Kumar, V.: Discovery of web robot sessions based on their navigational patterns. Data Mining and Knowledge Discovery 6(1), 9–35 (2002)
Vakali, A.I., Pokorný, J., Dalamagas, T.: An Overview of Web Data Clustering Practices. In: Lindner, W., Fischer, F., Türker, C., Tzitzikas, Y., Vakali, A.I. (eds.) EDBT 2004. LNCS, vol. 3268, pp. 597–606. Springer, Heidelberg (2004)
Wong, S., Pal, S.: Mining fuzzy association rules for web access case adaptation. In: Proc. of the Workshop on Soft Computing in Case-Based Reasoning (2001)
Zhizhen, L., Pengfei, S.: Similarity measures on intuitionistic fuzzy sets. Pattern Recognition Letter 24, 2687–2693 (2003)
Zhang, D., Dong, Y.: A novel web usage mining approach for search engines. Computer Networks 39(3), 303–310 (2003)
Zhou, B., Hui, S.C., Fong, A.C.M.: Web usage mining for semantic web personalization. In: Proc. of the Workshop on Personalization on the Semantic Web (PerSWeb 2005) (2005)
Xie, Y., Phoha, V.V.: Web user clustering from access log using belief function. In: Proc. of the First International Conference on Knowledge Capture (K-CAP 2001), pp. 202–208 (2001)
Yang, Q., Zhang, H.H.: Web-log mining for predictive web caching. IEEE Transactions on Knowledge and Data Engineering 15(4), 1050–1053 (2003)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Castellano, G., Fanelli, A.M., Torsello, M.A. (2013). Web Usage Mining: Discovering Usage Patterns for Web Applications. In: Velásquez, J., Palade, V., Jain, L. (eds) Advanced Techniques in Web Intelligence-2. Studies in Computational Intelligence, vol 452. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33326-2_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-33326-2_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33325-5
Online ISBN: 978-3-642-33326-2
eBook Packages: EngineeringEngineering (R0)