Exploiting Swarm Behaviour of Simple Agents for Clustering Web Users’ Session Data

Abstract

In recent years the integration and interaction of data mining and multi agent system (MAS) has become a popular approach for tackling the problem of distributed data mining. The use of intelligent optimization techniques in the form of MAS has been demonstrated to be beneficial for the performance of complex, real time, and costly data mining processes. Web session clustering, a sub domain of Web mining is one such problem, tackling the information comprehension problem of the exponentially growing World Wide Web (WWW) by grouping usage sessions on the basis of some similarity measure. In this chapter we present a novel web session clustering approach based on swarm intelligence (SI), a simple agent oriented approach based on communication and cooperation between agents. SI exploits the collective behaviour of simple agents, cooperation between the agents, and emergence on a feasible solution on the basis of their social and cognitive learning capabilities exhibited in the form of MAS. We describe the technique for web session clustering and demonstrate that our approach perform well against benchmark clustering techniques on benchmark session data.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Frawley, W., Piatetsky-Shapiro, G., Matheus, C.: Knowledge Discovery in Databases: An Overview. AI Magazine: pp. 213–228, (1992)Google Scholar
  2. 2.
    Edelstein, H.A.: Introduction to data mining and knowledge discovery (3rd ed). Two Crows Corp, Potomac, MD, (1999)Google Scholar
  3. 3.
    Cao, L., Gorodetski, V.: AREA OVERVIEW-Agent & data mining interaction (ADMI). In: WI-IAT 2006 IADM Workshop panel discussion, Hongkong (2006)Google Scholar
  4. 4.
    Cao, L., Luo, C., Zhang, C.: Agent-Mining Interaction: An Emerging Area, AIS-ADM07, LNAI 4476, 60–73, Springer, (2007)Google Scholar
  5. 5.
    Fu, Y., Sandhu K., Shih, M-Y.: A Generalization-Based Approach to Clustering of Web Usage Sessions Revised Papers from the International Workshop on Web Usage Analysis and User Profiling, p.21–38, (1999)Google Scholar
  6. 6.
    Banerjee, A., Ghosh, J.: Clickstream Clustering using Weighted Longest Common Subsequence. In: Proceedings of the 1st SIAM International Conference on Data Mining: Workshop on Web Mining. (2001)Google Scholar
  7. 7.
    Nichele, C. M., Becker, K.: Clustering Web Sessions by Levels of Page Similarity. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD ’06): 346–350. (2006).Google Scholar
  8. 8.
    Ganesan, P., Garcia-Molina, H., Widom, J.: Exploiting Hierarchical Domain Structure to Compute Similarity. ACM Transactions on Information Systems (TOIS), v.21, n.1, 64–93. (2003)CrossRefGoogle Scholar
  9. 9.
    Abraham, A., Guo, H., Liu, H.: Swarm Intelligence: Foundations, Perspectives and Applications. Swarm Intelligent Systems, Studies in Computational Intelligence, (eds.), pp. 3–25. Springer, (2006)Google Scholar
  10. 10.
    Kennedy, J., Eberhart, R.C.: Particle swarm optimization. International Conference on Neural Networks (ICNN ’95), Vol. IV, Perth, Australia (1995)Google Scholar
  11. 11.
    Engelbrecht, A.P.: Fundamentals of Computational Swarm Intelligence. John Wiley and Sons, (2005)Google Scholar
  12. 12.
    Kennedy, J., Eberhart, R.C.: Swarm intelligence. Morgan Kaufmann Publishers, (2001)Google Scholar
  13. 13.
    Altshuler, Y., Bruckstein, A.M., Wagner, I.A.: On Swarm Optimality In Dynamic And Symmetric Environments. In: Second International Conference on Informatics in Control, Automation and Robotics (ICINCO), Barcelona, Spain, (2005)Google Scholar
  14. 14.
    Cohen, S.C. M., de Castro, L. N.: Data Clustering with Particle Swarms. In: IEEE Congress on Evolutionary Computations, Vancouver, BC, Canada, (2006)Google Scholar
  15. 15.
    Van der Merwe, D.W, Engelbrecht, A.P.: Data clustering using particle swarm optimization. In: Proceedings of IEEE Congress on Evolutionary Computation 2003 (CEC 2003), Canbella, Australia. pp. 215–220, (2003)Google Scholar
  16. 16.
    Xiao, X., Dow, E.R., Eberhart, R.C., Ben Miled, Z., Oppelt, R.J.: Gene clustering using self-organizing maps and particle swarm optimization. In: Proceedings of Second IEEE International Workshop on High Performance Computational Biology, Nice, France, (2003)Google Scholar
  17. 17.
    Chen, C.-Y., Ye, F.: Particle swarm optimization algorithm and its application to clustering analysis. In: Proceedings of IEEE International Conference on Networking, Sensing and Control. pp. 789–794, (2004)Google Scholar
  18. 18.
    Omran, M. G. H., Salman, A., Engelbrecht, A. P.,: Dynamic Clustering Using Particle Swarm Optimization with Application in Image Segmentation. Pattern Analysis and Applications, Vol. 8, pp. 2–344, (2005)Google Scholar
  19. 19.
    Alam, S., Dobbie, G., Riddle, P.: An Evolutionary Particle Swarm Optimization Algorithm For Data Clustering. In: Proceedings of IEEE International Swarm Intelligence Symposium. Missouri, USA, (2008)Google Scholar
  20. 20.
    Abraham, A.: Natural Computation for Business Intelligence from Web Usage Mining. In: Seventh International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC’05), pp. 3–10, (2005)Google Scholar
  21. 21.
    Labroche, N., Monarch, N., Venturini, G.: AntClust: Ant clustering and web usage mining. Genetic and Evolutionary Computation. Chicago, IL. Lecture Notes in Computer Science 2723. Berlin, Heidelberg, Germany: Springer-Verlag. pp 25–36, (2003)Google Scholar
  22. 22.
    Steven Schockaert, Martine De Cock, Chris Cornelis, Etienne E. Kerre: Clustering web search results using fuzzy ants. International Journal of Intelligent Systems, Volume 22, Issue 5, Pages 455–474, (2007)CrossRefMATHGoogle Scholar
  23. 23.
    Chen, J. Z., Huiying : Research on Application of Clustering Algorithm Based on PSO for the Web Usage Pattern. Wireless Communications, Networking and Mobile Computing, (2007)Google Scholar
  24. 24.
    Alam, S., Dobbie, G., Riddle, P.: Particle Swarm Optimization Based Clustering of Web Usage Data. In: ACM/WIC/IEEE International conference on web intelligence. Sydney, Australia, (2008)Google Scholar

Copyright information

© Springer Science+Business Media, LLC 2009

Authors and Affiliations

  1. 1.Department of Computer ScienceUniversity of AucklandAucklandNew Zealand

Personalised recommendations