Agent-Based Distributed Data Mining: A Survey

  • Chayapol Moemeng
  • Vladimir Gorodetsky
  • Ziye Zuo
  • Yong Yang
  • Chengqi Zhang


Distributed data mining is originated from the need of mining over decentralised data sources. Data mining techniques involving in such complex environment must encounter great dynamics due to changes in the system can affect the overall performance of the system. Agent computing whose aim is to deal with complex systems has revealed opportunities to improve distributed data mining systems in a number of ways. This paper surveys the integration of multi-agent system and distributed data mining, also known as agent-based distributed data mining, in terms of significance, system overview, existing systems, and research trends.


Data Mining Multiagent System Mobile Agent Swarm Intelligence Data Mining Technique 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Ajith Abraham, Crina Grosan, and Vitorino Ramos, editors. Swarm Intelligence in Data Mining, volume 34 of Studies in Computational Intelligence. Springer, 2006.Google Scholar
  2. 2.
    Sung W. Baik, Jerzy W. Bala, and Ju S. Cho. Agent based distributed data mining. Lecture Notes in Computer Science, 3320:42–45, 2004.CrossRefGoogle Scholar
  3. 3.
    S. Bailey, R. Grossman, H. Sivakumar, and A. Turinsky. Papyrus: a system for data mining over local and wide area clusters and super-clusters. In Supercomputing ’99: Proceedings of the 1999 ACM/IEEE conference on Supercomputing (CDROM), page 63, New York, NY, USA, 1999. ACM.CrossRefGoogle Scholar
  4. 4.
    R. J. Bayardo, W. Bohrer, R. Brice, A. Cichocki, J. Fowler, A. Helal, V. Kashyap, T. Ksiezyk, G. Martin, M. Nodine, and Others. InfoSleuth: agent-based semantic integration of information in open and dynamic environments. ACM SIGMOD Record, 26(2): 195–206, 1997.CrossRefGoogle Scholar
  5. 5.
    F. Bergenti, M. P. Gleizes, and F. Zambonelli. Methodologies And Software Engineering For Agent Systems: The Agent-oriented Software Engineering Handbook. Kluwer Academic Publishers, 2004.Google Scholar
  6. 6.
    A. Bordetsky. Agent-based Support for Collaborative Data Mining in Systems Management. In Proceedings Of The Annual Hawaii International Conference On System Sciences, page 68, 2001.Google Scholar
  7. 7.
    R. Bose and V. Sugumaran. IDM: an intelligent software agent based data mining environment. 1998 IEEE International Conference on Systems, Man, and Cybernetics, 3, 1998.Google Scholar
  8. 8.
    L. Cao, C. Luo, and C. Zhang. Agent-Mining Interaction: An Emerging Area. Lecture Notes in Computer Science, 4476:60, 2007.CrossRefGoogle Scholar
  9. 9.
    L. Cao, J. Ni, J. Wang, and C. Zhang. Agent Services-Driven Plug and Play in the F-TRADE. In 17th Australian Joint Conference on Artificial Intelligence, volume 3339, pages 917–922. Springer, 2004.Google Scholar
  10. 10.
    J. Dasilva, C. Giannella, R. Bhargava, H. Kargupta, and M. Klusch. Distributed data mining and agents. Engineering Applications of Artificial Intelligence, 18(7):791–807, October 2005.CrossRefGoogle Scholar
  11. 11.
    S. Datta, K. Bhaduri, C. Giannella, R. Wolff, and H. Kargupta. Distributed data mining in peer-to-peer networks. Internet Computing, IEEE, 10(4):18–26, 2006.CrossRefGoogle Scholar
  12. 12.
    W. Davies and P. Edwards. Distributed Learning: An Agent-Based Approach to Data-Mining. In Proceedings of Machine Learning 95 Workshop on Agents that Learn from Other Agents, 1995.Google Scholar
  13. 13.
    U. Fayyad, R. Uthurusamy, and Others. Data mining and knowledge discovery in databases. Communications of the ACM, 39(11):24–26, 1996.CrossRefGoogle Scholar
  14. 14.
    C. Giannella, R. Bhargava, and H. Kargupta. Multi-agent Systems and Distributed Data Mining. Lecture Notes in Computer Science, pages 1–15, 2004.Google Scholar
  15. 15.
    V. Gorodetskiy. Interaction of agents and data mining in ubiquitous environment. In Proceedings of the 2008 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT’08), 2008.Google Scholar
  16. 16.
    V. Gorodetsky, O. Karsaev, and V. Samoilov. Multi-Agent Data and Information Fusion. Nato Science Series Sub Series Iii Computer And Systems Sciences, 198:308, 2005.Google Scholar
  17. 17.
    V. Gorodetsky, O. Karsaev, and V. Samoilov. Infrastructural Issues for Agent-Based Distributed Learning. In Proceedings of the 2006 IEEE/WIC/ACM international conference on Web Intelligence and Intelligent Agent Technology, pages 3–6. IEEE Computer Society Washington, DC, USA, 2006.Google Scholar
  18. 18.
    V. Gorodetsky, O. Karsaev, V. Samoylov, and S. Serebryakov. P2P Agent Platform: Implementation and Testing. In Proceedings International Workshop “Agent and Peer-to-Peer Computing” (AP2PC-2007) associated with AAMAS-07. Honolulu, Hawaii, pages 21–32, 2007.Google Scholar
  19. 19.
    V. Gorodetsky and I. Kotenko. The Multi-agent Systems for Computer Network Security Assurance: frameworks and case studies. In IEEE International Conference on Artificial Intelligence Systems, 2002, pages 297–302, 2002.Google Scholar
  20. 20.
    Vladimir Gorodetsky, Oleg Karsaev, and Vladimir Samoilov. Multi-agent technology for distributed data mining and classification. In IAT, pages 438–441. IEEE Computer Society, 2003.Google Scholar
  21. 21.
    Sven A. Brueckner H. Van Dyke Parunak. Engineering swarming systems. Methodologies and Software Engineering for Agent Systems, pages 341–376, 2004.Google Scholar
  22. 22.
    H. Kargupta, I. Hamzaoglu, and B. Stafford. Scalable, distributed data mining using an agent based architecture. In Proceedings the Third International Conference on the Knowledge Discovery and Data Mining, AAAI Press, Menlo Park, California, pages 211–214, 1997.Google Scholar
  23. 23.
    H. Kargupta and K. Sivakumar. Existential pleasures of distributed data mining. Data Mining: Next Generation Challenges and Future Directions, pages 1–25, 2004.Google Scholar
  24. 24.
    Hillol Kargupta, Byung-Hoon Park, Daryl Hershberger, and Erik Johnson. Collective data mining: A new perspective toward distributed data mining. In Advances in Distributed and Parallel Knowledge Discovery, pages 133–184, 1999.Google Scholar
  25. 25.
    M. Klusch, S. Lodi, and G. Moro. Agent-based distributed data mining: The KDEC scheme. AgentLink, Springer Lecture Notes in Computer Science, 2586, 2003.Google Scholar
  26. 26.
    M. Klusch, S. Lodi, and G. Moro. The role of agents in distributed data mining: Issues and benefits. In Proceedings of the 2003 IEEE/WIC International Conference on Intelligent Agent Technology (IAT 2003), pages 211–217, 2003.Google Scholar
  27. 27.
    Matthias Klusch, Stefano Lodi, and Gianluca Moro. Issues of agent-based distributed data mining. In Proceedings of the Second International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS-03), pages 1034–1035. ACM, 2003.Google Scholar
  28. 28.
    Raymond S. T. Lee and James N. K. Liu. ijade web-miner: An intelligent agent framework for internet shopping. IEEE Transactions on Knowledge and Data Engineering, 16(4):461–473, 2004.CrossRefGoogle Scholar
  29. 29.
    Xining Li and Jingbo Ni. Deploying mobile agents in distributed data mining. Lecture Notes in Computer Science, 4819:322–331, 2007.CrossRefGoogle Scholar
  30. 30.
    F. Menczer, R. K. Belew, and W. Willuhn. Artificial life applied to adaptive information agents. In AAAI Spring Symposium on Information Gathering, pages 128–132, 1995.Google Scholar
  31. 31.
    S. Merugu and J. Ghosh. A distributed learning framework for heterogeneous data sources. In Conference on Knowledge Discovery in Data, pages 208–217. ACM Press New York, NY, USA, 2005.Google Scholar
  32. 32.
    Peerapol Moemeng, Longbing Cao, and Chengqi Zhang. F-TRADE 3.0: An Agent-Based Integrated Framework for Data Mining Experiments. In Proceedings of IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, volume 3, pages 612–615, Los Alamitos, CA, USA, 2008. IEEE Computer Society.Google Scholar
  33. 33.
    L. Panait and S. Luke. Cooperative Multi-Agent Learning: The State of the Art. Autonomous Agents and Multi-Agent Systems, 11(3):387–434, 2005.CrossRefGoogle Scholar
  34. 34.
    Byung H. Park and Hillol Kargupta. Distributed data mining: Algorithms, systems, and applications. In Nong Ye, editor, Data Mining Handbook, pages 341–358. Lawrence Earlbaum Associates, 2002.Google Scholar
  35. 35.
    M. Penang. Distributed Data Mining From Heterogeneous Healthcare Data Repositories: Towards an Intelligent Agent-Based Framework. In Proceedings of the 15th IEEE Symposium on Computer-Based Medical Systems:(CBMS 2002): 4–7 June, 2002, Maribor, Slovenia. IEEE Computer Society, 2002.Google Scholar
  36. 36.
    Andreas Leonidas Prodromidis. Management of intelligent learning agents in distributed data mining systems. PhD thesis, Columbia University, New York, NY, USA, 1999.Google Scholar
  37. 37.
    S. Stolfo, A. L. Prodromidis, S. Tselepis, W. Lee, D. W. Fan, and P. K. Chan. JAM: Java agents for meta-learning over distributed databases. In Proceedings of the 3rd International Conference on Knowledge Discovery and Data Mining, pages 74–81, 1997.Google Scholar
  38. 38.
    H. Van Dyke Parunak Sven A. Brueckner. Swarming distributed pattern detection and classification. In Environments for Multi-Agent Systems, pages 232–245. Springer, 2005.Google Scholar
  39. 39.
    J. Tozicka, M. Rovatsos, and M. Pechoucek. A framework for agent-based distributed machine learning and data mining. In Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems. ACM New York, NY, USA, 2007.Google Scholar
  40. 40.
    Jan Tozicka, Michal Jakob, and Michal Pechoucek. Market-inspired approach to collaborative learning. Lecture Notes in Computer Science, 4149:213–227, 2006.CrossRefGoogle Scholar
  41. 41.
    Frank E. Walter, Stefano Battiston, and Frank Schweitzer. A model of a trust-based recommendation system on a social network, Nov 2006.Google Scholar
  42. 42.
    Gerhard Weiss. A multiagent perspective of parallel and distributed machine learning. In Agents, pages 226–230, 1998.Google Scholar
  43. 43.
    M. Wooldridge and N. R. Jennings. Agent Theories, Architectures, and Languages: A Survey. Intelligent Agents, 22, 1995.Google Scholar
  44. 44.
    M. Wooldridge, N. R. Jennings, and D. Kinny. The Gaia Methodology for Agent-Oriented Analysis and Design. Autonomous Agents and Multi-Agent Systems, 3(3):285–312, 2000.CrossRefGoogle Scholar
  45. 45.
    S. Z. H. Zaidi, S. S. R. Abidi, S. Manikam, and Cheah Yu-N. Admi: a multi-agent architecture to autonomously generate data mining services. In Intelligent Systems, 2004. Proceedings. 2004 2nd International IEEE Conference, volume 1, pages 273–279 Vol.1, 2004.Google Scholar
  46. 46.
    Mohammed J. Zaki. Parallel and distributed association mining: A survey. IEEE Concurrency, 7(4):14–25, 1999.CrossRefGoogle Scholar
  47. 8.
    Gorodetskiy, V.; Karsaev, O.; Samoilov, V.; Serebryakov, S., Agent-based Service-Oriented Intelligent P2P Networks for Distributed Classification Hybrid Information Technology, 2006.Google Scholar

Copyright information

© Springer Science+Business Media, LLC 2009

Authors and Affiliations

  • Chayapol Moemeng
  • Vladimir Gorodetsky
    • 1
  • Ziye Zuo
  • Yong Yang
  • Chengqi Zhang
  1. 1.University of TechnologySydneyAustralia

Personalised recommendations