Abstract
This paper proposes a novel Distributed Data Mining (DDM) approach based on the Agents and Artifacts paradigm, as implemented in CArtAgO [9], where artifacts encapsulate data mining tools, inherited from Weka, that agents can use while engaged in collaborative, distributed learning processes. Target hypothesis are currently constrained to decision trees built with J48, but the approach is flexible enough to allow different kinds of learning models. The twofold contribution of this work includes: i) JaCA-DDM: an extensible tool implemented in the agent oriented programming language Jason [2] and CArtAgO [10,9] to experiment DDM agent-based approaches on different, well known training sets. And ii) A collaborative protocol where an agent builds an initial decision tree, and then enhances this initial hypothesis using instances from other agents that are not covered yet (counter examples); reducing in this way the number of instances communicated, while preserving accuracy when compared to full centralized approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bache, K., Lichman, M.: UCI machine learning repository (2013)
Bordini, R.H., Hübner, J.F., Wooldridge, M.: Programming multi-agent systems in Agent Speak using Jason, vol. 8. Wiley-Interscience (2007)
Bourgne, G., El Fallah Segrouchni, A., Soldano, H.: SMILE: Sound multi-agent incremental learning. In: Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems, p. 38. ACM (2007)
Chan, P.K., Stolfo, S.J.: On the accuracy of meta-learning for scalable data mining. Journal of Intelligent Information Systems 8(1), 5–28 (1997)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The weka data mining software: an update. ACM SIGKDD Explorations Newsletter 11(1), 10–18 (2009)
Prodromidis, A., Chan, P., Stolfo, S.: Meta-learning in distributed data mining systems: Issues and approaches. Advances in Distributed and Parallel Knowledge Discovery 3 (2000)
Rao, V.S.: Multi agent-based distributed data mining: An overview. International Journal of Reviews in Computing 3, 83–92 (2009)
Rao, V.S., Vidyavathi, S., Ramaswamy, G.: Distributed data mining and agent mining interaction and integration: A novel approach (2010)
Ricci, A., Piunti, M., Viroli, M.: Environment programming in multi-agent systems: an artifact-based perspective. Autonomous Agents and Multi-Agent Systems 23(2), 158–192 (2011)
Ricci, A., Viroli, M., Omicini, A.: Construenda est CArtAgO: Toward an infrastructure for artifacts in MAS. Cybernetics and Systems 2, 569–574 (2006)
Secretan, J.: An Architecture for High-Performance Privacy-Preserving and Distributed Data Mining. PhD thesis, University of Central Florida, Orlando, Florida (2009)
Stolfo, S., Prodromidis, A.L., Tselepis, S., Lee, W., Fan, D.W., Chan, P.K.: Jam: Java agents for meta-learning over distributed databases. In: Proceedings of the 3rd International Conference on Knowledge Discovery and Data Mining, pp. 74–81 (1997)
Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann (2005)
Zeng, L., Li, L., Duan, L., Lu, K., Shi, Z., Wang, M., Wu, W., Luo, P.: Distributed data mining: a survey. Information Technology and Management 13(4), 403–409 (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Limón, X., Guerra-Hernández, A., Cruz-Ramírez, N., Grimaldo, F. (2013). An Agents and Artifacts Approach to Distributed Data Mining. In: Castro, F., Gelbukh, A., González, M. (eds) Advances in Soft Computing and Its Applications. MICAI 2013. Lecture Notes in Computer Science(), vol 8266. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-45111-9_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-45111-9_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-45110-2
Online ISBN: 978-3-642-45111-9
eBook Packages: Computer ScienceComputer Science (R0)