Multiagent Systems for Large Data Clustering

Babu, T. Ravindra; Murty, M. Narasimha; Subrahmanya, S. V.

doi:10.1007/978-1-4419-0522-2_15

T. Ravindra Babu²,
M. Narasimha Murty³ &
S. V. Subrahmanya⁴

1366 Accesses
4 Citations

Abstract

Multiagent system is an applied research area encompassing many disciplines. With increasing computing power and easy availability of storage devices vast volumes of data is available containing enormous amount of hidden information. Generating abstractions from such large data is a challenging data mining task. Efficient large data clustering schemes are important in dealing with such large data. In the current work we provide two different efficient approaches of multiagent based large pattern clustering that would generate abstraction with single database scan, integrating domain knowledge, multiagent systems, data mining and intelligence through agent-mining interaction. We illustrate the approaches based on implementation on practical data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Abonyi, J. and Feil, B. and Abraham, A.: Computational Intelligence in Data Mining. In: Informatica(Slovenia), 29, 1, 3–12 (2005)
Google Scholar
Agent-Mining Interaction and Integration(AMII): http://www.agentmining.org
Agogino, A., Tumer, K.: Efficient Agent-Based Clustering Ensembles. AAMAS’ 06, 1079–1086 (2006)
Google Scholar
Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. Proc. 1993 ACM-SIGMOD Int. Conf. Management of Data(SIGMOD’93), Washington D.C., 266–271 (1993)
Google Scholar
Allwein, E.L., Schapire, R.E., Singer, Y.: Reducing Multiclass to Binary - A Unifying Approach for Margin Classifiers. Machine Learning. No. 1, 113–141 (2000)
MathSciNet Google Scholar
Baghshah, M.S., Shouraki, S.B., Lucas, C.: An agent-based clustering algorithm using potential fields. AICCSA, IEEE, 551–558 (2008)
Google Scholar
Bajaj, C.: Data Visualization Techniques. John Wiley & Sons, John Wiley & Sons, New York (1999)
Google Scholar
Bekkerman, R., Zilberstein, S., Allan, J.: Web page clustering using heuristic search in the web graph. IJCAI, 2280–2285 (2007)
Google Scholar
Tian, Z., Ramakrishnan R., Micon, L.: BIRCH: An efficient data clustering method for very large databases. Proceedings of ACM SIGMOD, 103–114 (1996)
Google Scholar
Bradley, P. and Fayyad, U.M., Reina, C., Scaling clustering algorithms to large databases, Proceedings of 4th Intl. Conf. Knowledge Discovery and Data Mining, AAAI Press, New York, 9–15 (1998)
Google Scholar
Breban, S., Vissileva, J.: A coalition formation mechanism based on inter-agent trust relationships. In proc. of the 1st Conference on Autonomous Agents and Multi-Agent Systems, Italy, 306–308 (2002)
Google Scholar
Buccafurri, F., Rosaci, D., Sarne, G.M.L., Ursino, D.: An agent-based hierarchical clustering approach for e-commerce environments. In Proceedings of E-Commerce and Web Technologies, 3rd International Conference (EC-Web 2002), France. Lecture Notes in Computer Science, Vol.2455. Springer, 109–118 (2002)
Google Scholar
Cao, L., Zhang, C.: F-Trade: An agent-mining symbiont for financial services. AAMAS’07, May 14–18, Hawaii, USA (2007)
Google Scholar
Cao, L., Yu, P.S., Zhang, C., Zhao, Y., Williams, G.: DDDM2007: Domain Driven Data Mining, SIGKDD Explorations, Vol.9. Issue 2, 84–86 (2007)
Article Google Scholar
Cover, T.M., Hart, P.: Nearest Neighbour pattern classification. IEEE Transactions on Information Theory, Vol 13, 21–27 (1967)
Article MATH Google Scholar
Devijver, P.A., J. Kittler, J.: Pattern Recognition: A Statistical Approach, Prentice Hall, Englewood Cliffs (1986)
Google Scholar
Distributed Data Mining Bibliography, http://www.cs.umbc.edu/∼hillol/DDMBIB/
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, John Wiley & Sons, Wiley-interscience (2000)
Google Scholar
DuMouchel, W., Volinksy, C., Johnson, T., Cortez, C., Pregibon, D.: Squashing Flat Files Flatter. Proc. 5th Int. Conf. on Knowledge Discovery and Data Mining, AAAI Press, San Diego, CA. 6–15, (1999)
Google Scholar
Durfee, E.H., Rosenschein, J.S.: Distrubuted Problem Solving and Multi-Agent Systems - Comparisons and Examples. ftp://www.eecs.umich.edu/people/durfee/daiw94-dr.ps.Z (1994)
Google Scholar
Edwards, P., W. Davies, W.: A Heterogeneous multi-agent learning system. In Proc. of the special interest group on cooperating knowledge based systems, 163–184 (1993)
Google Scholar
Ferber, J.: Multi-Agent Systems. Addison-Wesley, Harlow (1999)
Google Scholar
Freias, A.A.: Data Mining and Knowledge Discovery with Evolutionary Algorithms. Springer, New York. (2002)
Book Google Scholar
Ghosh, J., Strehl, A., Merugu, S.: A concensus framework for integrating distributed clusterings under limited knowledge sharing. In NSF Workshop on Next Generation Data Mining, 99–108 (2002)
Google Scholar
Integration of Agents and Data Mining http://www-staff.it.uts.edu.au/∼lbcao/publication/IntegrationofAgentandDataMining.ppt
Garruzzo, S., Rasaci, D.: Agent Clustering Based on Semantic Negotiatiion. ACM Trans. on Autonomous and Adaptive Systems, Vol.3, No.2, Article 7, 7:1–7:40 (2008)
Article Google Scholar
Golfarelli, M., Rizzi, S.: Spatio-Temporal Clustering of Tasks for Swap-Based Negotiation Protocols in Multi-Agent Systems.
Google Scholar
Gomez, J., Dasgupta, D., Nasraoui, O.: A new gravitational clustering algorithm. In Proc. of the Third SIAM International Conference on Data MINING, San Francisco, 83–94 (2003)
Google Scholar
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation, Proceedings of ACM SIGMOD International Conference of Management of Data(SIGMOD 00), Dallas, Texas, USA, 1–12 (2000)
Google Scholar
Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufman, San Francisco, CA (2001)
Google Scholar
Hart, P.E.: The condensed nearest neighbour rule, IEEE Transactions on Information Theory, Vol 14, 515–516 (1968)
Article Google Scholar
Jain, A.K., and Murty, M.N. and P.J. Flynn.: Data Clustering: A Review, ACM Computing Review, 264–323 (1999)
Google Scholar
Jan Tozicka, Michael Rovatsos, Michal Pechoucek: A Framework for Agent-Based Distributed Machine Learning and Data Mining. In Autonomous Agents and Multi-Agent Systems, Article No.96, (AAMAS 2007) ACM Press (2007)
Google Scholar
KDDCup99 Data, http://kdd.ics.uci.edu/databases/kddcup99/kddcup99.html (1999)
Kaufman, L., Rouseeuw, P.: Finding Groups in Data - An Introduction to Cluster Analysis. John Wiley & Sons, New York (1990)
Google Scholar
Kazienko, P.: Multiagent system for web advertising, www.zsi.pwr.wroc.pl/∼kazienko/pub/2005/KazienkoKES05.pdf
Kearns, M., Valiant, L.G.: Leaving Boolean formulae or finite automata is as hard as factoring. Harward University Aiken Computation Laboratory. TR-14-88. (1988)
Google Scholar
Kearns, M., Valiant, L.G.: Cryptographic limitations on learning Boolean formulae and finite automata. Journal of the Association for Computing MachineryVol. 41, No.1, 67–95 (1994)
Article MATH MathSciNet Google Scholar
Meyer, J., Intelligent Systems Group: http://www.cs.uu.nl/groups/IS/agents/agents.html
Mitra, S., Acharya, T.: Data Mining: Multimedia, Soft Computing, and Bioinformatics. John Wiley, New York. (2003)
Google Scholar
Mitra, P., and Pal, S.K.: Density based Multiscale Data Condensation, IEEE Trans. on Patter Analysis and Machine Intelligence, Vol.24, No.6, 734–747 (2002)
Article Google Scholar
Multiagent Research Group:http://www.cs.wustl.edu/$\sim$mas
Multiagent Systems: http://www.aaai.org/AITopics/pmwiki/pmwiki.php/AITopics/MultiAgentSystems
Ogston, E., Overreinder, R., van Steen, M., Brazier, F.: A method for decentralizing clustering in large multi-agent systems. AAMAS’03, Australia (2003)
Google Scholar
Ogston, E., Overreinder, R., van Steen, M., Brazier, F.: Group formation among peer-to-peer agents: Learning group characteristics. In 2nd International Workshop, on Agents and Peer-to-peer computing. Lecture Notes in Computer Science, Vol. 2872, Springer, 59–70 (2003)
Google Scholar
Pal, S.K., and Ghosh, A.: Soft computing data mining. In: Information Sciences, Vol. 163, No. 1–3, pp. 1–3 (2004)
Google Scholar
Pal, S.K., Mitra, P.: Pattern Recognition Algorithms for Data Mining, Chapman & Hall/CRC (2004)
Google Scholar
Park, J., Oh, K.: Multi-Agent Systems for Intelligent Clustering. Proc. of World Academy of Science, Engineering and Technology, Vol. 11, February 2006, 97–102 (2006)
Google Scholar
Piraveenan, M., Prokopenko, M., Wang, P. and Zeman, A.: Decentralized multi-agent clustering in scale-free sensor networks. Studies in Computational Intelligence, 115, 485–515 (2008)
Article Google Scholar
Ravindra Babu, T., Narasimha Murty, M.: Comparison of Genetic Algorithms Based Prototype Selection Schemes. Pattern Recognition, 34(2), 523–525 (2001)
Article Google Scholar
Ravindra Babu, T., Narasimha Murty, M., Agrawal, V.K.: Hybrid Learning Scheme for Data Mining Applications, Proceedings of the Fourth International Conference on Hybrid Intelligent Systems, IEEE Computer Society, Los Alamitos, California, 266–271 (2004)
Google Scholar
Ravindra Babu, T., Narasimha Murty, M., Agrawal, V.K.: Adaptive boosting with leader based learners for classification of large handwritten data. Proceedings of the Fourth International Conference on Hybrid Intelligent Systems, IEEE Computer Society, Los Alamitos, California, 266–271 (2004)
Google Scholar
Rosaci, D.: An ontology-based two-level clustering for supporting e-commerce agents activities. In Proceedings of E-Commerce and Web Technologies, Sixth International Conference (EC-Web 2005). Lecture Notes in Computer Science, Vol. 3590. Springer, 31–40 (2005)
Google Scholar
Schapire, R.E.: Theoretical views of Boosting and Applications. Proceedings of Algorithmic Learning Theory. (1999)
Google Scholar
Sen, S., Saha, S., Airiau, S., Candale, T., Banerjee, D., Chakraborty, D., Mukherjee, P., and Gursel, A.: Robust Agent Communities. In Autonomous Intelligent Systems: Agents and Data Mining, V. Gorodetsky, C. Zhang, V.A. Skormin, and L. Cao (Editors), pages 28–45, Lecture Notes in Artificial Intelligence, volume 4476, Springer. (2007)
Google Scholar
Sian, S.: Extending learning to multiple agents: Issues and a model for multi-agent machine learning. In Y. Kodratoff (ed), Machine Learning - EWSL-91, pp 440–456. Springer-Verlag. (1991)
Google Scholar
Spath, H.: Cluster Analysis - Algorithms for Data Reduction and Classification of Objects, West Sussex, UK, Ellis Horwood Limited. (1980)
Google Scholar
Tozicka, J., Rovatsos, M., Pechoucek, M.: A Framework for Agent-Based Distributed Machine Learning and Data Mining. AAMAS’07, pp. 678–685 (2007), May 14–18. (2007)
Google Scholar
Viaenne, S., Darrig, R.A., Dedene, G.: A case study of applying boosting Naive Bayes to claim fraud diagnosis. IEEE Transactions on Knowledge and Data Engineering. Vol. 16, No. 5, 612–620 (2004)
Article Google Scholar
Viswanath, P., Narasimha Murty, M. Shalabh Bhatnagar: Overlap Pattern Synthesis with an efficient nearest neighbor classifier. Pattern Recognition. Vol. 38, No. 8, 1187–1195 (2005)
Article MATH Google Scholar
Weiss, G. (ed). Multiagent Systems - A modern approach to Distributed Artificial Intelligence. The MIT Press (2000)
Google Scholar
Wooldridge, M., Jennings, N.R.: Towards a theory of cooperative problem solving. In proc. of the Workshop of Distributed Software Agents and Applications, Denmark, 40–53 (1994)
Google Scholar
Yoshida, K., Pedrycz: Recent developments in hybrid intelligent systems, In: Int. Journal on Hybrid Intelligent Systems, Vol 2, No. 4, pp 235–236 (2005) Vol 34, 523–525 (2001)
Google Scholar
Zhao, Y., Zhang H., Figueiredo, F., Cao, L., Zhang C.: Mining for combined association rules for multiple datasets. Proc. of 2007 International workshop on domain driven data mining. 18–23 (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

E-Comm Research Lab, Infosys Technologies Limited, 560100, Bangalore, India
T. Ravindra Babu
Department of Computer Science and Automation, Indian Institute of Science, 560012, Bangalore, India
M. Narasimha Murty
E-Comm Research Lab, Infosys Technologies Limited, 560100, Bangalore, India
S. V. Subrahmanya

Authors

T. Ravindra Babu
View author publications
You can also search for this author in PubMed Google Scholar
M. Narasimha Murty
View author publications
You can also search for this author in PubMed Google Scholar
S. V. Subrahmanya
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to T. Ravindra Babu .

Editor information

Editors and Affiliations

Faculty of Engineering and Information Technology, University of Technology, Sydney, Broadway, NSW 2007, Australia
Longbing Cao

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Babu, T.R., Murty, M.N., Subrahmanya, S.V. (2009). Multiagent Systems for Large Data Clustering. In: Cao, L. (eds) Data Mining and Multi-agent Integration. Springer, Boston, MA. https://doi.org/10.1007/978-1-4419-0522-2_15

Download citation

DOI: https://doi.org/10.1007/978-1-4419-0522-2_15
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-0521-5
Online ISBN: 978-1-4419-0522-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics