Skip to main content

Cluster Integration for the Cluster-Based Instance Selection

  • Conference paper
Computational Collective Intelligence. Technologies and Applications (ICCCI 2010)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6421))

Included in the following conference series:

Abstract

The problem addressed in this paper concerns data reduction through instance selection. The paper proposes an approach based on instance selection from clusters. The process of selection and learning is executed by a team of agents. The approach aims at obtaining a compact representation of the dataset, where the upper bound on the size of data is determined by the user. The basic assumption is that the instance selection is carried out after the training data have been grouped into clusters. The cluster initialization and integration strategies are proposed and experimentally evaluated.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Asuncion, A., Newman, D.J.: UCI Machine Learning Repository, School of Information and Computer Science. University of California, Irvine (2007), http://www.ics.uci.edu/~mlearn/MLRepository.html

    Google Scholar 

  2. Barbucha, D., Czarnowski, I., Jędrzejowicz, P., Ratajczak-Ropel, E., Wierzbowska, I.: e-JABAT - An Implementation of the Web-Based A-Team. In: Nguyen, N.T., Jain, I.C. (eds.) Intelligent Agents in the Evolution of Web and Applications, Studies in Computational Intelligence, vol. 167, pp. 57–86. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  3. Bellifemine, F., Caire, G., Poggi, A., Rimassa, G.: JADE. A white paper, Exp. 3(3), 6–20 (2003)

    MATH  Google Scholar 

  4. Czarnowski, I., Jędrzejowicz, P.: An Approach to Instance Reduction in Supervised Learning. In: Coenen, F., Preece, A., Macintosh, A. (eds.) Research and Development in Intelligent Systems XX, pp. 267–282. Springer, London (2004)

    Chapter  Google Scholar 

  5. Czarnowski, I.: Cluster-based instance selection for machine classification. Knowledge and Information Systems (to appear, 2010)

    Google Scholar 

  6. Datasets used for classification: comparison of results. In. directory of data sets, http://www.is.umk.pl/projects/datasets.html (accessed 1 September 2009)

  7. Hamo, Y., Markovitch, S.: The COMPSET Algorithm for Subset Selection. In: Proceedings of The Nineteenth International Joint Conference for Artificial Intelligence, Edinburgh, Scotland, pp. 728–733 (2005)

    Google Scholar 

  8. Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction, pp. 520–528. Springer, New York (2009)

    Book  MATH  Google Scholar 

  9. Klusch, M., Lodi, S., Moro, G.-L.: Agent-Based Distributed Data Mining: The KDEC Scheme. In: Klusch, M., et al. (eds.) Intelligent Information Agents. LNCS (LNAI), vol. 2586, pp. 104–122. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  10. Krishnaswamy, S., Zaslavsky, A., Loke, S.W.: Techniques for Estimating the Computation and Communication Costs of Distributed Data Mining. In: Sloot, P.M.A., et al. (eds.) ICCS-ComputSci 2002. LNCS, vol. 2329, pp. 603–612. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  11. Liu, H., Lu, H., Yao, J.: Identifying Relevant Databases for Multidatabase Mining. In: Proceedings of Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 210–221 (1998)

    Google Scholar 

  12. Quinlan, J.R.: C4.5: Programs for machine learning. Morgan Kaufmann Publishers, SanMateo (1993)

    Google Scholar 

  13. Rozsypal, A., Kubat, M.: Selecting Representative Examples and Attributes by a Genetic Algorithm. Intelligent Data Analysis 7(4), 291–304 (2003)

    MATH  Google Scholar 

  14. Silva, J., Giannella, C., Bhargava, R., Kargupta, H., Klusch, M.: Distributed Data Mining and Agents. Engineering Applications of Artificial Intelligence Journal 18, 791–807 (2005)

    Article  Google Scholar 

  15. Skalak, D.B.: Prototype and Feature Selection by Sampling and Random Mutation Hill Climbing Algorithm. In: Proceedings of the International Conference on Machine Learning, pp. 293–301 (1994)

    Google Scholar 

  16. Stolfo, S., Prodromidis, A.L., Tselepis, S., Lee, W., Fan, D.W.: JAM: Java Agents for Meta-learning over Distributed Databases. In: Proceedings of the 3rd International Conference on Knowledge Discovery and Data Mining, Newport Beach, CA, pp. 74–81. AAAI Press, Menlo Park (1997)

    Google Scholar 

  17. Talukdar, S., Baerentzen, L., Gove, A., de Souza P.: Asynchronous Teams: Co-operation Schemes for Autonomous, Computer-Based Agents, Technical Report EDRC 18-59-96, Carnegie Mellon University, Pittsburgh (1996)

    Google Scholar 

  18. Tsoumakas, G., Angelis, L., Vlahavas, I.: Clustering Classifiers for Knowledge Discovery from Physical Distributed Databased. Data and Knowledge Enginering 49(3) (2004)

    Google Scholar 

  19. Vucetic, S., Obradovic, Z.: Performance Controlled Data Reduction for Knowledge Discovery in Distributed Databases. In: Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 29–39 (2000)

    Google Scholar 

  20. Wilson, D.R., Martinez, T.R.: Reduction Techniques for Instance-based Learning Algorithm. Machine Learning 33(3), 257–286 (2000)

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Czarnowski, I., Jędrzejowicz, P. (2010). Cluster Integration for the Cluster-Based Instance Selection. In: Pan, JS., Chen, SM., Nguyen, N.T. (eds) Computational Collective Intelligence. Technologies and Applications. ICCCI 2010. Lecture Notes in Computer Science(), vol 6421. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16693-8_37

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-16693-8_37

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-16692-1

  • Online ISBN: 978-3-642-16693-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics