Skip to main content

Stacking-Based Integrated Machine Learning with Data Reduction

  • Conference paper
  • First Online:
Intelligent Decision Technologies 2017 (IDT 2017)

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 72))

Included in the following conference series:

Abstract

Integrated machine learning is understood as integration of the data reduction with the learning process. Such integration allows to introduce adaptation mechanisms within the learning process by modification of the data with a view to finding its better representation from the point of view of the learning performance criterion. Data modification can be carried out through data reduction in both dimensions, i.e. the feature and the instance ones producing the set of prototypes. Currently, data reduction has become a crucial technique for big data analysis and improvement of the machine learning process results. In this paper the stacking technique has been proposed for improving the process of the integrated machine classification and to assure diversification among prototypes. To validate the proposed approach we have carried-out computational experiment. The paper includes the description of the approach and the discussion of the validating experiment results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Asuncion, A., Newman, D.J.: UCI Machine Learning Repository. University of California, School of Information and Computer Science, Irvine, CA (2007). http://www.ics.uci.edu/~mlearn/MLRepository.html

  2. Barbucha, D., Czarnowski, I., Jędrzejowicz, P., Ratajczak-Ropel, E., Wierzbowska, I.: e-JABAT – an implementation of the web-based A-Team, In: Nguyen, N.T., Jain, L.C. (eds.) Intelligence Agents in the Evolution of Web and Applications. SCI, vol. 167, pp. 57–86. Springer, Heidelberg (2009). doi:10.1007/978-3-540-88071-4_4

  3. Bull, L.: Learning classifier systems: a brief introduction, applications of learning classifier systems. In: Bull, L. (ed.) STUDFUZZ. Springer (2004)

    Google Scholar 

  4. Cano, J.R., Herrera, F., Lozano, M.: On the combination of evolutionary algorithms and stratified strategies for training set selection in data mining. Appl. Soft Comput. 6, 323–332 (2004)

    Article  Google Scholar 

  5. Carbonera, J.L., Abel, M.: A density-based approach for instance selection. In: Proceedings of the 2015 IEEE 27th International Conference on Tool with Artificial Intelligence, pp. 768–774 (2015). doi:10.1109/ICTAI.2015.114

  6. Czarnowski, I., Jędrzejowicz, P.: Agent-based data reduction using ensemble technique, In: Badica, C., Nguyen, N.T., Brezovan, M. (Eds.): Computational Collective Intelligence. Technologies and Applications, ICCCI 2013. LNAI, vol. 8083, pp. 447–456. Springer, Heidelberg (2013)

    Google Scholar 

  7. Czarnowski, I., Jędrzejowicz, P.: An approach to machine classification based on stacked generalization and instance selection. In: Proceedings of 2016 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2016, Budapest, Hungary, 9–12 October, 2016, pp. 4836–4841. IEEE (2016)

    Google Scholar 

  8. Czarnowski, I., Jędrzejowicz, P.: Experimental evaluation of the agent-based population learning algorithm for the cluster-based instance selection, In: Jędrzejowicz P., Nguyen N.T., Hoang K. (eds.): Computational Collective Intelligence, Technologies and Applications, ICCCI 2011. LNAI, vol. 6923, pp. 301–310. Springer, Heidelberg (2011)

    Google Scholar 

  9. Czarnowski, I., Jędrzejowicz, P.: Learning from examples with data reduction and stacked generalization. J. Intell. Fuzzy Syst. 32(2), 1401–1411 (2017)

    Article  Google Scholar 

  10. Czarnowski, I.: Distributed learning with data reduction, In: Nguyen, N.T. (ed.) Transactions on CCI IV. LNCS, vol. 6660, pp. 3–121. Springer, Heidelberg (2011)

    Google Scholar 

  11. Czarnowski, I.: Cluster-based instance selection for machine classification. Knowl.-Based Inf. Syst. 30(1), 113–133 (2012)

    Article  Google Scholar 

  12. Dash, M., Liu, H.: Feature selection for classification. Intell. Data Anal. 1(3), 131–156 (1997)

    Article  Google Scholar 

  13. Datasets used for classification: comparison of results. Directory of Data Sets. http://www.is.umk.pl/projects/datasets.html. Accessed 1 Sep 2009

  14. Ho, T.K.: Data complexity analysis for classifier combination, In: Kittler, J., Roli, F. (eds.) MCS 2001. LNCS, vol. 2096, pp. 53–67. Springer, London (2001)

    Google Scholar 

  15. Holland, J.H.: Adaptation, In: Rosen, R. and Snell, F.M. (eds.) Progress in Theoretical Biology 4, Plenum (1976)

    Google Scholar 

  16. Jędrzejowicz, J., Jędrzejowicz, P.: Cellular GEP-induced classifiers, In: Pan, J.-S., Chen, S.-M., Nguyen, N.T. (eds.) ICCCI 2010, Part I. LNAI, vol. 6421, pp. 343–352. Springer, Heidelberg (2010)

    Google Scholar 

  17. Kim, S.-W., Oommen, B.J.: A brief taxonomy and ranking of creative prototype reduction schemes. Pattern Anal. Appl. 6, 232–244 (2003)

    Article  MathSciNet  Google Scholar 

  18. Li, Z., Tang, S., Xue, J., Jiang, J.: Modified FCM clustering based on kernel mapping. In: Proceedings of the International Conference on Society for Optical Engineering, vol. 4554, pp. 241–245 (2001). doi:10.1117/12.441658

  19. Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, SanMateo (1993)

    Google Scholar 

  20. Sikora, R., Al-laymoun, O.H.: A modified stacking ensemble machine learning algorithm using genetic algorithms. J. Int. Technol. Inf. Manage. 23(1), 1–11 (2014)

    Google Scholar 

  21. Skalak, D.B.: Prototype selection for composite neighbor classifiers, University of Massachusetts Amherst (1997). https://web.cs.umass.edu/publication/docs/1996/UM-CS-1996-089.pdf

  22. Stefanowski, J.: Multiple and hybrid classifiers. In: Polkowski, L. (ed.) Formal Methods and Intelligent Techniques in Control, Decision Making. Multimedia and Robotics, Warszawa, pp. 174–188 (2001)

    Google Scholar 

  23. Tsoumakas, G., Angelis, L., Vlahavas, I.: Clustering classifiers for knowledge discovery from physically distributed databases. Data Knowl. Eng. 49, 223–242 (2004)

    Article  Google Scholar 

  24. Wilson, D.R., Martinez, T.R.: An integrated instance-based learning algorithm. Comput. Intell. 16, 1–28 (2000)

    Article  MathSciNet  Google Scholar 

  25. Wilson, D.R., Martinez, T.R.: Reduction techniques for instance-based learning algorithm. Mach. Learn. 33(3), 257–286 (2000)

    Article  MATH  Google Scholar 

  26. Wolpert, D.: Stacked Generalization. Neural Netw. 5, 241–259 (1992)

    Article  Google Scholar 

  27. Yıldırım, A.A., Özdoğan, C., Watson, D.: Parallel data reduction techniques for big datasets. In: Hu, W.-C., Kaabouch, N. (eds.) Big Data Management, Technologies, and Applications. IGI Global, pp. 72–93 (2014)

    Google Scholar 

  28. Zhou, S., Gan, J.Q.: Mercel kernel fuzzy c-means algorithm and prototypes of clusters. In: Proceedings of the International Conference on Data Engineering and Automated Learning. LNCS, vol. 3177, pp. 613–618 (2004). doi:10.1007/978-3-540-28651-6_90

  29. Zhu, X., Wu, X.: Scalable representative instance selection and ranking. In: IEEE Proceedings of the 18th International Conference on Pattern Recognition, vol. 3, pp. 352–355 (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ireneusz Czarnowski .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Cite this paper

Czarnowski, I., Jędrzejowicz, P. (2018). Stacking-Based Integrated Machine Learning with Data Reduction. In: Czarnowski, I., Howlett, R., Jain, L. (eds) Intelligent Decision Technologies 2017. IDT 2017. Smart Innovation, Systems and Technologies, vol 72. Springer, Cham. https://doi.org/10.1007/978-3-319-59421-7_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-59421-7_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-59420-0

  • Online ISBN: 978-3-319-59421-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics