Stacking-Based Integrated Machine Learning with Data Reduction

Czarnowski, Ireneusz; Jędrzejowicz, Piotr

doi:10.1007/978-3-319-59421-7_9

Ireneusz Czarnowski⁶ &
Piotr Jędrzejowicz⁶

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 72))

Included in the following conference series:

International Conference on Intelligent Decision Technologies

1003 Accesses
1 Citations

Abstract

Integrated machine learning is understood as integration of the data reduction with the learning process. Such integration allows to introduce adaptation mechanisms within the learning process by modification of the data with a view to finding its better representation from the point of view of the learning performance criterion. Data modification can be carried out through data reduction in both dimensions, i.e. the feature and the instance ones producing the set of prototypes. Currently, data reduction has become a crucial technique for big data analysis and improvement of the machine learning process results. In this paper the stacking technique has been proposed for improving the process of the integrated machine classification and to assure diversification among prototypes. To validate the proposed approach we have carried-out computational experiment. The paper includes the description of the approach and the discussion of the validating experiment results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Asuncion, A., Newman, D.J.: UCI Machine Learning Repository. University of California, School of Information and Computer Science, Irvine, CA (2007). http://www.ics.uci.edu/~mlearn/MLRepository.html
Barbucha, D., Czarnowski, I., Jędrzejowicz, P., Ratajczak-Ropel, E., Wierzbowska, I.: e-JABAT – an implementation of the web-based A-Team, In: Nguyen, N.T., Jain, L.C. (eds.) Intelligence Agents in the Evolution of Web and Applications. SCI, vol. 167, pp. 57–86. Springer, Heidelberg (2009). doi:10.1007/978-3-540-88071-4_4
Bull, L.: Learning classifier systems: a brief introduction, applications of learning classifier systems. In: Bull, L. (ed.) STUDFUZZ. Springer (2004)
Google Scholar
Cano, J.R., Herrera, F., Lozano, M.: On the combination of evolutionary algorithms and stratified strategies for training set selection in data mining. Appl. Soft Comput. 6, 323–332 (2004)
Article Google Scholar
Carbonera, J.L., Abel, M.: A density-based approach for instance selection. In: Proceedings of the 2015 IEEE 27th International Conference on Tool with Artificial Intelligence, pp. 768–774 (2015). doi:10.1109/ICTAI.2015.114
Czarnowski, I., Jędrzejowicz, P.: Agent-based data reduction using ensemble technique, In: Badica, C., Nguyen, N.T., Brezovan, M. (Eds.): Computational Collective Intelligence. Technologies and Applications, ICCCI 2013. LNAI, vol. 8083, pp. 447–456. Springer, Heidelberg (2013)
Google Scholar
Czarnowski, I., Jędrzejowicz, P.: An approach to machine classification based on stacked generalization and instance selection. In: Proceedings of 2016 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2016, Budapest, Hungary, 9–12 October, 2016, pp. 4836–4841. IEEE (2016)
Google Scholar
Czarnowski, I., Jędrzejowicz, P.: Experimental evaluation of the agent-based population learning algorithm for the cluster-based instance selection, In: Jędrzejowicz P., Nguyen N.T., Hoang K. (eds.): Computational Collective Intelligence, Technologies and Applications, ICCCI 2011. LNAI, vol. 6923, pp. 301–310. Springer, Heidelberg (2011)
Google Scholar
Czarnowski, I., Jędrzejowicz, P.: Learning from examples with data reduction and stacked generalization. J. Intell. Fuzzy Syst. 32(2), 1401–1411 (2017)
Article Google Scholar
Czarnowski, I.: Distributed learning with data reduction, In: Nguyen, N.T. (ed.) Transactions on CCI IV. LNCS, vol. 6660, pp. 3–121. Springer, Heidelberg (2011)
Google Scholar
Czarnowski, I.: Cluster-based instance selection for machine classification. Knowl.-Based Inf. Syst. 30(1), 113–133 (2012)
Article Google Scholar
Dash, M., Liu, H.: Feature selection for classification. Intell. Data Anal. 1(3), 131–156 (1997)
Article Google Scholar
Datasets used for classification: comparison of results. Directory of Data Sets. http://www.is.umk.pl/projects/datasets.html. Accessed 1 Sep 2009
Ho, T.K.: Data complexity analysis for classifier combination, In: Kittler, J., Roli, F. (eds.) MCS 2001. LNCS, vol. 2096, pp. 53–67. Springer, London (2001)
Google Scholar
Holland, J.H.: Adaptation, In: Rosen, R. and Snell, F.M. (eds.) Progress in Theoretical Biology 4, Plenum (1976)
Google Scholar
Jędrzejowicz, J., Jędrzejowicz, P.: Cellular GEP-induced classifiers, In: Pan, J.-S., Chen, S.-M., Nguyen, N.T. (eds.) ICCCI 2010, Part I. LNAI, vol. 6421, pp. 343–352. Springer, Heidelberg (2010)
Google Scholar
Kim, S.-W., Oommen, B.J.: A brief taxonomy and ranking of creative prototype reduction schemes. Pattern Anal. Appl. 6, 232–244 (2003)
Article MathSciNet Google Scholar
Li, Z., Tang, S., Xue, J., Jiang, J.: Modified FCM clustering based on kernel mapping. In: Proceedings of the International Conference on Society for Optical Engineering, vol. 4554, pp. 241–245 (2001). doi:10.1117/12.441658
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, SanMateo (1993)
Google Scholar
Sikora, R., Al-laymoun, O.H.: A modified stacking ensemble machine learning algorithm using genetic algorithms. J. Int. Technol. Inf. Manage. 23(1), 1–11 (2014)
Google Scholar
Skalak, D.B.: Prototype selection for composite neighbor classifiers, University of Massachusetts Amherst (1997). https://web.cs.umass.edu/publication/docs/1996/UM-CS-1996-089.pdf
Stefanowski, J.: Multiple and hybrid classifiers. In: Polkowski, L. (ed.) Formal Methods and Intelligent Techniques in Control, Decision Making. Multimedia and Robotics, Warszawa, pp. 174–188 (2001)
Google Scholar
Tsoumakas, G., Angelis, L., Vlahavas, I.: Clustering classifiers for knowledge discovery from physically distributed databases. Data Knowl. Eng. 49, 223–242 (2004)
Article Google Scholar
Wilson, D.R., Martinez, T.R.: An integrated instance-based learning algorithm. Comput. Intell. 16, 1–28 (2000)
Article MathSciNet Google Scholar
Wilson, D.R., Martinez, T.R.: Reduction techniques for instance-based learning algorithm. Mach. Learn. 33(3), 257–286 (2000)
Article MATH Google Scholar
Wolpert, D.: Stacked Generalization. Neural Netw. 5, 241–259 (1992)
Article Google Scholar
Yıldırım, A.A., Özdoğan, C., Watson, D.: Parallel data reduction techniques for big datasets. In: Hu, W.-C., Kaabouch, N. (eds.) Big Data Management, Technologies, and Applications. IGI Global, pp. 72–93 (2014)
Google Scholar
Zhou, S., Gan, J.Q.: Mercel kernel fuzzy c-means algorithm and prototypes of clusters. In: Proceedings of the International Conference on Data Engineering and Automated Learning. LNCS, vol. 3177, pp. 613–618 (2004). doi:10.1007/978-3-540-28651-6_90
Zhu, X., Wu, X.: Scalable representative instance selection and ranking. In: IEEE Proceedings of the 18th International Conference on Pattern Recognition, vol. 3, pp. 352–355 (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Systems, Gdynia Maritime University, Morska 83, 81-225, Gdynia, Poland
Ireneusz Czarnowski & Piotr Jędrzejowicz

Authors

Ireneusz Czarnowski
View author publications
You can also search for this author in PubMed Google Scholar
Piotr Jędrzejowicz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ireneusz Czarnowski .

Editor information

Editors and Affiliations

Maritime University , Gdynia, Poland
Ireneusz Czarnowski
Bournemouth University and KES International, Poole, Dorset, United Kingdom
Robert J. Howlett
University of Canberra, Canberra, Aust Capital Terr, Australia
Lakhmi C. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Czarnowski, I., Jędrzejowicz, P. (2018). Stacking-Based Integrated Machine Learning with Data Reduction. In: Czarnowski, I., Howlett, R., Jain, L. (eds) Intelligent Decision Technologies 2017. IDT 2017. Smart Innovation, Systems and Technologies, vol 72. Springer, Cham. https://doi.org/10.1007/978-3-319-59421-7_9

Download citation

DOI: https://doi.org/10.1007/978-3-319-59421-7_9
Published: 26 May 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59420-0
Online ISBN: 978-3-319-59421-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics