Abstract
Integrated machine learning is understood as integration of the data reduction with the learning process. Such integration allows to introduce adaptation mechanisms within the learning process by modification of the data with a view to finding its better representation from the point of view of the learning performance criterion. Data modification can be carried out through data reduction in both dimensions, i.e. the feature and the instance ones producing the set of prototypes. Currently, data reduction has become a crucial technique for big data analysis and improvement of the machine learning process results. In this paper the stacking technique has been proposed for improving the process of the integrated machine classification and to assure diversification among prototypes. To validate the proposed approach we have carried-out computational experiment. The paper includes the description of the approach and the discussion of the validating experiment results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Asuncion, A., Newman, D.J.: UCI Machine Learning Repository. University of California, School of Information and Computer Science, Irvine, CA (2007). http://www.ics.uci.edu/~mlearn/MLRepository.html
Barbucha, D., Czarnowski, I., Jędrzejowicz, P., Ratajczak-Ropel, E., Wierzbowska, I.: e-JABAT – an implementation of the web-based A-Team, In: Nguyen, N.T., Jain, L.C. (eds.) Intelligence Agents in the Evolution of Web and Applications. SCI, vol. 167, pp. 57–86. Springer, Heidelberg (2009). doi:10.1007/978-3-540-88071-4_4
Bull, L.: Learning classifier systems: a brief introduction, applications of learning classifier systems. In: Bull, L. (ed.) STUDFUZZ. Springer (2004)
Cano, J.R., Herrera, F., Lozano, M.: On the combination of evolutionary algorithms and stratified strategies for training set selection in data mining. Appl. Soft Comput. 6, 323–332 (2004)
Carbonera, J.L., Abel, M.: A density-based approach for instance selection. In: Proceedings of the 2015 IEEE 27th International Conference on Tool with Artificial Intelligence, pp. 768–774 (2015). doi:10.1109/ICTAI.2015.114
Czarnowski, I., Jędrzejowicz, P.: Agent-based data reduction using ensemble technique, In: Badica, C., Nguyen, N.T., Brezovan, M. (Eds.): Computational Collective Intelligence. Technologies and Applications, ICCCI 2013. LNAI, vol. 8083, pp. 447–456. Springer, Heidelberg (2013)
Czarnowski, I., Jędrzejowicz, P.: An approach to machine classification based on stacked generalization and instance selection. In: Proceedings of 2016 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2016, Budapest, Hungary, 9–12 October, 2016, pp. 4836–4841. IEEE (2016)
Czarnowski, I., Jędrzejowicz, P.: Experimental evaluation of the agent-based population learning algorithm for the cluster-based instance selection, In: Jędrzejowicz P., Nguyen N.T., Hoang K. (eds.): Computational Collective Intelligence, Technologies and Applications, ICCCI 2011. LNAI, vol. 6923, pp. 301–310. Springer, Heidelberg (2011)
Czarnowski, I., Jędrzejowicz, P.: Learning from examples with data reduction and stacked generalization. J. Intell. Fuzzy Syst. 32(2), 1401–1411 (2017)
Czarnowski, I.: Distributed learning with data reduction, In: Nguyen, N.T. (ed.) Transactions on CCI IV. LNCS, vol. 6660, pp. 3–121. Springer, Heidelberg (2011)
Czarnowski, I.: Cluster-based instance selection for machine classification. Knowl.-Based Inf. Syst. 30(1), 113–133 (2012)
Dash, M., Liu, H.: Feature selection for classification. Intell. Data Anal. 1(3), 131–156 (1997)
Datasets used for classification: comparison of results. Directory of Data Sets. http://www.is.umk.pl/projects/datasets.html. Accessed 1 Sep 2009
Ho, T.K.: Data complexity analysis for classifier combination, In: Kittler, J., Roli, F. (eds.) MCS 2001. LNCS, vol. 2096, pp. 53–67. Springer, London (2001)
Holland, J.H.: Adaptation, In: Rosen, R. and Snell, F.M. (eds.) Progress in Theoretical Biology 4, Plenum (1976)
Jędrzejowicz, J., Jędrzejowicz, P.: Cellular GEP-induced classifiers, In: Pan, J.-S., Chen, S.-M., Nguyen, N.T. (eds.) ICCCI 2010, Part I. LNAI, vol. 6421, pp. 343–352. Springer, Heidelberg (2010)
Kim, S.-W., Oommen, B.J.: A brief taxonomy and ranking of creative prototype reduction schemes. Pattern Anal. Appl. 6, 232–244 (2003)
Li, Z., Tang, S., Xue, J., Jiang, J.: Modified FCM clustering based on kernel mapping. In: Proceedings of the International Conference on Society for Optical Engineering, vol. 4554, pp. 241–245 (2001). doi:10.1117/12.441658
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, SanMateo (1993)
Sikora, R., Al-laymoun, O.H.: A modified stacking ensemble machine learning algorithm using genetic algorithms. J. Int. Technol. Inf. Manage. 23(1), 1–11 (2014)
Skalak, D.B.: Prototype selection for composite neighbor classifiers, University of Massachusetts Amherst (1997). https://web.cs.umass.edu/publication/docs/1996/UM-CS-1996-089.pdf
Stefanowski, J.: Multiple and hybrid classifiers. In: Polkowski, L. (ed.) Formal Methods and Intelligent Techniques in Control, Decision Making. Multimedia and Robotics, Warszawa, pp. 174–188 (2001)
Tsoumakas, G., Angelis, L., Vlahavas, I.: Clustering classifiers for knowledge discovery from physically distributed databases. Data Knowl. Eng. 49, 223–242 (2004)
Wilson, D.R., Martinez, T.R.: An integrated instance-based learning algorithm. Comput. Intell. 16, 1–28 (2000)
Wilson, D.R., Martinez, T.R.: Reduction techniques for instance-based learning algorithm. Mach. Learn. 33(3), 257–286 (2000)
Wolpert, D.: Stacked Generalization. Neural Netw. 5, 241–259 (1992)
Yıldırım, A.A., Özdoğan, C., Watson, D.: Parallel data reduction techniques for big datasets. In: Hu, W.-C., Kaabouch, N. (eds.) Big Data Management, Technologies, and Applications. IGI Global, pp. 72–93 (2014)
Zhou, S., Gan, J.Q.: Mercel kernel fuzzy c-means algorithm and prototypes of clusters. In: Proceedings of the International Conference on Data Engineering and Automated Learning. LNCS, vol. 3177, pp. 613–618 (2004). doi:10.1007/978-3-540-28651-6_90
Zhu, X., Wu, X.: Scalable representative instance selection and ranking. In: IEEE Proceedings of the 18th International Conference on Pattern Recognition, vol. 3, pp. 352–355 (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Czarnowski, I., Jędrzejowicz, P. (2018). Stacking-Based Integrated Machine Learning with Data Reduction. In: Czarnowski, I., Howlett, R., Jain, L. (eds) Intelligent Decision Technologies 2017. IDT 2017. Smart Innovation, Systems and Technologies, vol 72. Springer, Cham. https://doi.org/10.1007/978-3-319-59421-7_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-59421-7_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59420-0
Online ISBN: 978-3-319-59421-7
eBook Packages: EngineeringEngineering (R0)