Skip to main content

The Sample and Instance Selection for Data Dimensionality Reduction

  • Conference paper
  • First Online:
Recent Advances in Systems, Control and Information Technology (SCIT 2016)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 543))

Abstract

The paper proposes tools for data dimensionality reduction containing sample selection method and instance informativity indicators based on the evolutionary search, which is modified to speed up the search through the creation of special operators, taking into account a priori information about the data sample and concentrating search on the most perspective solution areas. This allows preserving the stochastic nature of the search to accelerate the obtainment of acceptable solutions through the introduction of deterministic component in the search strategy. The proposed methods are experimentally studied. On the results of experiments the comparative characteristics and recommendations for the use of the proposed methods are given.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Lavrakas, P.J.: Encyclopedia of Survey Research Methods. Sage Publications, Thousand Oaks (2008)

    Book  Google Scholar 

  2. Korobiichuk, I., Podchashinskiy, Y., Shapovalova, O., Shadura, V., Nowicki, M., Szewczyk, R.: Precision increase in automated digital image measurement systems of geometric values. In: Jabłoński, R., Brezina, T. (eds.) Advanced Mechatronics Solutions. AISC, vol. 393, pp. 335–340. Springer, Heidelberg (2016). doi:10.1007/978-3-319-23923-1_51

    Chapter  Google Scholar 

  3. Hansen, M.H., Hurtz, W.N., Madow, W.G.: Sample Survey Methods and Theory. Wiley, New York (1953)

    Google Scholar 

  4. Bernard, H.R.: Social Research Methods: Qualitative and Quantitative Approaches. Sage Publications, Thousand Oaks (2006)

    Google Scholar 

  5. Korobiichuk, I., Bezvesilna, O., Ilchenko, A., Shadura, V., Nowicki, M., Szewczyk, R.: A mathematical model of the thermo-anemometric flowmeter. Sensors 15, 22899–22913 (2015). doi:10.3390/s150922899

    Article  Google Scholar 

  6. Subbotin, S.: The instance and feature selection for neural network based diagnosis of chronic obstructive bronchitis. In: Bris, R., Majernik, J., Pancerz, K., Zaitseva, E. (eds.) Applications of Computational Intelligence in Biomedical Technology. SCI, vol. 606, pp. 215–228. Springer, Heidelberg (2016). doi:10.1007/978-3-319-19147-8_13

    Chapter  Google Scholar 

  7. Ghosh, S.: Multivariate Analysis, Design of Experiments, and Survey Sampling. Marcel Dekker Inc., New York (1999)

    MATH  Google Scholar 

  8. Subbotin, S.A.: The training set quality measures for neural network learning. Opt. Memory Neural Netw. (Inf. Opt.) 2(19), 126–139 (2010)

    Article  Google Scholar 

  9. Smith, G.: A deterministic approach to partitioning neural network training data for the classification problem: dissertation. Virginia Polytechnic Institute & State University, Blacksburg (2006)

    Google Scholar 

  10. Subbotin, S.A.: The sample properties evaluation for pattern recognition and intelligent diagnosis. In: 10th International Conference on Digital Technologies, pp. 321–332. IEEE, Zilina (2014)

    Google Scholar 

  11. Plutowski, M.: Selecting training exemplars for neural network learning: dissertation. University of California, San Diego (1994)

    Google Scholar 

  12. Oliinyk, A., Zaiko, T., Subbotin, S.: Training sample reduction based on association rules for neuro-fuzzy networks synthesis. Opt. Memory Neural Netw. (Inf. Opt.) 2(23), 89–95 (2014)

    Article  Google Scholar 

  13. Chaudhuri, A., Stenger, H.: Survey Sampling Theory and Methods. Chapman & Hall, New York (2005)

    Book  MATH  Google Scholar 

  14. Subbotin, S.A.: Methods of sampling based on exhaustive and evolutionary search. Autom. Control Comput. Sci. 3(47), 113–121 (2013)

    Article  Google Scholar 

  15. Tenne, Y., Goh, C.-K.: Computational Intelligence in Expensive Optimization Problems. Springer, Berlin (2010)

    MATH  Google Scholar 

  16. Talbi, E.: Metaheuristics: from Design to Implementation. Wiley, Hoboken (2009)

    Book  MATH  Google Scholar 

  17. Iris Data Set. https://archive.ics.uci.edu/ml/datasets/Iris

Download references

Acknowledgements

This paper is prepared with partial support of “Centers of Excellence for young RESearchers” (CERES) project (Reference Number 544137-TEMPUS-1-2013-1-SK-TEMPUS-JPHES) of Tempus Programme of the European Union.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sergey Subbotin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Subbotin, S., Oliinyk, A. (2017). The Sample and Instance Selection for Data Dimensionality Reduction. In: Szewczyk, R., Kaliczyńska, M. (eds) Recent Advances in Systems, Control and Information Technology. SCIT 2016. Advances in Intelligent Systems and Computing, vol 543. Springer, Cham. https://doi.org/10.1007/978-3-319-48923-0_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-48923-0_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-48922-3

  • Online ISBN: 978-3-319-48923-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics