Skip to main content

Dimensionality Reduction Methods in Machine Learning

  • Chapter
  • First Online:
Machine Learning in Biological Sciences
  • 1030 Accesses

Abstract

The goal of machine learning algorithm is to understand the basic features of a complex system. If the dataset is large and the number of features is large as well, it is possible that one can get features or input variables easily identified. In case the dataset is small, there may be a circumstance that one may miss some of the observations and then eventually ignore some features and find the minimal set of features to define the system adequately. In this chapter we will cover the feature selection methods that choose a subset of important features and skip the rest and the feature extraction methods that form the minimally accepted feature from original set of features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Aydin Z, Kaynar O, Görmez Y (2018) Dimensionality reduction for protein secondary structure and solvent accessibility prediction. J Bioinforma Comput Biol 16(5):1850020. https://doi.org/10.1142/S0219720018500208

    Article  CAS  Google Scholar 

  • Lehrmann A, Huber M, Polatkan AC et al (2013) Visualizing dimensionality reduction of systems biology data. Data Min Knowl Disc 27:146–165

    Article  Google Scholar 

  • Teodoro ML, Phillips GN, Kavraki LE (2002) A dimensionality reduction approach to modeling protein flexibility. In RECOMB '02: proceedings of the sixth annual international conference on Computational biology, pp 299–308. https://doi.org/10.1145/565196.565235

Further Reading

  • Fukunaga K (1990) Introduction to statistical pattern recognition. Academic, San Diego

    Google Scholar 

  • Jimenez LO, Landgrebe DA (1997) Supervised classification in high-dimensional space: geometrical, statistical, and asymptotical properties of multivariate data. IEEE Trans Syst Man Cybern 28(1):39–54

    Article  Google Scholar 

  • Naanaa W, Nuzillard J-M (2005) Blind source separation of positive and partially correlated data. Signal Process 85:1711–1722

    Article  Google Scholar 

  • Pearson K (1901) On lines and planes of closest fit to systems of points in space. Philos Mag 2:559–572

    Article  Google Scholar 

  • Spearman C (1904) General intelligence objectively determined and measured. Am J Psychol 15:206–221

    Google Scholar 

  • Torgerson WS (1952) Multidimensional scaling I: theory and method. Psychometrika 17:401–419

    Article  Google Scholar 

  • Winter ME (1999) N-FINDR: an algorithm for fast autonomous spectral end-member determination in hyperspectral data. Imaging Spectrom 3753:266–275

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Ghosh, S., Dasgupta, R. (2022). Dimensionality Reduction Methods in Machine Learning. In: Machine Learning in Biological Sciences. Springer, Singapore. https://doi.org/10.1007/978-981-16-8881-2_7

Download citation

Publish with us

Policies and ethics