Abstract
The goal of machine learning algorithm is to understand the basic features of a complex system. If the dataset is large and the number of features is large as well, it is possible that one can get features or input variables easily identified. In case the dataset is small, there may be a circumstance that one may miss some of the observations and then eventually ignore some features and find the minimal set of features to define the system adequately. In this chapter we will cover the feature selection methods that choose a subset of important features and skip the rest and the feature extraction methods that form the minimally accepted feature from original set of features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Aydin Z, Kaynar O, Görmez Y (2018) Dimensionality reduction for protein secondary structure and solvent accessibility prediction. J Bioinforma Comput Biol 16(5):1850020. https://doi.org/10.1142/S0219720018500208
Lehrmann A, Huber M, Polatkan AC et al (2013) Visualizing dimensionality reduction of systems biology data. Data Min Knowl Disc 27:146–165
Teodoro ML, Phillips GN, Kavraki LE (2002) A dimensionality reduction approach to modeling protein flexibility. In RECOMB '02: proceedings of the sixth annual international conference on Computational biology, pp 299–308. https://doi.org/10.1145/565196.565235
Further Reading
Fukunaga K (1990) Introduction to statistical pattern recognition. Academic, San Diego
Jimenez LO, Landgrebe DA (1997) Supervised classification in high-dimensional space: geometrical, statistical, and asymptotical properties of multivariate data. IEEE Trans Syst Man Cybern 28(1):39–54
Naanaa W, Nuzillard J-M (2005) Blind source separation of positive and partially correlated data. Signal Process 85:1711–1722
Pearson K (1901) On lines and planes of closest fit to systems of points in space. Philos Mag 2:559–572
Spearman C (1904) General intelligence objectively determined and measured. Am J Psychol 15:206–221
Torgerson WS (1952) Multidimensional scaling I: theory and method. Psychometrika 17:401–419
Winter ME (1999) N-FINDR: an algorithm for fast autonomous spectral end-member determination in hyperspectral data. Imaging Spectrom 3753:266–275
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Ghosh, S., Dasgupta, R. (2022). Dimensionality Reduction Methods in Machine Learning. In: Machine Learning in Biological Sciences. Springer, Singapore. https://doi.org/10.1007/978-981-16-8881-2_7
Download citation
DOI: https://doi.org/10.1007/978-981-16-8881-2_7
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-8880-5
Online ISBN: 978-981-16-8881-2
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)