Abstract
Image analysis is a prolific field of research which has been broadly studied in the last decades, successfully applied to a great number of disciplines. Since the apparition of Big Data, the number of digital images is explosively growing, and a large amount of multimedia data is publicly available. Not only is it necessary to deal with this increasing number of images, but also to know which features extract from them, and feature selection can help in this scenario. The goal of this paper is to survey the most recent feature selection methods developed and/or applied to image analysis, covering the most popular fields such as image classification, image segmentation, etc. Finally, an experimental evaluation on several popular datasets using well-known feature selection methods is presented, bearing in mind that the aim is not to provide the best feature selection method, but to facilitate comparative studies for the research community.
Similar content being viewed by others
Notes
http://www.image-net.org/challenges/LSVRC/.
References
Barbu A, She Y, Ding L, Gramajo G (2017) Feature selection with annealing for computer vision and big data learning. IEEE Trans Pattern Anal Mach Intell 39(2):272–286
Blake CL, Merz CJ (1998) UCI machine learning repository, vol 55. Department of Information and Computer Science, University of California. http://archive.ics.uci.edu/ml/. Accessed August 2019
Bolón-Canedo V, Ataer-Cansizoglu E, Erdogmus D, Kalpathy-Cramer J, Fontenla-Romero O, Alonso-Betanzos A, Chiang M (2015a) Dealing with inter-expert variability in retinopathy of prematurity: a machine learning approach. Comput Methods Programs Biomed 122(1):1–15
Bolon-Canedo V, Sanchez-Marono N, Alonso-Betanzos A (2015b) Feature selection for high-dimensional data. Springer, Berlin
Bolón-Canedo V, Sánchez-Maroño N, Alonso-Betanzos A (2015c) Recent advances and emerging challenges of feature selection in the context of big data. Knowl Based Syst 86:33–45
Bossard L, Guillaumin M, Van Gool L (2014) Food-101—mining discriminative components with random forests. In: European conference on computer vision, pp 446–461
Brown G, Pocock A, Zhao MJ, Luján M (2012) Conditional likelihood maximisation: a unifying framework for information theoretic feature selection. J Mach Learn Res 13:27–66
Chen EL, Chung PC, Chen CL, Tsai HM, Chang CI (1998) An automatic diagnostic system for CT liver image classification. IEEE Trans Biomed Eng 45(6):783–794
Chen L, Chen B, Chen Y (2011) Image feature selection based on ant colony optimization. In: AI 2011: advances in artificial intelligence, pp 580–589
Chen B, Chen L, Chen Y (2013) Efficient ant colony optimization for image feature selection. Signal Process 93(6):1566–1576
Chen X, Liu W, Su F, Shao G (2016) Semi-supervised multiview feature selection with label learning for VHR remote sensing images. In: IEEE international geoscience and remote sensing symposium, pp 2372–2375
Cheng HD, Jiang X, Sun Y, Wang J (2001) Color image segmentation: advances and prospects. Pattern Recognit 34(12):2259–2281
Cheng MM, Liu Y, Hou Q, Bian J, Torr P, Hu SM, Tu Z (2016) HFS: hierarchical feature selection for efficient image segmentation. In: European conference on computer vision, pp 867–882
Chua TS, Tang J, Hong R, Li H, Luo Z, Zheng Y (2009) NUS-WIDE: a real-world web image database from National University of Singapore. In: ACM international conference on image and video retrieval, p 48
Cong Y, Wang S, Fan B, Yang Y, Yu H (2016) UDSFS: unsupervised deep sparse feature selection. Neurocomputing 196:150–158
Dash M, Liu H (2003) Consistency-based search in feature selection. Artif Intell 151(1):155–176
Datta R, Joshi D, Li J, Wang JZ (2008) Image retrieval: ideas, influences, and trends of the new age. ACM Comput Surv 40(2):5
Deng J, Dong W, Socher R, Li LJ, Li K, Fei-Fei L (2009) Imagenet: a large-scale hierarchical image database. In: IEEE conference on computer vision and pattern recognition, pp 248–255
Deselaers T, Keysers D, Ney H (2008) Features for image retrieval: an experimental comparison. Inf Retr 11(2):77–107
du Buf JMH, Kardan M, Spann M (1990) Texture feature performance for image segmentation. Pattern Recognit 23(3–4):291–309
Everingham M, Van Gool L, Williams CK, Winn J, Zisserman A (2010) The PASCAL visual object classes (VOC) challenge. Int J Comput Vis 88(2):303–338
Everingham M, Eslami SA, Van Gool L, Williams CK, Winn J, Zisserman A (2015) The pascal visual object classes challenge: a retrospective. Int J Comput Vis 111(1):98–136
Fahmi H, Zen RA, Sanabila HR, Nurhaida I, Arymurthy AM (2016) Feature selection and reduction for Batik image retrieval. In: Proceedings of the fifth international conference on network, communication and computing, pp 47–52
Fernández-Delgado M, Cernadas E, Barro S, Amorim D (2014) Do we need hundreds of classifiers to solve real world classification problems. J Mach Learn Res 15(1):3133–3181
Gao W, Hu L, Zhang P (2018a) Class-specific mutual information variation for feature selection. Pattern Recognit 79:328–339
Gao W, Hu L, Zhang P, Wang F (2018b) Feature selection by integrating two groups of feature evaluation criteria. Expert Syst Appl 110:11–19
Ghamisi P, Benediktsson JA (2015) Feature selection based on hybridization of genetic algorithm and particle swarm optimization. IEEE Geosci Remote Sens Lett 12(2):309–313
Gonzalez RC, Woods RE (2008) Digital image processing, 3rd edn. Pearson, Prentice Hall, Englewood Cliffs
Griffin G, Holub A, Perona P (2007) Caltech-256 object category dataset
Guo G, Fu Y, Dyer CR, Huang TS (2008) Image-based human age estimation by manifold learning and locally adjusted robust regression. IEEE Trans Image Process 17(7):1178–1188
Guyon I, Weston J, Barnhill S, Vapnik V (2002) Gene selection for cancer classification using support vector machines. Mach Learn 46(1–3):389–422
Guyon I, Gunn S, Ben-Hur A, Dror G (2005) Result analysis of the NIPS 2003 feature selection challenge. In: Advances in neural information processing systems, pp 545–552
Guyon I, Gunn S, Nikravesh M, Zadeh LA (2006) Feature extraction: foundations and applications. Springer, Berlin
Hall MA (1999) Correlation-based feature selection for machine learning. Ph.D. thesis, The University of Waikato
Hall MA, Smith LA (1998) Practical feature subset selection for machine learning. Comput Sci 98:181–191
Haralick RM, Shanmugam K, Dinstein I (1973) Texture features for image classification. IEEE Trans Syst Man Cybern 3:610–621
Izadipour A, Akbari B, Mojaradi B (2016) A feature selection approach for segmentation of very high-eesolution satellite images. Photogramm Eng Remote Sens 82(3):213–222
Jain AK, Vailaya A (1996) Image retrieval using color and shape. Pattern Recognit 29(8):1233–1244
Jia S, Qian Y, Li J, Liu W, Ji Z (2010) Feature extraction and selection hybrid algorithm for hyperspectral imagery classification. In: IEEE international geoscience and remote sensing symposium, pp 72–75
Jia Y, Huang C, Darrell T (2012) Beyond spatial pyramids: receptive field learning for pooled image features. In: IEEE conference on computer vision and pattern recognition, pp 3370–3377
Jia S, Zhu Z, Shen L, Li Q (2014) A two-stage feature selection framework for hyperspectral image classification using few labeled samples. IEEE J Sel Top Appl Earth Obs Remote Sens 7(4):1023–1035
Jin C, Jin SW (2015) Automatic image annotation using feature selection based on improving quantum particle swarm optimization. Signal Process 109:172–181
Jin C, Liu J, Guo J (2015) A hybrid model based on mutual information and support vector machine for automatic image annotation. In: Artificial intelligence perspectives and applications, pp 29–38
Juan L, Gwun O (2009) A comparison of SIFT, PCA-SIFT and SURF. Int J Image Process 3(4):143–152
Kerroum MA, Hammouch A, Aboutajdine D (2010) Textural feature selection by joint mutual information based on Gaussian mixture model for multispectral image classification. Pattern Recognit Lett 31(10):1168–1174
Kira K, Rendell LA (1992) The feature selection problem: traditional methods and a new algorithm. In: National conference on artificial intelligence, pp 129–129
Kong T, Yao A, Chen Y, Sun F (2016) Hypernet: towards accurate region proposal generation and joint object detection. In: IEEE conference on computer vision and pattern recognition, pp 845–853
Kononenko I (1994) Estimating attributes: analysis and extensions of RELIEF. In: Machine learning: ECML-94, pp 171–182
Korytkowski M, Rutkowski L, Scherer R (2016) Fast image classification by boosting fuzzy classifiers. Inf Sci 327:175–182
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105
Kuo BC, Ho HH, Li CH, Hung CC, Taur JS (2014) A kernel-based feature selection method for SVM with RBF kernel for hyperspectral image classification. IEEE J Sel Top Appl Earth Obs Remote Sens 7(1):317–326
Laliberte AS, Browning D, Rango A (2012) A comparison of three feature selection methods for object-based classification of sub-decimeter resolution UltraCam-L imagery. Int J Appl Earth Obs Geoinf 15:70–78
Landgrebe DA (1980) The development of a spectral-spatial classifier for earth observational data. Pattern Recognit 12(3):165–175
Learned-Miller E, Huang GB, RoyChowdhury A, Li H, Hua G (2016) Labeled faces in the wild: a survey. In: Advances in face detection and facial image analysis, pp 189–248
LeCun Y, Cortes C, Burges CJ (2010) MNIST handwritten digit database, vol 2. AT&T Labs. http://yann.lecun.com/exdb/mnist. Accessed August 2019
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444
Lee J, Weger R, Sengupta S, Welch R (1990) A neural network approach to cloud classification. IEEE Trans Geosci Remote Sens 28(5):846–855
Levin A, Weiss Y (2009) Learning to combine bottom-up and top-down segmentation. Int J Comput Vis 81(1):105–118
Li Z, Tang J (2015) Unsupervised feature selection via nonnegative spectral analysis and redundancy control. IEEE Trans Image Process 24(12):5343–5355
Li R, Lu J, Zhang Y, Zhao T (2010) Dynamic adaboost learning with feature selection based on parallel genetic algorithm for image annotation. Knowl Based Syst 23(3):195–201
Li S, Yu H, Yuan L (2016a) A novel approach to remote sensing image retrieval with multi-feature VP-tree indexing and online feature selection. In: IEEE second international conference on multimedia big data, pp 133–136
Li Y, Shi X, Du C, Liu Y, Wen Y (2016b) Manifold regularized multi-view feature selection for social image annotation. Neurocomputing 204:135–141
Liang Y, Zhang M, Browne WN (2017) Image feature selection using genetic programming for figure-ground segmentation. Eng Appl Artif Intell 62:96–108
Lim YW, Lee SU (1990) On the color image segmentation algorithm based on the thresholding and the fuzzy c-means techniques. Pattern Recognit 23(9):935–952
Lin TY, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Microsoft COCO: common objects in context. In: European conference on computer vision, pp 740–755
Liu Y, Zhang D, Lu G, Ma WY (2007) A survey of content-based image retrieval with high-level semantics. Pattern Recognit 40(1):262–282
Liu Y, Cheng MM, Hu X, Wang K, Bai X (2017) Richer convolutional features for edge detection. In: IEEE conference on computer vision and pattern recognition, pp 5872–5881
Lotfabadi MS, Shiratuddin MF, Wong KW (2015) Utilising fuzzy rough set based on mutual information decreasing method for feature reduction in an image retrieval system. In: Innovations and advances in computing, informatics, systems sciences, networking and engineering, pp 177–184
Loughrey J, Cunningham P (2005) Overfitting in wrapper-based feature subset selection: the harder you try the worse it gets. In: Research and development in intelligent systems XXI, pp 33–43
Lu D, Weng Q (2007) A survey of image classification methods and techniques for improving classification performance. Int J Remote Sens 28(5):823–870
Lu J, Zhao T, Zhang Y (2008) Feature selection based-on genetic algorithm for image annotation. Knowl Based Syst 21(8):887–891
Ma Z, Nie F, Yang Y, Uijlings JR, Sebe N (2012) Web image annotation via subspace-sparsity collaborated feature selection. IEEE Trans Multimed 14(4):1021–1030
Ma L, Li M, Gao Y, Chen T, Ma X, Qu L (2017) A novel wrapper approach for feature selection in object-based image classification using polygon-based cross-validation. IEEE Geosci Remote Sens Lett 14(3):409–413
Maaten LVD, Hinton G (2008) Visualizing data using t-SNE. J Mach Learn Res 9:2579–2605
Makadia A, Pavlovic V, Kumar S (2008) A new baseline for image annotation. In: European conference on computer vision, pp 316–329
Mui JK, Fu KS (1980) Automated classification of nucleated blood cells using a binary tree classifier. IEEE Trans Pattern Anal Mach Intell 2(5):429–443
Ng WW, Dorado A, Yeung DS, Pedrycz W, Izquierdo E (2007) Image classification with the use of radial basis function neural networks and the minimization of the localized generalization error. Pattern Recognit 40(1):19–32
Núñez J, Llacer J (2003) Astronomical image segmentation by self-organizing neural networks and wavelets. Neural Netw 16(3):411–417
Pal M, Foody GM (2010) Feature selection for classification of hyperspectral data by SVM. IEEE Trans Geosci Remote Sens 48(5):2297–2307
Patil U, Mudengudi U (2011) Image fusion using hierarchical PCA. In: International conference on image information Processing, pp 1–6
Perez CA, Estévez PA, Vera PA, Castillo LE, Aravena CM, Schulz DA, Medina LE (2011) Ore grade estimation by feature selection and voting using boundary detection in digital image analysis. Int J Miner Process 101(1):28–36
Picard RW, Minka TP (1995) Vision texture for annotation. Multimed Syst 3(1):3–14
Porebski A, Vandenbroucke N, Macaire L (2010) Comparison of feature selection schemes for color texture classification. In: International conference on image processing theory tools and applications, pp 32–37
Qi C, Zhou Z, Sun Y, Song H, Hu L, Wang Q (2017) Feature selection and multiple kernel boosting framework based on PSO with mutation mechanism for hyperspectral classification. Neurocomputing 220:181–190
Raut SA, Raghuwanshi M, Dharaskar R, Raut A (2009) Image segmentation–a state-of-art survey for prediction. In: International conference on advanced computer control, pp 420–424
Remeseiro B, Penas M, Barreira N, Mosquera A, Novo J, García-Resúa C (2013) Automatic classification of the interferential tear film lipid layer using colour texture analysis. Comput Methods Programs Biomed 111(1):93–103
Remeseiro B, Bolon-Canedo V, Peteiro-Barral D, Alonso-Betanzos A, Guijarro-Berdinas B, Mosquera A, Penedo MG, Sanchez-Marono N (2014) A methodology for improving tear film lipid layer classification. IEEE J Biomed Health Inform 18(4):1485–1493
Roffo G, Melzi S, Cristani M (2015) Infinite feature selection. In: IEEE international conference on computer vision, pp 4202–4210
Russell BC, Torralba A, Murphy KP, Freeman WT (2008) LabelMe: a database and web-based tool for image annotation. Int J Comput Vis 77(1):157–173
Sankaran A, Jain A, Vashisth T, Vatsa M, Singh R (2017) Adaptive latent fingerprint segmentation using feature selection and random decision forest classification. Inf Fusion 34:1–15
Schreiber AT, Dubbeldam B, Wielemaker J, Wielinga B (2001) Ontology-based photo annotation. IEEE Intell Syst 3:66–74
Schroff F, Criminisi A, Zisserman A (2008) Object class segmentation using random forests. In: British machine vision conference, pp 1–10
Shafarenko L, Petrou M, Kittler J (1997) Automatic watershed segmentation of randomly textured color images. IEEE Trans Image Process 6(11):1530–1544
Shang C, Barnes D (2013) Fuzzy-rough feature selection aided support vector machines for mars image classification. Comput Vis Image Underst 117(3):202–213
Shang C, Barnes D, Shen Q (2011) Facilitating efficient mars terrain image classification with fuzzy-rough feature selection. Int J Hybrid Intell Syst 8(1):3–13
Shen L, Zhu Z, Jia S, Zhu J, Sun Y (2013) Discriminative Gabor feature selection for hyperspectral image classification. IEEE Geosci Remote Sens Lett 10(1):29–33
Shi C, Ruan Q, Guo S, Tian Y (2015) Sparse feature selection based on L 2, 1/2-matrix norm for web image annotation. Neurocomputing 151:424–433
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: IEEE conference on computer vision and pattern recognition, pp 1–9
Thomaz CE, Giraldi GA (2010) A new ranking method for principal components analysis and its application to face image analysis. Image Vis Comput 28(6):902–913
Tuia D, Camps-Valls G, Matasci G, Kanevski M (2010) Learning relevant image features with multiple-kernel classification. IEEE Trans Geosci Remote Sens 48(10):3780–3791
Vergara JR, Estévez PA (2014) A review of feature selection methods based on mutual information. Neural Comput Appl 24(1):175–186
Wang K, He R, Wang L, Wang W, Tan T (2016a) Joint feature selection and subspace learning for cross-modal retrieval. IEEE Trans Pattern Anal Mach Intell 38(10):2010–2023
Wang Z, Feng Y, Qi T, Yang X, Zhang JJ (2016b) Adaptive multi-view feature selection for human motion retrieval. Signal Process 120:691–701
Weinberger KQ, Saul LK (2006) Unsupervised learning of image manifolds by semidefinite programming. Int J Comput Vis 70(1):77–90
Wen X, Shao L, Fang W, Xue Y (2015) Efficient feature selection and classification for vehicle detection. IEEE Trans Circuits Syst Video Technol 25(3):508–517
Xue B, Zhang M, Browne W, Yao X (2016) A survey on evolutionary computation approaches to feature selection. IEEE Trans Evol Comput 20(4):606–626
Yao C, Han J, Nie F, Xiao F, Li X (2018) Local regression and global information-embedded dimension reduction. IEEE Trans Neural Netw Learn Syst 29(10):4882–4893
Zeng Z, Wang X, Chen Y (2017) Multimedia annotation via semi-supervised shared-subspace feature selection. J Vis Commun Image Represent 48:386–395
Zhang H, Fritts JE, Goldman SA (2008) Image segmentation evaluation: a survey of unsupervised methods. Comput Vis Image Underst 110(2):260–280
Zhang D, Islam MM, Lu G (2012a) A review on automatic image annotation techniques. Pattern Recognit 45(1):346–362
Zhang X, Wang W, Li Y, Jiao L (2012b) PSO-based automatic relevance determination and feature selection system for hyperspectral image classification. Electron Lett 48(20):1263–1265
Zhang R, Nie F, Li X (2018) Self-weighted supervised discriminative feature selection. IEEE Trans Neural Netw Learn Syst 29(8):3913–3918
Zhao W, Du S (2016) Spectral-spatial feature extraction for hyperspectral image classification: a dimension reduction and deep learning approach. IEEE Trans Geosci Remote Sens 54(8):4544–4554
Zhao ZA, Liu H (2011) Spectral feature selection for data mining. CRC Press, Boca Raton
Zheng W, Zhu X, Zhu Y, Zhang S (2018) Robust feature selection on incomplete data. In: Proceedings of the twenty-seventh international joint conference on artificial intelligence, pp 3191–3197
Zhou B, Lapedriza A, Xiao J, Torralba A, Oliva A (2014) Learning deep features for scene recognition using places database. In: Advances in neural information processing systems, pp 487–495
Zhou X, Gao X, Wang J, Yu H, Wang Z, Chi Z (2017) Eye tracking data guided feature selection for image classification. Pattern Recognit 63:56–70
Zhu C, Jia H, Lu T, Tao L, Song J, Xiang G, Li Y, Xie X (2017) Adaptive feature selection based on local descriptor distinctive degree for vehicle retrieval application. In: IEEE international conference on consumer electronics, pp 66–69
Zou Q, Ni L, Zhang T, Wang Q (2015) Deep learning based feature selection for remote sensing scene classification. IEEE Geosci Remote Sens Lett 12(11):2321–2325
Acknowledgements
This research has been financially supported in part by European Union FEDER funds, by the Spanish Ministerio de Economía y Competitividad (research project TIN2015-65069-C2), by the Consellería de Industria of the Xunta de Galicia (research project GRC2014/035), and by the Principado de Asturias (research project IDI-2018-000176). Financial support from the Xunta de Galicia (Centro singular de investigación de Galicia accreditation 2016–2019) and the European Union (European Regional Development Fund—ERDF), is gratefully acknowledged (research project ED431G/01). We are particularly grateful to Brais Cancela and Amparo Alonso-Betanzos for our stimulating discussions and their comments on the manuscript.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Bolón-Canedo, V., Remeseiro, B. Feature selection in image analysis: a survey. Artif Intell Rev 53, 2905–2931 (2020). https://doi.org/10.1007/s10462-019-09750-3
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10462-019-09750-3