Abstract
Object recognition is one of the research area in the field of computer vision and image processing because of its varied applications in surveillance and security systems, biometrics, intelligent vehicle system, content based image retrieval, etc. Many researchers have already done a lot of work in this area, but still there are many issues like scale, rotation, illumination, viewpoint, occlusion, background clutter among many more that draw the attention of the researchers. Object recognition is the task of recognizing the object and labeling the object in an image. The main goal of this survey is to present a comprehensive study in the field of 2D object recognition. An object is recognized by extracting the features of object like color of the object, texture of the object or shape or some other features. Then based on these features, objects are classified into various classes and each class is assigned a name. In this paper, various feature extraction techniques and classification algorithms are discussed which are required for object recognition. As the deep learning has made a tremendous improvement in object recognition process, so the paper also presents the recognition results achieved with various deep learning methods. The survey also includes the applications of object recognition system and various challenges faced while recognizing the object. Pros and cons of feature extraction and classification algorithms are also discussed which may help other researchers during their initial period of study. In this survey, the authors have also reported an analysis of various researches that describes the techniques used for object recognition with the accuracy achieved on particular image dataset. Finally, this paper ends with concluding notes and future directions. The aim of this study is to introduce the researchers about various techniques used for object recognition system.
Similar content being viewed by others
References
Arnow T, Bovik AC (2008) Foveated object recognition using corners. In: Proceedings of the IEEE southwest symposium on image analysis and interpretation, 53–56
Atabay HA (2016) Binary shape classification using convolutional neural networks. IIOAB J 7(5):332–336
Bay H, Ess A, Tuytelaars T, Van-Gool L (2008) Speeded-up robust features (SURF). Comput Vis Image Understand 110(3):346–359
Belongie S, Malik J, Puzicha J (2002) Shape matching and object recognition using shape contexts. IEEE Trans Pattern Anal Mach Intell 24(4):509–522
Bora DJ, Gupta AK (2014) A comparative study between fuzzy clustering algorithm and hard clustering algorithm. Int J Comput Trends Technol (IJCTT) 10(2):108–113
Bosch A, Zisserman A, Munoz X (2007) Image classification using random forests and ferns. In: Proceedings of the IEEE 11th international conference on computer vision (ICCV), 1–8
Bouza M, Cernuschi FB (2015) Common visual pattern recognition using hierarchical clustering methods with asymmetric networks. In: Argentine symposium on artificial intelligence (ASAI), vol 44, pp 192–199
Chandhok C, Chaturvedi S, Khurshid AA (2012) An approach to image segmentation using K-means clustering algorithm. Int J Inf Technol (IJIT) 1(1):11–17
Chao WL, Changpinyo S, Gong B, Sha F (2016) An empirical study and analysis of generalized zero-shot learning for object recognition in the wild. In: European conference on computer vision, 52–68
Charaniya NA, Rathod MY (2016) Object recognition using image segmentation. Int J Electr Electr Eng (IJEEE) 8(1):12–18
Cheong S, Oh SH, Lee SY (2004) Support vector machines with binary tree architecture for multi-class classification. Neural Inf Processing-Lett Rev 2(3):47–51
Chum O, Zisserman A (2007) An exemplar model for learning object classes. In: Proceedings of IEEE conference on computer vision and pattern recognition, 1–8
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Proceedings of the international conference on computer vision and pattern recognition (CVPR), vol 1, pp 886–893
Dalal N, Triggs B (2006) Object detection using histograms of oriented gradients. In: Proceedings of the Pascal VOC Workshop, ECCV, 1–5
Dell’Agnello D, Carneiro G, Chin TJ, Castellano G, Fanelli AM (2013) Fuzzy clustering based encoding for visual object classification. In: Proceedings of the In Joint IFSA world congress and NAFIPS annual meeting (IFSA/NAFIPS), 1439–1444
Diplaros A, Gevers T, Patras I (2016) Combining color and shape information for illumination-viewpoint invariant object recognition. IEEE Trans Image Process 15(1):1–11
Duanmu X (2010) Image retrieval using color moment invariant. In: Proceedings of the seventh international conference on information technology, 200–203
Eitel A, Springenberg JT, Spinello L, Riedmiller M, Burgard W (2015) Multimodal deep learning for robust RGB-D object recognition. In: International conference on intelligent robots and systems (IROS), 681–687
Elazary L, Itti L (2010) A Bayesian model for efficient visual search and recognition. Vis Res 50(14):1338–1352
Fergus R, Perona P, Zisserman A (2003) Object class recognition by unsupervised scale-invariant learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 1–8
Frias-Martinez E, Sanchez A, Velez J (2006) Support vector machines versus multi-layer perceptrons for efficient off-line signature recognition. Eng Appl Artif Intell 19(6):693–704
Hatori T, Sato-Ilic M (2014) A Fuzzy clustering method using the relative structure of the belongingness of objects to clusters. Proc Comput Sci 35:994–1002
Helmer S, Lowe DG (2004) Object class recognition with many local features. In: Proceedings of the international conference on computer vision and pattern recognition, 187–187
Huang J, Kumar SK, Mitra M, Zhu W, Zabih R (1997) Image indexing using color correlograms. In: Proceedings of the international conference on computer vision and pattern recognition, 762–768
Huang FJ, LeCun Y (2006) Large-scale learning with SVM and convolutional for generic object categorization. Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit 1:284–291
Ijjina EP, Mohan CK (2014) View and illumination invariant object classification based on 3D color histogram using convolutional neural networks. In: Proceedings of the Asian conference on computer vision, 316–327
Jurie F, Triggs B (2005) Creating efficient codebooks for visual recognition. Proc Tenth IEEE Int Conf Comput Vis (ICCV) 1:604–610
Khotanzad A, Lu JH (1990) Classification of invariant image representations using a neural network. IEEE Trans Acoust Speech Signal Process 38(6):1028–1038
Kim J, Kim BS, Savarese S (2012) Comparing image classification methods: K-nearest-neighbor and support-vector-machines
Kitti T, Jaruwan T, Chaiyapon T (2012) An object recognition and identification system using the harris corner detection method. Int J Mach Learn Comput 2(4):462–465
Kumar I, Rawat J, Bhadauria HS (2014) A conventional study of edge detection technique in digital image processing. Int J Comput Sci Mob Comput 3(4):328–334
Leibe B, Mikolajczyk K, Schiele B (2006) Efficient clustering and matching for object class recognition. In: Proceedings of the British machine vision conference, 789–798
Lepetit V, Fua P (2006) Keypoint recognition using randomized trees. IEEE Trans Pattern Anal Mach Intell 28(9):1465–1479
Lowe D (1999) Object recognition from local scale invariant features. In: Proceedings of the international conference on computer vision, 1–8
Madzarov G, Gjorgjevikj D, Chorbev I (2009) A multi-class SVM classifier utilizing binary decision tree. Informatica 33(2):233–241
Madzarov G, Gjorgjevikj D (2010) Evaluation of distance measures for multi-class classification in binary SVM decision tree. In: Proceedings of the international conference on artificial intelligence and soft computing (ICAISC), 437–444
McCann S, Lowe DG (2012) Local naive Bayes nearest neighbor for image classification. In: Proceedings of the international conference on computer vision and pattern recognition (CVPR), 3650–3656
Mele K, Maver J (2003) Object recognition using hierarchical SVMs. In: Proceedings of the workshop on computer vision, 109–114
Miller D, Nicholson L, Dayoub F, Sünderhauf N (2017) Dropout sampling for robust object detection in open-set conditions. In: Proceedings of the IEEE international conference on robotics and automation. Arxiv preprint arXiv:1710.06677
Muralidharan R (2014) Object recognition using K-nearest neighbor supported by eigen value generated from the features of an image. Int J Innov Res Comput Commun Eng (IJIRCCE) 2(8):5521–5528
Namratha M, Prajwala TR (2012) A comprehensive overview of clustering algorithms in pattern recognition. IOSR J Comput Eng (IOSRJCE) 4(6):23–30
Prajapati N, Nandanwar AK, Prajapati GS (2016) Edge histogram descriptor, geometric moment and sobel edge detector combined features based object recognition and retrieval system. Int J Comput Sci Inf Technol (IJCSIT) 7(1):407–412
Qiu G, Sudirman S (2002) A binary color vision framework for content-based image indexing. In: Proceedings of the international conference on advances in visual information systems, 50–60
Ramadevi Y, Sridevi T, Poornima B, Kalyani B (2010) Segmentation and object recognition using edge detection techniques. Int J Comput Sci Inf Technol (IJCSIT) 2(6):153–161
Rastegari M, Ordonez V, Redmon J, Farhadi A (2016) Xnor-net: imagenet classification using binary convolutional neural networks. In: European conference on computer vision, 525–542
Rosten E, Drummond T (2006) Machine learning for high-speed corner detection. In: Proceedings of the European conference on computer vision, 430–443
Roy K, Mukherjee J (2013) Image similarity measure using color histogram, color coherence vector, and sobel method. Int J Sci Res (IJSR) 2(1):538–543
Saraswat M, Goswami AK, Tiwari A (2013) Object recognition using texture based analysis. Int J Comput Sci Inf Technol (IJCSIT) 4(6):775–782
Shivakanth AM (2014) Object recognition using SIFT. Int J Innov Sci Eng Technol (IJISET) 1(4):378–381
Sivic J, Russell BC, Efros AA, Zisserman A, Freeman WT (2005) Discovering objects and their location in images. In: Proceedings of the 10th international conference on computer vision, vol 1, pp 370–377
Stricker M, Orengo M (1995) Similarity of color images. In: Proceedings of the SPIE storage and retrieval for still image and video databases III, 381–392
Sun S, Chen Z, Chia T (2002) Invariant feature extraction and object shape matching using gabor filtering. Recent Advances in Visual Information Systems
Swain MJ, Ballard DH (1991) Color Indexing. Int J Comput Vis 7(1):11–32
Syahputra H, Harjoko A, Wardoyo R, Pulungan R (2014) Plant recognition using stereo leaf image using gray-level co-occurrence matrix. J Comput Sci 10(4):697–704
Tao J, Hung Y (2002) A Bayesian method for content-based image retrieval by use of relevance feedback. In: Proceedings of the international conference on advances in visual information systems, 76–87
Thureson J, Carlsson S (2004) Appearance based qualitative image description for object class recognition. In: Proceedings of the European conference on computer vision, 518–529
Uijlings JRR, Van de Sande KEA, Gevers T (2013) Selective search for object recognition. Int J Comput Vis (IJCA) 104:1–14
Wu W, Wang MJ (1999) Two-dimensional object recognition through two-stage string matching. IEEE Trans Image Process 8(7):978–981
Vashaee A, Jafari R, Ziou D, Rashidi MM (2016) Rotation invariant HOG for object localization in web images. Signal Process 125:304–314
Vondrick C, Khosla A, Pirsiavash H, Malisiewicz T, Torralba A (2016) Visualizing object detection features. Int J Comput Vis 119(2):145–158
Wei H, Chengzhuan Y, Yu Q (2017) Contour segment grouping for object detection. J Vis Commun Image Represent 48:292–309
Wu X, Bhanu B (1997) Gabor wavelet representation for 3-D object recognition. IEEE Trans Image Process 6(1):47–64
Yakhnenko O, Honavar V (2009) Multi-modal hierarchical Dirichlet process model for predicting image annotation and image-object label correspondence. In: Proceedings of the SIAM international conference on data mining, 281–294
Yu H, Li M, Zhang H, Feng J (2002) Color texture moments for content-based image retrieval. In: Proceedings of the IEEE international conference on image processing, 929–932
Zaboli H, Rahmati M, Mirzaei A (2008) Shape recognition by clustering and matching of skeletons. J Comput 3(5):24–33
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
We have surveyed on various object recognition techniques. Papers surveyed are experimented on standard datasets like Caltech-101, COIL-100, Pascal VOC 2007, etc.
Conflict of interest
Authors have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Bansal, M., Kumar, M. & Kumar, M. 2D Object Recognition Techniques: State-of-the-Art Work. Arch Computat Methods Eng 28, 1147–1161 (2021). https://doi.org/10.1007/s11831-020-09409-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11831-020-09409-1