2D Object Recognition Techniques: State-of-the-Art Work

Bansal, Monika; Kumar, Munish; Kumar, Manish

doi:10.1007/s11831-020-09409-1

2D Object Recognition Techniques: State-of-the-Art Work

Original Paper
Published: 18 February 2020

Volume 28, pages 1147–1161, (2021)
Cite this article

Archives of Computational Methods in Engineering Aims and scope Submit manuscript

Monika Bansal¹,
Munish Kumar² &
Manish Kumar³

2571 Accesses
44 Citations
3 Altmetric
Explore all metrics

Abstract

Object recognition is one of the research area in the field of computer vision and image processing because of its varied applications in surveillance and security systems, biometrics, intelligent vehicle system, content based image retrieval, etc. Many researchers have already done a lot of work in this area, but still there are many issues like scale, rotation, illumination, viewpoint, occlusion, background clutter among many more that draw the attention of the researchers. Object recognition is the task of recognizing the object and labeling the object in an image. The main goal of this survey is to present a comprehensive study in the field of 2D object recognition. An object is recognized by extracting the features of object like color of the object, texture of the object or shape or some other features. Then based on these features, objects are classified into various classes and each class is assigned a name. In this paper, various feature extraction techniques and classification algorithms are discussed which are required for object recognition. As the deep learning has made a tremendous improvement in object recognition process, so the paper also presents the recognition results achieved with various deep learning methods. The survey also includes the applications of object recognition system and various challenges faced while recognizing the object. Pros and cons of feature extraction and classification algorithms are also discussed which may help other researchers during their initial period of study. In this survey, the authors have also reported an analysis of various researches that describes the techniques used for object recognition with the accuracy achieved on particular image dataset. Finally, this paper ends with concluding notes and future directions. The aim of this study is to introduce the researchers about various techniques used for object recognition system.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Object detection using YOLO: challenges, architectural successors, datasets and applications

Article 08 August 2022

SSD: Single Shot MultiBox Detector

CBAM: Convolutional Block Attention Module

References

Arnow T, Bovik AC (2008) Foveated object recognition using corners. In: Proceedings of the IEEE southwest symposium on image analysis and interpretation, 53–56
Atabay HA (2016) Binary shape classification using convolutional neural networks. IIOAB J 7(5):332–336
Google Scholar
Bay H, Ess A, Tuytelaars T, Van-Gool L (2008) Speeded-up robust features (SURF). Comput Vis Image Understand 110(3):346–359
Article Google Scholar
Belongie S, Malik J, Puzicha J (2002) Shape matching and object recognition using shape contexts. IEEE Trans Pattern Anal Mach Intell 24(4):509–522
Article Google Scholar
Bora DJ, Gupta AK (2014) A comparative study between fuzzy clustering algorithm and hard clustering algorithm. Int J Comput Trends Technol (IJCTT) 10(2):108–113
Article Google Scholar
Bosch A, Zisserman A, Munoz X (2007) Image classification using random forests and ferns. In: Proceedings of the IEEE 11th international conference on computer vision (ICCV), 1–8
Bouza M, Cernuschi FB (2015) Common visual pattern recognition using hierarchical clustering methods with asymmetric networks. In: Argentine symposium on artificial intelligence (ASAI), vol 44, pp 192–199
Chandhok C, Chaturvedi S, Khurshid AA (2012) An approach to image segmentation using K-means clustering algorithm. Int J Inf Technol (IJIT) 1(1):11–17
Google Scholar
Chao WL, Changpinyo S, Gong B, Sha F (2016) An empirical study and analysis of generalized zero-shot learning for object recognition in the wild. In: European conference on computer vision, 52–68
Charaniya NA, Rathod MY (2016) Object recognition using image segmentation. Int J Electr Electr Eng (IJEEE) 8(1):12–18
Google Scholar
Cheong S, Oh SH, Lee SY (2004) Support vector machines with binary tree architecture for multi-class classification. Neural Inf Processing-Lett Rev 2(3):47–51
Google Scholar
Chum O, Zisserman A (2007) An exemplar model for learning object classes. In: Proceedings of IEEE conference on computer vision and pattern recognition, 1–8
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Proceedings of the international conference on computer vision and pattern recognition (CVPR), vol 1, pp 886–893
Dalal N, Triggs B (2006) Object detection using histograms of oriented gradients. In: Proceedings of the Pascal VOC Workshop, ECCV, 1–5
Dell’Agnello D, Carneiro G, Chin TJ, Castellano G, Fanelli AM (2013) Fuzzy clustering based encoding for visual object classification. In: Proceedings of the In Joint IFSA world congress and NAFIPS annual meeting (IFSA/NAFIPS), 1439–1444
Diplaros A, Gevers T, Patras I (2016) Combining color and shape information for illumination-viewpoint invariant object recognition. IEEE Trans Image Process 15(1):1–11
Article Google Scholar
Duanmu X (2010) Image retrieval using color moment invariant. In: Proceedings of the seventh international conference on information technology, 200–203
Eitel A, Springenberg JT, Spinello L, Riedmiller M, Burgard W (2015) Multimodal deep learning for robust RGB-D object recognition. In: International conference on intelligent robots and systems (IROS), 681–687
Elazary L, Itti L (2010) A Bayesian model for efficient visual search and recognition. Vis Res 50(14):1338–1352
Article Google Scholar
Fergus R, Perona P, Zisserman A (2003) Object class recognition by unsupervised scale-invariant learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 1–8
Frias-Martinez E, Sanchez A, Velez J (2006) Support vector machines versus multi-layer perceptrons for efficient off-line signature recognition. Eng Appl Artif Intell 19(6):693–704
Article Google Scholar
Hatori T, Sato-Ilic M (2014) A Fuzzy clustering method using the relative structure of the belongingness of objects to clusters. Proc Comput Sci 35:994–1002
Article Google Scholar
Helmer S, Lowe DG (2004) Object class recognition with many local features. In: Proceedings of the international conference on computer vision and pattern recognition, 187–187
Huang J, Kumar SK, Mitra M, Zhu W, Zabih R (1997) Image indexing using color correlograms. In: Proceedings of the international conference on computer vision and pattern recognition, 762–768
Huang FJ, LeCun Y (2006) Large-scale learning with SVM and convolutional for generic object categorization. Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit 1:284–291
Google Scholar
Ijjina EP, Mohan CK (2014) View and illumination invariant object classification based on 3D color histogram using convolutional neural networks. In: Proceedings of the Asian conference on computer vision, 316–327
Jurie F, Triggs B (2005) Creating efficient codebooks for visual recognition. Proc Tenth IEEE Int Conf Comput Vis (ICCV) 1:604–610
Article Google Scholar
Khotanzad A, Lu JH (1990) Classification of invariant image representations using a neural network. IEEE Trans Acoust Speech Signal Process 38(6):1028–1038
Article Google Scholar
Kim J, Kim BS, Savarese S (2012) Comparing image classification methods: K-nearest-neighbor and support-vector-machines
Kitti T, Jaruwan T, Chaiyapon T (2012) An object recognition and identification system using the harris corner detection method. Int J Mach Learn Comput 2(4):462–465
Article Google Scholar
Kumar I, Rawat J, Bhadauria HS (2014) A conventional study of edge detection technique in digital image processing. Int J Comput Sci Mob Comput 3(4):328–334
Google Scholar
Leibe B, Mikolajczyk K, Schiele B (2006) Efficient clustering and matching for object class recognition. In: Proceedings of the British machine vision conference, 789–798
Lepetit V, Fua P (2006) Keypoint recognition using randomized trees. IEEE Trans Pattern Anal Mach Intell 28(9):1465–1479
Article Google Scholar
Lowe D (1999) Object recognition from local scale invariant features. In: Proceedings of the international conference on computer vision, 1–8
Madzarov G, Gjorgjevikj D, Chorbev I (2009) A multi-class SVM classifier utilizing binary decision tree. Informatica 33(2):233–241
MathSciNet MATH Google Scholar
Madzarov G, Gjorgjevikj D (2010) Evaluation of distance measures for multi-class classification in binary SVM decision tree. In: Proceedings of the international conference on artificial intelligence and soft computing (ICAISC), 437–444
McCann S, Lowe DG (2012) Local naive Bayes nearest neighbor for image classification. In: Proceedings of the international conference on computer vision and pattern recognition (CVPR), 3650–3656
Mele K, Maver J (2003) Object recognition using hierarchical SVMs. In: Proceedings of the workshop on computer vision, 109–114
Miller D, Nicholson L, Dayoub F, Sünderhauf N (2017) Dropout sampling for robust object detection in open-set conditions. In: Proceedings of the IEEE international conference on robotics and automation. Arxiv preprint arXiv:1710.06677
Muralidharan R (2014) Object recognition using K-nearest neighbor supported by eigen value generated from the features of an image. Int J Innov Res Comput Commun Eng (IJIRCCE) 2(8):5521–5528
Google Scholar
Namratha M, Prajwala TR (2012) A comprehensive overview of clustering algorithms in pattern recognition. IOSR J Comput Eng (IOSRJCE) 4(6):23–30
Article Google Scholar
Prajapati N, Nandanwar AK, Prajapati GS (2016) Edge histogram descriptor, geometric moment and sobel edge detector combined features based object recognition and retrieval system. Int J Comput Sci Inf Technol (IJCSIT) 7(1):407–412
Google Scholar
Qiu G, Sudirman S (2002) A binary color vision framework for content-based image indexing. In: Proceedings of the international conference on advances in visual information systems, 50–60
Ramadevi Y, Sridevi T, Poornima B, Kalyani B (2010) Segmentation and object recognition using edge detection techniques. Int J Comput Sci Inf Technol (IJCSIT) 2(6):153–161
Google Scholar
Rastegari M, Ordonez V, Redmon J, Farhadi A (2016) Xnor-net: imagenet classification using binary convolutional neural networks. In: European conference on computer vision, 525–542
Rosten E, Drummond T (2006) Machine learning for high-speed corner detection. In: Proceedings of the European conference on computer vision, 430–443
Roy K, Mukherjee J (2013) Image similarity measure using color histogram, color coherence vector, and sobel method. Int J Sci Res (IJSR) 2(1):538–543
Google Scholar
Saraswat M, Goswami AK, Tiwari A (2013) Object recognition using texture based analysis. Int J Comput Sci Inf Technol (IJCSIT) 4(6):775–782
Google Scholar
Shivakanth AM (2014) Object recognition using SIFT. Int J Innov Sci Eng Technol (IJISET) 1(4):378–381
Google Scholar
Sivic J, Russell BC, Efros AA, Zisserman A, Freeman WT (2005) Discovering objects and their location in images. In: Proceedings of the 10th international conference on computer vision, vol 1, pp 370–377
Stricker M, Orengo M (1995) Similarity of color images. In: Proceedings of the SPIE storage and retrieval for still image and video databases III, 381–392
Sun S, Chen Z, Chia T (2002) Invariant feature extraction and object shape matching using gabor filtering. Recent Advances in Visual Information Systems
Swain MJ, Ballard DH (1991) Color Indexing. Int J Comput Vis 7(1):11–32
Article Google Scholar
Syahputra H, Harjoko A, Wardoyo R, Pulungan R (2014) Plant recognition using stereo leaf image using gray-level co-occurrence matrix. J Comput Sci 10(4):697–704
Article Google Scholar
Tao J, Hung Y (2002) A Bayesian method for content-based image retrieval by use of relevance feedback. In: Proceedings of the international conference on advances in visual information systems, 76–87
Thureson J, Carlsson S (2004) Appearance based qualitative image description for object class recognition. In: Proceedings of the European conference on computer vision, 518–529
Uijlings JRR, Van de Sande KEA, Gevers T (2013) Selective search for object recognition. Int J Comput Vis (IJCA) 104:1–14
Article Google Scholar
Wu W, Wang MJ (1999) Two-dimensional object recognition through two-stage string matching. IEEE Trans Image Process 8(7):978–981
Article Google Scholar
Vashaee A, Jafari R, Ziou D, Rashidi MM (2016) Rotation invariant HOG for object localization in web images. Signal Process 125:304–314
Article Google Scholar
Vondrick C, Khosla A, Pirsiavash H, Malisiewicz T, Torralba A (2016) Visualizing object detection features. Int J Comput Vis 119(2):145–158
Article MathSciNet Google Scholar
Wei H, Chengzhuan Y, Yu Q (2017) Contour segment grouping for object detection. J Vis Commun Image Represent 48:292–309
Article Google Scholar
Wu X, Bhanu B (1997) Gabor wavelet representation for 3-D object recognition. IEEE Trans Image Process 6(1):47–64
Article Google Scholar
Yakhnenko O, Honavar V (2009) Multi-modal hierarchical Dirichlet process model for predicting image annotation and image-object label correspondence. In: Proceedings of the SIAM international conference on data mining, 281–294
Yu H, Li M, Zhang H, Feng J (2002) Color texture moments for content-based image retrieval. In: Proceedings of the IEEE international conference on image processing, 929–932
Zaboli H, Rahmati M, Mirzaei A (2008) Shape recognition by clustering and matching of skeletons. J Comput 3(5):24–33
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Punjabi University, Patiala, India
Monika Bansal
Department of Computational Sciences, Maharaja Ranjit Singh Punjab Technical University, Bathinda, Punjab, India
Munish Kumar
Department of Computer Science, Baba Farid College, Bathinda, Punjab, India
Manish Kumar

Authors

Monika Bansal
View author publications
You can also search for this author in PubMed Google Scholar
Munish Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Manish Kumar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Munish Kumar.

Ethics declarations

We have surveyed on various object recognition techniques. Papers surveyed are experimented on standard datasets like Caltech-101, COIL-100, Pascal VOC 2007, etc.

Conflict of interest

Authors have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bansal, M., Kumar, M. & Kumar, M. 2D Object Recognition Techniques: State-of-the-Art Work. Arch Computat Methods Eng 28, 1147–1161 (2021). https://doi.org/10.1007/s11831-020-09409-1

Download citation

Received: 26 July 2019
Accepted: 10 February 2020
Published: 18 February 2020
Issue Date: May 2021
DOI: https://doi.org/10.1007/s11831-020-09409-1

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

2D Object Recognition Techniques: State-of-the-Art Work

Abstract

Access this article

Similar content being viewed by others

Object detection using YOLO: challenges, architectural successors, datasets and applications

SSD: Single Shot MultiBox Detector

CBAM: Convolutional Block Attention Module

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Navigation

2D Object Recognition Techniques: State-of-the-Art Work

Abstract

Access this article

Similar content being viewed by others

Object detection using YOLO: challenges, architectural successors, datasets and applications

SSD: Single Shot MultiBox Detector

CBAM: Convolutional Block Attention Module

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation