Skip to main content
Log in

2D Object Recognition Techniques: State-of-the-Art Work

  • Original Paper
  • Published:
Archives of Computational Methods in Engineering Aims and scope Submit manuscript

Abstract

Object recognition is one of the research area in the field of computer vision and image processing because of its varied applications in surveillance and security systems, biometrics, intelligent vehicle system, content based image retrieval, etc. Many researchers have already done a lot of work in this area, but still there are many issues like scale, rotation, illumination, viewpoint, occlusion, background clutter among many more that draw the attention of the researchers. Object recognition is the task of recognizing the object and labeling the object in an image. The main goal of this survey is to present a comprehensive study in the field of 2D object recognition. An object is recognized by extracting the features of object like color of the object, texture of the object or shape or some other features. Then based on these features, objects are classified into various classes and each class is assigned a name. In this paper, various feature extraction techniques and classification algorithms are discussed which are required for object recognition. As the deep learning has made a tremendous improvement in object recognition process, so the paper also presents the recognition results achieved with various deep learning methods. The survey also includes the applications of object recognition system and various challenges faced while recognizing the object. Pros and cons of feature extraction and classification algorithms are also discussed which may help other researchers during their initial period of study. In this survey, the authors have also reported an analysis of various researches that describes the techniques used for object recognition with the accuracy achieved on particular image dataset. Finally, this paper ends with concluding notes and future directions. The aim of this study is to introduce the researchers about various techniques used for object recognition system.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  1. Arnow T, Bovik AC (2008) Foveated object recognition using corners. In: Proceedings of the IEEE southwest symposium on image analysis and interpretation, 53–56

  2. Atabay HA (2016) Binary shape classification using convolutional neural networks. IIOAB J 7(5):332–336

    Google Scholar 

  3. Bay H, Ess A, Tuytelaars T, Van-Gool L (2008) Speeded-up robust features (SURF). Comput Vis Image Understand 110(3):346–359

    Article  Google Scholar 

  4. Belongie S, Malik J, Puzicha J (2002) Shape matching and object recognition using shape contexts. IEEE Trans Pattern Anal Mach Intell 24(4):509–522

    Article  Google Scholar 

  5. Bora DJ, Gupta AK (2014) A comparative study between fuzzy clustering algorithm and hard clustering algorithm. Int J Comput Trends Technol (IJCTT) 10(2):108–113

    Article  Google Scholar 

  6. Bosch A, Zisserman A, Munoz X (2007) Image classification using random forests and ferns. In: Proceedings of the IEEE 11th international conference on computer vision (ICCV), 1–8

  7. Bouza M, Cernuschi FB (2015) Common visual pattern recognition using hierarchical clustering methods with asymmetric networks. In: Argentine symposium on artificial intelligence (ASAI), vol 44, pp 192–199

  8. Chandhok C, Chaturvedi S, Khurshid AA (2012) An approach to image segmentation using K-means clustering algorithm. Int J Inf Technol (IJIT) 1(1):11–17

    Google Scholar 

  9. Chao WL, Changpinyo S, Gong B, Sha F (2016) An empirical study and analysis of generalized zero-shot learning for object recognition in the wild. In: European conference on computer vision, 52–68

  10. Charaniya NA, Rathod MY (2016) Object recognition using image segmentation. Int J Electr Electr Eng (IJEEE) 8(1):12–18

    Google Scholar 

  11. Cheong S, Oh SH, Lee SY (2004) Support vector machines with binary tree architecture for multi-class classification. Neural Inf Processing-Lett Rev 2(3):47–51

    Google Scholar 

  12. Chum O, Zisserman A (2007) An exemplar model for learning object classes. In: Proceedings of IEEE conference on computer vision and pattern recognition, 1–8

  13. Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Proceedings of the international conference on computer vision and pattern recognition (CVPR), vol 1, pp 886–893

  14. Dalal N, Triggs B (2006) Object detection using histograms of oriented gradients. In: Proceedings of the Pascal VOC Workshop, ECCV, 1–5

  15. Dell’Agnello D, Carneiro G, Chin TJ, Castellano G, Fanelli AM (2013) Fuzzy clustering based encoding for visual object classification. In: Proceedings of the In Joint IFSA world congress and NAFIPS annual meeting (IFSA/NAFIPS), 1439–1444

  16. Diplaros A, Gevers T, Patras I (2016) Combining color and shape information for illumination-viewpoint invariant object recognition. IEEE Trans Image Process 15(1):1–11

    Article  Google Scholar 

  17. Duanmu X (2010) Image retrieval using color moment invariant. In: Proceedings of the seventh international conference on information technology, 200–203

  18. Eitel A, Springenberg JT, Spinello L, Riedmiller M, Burgard W (2015) Multimodal deep learning for robust RGB-D object recognition. In: International conference on intelligent robots and systems (IROS), 681–687

  19. Elazary L, Itti L (2010) A Bayesian model for efficient visual search and recognition. Vis Res 50(14):1338–1352

    Article  Google Scholar 

  20. Fergus R, Perona P, Zisserman A (2003) Object class recognition by unsupervised scale-invariant learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, 1–8

  21. Frias-Martinez E, Sanchez A, Velez J (2006) Support vector machines versus multi-layer perceptrons for efficient off-line signature recognition. Eng Appl Artif Intell 19(6):693–704

    Article  Google Scholar 

  22. Hatori T, Sato-Ilic M (2014) A Fuzzy clustering method using the relative structure of the belongingness of objects to clusters. Proc Comput Sci 35:994–1002

    Article  Google Scholar 

  23. Helmer S, Lowe DG (2004) Object class recognition with many local features. In: Proceedings of the international conference on computer vision and pattern recognition, 187–187

  24. Huang J, Kumar SK, Mitra M, Zhu W, Zabih R (1997) Image indexing using color correlograms. In: Proceedings of the international conference on computer vision and pattern recognition, 762–768

  25. Huang FJ, LeCun Y (2006) Large-scale learning with SVM and convolutional for generic object categorization. Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit 1:284–291

    Google Scholar 

  26. Ijjina EP, Mohan CK (2014) View and illumination invariant object classification based on 3D color histogram using convolutional neural networks. In: Proceedings of the Asian conference on computer vision, 316–327

  27. Jurie F, Triggs B (2005) Creating efficient codebooks for visual recognition. Proc Tenth IEEE Int Conf Comput Vis (ICCV) 1:604–610

    Article  Google Scholar 

  28. Khotanzad A, Lu JH (1990) Classification of invariant image representations using a neural network. IEEE Trans Acoust Speech Signal Process 38(6):1028–1038

    Article  Google Scholar 

  29. Kim J, Kim BS, Savarese S (2012) Comparing image classification methods: K-nearest-neighbor and support-vector-machines

  30. Kitti T, Jaruwan T, Chaiyapon T (2012) An object recognition and identification system using the harris corner detection method. Int J Mach Learn Comput 2(4):462–465

    Article  Google Scholar 

  31. Kumar I, Rawat J, Bhadauria HS (2014) A conventional study of edge detection technique in digital image processing. Int J Comput Sci Mob Comput 3(4):328–334

    Google Scholar 

  32. Leibe B, Mikolajczyk K, Schiele B (2006) Efficient clustering and matching for object class recognition. In: Proceedings of the British machine vision conference, 789–798

  33. Lepetit V, Fua P (2006) Keypoint recognition using randomized trees. IEEE Trans Pattern Anal Mach Intell 28(9):1465–1479

    Article  Google Scholar 

  34. Lowe D (1999) Object recognition from local scale invariant features. In: Proceedings of the international conference on computer vision, 1–8

  35. Madzarov G, Gjorgjevikj D, Chorbev I (2009) A multi-class SVM classifier utilizing binary decision tree. Informatica 33(2):233–241

    MathSciNet  MATH  Google Scholar 

  36. Madzarov G, Gjorgjevikj D (2010) Evaluation of distance measures for multi-class classification in binary SVM decision tree. In: Proceedings of the international conference on artificial intelligence and soft computing (ICAISC), 437–444

  37. McCann S, Lowe DG (2012) Local naive Bayes nearest neighbor for image classification. In: Proceedings of the international conference on computer vision and pattern recognition (CVPR), 3650–3656

  38. Mele K, Maver J (2003) Object recognition using hierarchical SVMs. In: Proceedings of the workshop on computer vision, 109–114

  39. Miller D, Nicholson L, Dayoub F, Sünderhauf N (2017) Dropout sampling for robust object detection in open-set conditions. In: Proceedings of the IEEE international conference on robotics and automation. Arxiv preprint arXiv:1710.06677

  40. Muralidharan R (2014) Object recognition using K-nearest neighbor supported by eigen value generated from the features of an image. Int J Innov Res Comput Commun Eng (IJIRCCE) 2(8):5521–5528

    Google Scholar 

  41. Namratha M, Prajwala TR (2012) A comprehensive overview of clustering algorithms in pattern recognition. IOSR J Comput Eng (IOSRJCE) 4(6):23–30

    Article  Google Scholar 

  42. Prajapati N, Nandanwar AK, Prajapati GS (2016) Edge histogram descriptor, geometric moment and sobel edge detector combined features based object recognition and retrieval system. Int J Comput Sci Inf Technol (IJCSIT) 7(1):407–412

    Google Scholar 

  43. Qiu G, Sudirman S (2002) A binary color vision framework for content-based image indexing. In: Proceedings of the international conference on advances in visual information systems, 50–60

  44. Ramadevi Y, Sridevi T, Poornima B, Kalyani B (2010) Segmentation and object recognition using edge detection techniques. Int J Comput Sci Inf Technol (IJCSIT) 2(6):153–161

    Google Scholar 

  45. Rastegari M, Ordonez V, Redmon J, Farhadi A (2016) Xnor-net: imagenet classification using binary convolutional neural networks. In: European conference on computer vision, 525–542

  46. Rosten E, Drummond T (2006) Machine learning for high-speed corner detection. In: Proceedings of the European conference on computer vision, 430–443

  47. Roy K, Mukherjee J (2013) Image similarity measure using color histogram, color coherence vector, and sobel method. Int J Sci Res (IJSR) 2(1):538–543

    Google Scholar 

  48. Saraswat M, Goswami AK, Tiwari A (2013) Object recognition using texture based analysis. Int J Comput Sci Inf Technol (IJCSIT) 4(6):775–782

    Google Scholar 

  49. Shivakanth AM (2014) Object recognition using SIFT. Int J Innov Sci Eng Technol (IJISET) 1(4):378–381

    Google Scholar 

  50. Sivic J, Russell BC, Efros AA, Zisserman A, Freeman WT (2005) Discovering objects and their location in images. In: Proceedings of the 10th international conference on computer vision, vol 1, pp 370–377

  51. Stricker M, Orengo M (1995) Similarity of color images. In: Proceedings of the SPIE storage and retrieval for still image and video databases III, 381–392

  52. Sun S, Chen Z, Chia T (2002) Invariant feature extraction and object shape matching using gabor filtering. Recent Advances in Visual Information Systems

  53. Swain MJ, Ballard DH (1991) Color Indexing. Int J Comput Vis 7(1):11–32

    Article  Google Scholar 

  54. Syahputra H, Harjoko A, Wardoyo R, Pulungan R (2014) Plant recognition using stereo leaf image using gray-level co-occurrence matrix. J Comput Sci 10(4):697–704

    Article  Google Scholar 

  55. Tao J, Hung Y (2002) A Bayesian method for content-based image retrieval by use of relevance feedback. In: Proceedings of the international conference on advances in visual information systems, 76–87

  56. Thureson J, Carlsson S (2004) Appearance based qualitative image description for object class recognition. In: Proceedings of the European conference on computer vision, 518–529

  57. Uijlings JRR, Van de Sande KEA, Gevers T (2013) Selective search for object recognition. Int J Comput Vis (IJCA) 104:1–14

    Article  Google Scholar 

  58. Wu W, Wang MJ (1999) Two-dimensional object recognition through two-stage string matching. IEEE Trans Image Process 8(7):978–981

    Article  Google Scholar 

  59. Vashaee A, Jafari R, Ziou D, Rashidi MM (2016) Rotation invariant HOG for object localization in web images. Signal Process 125:304–314

    Article  Google Scholar 

  60. Vondrick C, Khosla A, Pirsiavash H, Malisiewicz T, Torralba A (2016) Visualizing object detection features. Int J Comput Vis 119(2):145–158

    Article  MathSciNet  Google Scholar 

  61. Wei H, Chengzhuan Y, Yu Q (2017) Contour segment grouping for object detection. J Vis Commun Image Represent 48:292–309

    Article  Google Scholar 

  62. Wu X, Bhanu B (1997) Gabor wavelet representation for 3-D object recognition. IEEE Trans Image Process 6(1):47–64

    Article  Google Scholar 

  63. Yakhnenko O, Honavar V (2009) Multi-modal hierarchical Dirichlet process model for predicting image annotation and image-object label correspondence. In: Proceedings of the SIAM international conference on data mining, 281–294

  64. Yu H, Li M, Zhang H, Feng J (2002) Color texture moments for content-based image retrieval. In: Proceedings of the IEEE international conference on image processing, 929–932

  65. Zaboli H, Rahmati M, Mirzaei A (2008) Shape recognition by clustering and matching of skeletons. J Comput 3(5):24–33

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Munish Kumar.

Ethics declarations

We have surveyed on various object recognition techniques. Papers surveyed are experimented on standard datasets like Caltech-101, COIL-100, Pascal VOC 2007, etc.

Conflict of interest

Authors have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bansal, M., Kumar, M. & Kumar, M. 2D Object Recognition Techniques: State-of-the-Art Work. Arch Computat Methods Eng 28, 1147–1161 (2021). https://doi.org/10.1007/s11831-020-09409-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11831-020-09409-1

Navigation