Pose Invariant Generic Object Recognition with Orthogonal Axis Manifolds in Linear Subspace

  • Manisha Kalra
  • P. Deepti
  • R. Abhilash
  • Sukhendu Das
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4338)


This paper addresses the problem of pose invariant Generic Object Recognition by modeling the perceptual capability of human beings. We propose a novel framework using a combination of appearance and shape cues to recognize the object class and viewpoint (axis of rotation) as well as determine its pose (angle of view). The appearance model of the object from multiple viewpoints is captured using Linear Subspace Analysis techniques and is used to reduce the search space to a few rank-ordered candidates. We have used a decision-fusion based combination of 2D PCA and ICA to integrate the complementary information of classifiers and improve recognition accuracy. For matching based on shape features, we propose the use of distance transform based correlation. A decision fusion using ‘Sum Rule’ of 2D PCA and ICA subspace classifiers, and distance transform based correlation is then used to verify the correct object class and determine its viewpoint and pose. Experiments were conducted on COIL-100 and IGOIL (IITM Generic Object Image Library) databases which contain objects with complex appearance and shape characteristics. IGOIL database was captured to analyze the appearance manifolds along two orthogonal axes of rotation.


Object Recognition Recognition Rate Recognition Accuracy Appearance Model Orthogonal Axis 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Murase, H., Nayar, S.: Visual Learning and recognition of 3-D Objects from Appearance. International Journal of Computer Vision 14, 5–24 (1995)CrossRefGoogle Scholar
  2. 2.
    Nagabhushan, P., Guru, D., Shekar, B.: Visual Learning and recognition of 3-D Objects from Appearance using two-dimensional principal components analysis: A robust and an efficient approach. Pattern Recognition 39, 721–725 (2006)MATHCrossRefGoogle Scholar
  3. 3.
    Rosin, P.L., Marshall, D.: Object recognition using local affine frames on distinguished regions. In: Proceedings of the British Machine Vision Conference, London, UK, pp. 113–122 (2002)Google Scholar
  4. 4.
    Rothganger, F., Lazebnik, S., Schmid, C., Ponce, J.: 3D Object Modeling and Recognition Using Local Affine-Invariant Image Descriptors and Multi-View Spatial Constraints. International Journal of Computer Vision 66 (2006)Google Scholar
  5. 5.
    Biederman, I.: Recognition by Components: A theory of Human Image Understanding. Psychological Review 94, 115–147 (1987)CrossRefGoogle Scholar
  6. 6.
    Leonardis, A., Bischof, H.: Robust recognition using eigenimages. Computer Vision and Image Understanding 78, 99–118 (2000)CrossRefGoogle Scholar
  7. 7.
    Pontil, M., Verri, A.: Support Vector Machines for 3D Object Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 20, 637–646 (1998)CrossRefGoogle Scholar
  8. 8.
    Zhang, H., Berg, A., Mair, M., Malik, J.: SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition. Computer Vision and Pattern Recognition 2, 2126–2136 (2006)Google Scholar
  9. 9.
    Belongie, P.J., Malik, J.: Shape Matching and Object Recognition using Shape Contexts. IEEE transactions on Pattern Analysis and Machine Intelligence 24, 509–522 (2002)CrossRefGoogle Scholar
  10. 10.
    Bohm, J., Bernner, C., Guhring, J., Fritsch, D.: Automated Extraction of features from CAD models for 3-D object recognition. In: International Society for Photogrammetry and Remote Sensing Congress, Amsterdam, Netherlands, vol. 33 (2000)Google Scholar
  11. 11.
    Edelman, S., Buelthoff, H.: Orientation dependence in the recognition of familiar and novel views of three dimensional objects. Vision Research 32, 2385–2400 (1992)CrossRefGoogle Scholar
  12. 12.
    Kanwisher, N.: Domain Specificity in face perception. Nature Neuroscience 3, 759–776 (2000)CrossRefGoogle Scholar
  13. 13.
    Hyvarinen, A.: Fast and Robust Fixed-Point Algorithms for Independent Component Analysis. IEEE Trans. on Neural Networks 10, 626–634 (1999)CrossRefGoogle Scholar
  14. 14.
    Zhang, J.Y., Frangi, A.F., Yang, J.Y.: Two-Dimensional PCA: A New Approach to Appearance-Based Face Representation and Recognition. IEEE Tran. on Pattern Analysis and Machine Intelligence 26, 131–137 (2004)CrossRefGoogle Scholar
  15. 15.
    Kittler, J., Duin, P., Matas, J.: On Combining Classifiers. IEEE Transactions on Pattern Analysis and Machine Intelligence 20, 226–239 (1998)CrossRefGoogle Scholar
  16. 16.
    Sanjay, M., Das, S., Yegnanarayana, B.: Robust Template Matching for noisy bitmap images invariant to translation and rotation. In: Indian Conference on Computer Vision, Graphics and Image Processing, New Delhi, India, pp. 82–84 (1998)Google Scholar
  17. 17.
    Nene, S.A., Nayar, S.K., Murase, H.: COIL 100 Database (1996),
  18. 18.
    Kalra, M., Das, S.: IITM Generic Object Image Library (2006),
  19. 19.
    Rother, C., Kolomogorov, V., Blake, A.: GrabCut- Interactive Foreground extraction using iterated Graph Cuts. ACM transactions on Graphics (SIGGRAPH), 309–314 (2004)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Manisha Kalra
    • 1
  • P. Deepti
    • 1
  • R. Abhilash
    • 1
  • Sukhendu Das
    • 1
  1. 1.Visualization and Perception Laboratory, Department of Computer Science and EngineeringIndian Institute of Technology – MadrasChennaiIndia

Personalised recommendations