Real-Time Visual Recognition of Objects and Scenes Using P-Channel Matching

  • Michael Felsberg
  • Johan Hedborg
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4522)


In this paper we propose a new approach to real-time view-based object recognition and scene registration. Object recognition is an important sub-task in many applications, as e.g., robotics, retrieval, and surveillance. Scene registration is particularly useful for identifying camera views in databases or video sequences. All of these applications require a fast recognition process and the possibility to extend the database with new material, i.e., to update the recognition system online.

The method that we propose is based on P-channels, a special kind of information representation which combines advantages of histograms and local linear models. Our approach is motivated by its similarity to information representation in biological systems but its main advantage is its robustness against common distortions as clutter and occlusion. The recognition algorithm extracts a number of basic, intensity invariant image features, encodes them into P-channels, and compares the query P-channels to a set of prototype P-channels in a database. The algorithm is applied in a cross-validation experiment on the COIL database, resulting in nearly ideal ROC curves. Furthermore, results from scene registration with a fish-eye camera are presented.


object recognition scene registration P-channels real-time processing view-based computer vision 


  1. 1.
    Obdržálek, Š., Matas, J.: Sub-linear indexing for large scale object recognition. In: Clocksin, W.F., Fitzgibbon, A.W., Torr, P.H.S. (eds.) BMVC 2005: Proceedings of the 16th British Machine Vision Conference, vol. 1, September 2005, pp. 1–10. BMVA, London (2005)Google Scholar
  2. 2.
    Nene, S.A., Nayar, S.K., Murase, H.: Columbia object image library (coil-100). Technical Report CUCS-006-96 (1996)Google Scholar
  3. 3.
    Pontil, M., Verri, A.: Support vector machines for 3d object recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(6), 637–646 (1998)CrossRefGoogle Scholar
  4. 4.
    Roobaert, D., Zillich, M., Eklundh, J.O.: A pure learning approach to background-invariant object recognition using pedagogical support vector learning. In: IEEE Computer Vision and Pattern Recognition, vol. 2, pp. 351–357 (2001)Google Scholar
  5. 5.
    Murphy-Chutorian, E., Aboutalib, S., Triesch, J.: Analysis of a biologically-inspired system for real-time object recognition. Cognitive Science Online 3, 1–14 (2005)Google Scholar
  6. 6.
    Cover, T.M., Thomas, J.A.: Elements of Information Theory. Wiley, New York (1991)zbMATHGoogle Scholar
  7. 7.
    Bishop, C.M.: Neural Networks for Pattern Recognition. Oxford University Press, New York (1995)Google Scholar
  8. 8.
    Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, Heidelberg (2006)zbMATHGoogle Scholar
  9. 9.
    Granlund, G.H.: An Associative Perception-Action Structure Using a Localized Space Variant Information Representation. In: Sommer, G., Zeevi, Y.Y. (eds.) AFPAC 2000. LNCS, vol. 1888, pp. 48–68. Springer, Heidelberg (2000)CrossRefGoogle Scholar
  10. 10.
    Snippe, H.P., Koenderink, J.J.: Discrimination thresholds for channel-coded systems. Biological Cybernetics 66, 543–551 (1992)zbMATHCrossRefGoogle Scholar
  11. 11.
    Forssén, P.E.: Low and Medium Level Vision using Channel Representations. PhD thesis, Linköping University, Sweden (2004)Google Scholar
  12. 12.
    Johansson, B., Elfving, T., Kozlov, V., Censor, Y., Forssén, P.E., Granlund, G.: The application of an oblique-projected landweber method to a model of supervised learning. Mathematical and Computer Modelling 43, 892–909 (2006)zbMATHCrossRefMathSciNetGoogle Scholar
  13. 13.
    Felsberg, M., Forssén, P.E., Scharr, H.: Channel smoothing: Efficient robust smoothing of low-level signal features. IEEE Transactions on Pattern Analysis and Machine Intelligence 28(2), 209–222 (2006)CrossRefGoogle Scholar
  14. 14.
    Jonsson, E., Felsberg, M.: Reconstruction of Probability Density Functions from Channel Representations. In: Kalviainen, H., Parkkinen, J., Kaarna, A. (eds.) SCIA 2005. LNCS, vol. 3540, pp. 491–500. Springer, Heidelberg (2005)Google Scholar
  15. 15.
    Unser, M.: Splines – a perfect fit for signal and image processing. IEEE Signal Processing Magazine 16, 22–38 (1999)CrossRefGoogle Scholar
  16. 16.
    Felsberg, M., Granlund, G.: P-channels: Robust multivariate m-estimation of large datasets. In: International Conference on Pattern Recognition, Hong Kong (August 2006)Google Scholar
  17. 17.
    Granlund, G.H., Knutsson, H.: Signal Processing for Computer Vision. Kluwer Academic Publishers, Dordrecht (1995)Google Scholar
  18. 18.
    Brand, M.: Incremental singular value decomposition of uncertain data with missing values. Technical Report TR-2002-24, Mitsubishi Electric Research Laboratory (2002)Google Scholar

Copyright information

© Springer Berlin Heidelberg 2007

Authors and Affiliations

  • Michael Felsberg
    • 1
  • Johan Hedborg
    • 1
  1. 1.Computer Vision Laboratory, Linköping University, S-58183 LinköpingSweden

Personalised recommendations