Skip to main content
Log in

Coding of Image Feature Descriptors for Distributed Rate-efficient Visual Correspondences

  • Published:
International Journal of Computer Vision Aims and scope Submit manuscript

Abstract

Establishing visual correspondences is a critical step in many computer vision tasks involving multiple views of a scene. In a dynamic environment and when cameras are mobile, visual correspondences need to be updated on a recurring basis. At the same time, the use of wireless links between camera motes imposes tight rate constraints. This combination of issues motivates us to consider the problem of establishing visual correspondences in a distributed fashion between cameras operating under rate constraints. We propose a solution based on constructing distance preserving hashes using binarized random projections. By exploiting the fact that descriptors of regions in correspondence are highly correlated, we propose a novel use of distributed source coding via linear codes on the binary hashes to more efficiently exchange feature descriptors for establishing correspondences across multiple camera views. A systematic approach is used to evaluate rate vs visual correspondences retrieval performance; under a stringent matching criterion, our proposed methods demonstrate superior performance to a baseline scheme employing transform coding of descriptors.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • Ahlswede, R., & Csiszár, I. (1981). To get a bit of information may be as hard as to get full information. IEEE Transactions on Information Theory, 27(4), 398–408.

    Article  MATH  Google Scholar 

  • Avidan, S., & Shashua, A. (1998). Novel view synthesis by cascading trilinear tensors. IEEE Transactions on Visualization and Computer Graphics, 4(4), 293–306.

    Article  Google Scholar 

  • Barton-Sweeney, A., Lymberopoulos, D., & Savvides, A. (2006). Sensor localization and camera calibration in distributed camera sensor networks. In Proc. IEEE basenets.

    Google Scholar 

  • Berg, A., Berg, T., & Malik, J. (2005). Shape matching and object recognition using low distortion correspondence. In Proc. IEEE conference on computer vision and pattern recognition (Vol. 1, pp. 26–33).

    Google Scholar 

  • Bickel, P. J., & Doksum, K. A. (2000). Mathematical statistics: basic ideas and selected topics, 2nd edn. (Vol. 1). New York: Prentice Hall.

    Google Scholar 

  • Cai, H., Mikolajczyk, K., & Matas, J. (2008). Learning linear discriminant projections for dimensionality reduction of image descriptors. In Proc. British machine vision conf.

  • Chandrasekhar, V., Takacs, G., Chen, D., Tsai, S. S., Grzeszczuk, R., & Girod, B. (2009a). CHoG: compressed histogram of gradients. In Conference on computer vision and pattern recognition, Miami, FL, USA (pp. 2504–2511).

    Chapter  Google Scholar 

  • Chandrasekhar, V., Takacs, G., Chen, D., Tsai, S. S., Singh, J., & Girod, B. (2009b). Transform coding of image feature descriptors. In Proc. SPIE visual communication and image processing.

    Google Scholar 

  • Charikar, M. S. (2002). Similarity estimation techniques from rounding algorithms. In Proc. ACM symposium on theory of computing (pp. 380–388).

    Google Scholar 

  • Chen, P. W. C., Ahammad, P., Boyer, C., Huang, S. I., Lin, L., Lobaton, E. J., Meingast, M. L., Oh, S., Wang, S., Yan, P., Yang, A., Yeo, C., Chang, L. C., Tygar, D., & Sastry, S. S. (2008). Citric: a low-bandwidth wireless camera network platform. Tech. Rep. UCB/EECS-2008-50, EECS Department, University of California, Berkeley. http://www.eecs.berkeley.edu/Pubs/TechRpts/2008/EECS-2008-50.html.

  • Cheng, Z., Devarajan, D., & Radke, R. J. (2007). Determining vision graphs for distributed camera networks using feature digests. EURASIP Journal on Advances in Signal Processing, 2007, Article ID 57,034, 11 pages.

    Article  Google Scholar 

  • Cover, T., & Thomas, J. (1991). Elements of information theory. New York: Wiley.

    Book  MATH  Google Scholar 

  • Devarajan, D., & Radke, R. J. (2004). Distributed metric calibration of large camera networks. In Proc. workshop on broadband advanced sensor networks.

    Google Scholar 

  • Downes, I., Rad, L. B., & Aghajan, H. (2006). Development of a mote for wireless image sensor networks. In Proc. COGnitive systems with Interactive Sensors (COGIS).

    Google Scholar 

  • Ferrari, V., Tuytelaars, T., & Van Gool, L. (2004). Simultaneous object recognition and segmentation by image exploration. In Proc. European conference on computer vision (Vol. 1, pp. 40–54). Berlin: Springer.

    Google Scholar 

  • Fischler, M. A., & Bolles, R. C. (1981). Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM, 24(6), 381–395. http://doi.acm.org/10.1145/358669.358692.

    Article  MathSciNet  Google Scholar 

  • Franke, U., & Joos, A. (2000). Real-time stereo vision for urban traffic scene understanding. In Proc. IEEE intelligent vehicles symposium (pp. 273–278).

    Google Scholar 

  • Gallager, R. G. (1963). Low-density parity-check codes. Cambridge: MIT Press.

    Google Scholar 

  • Goemans, M. X., & Williamson, D. P. (1995). Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming. Journal of the ACM, 42(6), 1115–1145.

    Article  MathSciNet  MATH  Google Scholar 

  • Hartley, R., & Zisserman, A. (2000). Multiple view geometry in computer vision. Cambridge: Cambridge University Press.

    MATH  Google Scholar 

  • Indyk, P., & Motwani, R. (1998). Approximate nearest neighbors: towards removing the curse of dimensionality. In Proceedings of the thirtieth annual ACM symposium on theory of computing (pp. 604–613). New York: ACM.

    Chapter  Google Scholar 

  • Jain, P., Kulis, B., & Grauman, K. (2008). Fast image search for learned metrics. In IEEE conference on computer vision and pattern recognition, 2008. CVPR 2008 (pp. 1–8).

    Google Scholar 

  • Körner, J., & Marton, K. (1979). How to encode the modulo-two sum of binary sources. IEEE Transactions on Information Theory, 25(2), 219–221.

    Article  MATH  Google Scholar 

  • Larsen, B., & Aone, C. (1999). Fast and effective text mining using linear-time document clustering. In Proc. ACM SIGKDD international conference on knowledge discovery and data mining (pp. 16–22). New York: ACM Press.

    Chapter  Google Scholar 

  • Lee, H., & Aghajan, H. (2006). Collaborative node localization in surveillance networks using opportunistic target observations. In Proc. ACM international workshop on video surveillance and sensor networks (pp. 9–18). New York: ACM Press.

    Chapter  Google Scholar 

  • Lin, Y. C., Varodayan, D., & Girod, B. (2007). Image authentication based on distributed source coding. In Proc. IEEE international conference on image processing.

    Google Scholar 

  • Lowe, D. G. (2004). Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2), 91–110.

    Article  Google Scholar 

  • Ma, Y., Soatto, S., Kosecka, J., & Sastry, S. S. (2004). An invitation to 3-D vision: from images to geometric models. Berlin: Springer.

    MATH  Google Scholar 

  • Martinian, E., Yekhanin, S., & Yedidia, J. S. (2005). Secure biometrics via syndromes. In Proc. Allerton conference on communications, control and computing.

    Google Scholar 

  • Matusik, W., & Pfister, H. (2004). 3D TV: a scalable system for real-time acquisition, transmission, and autostereoscopic display of dynamic scenes. ACM Transactions on Graphics, 23(3), 814–824.

    Article  Google Scholar 

  • Mikolajczyk, K., & Matas, J. (2007). Improving descriptors for fast tree matching by optimal linear projection. In IEEE 11th international conference on computer vision, 2007 (pp. 1–8).

    Chapter  Google Scholar 

  • Mikolajczyk, K., & Schmid, C. (2004). Scale and affine invariant interest point detectors. International Journal of Computer Vision, 60(1), 63–86.

    Article  Google Scholar 

  • Mikolajczyk, K., & Schmid, C. (2005). A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(10), 1615–1630.

    Article  Google Scholar 

  • Oh, S., Schenato, L., Chen, P., & Sastry, S. (2007). Tracking and coordination of multiple agents using sensor networks: system design, algorithms and experiments. Proceedings of the IEEE, 95, 234–254.

    Article  Google Scholar 

  • Rahimi, M., Baer, R., Iroezi, O., Garcia, J., Warrior, J., Estrin, D., & Srivastava, M. (2005). Cyclops: in situ image sensing and interpretation. In Proc. ACM conference on embedded networked sensor systems.

    Google Scholar 

  • Richardson, T. J., & Urbanke, R. L. (2001). The capacity of low-density parity-check codes under message-passing decoding. IEEE Transactions on Information Theory, 47(2), 599–618.

    Article  MathSciNet  MATH  Google Scholar 

  • Roy, S., & Sun, Q. (2007). Robust hash for detecting and localizing image tampering. In Proc. IEEE international conference on image processing.

    Google Scholar 

  • Salakhutdinov, R., & Hinton, G. (2009). Semantic hashing. International Journal of Approximate Reasoning, 50(7), 969–978.

    Article  Google Scholar 

  • Schaffalitzky, F., & Zisserman, A. (2002). Multi-view matching for unordered image sets, or “how do I organize my holiday snaps?”. In Proc. European conference on computer vision (Vol. 1, pp. 414–431). Berlin: Springer.

    Google Scholar 

  • Se, S., Lowe, D., & Little, J. (2002). Global localization using distinctive visual features. In Proc. IEEE/RSJ international conference on intelligent robots and system (Vol. 1).

    Google Scholar 

  • Shum, H., & Kang, S. B. (2000). A review of image-based rendering techniques. In Proc. SPIE visual communications and image processing (pp. 2–13). Bellingham: SPIE.

    Google Scholar 

  • Slepian, D., & Wolf, J. (1973). Noiseless coding of correlated information sources. IEEE Transactions on Information Theory, 19(4), 471–480.

    Article  MathSciNet  MATH  Google Scholar 

  • Szewczyk, R., Osterweil, E., Polastre, J., Hamilton, M., Mainwaring, A. M., & Estrin, D. (2004). Habitat monitoring with sensor networks. Communications of the ACM, 47(6), 34–40.

    Article  Google Scholar 

  • Teixeira, T., Lymberopoulos, D., Culurciello, E., Aloimonos, Y., & Savvides, A. (2006). A lightweight camera sensor network operating on symbolic information. In Proc. workshop on distributed smart cameras, Boulder, Colorado.

    Google Scholar 

  • Weiss, Y., Torralba, A., & Fergus, R. (2009). Spectral hashing. Advances in Neural Information Processing Systems, 21, 1753–1760.

    Google Scholar 

  • Winder, S. A. J., & Brown, M. (2007). Learning local image descriptors. In IEEE conference on computer vision and pattern recognition, 2007. CVPR’07 (pp. 1–8).

    Chapter  Google Scholar 

  • Wyner, A. D., & Ziv, J. (1976). The rate distortion function for source coding with side information at the decoder. IEEE Transactions on Information Theory, 22(1), 1–10.

    Article  MathSciNet  MATH  Google Scholar 

  • Yeo, C., Ahammad, P., & Ramchandran, K. (2008a). A rate-efficient approach for establishing visual correspondences via distributed source coding. In Proc. SPIE visual communications and image processing.

    Google Scholar 

  • Yeo, C., Ahammad, P., & Ramchandran, K. (2008b). Rate-efficient visual correspondences using random projections. In Proc. IEEE international conference on image processing.

    Google Scholar 

  • Yeo, C., Ahammad, P., Zhang, H., & Ramchandran, K. (2009). Rate-constrained distributed distance testing and its applications. In Proc. IEEE international conference on acoustics, speech, and signal processing.

    Google Scholar 

  • Zhang, Z. (1998). Determining the epipolar geometry and its uncertainty: a review. International Journal of Computer Vision, 27(2), 161–195.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chuohao Yeo.

Additional information

This work has been presented in part in Yeo et al. (2008a, 2008b, 2009).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yeo, C., Ahammad, P. & Ramchandran, K. Coding of Image Feature Descriptors for Distributed Rate-efficient Visual Correspondences. Int J Comput Vis 94, 267–281 (2011). https://doi.org/10.1007/s11263-011-0427-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11263-011-0427-1

Keywords

Navigation