Coding of Image Feature Descriptors for Distributed Rate-efficient Visual Correspondences

Yeo, Chuohao; Ahammad, Parvez; Ramchandran, Kannan

doi:10.1007/s11263-011-0427-1

Coding of Image Feature Descriptors for Distributed Rate-efficient Visual Correspondences

Published: 03 March 2011

Volume 94, pages 267–281, (2011)
Cite this article

International Journal of Computer Vision Aims and scope Submit manuscript

Chuohao Yeo¹,
Parvez Ahammad² &
Kannan Ramchandran³

332 Accesses
11 Citations
3 Altmetric
Explore all metrics

Abstract

Establishing visual correspondences is a critical step in many computer vision tasks involving multiple views of a scene. In a dynamic environment and when cameras are mobile, visual correspondences need to be updated on a recurring basis. At the same time, the use of wireless links between camera motes imposes tight rate constraints. This combination of issues motivates us to consider the problem of establishing visual correspondences in a distributed fashion between cameras operating under rate constraints. We propose a solution based on constructing distance preserving hashes using binarized random projections. By exploiting the fact that descriptors of regions in correspondence are highly correlated, we propose a novel use of distributed source coding via linear codes on the binary hashes to more efficiently exchange feature descriptors for establishing correspondences across multiple camera views. A systematic approach is used to evaluate rate vs visual correspondences retrieval performance; under a stringent matching criterion, our proposed methods demonstrate superior performance to a baseline scheme employing transform coding of descriptors.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

VocMatch: Efficient Multiview Correspondence for Structure from Motion

Is Geometry Enough for Matching in Visual Localization?

Learning and Matching Multi-View Descriptors for Registration of Point Clouds

References

Ahlswede, R., & Csiszár, I. (1981). To get a bit of information may be as hard as to get full information. IEEE Transactions on Information Theory, 27(4), 398–408.
Article MATH Google Scholar
Avidan, S., & Shashua, A. (1998). Novel view synthesis by cascading trilinear tensors. IEEE Transactions on Visualization and Computer Graphics, 4(4), 293–306.
Article Google Scholar
Barton-Sweeney, A., Lymberopoulos, D., & Savvides, A. (2006). Sensor localization and camera calibration in distributed camera sensor networks. In Proc. IEEE basenets.
Google Scholar
Berg, A., Berg, T., & Malik, J. (2005). Shape matching and object recognition using low distortion correspondence. In Proc. IEEE conference on computer vision and pattern recognition (Vol. 1, pp. 26–33).
Google Scholar
Bickel, P. J., & Doksum, K. A. (2000). Mathematical statistics: basic ideas and selected topics, 2nd edn. (Vol. 1). New York: Prentice Hall.
Google Scholar
Cai, H., Mikolajczyk, K., & Matas, J. (2008). Learning linear discriminant projections for dimensionality reduction of image descriptors. In Proc. British machine vision conf.
Chandrasekhar, V., Takacs, G., Chen, D., Tsai, S. S., Grzeszczuk, R., & Girod, B. (2009a). CHoG: compressed histogram of gradients. In Conference on computer vision and pattern recognition, Miami, FL, USA (pp. 2504–2511).
Chapter Google Scholar
Chandrasekhar, V., Takacs, G., Chen, D., Tsai, S. S., Singh, J., & Girod, B. (2009b). Transform coding of image feature descriptors. In Proc. SPIE visual communication and image processing.
Google Scholar
Charikar, M. S. (2002). Similarity estimation techniques from rounding algorithms. In Proc. ACM symposium on theory of computing (pp. 380–388).
Google Scholar
Chen, P. W. C., Ahammad, P., Boyer, C., Huang, S. I., Lin, L., Lobaton, E. J., Meingast, M. L., Oh, S., Wang, S., Yan, P., Yang, A., Yeo, C., Chang, L. C., Tygar, D., & Sastry, S. S. (2008). Citric: a low-bandwidth wireless camera network platform. Tech. Rep. UCB/EECS-2008-50, EECS Department, University of California, Berkeley. http://www.eecs.berkeley.edu/Pubs/TechRpts/2008/EECS-2008-50.html.
Cheng, Z., Devarajan, D., & Radke, R. J. (2007). Determining vision graphs for distributed camera networks using feature digests. EURASIP Journal on Advances in Signal Processing, 2007, Article ID 57,034, 11 pages.
Article Google Scholar
Cover, T., & Thomas, J. (1991). Elements of information theory. New York: Wiley.
Book MATH Google Scholar
Devarajan, D., & Radke, R. J. (2004). Distributed metric calibration of large camera networks. In Proc. workshop on broadband advanced sensor networks.
Google Scholar
Downes, I., Rad, L. B., & Aghajan, H. (2006). Development of a mote for wireless image sensor networks. In Proc. COGnitive systems with Interactive Sensors (COGIS).
Google Scholar
Ferrari, V., Tuytelaars, T., & Van Gool, L. (2004). Simultaneous object recognition and segmentation by image exploration. In Proc. European conference on computer vision (Vol. 1, pp. 40–54). Berlin: Springer.
Google Scholar
Fischler, M. A., & Bolles, R. C. (1981). Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM, 24(6), 381–395. http://doi.acm.org/10.1145/358669.358692.
Article MathSciNet Google Scholar
Franke, U., & Joos, A. (2000). Real-time stereo vision for urban traffic scene understanding. In Proc. IEEE intelligent vehicles symposium (pp. 273–278).
Google Scholar
Gallager, R. G. (1963). Low-density parity-check codes. Cambridge: MIT Press.
Google Scholar
Goemans, M. X., & Williamson, D. P. (1995). Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming. Journal of the ACM, 42(6), 1115–1145.
Article MathSciNet MATH Google Scholar
Hartley, R., & Zisserman, A. (2000). Multiple view geometry in computer vision. Cambridge: Cambridge University Press.
MATH Google Scholar
Indyk, P., & Motwani, R. (1998). Approximate nearest neighbors: towards removing the curse of dimensionality. In Proceedings of the thirtieth annual ACM symposium on theory of computing (pp. 604–613). New York: ACM.
Chapter Google Scholar
Jain, P., Kulis, B., & Grauman, K. (2008). Fast image search for learned metrics. In IEEE conference on computer vision and pattern recognition, 2008. CVPR 2008 (pp. 1–8).
Google Scholar
Körner, J., & Marton, K. (1979). How to encode the modulo-two sum of binary sources. IEEE Transactions on Information Theory, 25(2), 219–221.
Article MATH Google Scholar
Larsen, B., & Aone, C. (1999). Fast and effective text mining using linear-time document clustering. In Proc. ACM SIGKDD international conference on knowledge discovery and data mining (pp. 16–22). New York: ACM Press.
Chapter Google Scholar
Lee, H., & Aghajan, H. (2006). Collaborative node localization in surveillance networks using opportunistic target observations. In Proc. ACM international workshop on video surveillance and sensor networks (pp. 9–18). New York: ACM Press.
Chapter Google Scholar
Lin, Y. C., Varodayan, D., & Girod, B. (2007). Image authentication based on distributed source coding. In Proc. IEEE international conference on image processing.
Google Scholar
Lowe, D. G. (2004). Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2), 91–110.
Article Google Scholar
Ma, Y., Soatto, S., Kosecka, J., & Sastry, S. S. (2004). An invitation to 3-D vision: from images to geometric models. Berlin: Springer.
MATH Google Scholar
Martinian, E., Yekhanin, S., & Yedidia, J. S. (2005). Secure biometrics via syndromes. In Proc. Allerton conference on communications, control and computing.
Google Scholar
Matusik, W., & Pfister, H. (2004). 3D TV: a scalable system for real-time acquisition, transmission, and autostereoscopic display of dynamic scenes. ACM Transactions on Graphics, 23(3), 814–824.
Article Google Scholar
Mikolajczyk, K., & Matas, J. (2007). Improving descriptors for fast tree matching by optimal linear projection. In IEEE 11th international conference on computer vision, 2007 (pp. 1–8).
Chapter Google Scholar
Mikolajczyk, K., & Schmid, C. (2004). Scale and affine invariant interest point detectors. International Journal of Computer Vision, 60(1), 63–86.
Article Google Scholar
Mikolajczyk, K., & Schmid, C. (2005). A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(10), 1615–1630.
Article Google Scholar
Oh, S., Schenato, L., Chen, P., & Sastry, S. (2007). Tracking and coordination of multiple agents using sensor networks: system design, algorithms and experiments. Proceedings of the IEEE, 95, 234–254.
Article Google Scholar
Rahimi, M., Baer, R., Iroezi, O., Garcia, J., Warrior, J., Estrin, D., & Srivastava, M. (2005). Cyclops: in situ image sensing and interpretation. In Proc. ACM conference on embedded networked sensor systems.
Google Scholar
Richardson, T. J., & Urbanke, R. L. (2001). The capacity of low-density parity-check codes under message-passing decoding. IEEE Transactions on Information Theory, 47(2), 599–618.
Article MathSciNet MATH Google Scholar
Roy, S., & Sun, Q. (2007). Robust hash for detecting and localizing image tampering. In Proc. IEEE international conference on image processing.
Google Scholar
Salakhutdinov, R., & Hinton, G. (2009). Semantic hashing. International Journal of Approximate Reasoning, 50(7), 969–978.
Article Google Scholar
Schaffalitzky, F., & Zisserman, A. (2002). Multi-view matching for unordered image sets, or “how do I organize my holiday snaps?”. In Proc. European conference on computer vision (Vol. 1, pp. 414–431). Berlin: Springer.
Google Scholar
Se, S., Lowe, D., & Little, J. (2002). Global localization using distinctive visual features. In Proc. IEEE/RSJ international conference on intelligent robots and system (Vol. 1).
Google Scholar
Shum, H., & Kang, S. B. (2000). A review of image-based rendering techniques. In Proc. SPIE visual communications and image processing (pp. 2–13). Bellingham: SPIE.
Google Scholar
Slepian, D., & Wolf, J. (1973). Noiseless coding of correlated information sources. IEEE Transactions on Information Theory, 19(4), 471–480.
Article MathSciNet MATH Google Scholar
Szewczyk, R., Osterweil, E., Polastre, J., Hamilton, M., Mainwaring, A. M., & Estrin, D. (2004). Habitat monitoring with sensor networks. Communications of the ACM, 47(6), 34–40.
Article Google Scholar
Teixeira, T., Lymberopoulos, D., Culurciello, E., Aloimonos, Y., & Savvides, A. (2006). A lightweight camera sensor network operating on symbolic information. In Proc. workshop on distributed smart cameras, Boulder, Colorado.
Google Scholar
Weiss, Y., Torralba, A., & Fergus, R. (2009). Spectral hashing. Advances in Neural Information Processing Systems, 21, 1753–1760.
Google Scholar
Winder, S. A. J., & Brown, M. (2007). Learning local image descriptors. In IEEE conference on computer vision and pattern recognition, 2007. CVPR’07 (pp. 1–8).
Chapter Google Scholar
Wyner, A. D., & Ziv, J. (1976). The rate distortion function for source coding with side information at the decoder. IEEE Transactions on Information Theory, 22(1), 1–10.
Article MathSciNet MATH Google Scholar
Yeo, C., Ahammad, P., & Ramchandran, K. (2008a). A rate-efficient approach for establishing visual correspondences via distributed source coding. In Proc. SPIE visual communications and image processing.
Google Scholar
Yeo, C., Ahammad, P., & Ramchandran, K. (2008b). Rate-efficient visual correspondences using random projections. In Proc. IEEE international conference on image processing.
Google Scholar
Yeo, C., Ahammad, P., Zhang, H., & Ramchandran, K. (2009). Rate-constrained distributed distance testing and its applications. In Proc. IEEE international conference on acoustics, speech, and signal processing.
Google Scholar
Zhang, Z. (1998). Determining the epipolar geometry and its uncertainty: a review. International Journal of Computer Vision, 27(2), 161–195.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Signal Processing Department, Institute for Infocomm Research, Singapore, 118936, Singapore
Chuohao Yeo
Janelia Farm Research Campus, Howard Hughes Medical Institute, Ashburn, VA, 20147, USA
Parvez Ahammad
Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, CA, 94720, USA
Kannan Ramchandran

Authors

Chuohao Yeo
View author publications
You can also search for this author in PubMed Google Scholar
Parvez Ahammad
View author publications
You can also search for this author in PubMed Google Scholar
Kannan Ramchandran
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chuohao Yeo.

Additional information

This work has been presented in part in Yeo et al. (2008a, 2008b, 2009).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yeo, C., Ahammad, P. & Ramchandran, K. Coding of Image Feature Descriptors for Distributed Rate-efficient Visual Correspondences. Int J Comput Vis 94, 267–281 (2011). https://doi.org/10.1007/s11263-011-0427-1

Download citation

Received: 27 April 2009
Accepted: 17 February 2011
Published: 03 March 2011
Issue Date: September 2011
DOI: https://doi.org/10.1007/s11263-011-0427-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Coding of Image Feature Descriptors for Distributed Rate-efficient Visual Correspondences

Abstract

Access this article

Similar content being viewed by others

VocMatch: Efficient Multiview Correspondence for Structure from Motion

Is Geometry Enough for Matching in Visual Localization?

Learning and Matching Multi-View Descriptors for Registration of Point Clouds

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Coding of Image Feature Descriptors for Distributed Rate-efficient Visual Correspondences

Abstract

Access this article

Similar content being viewed by others

VocMatch: Efficient Multiview Correspondence for Structure from Motion

Is Geometry Enough for Matching in Visual Localization?

Learning and Matching Multi-View Descriptors for Registration of Point Clouds

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation