Skip to main content

Alignment of 3D Models to Images Using Region-Based Mutual Information and Neighborhood Extended Gaussian Images

  • Conference paper
Computer Vision – ACCV 2006 (ACCV 2006)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3851))

Included in the following conference series:

Abstract

Mutual information has been used for matching and registering 3D models to 2D images. However, in Viola’s original framework [1], surface albedo variance is assumed to be minimal when measuring similarity between 3D models and 2D image data using mutual information. In reality, most objects have textured surfaces with different albedo values across their surfaces, and direct application of this method in such circumstances will fail. To solve this problem, we propose to include spatial information into the original formulation by using histogram-based features of local regions that are robust to local but significant albedo variation. Neighborhood Extended Gaussian Images (NEGI) are used as descriptors to represent local surface regions on the 3D model, while pixel intensity data are considered within corresponding region windows on the image. Experiments on aligning 3D car models in cluttered scenes using this new framework demonstrate substantial improvement as compared to the original pixel-wise mutual information approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Viola, P.: Alignment by Maximization of Mutual Information. PhD thesis, Massachusetts Institute of Technology (1995)

    Google Scholar 

  2. Pluim, J., Maintz, J., Viergiver, M.: Mutual information based registration of medical images: A survey. IEEE Transcations on Medical Imaging 22, 986–1004 (2003)

    Article  Google Scholar 

  3. Campbell, R., Flynn, P.: A survey of free-form object representation and recognition techniques. Computer Vision and Image Understanding 81, 166–210 (2001)

    Article  MATH  Google Scholar 

  4. Suveg, I., Gosselman, G.: Mutual information based evaluation of 3D building models. In: Proceedings of the International Conference on Pattern Recognition, Quebec City, Canada, vol. 3, pp. 188–197 (2002)

    Google Scholar 

  5. Kollnig, H., Nagel, N.: 3D pose estimation by directly matching polyhedral models to gray value gradients. International Journal of Computer Vision 23, 283–302 (1997)

    Article  Google Scholar 

  6. Tan, T., Sullivan, G., Baker, K.: Model-based localization and recognition of road vehicles. International Journal of Computer Vision 27, 5–25 (1998)

    Article  Google Scholar 

  7. Maes, F., Collignon, A., Vandermeulen, D., Marchal, G., Suetens, P.: Multi-modality image registration by maximization of mutual information. IEEE Transactions on Medical Imaging 16, 187–198 (1997)

    Article  Google Scholar 

  8. Cover, T., Thomas, J.: Elements of Information Theory. John Wiley, Chichester (1991)

    Book  MATH  Google Scholar 

  9. Russakoff, D., Tomasi, C., Rohlfing, T., Maurer, C.: Image similarity using mutual information of regions. In: Proceedings of the European Conference on Computer Vision, Prague, Czech Republic, pp. 596–607 (2004)

    Google Scholar 

  10. Leventon, M., Wells III, W., Grimson, W.: Multiple view 2D-3D mutual information registration. In: DARPA Image Understanding Workshop, pp. 625–630 (1997)

    Google Scholar 

  11. Shannon, C.: A mathematical theory of communication. The Bell System Technical Journal 27, 379–423 (1948)

    MATH  MathSciNet  Google Scholar 

  12. Horn, B.: Extended gaussian images. Proceedings of the IEEE 72, 1656–1678 (1984)

    Article  Google Scholar 

  13. Pong, H., Cham, T.: Object detection using a cascade of 3D models. In: Proceedings of the Asian Conference on Computer Vision, Hyderabad, India (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pong, HK., Cham, TJ. (2006). Alignment of 3D Models to Images Using Region-Based Mutual Information and Neighborhood Extended Gaussian Images. In: Narayanan, P.J., Nayar, S.K., Shum, HY. (eds) Computer Vision – ACCV 2006. ACCV 2006. Lecture Notes in Computer Science, vol 3851. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11612032_7

Download citation

  • DOI: https://doi.org/10.1007/11612032_7

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-31219-2

  • Online ISBN: 978-3-540-32433-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics