Skip to main content

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 98))

  • 1035 Accesses

Abstract

The paper demonstrates how to transform (using a combination of techniques reported in our previous papers) a collection of random images gathered in an unknown environment into a limited-scale visual model of that environment. The model generally consists of the template images of the typical “visual objects” identified in the explored world. Both the concepts of objects and their templates are formed without any assumptions about the content of acquired images, i.e. the semantics is built using the pictorial data only (although users may subsequently identify the real-world semantics of the formed objects). From the image processing perspective, the method consists in detecting near-duplicate (i.e. photometric/geometric distortions and partial occlusions are allowed) fragments in random images. It is envisaged that such a proposal can be instrumental in assisting both autonomous agents and visually impaired humans (including both blind people and people unable to understand perceived visual data) facing unfamiliar worlds. The paper focuses on the practical aspects of the problem (exemplary results, computational efficiency, etc.) although a substantial amount of theoretical background is also included.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Biederman, I.: Recognition-by-components: A theory of human image understanding. Psychological Review 94(2), 115–147 (1987)

    Article  Google Scholar 

  2. Bourbakis, N.G., Kavraki, D.: An intelligent assistant for navigation of visually impaired people. In: 2nd IEEE Int. Symp. on Bioinformatics and Bioengineering, Bethesda, p. 230 (2001)

    Google Scholar 

  3. Ke, Y., Sukthankar, R., Huston, L.: Efficient near-duplicate detection and sub-image retrieval. In: ACM Multimedia Conference, New York, pp. 869–876 (2004)

    Google Scholar 

  4. Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide baseline stereo from maximally stable extremal regions. In: British Machine Vision Conf., Cardiff, pp. 384–393 (2002)

    Google Scholar 

  5. Mikolajczyk, K., Schmid, C.: Scale and affine invariant interest point detectors. Int. J. of Computer Vision 60, 63–86 (2004)

    Article  Google Scholar 

  6. Paradowski, M., Śluzek, A.: Local keypoints and global affine geometry: Triangles and ellipses for image fragment matching. In: Kwaśnicka, H., Jain, L.C. (eds.) Innovations in Intelligent Image Analysis. SCI, vol. 339, pp. 195–224. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  7. Paradowski, M., Śluzek, A.: Automatic visual object formation using image fragment matching. In: 5th Int. Symp. Advances in Artificial Intelligence & Applications, Wisła, pp. 97–104 (2010)

    Google Scholar 

  8. Riesenhuber, M., Poggio, T.: Models of object recognition. Nature Neuroscience 3, 1199–1204 (2000)

    Article  Google Scholar 

  9. Zhao, W.-L., Ngo, C.-W.: Scale-rotation invariant pattern entropy for keypoint-based near-duplicate detection. IEEE Trans. on Image Processing 18(2), 412–423 (2009)

    Article  MathSciNet  Google Scholar 

  10. Zhao, W.-L., Ngo, C.-W., Tan, H.-K., Wu, X.: Near-duplicate keyframe identification with interest point matching and pattern learning. IEEE Trans. on Multimedia 9(5), 1037–1048 (2007)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Śluzek, A., Paradowski, M. (2012). Towards Vision-Based Understanding of Unknown Environments. In: Hippe, Z.S., Kulikowski, J.L., Mroczek, T. (eds) Human – Computer Systems Interaction: Backgrounds and Applications 2. Advances in Intelligent and Soft Computing, vol 98. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23187-2_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-23187-2_25

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-23186-5

  • Online ISBN: 978-3-642-23187-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics