Abstract
The paper demonstrates how to transform (using a combination of techniques reported in our previous papers) a collection of random images gathered in an unknown environment into a limited-scale visual model of that environment. The model generally consists of the template images of the typical “visual objects” identified in the explored world. Both the concepts of objects and their templates are formed without any assumptions about the content of acquired images, i.e. the semantics is built using the pictorial data only (although users may subsequently identify the real-world semantics of the formed objects). From the image processing perspective, the method consists in detecting near-duplicate (i.e. photometric/geometric distortions and partial occlusions are allowed) fragments in random images. It is envisaged that such a proposal can be instrumental in assisting both autonomous agents and visually impaired humans (including both blind people and people unable to understand perceived visual data) facing unfamiliar worlds. The paper focuses on the practical aspects of the problem (exemplary results, computational efficiency, etc.) although a substantial amount of theoretical background is also included.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Biederman, I.: Recognition-by-components: A theory of human image understanding. Psychological Review 94(2), 115–147 (1987)
Bourbakis, N.G., Kavraki, D.: An intelligent assistant for navigation of visually impaired people. In: 2nd IEEE Int. Symp. on Bioinformatics and Bioengineering, Bethesda, p. 230 (2001)
Ke, Y., Sukthankar, R., Huston, L.: Efficient near-duplicate detection and sub-image retrieval. In: ACM Multimedia Conference, New York, pp. 869–876 (2004)
Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide baseline stereo from maximally stable extremal regions. In: British Machine Vision Conf., Cardiff, pp. 384–393 (2002)
Mikolajczyk, K., Schmid, C.: Scale and affine invariant interest point detectors. Int. J. of Computer Vision 60, 63–86 (2004)
Paradowski, M., Śluzek, A.: Local keypoints and global affine geometry: Triangles and ellipses for image fragment matching. In: Kwaśnicka, H., Jain, L.C. (eds.) Innovations in Intelligent Image Analysis. SCI, vol. 339, pp. 195–224. Springer, Heidelberg (2011)
Paradowski, M., Śluzek, A.: Automatic visual object formation using image fragment matching. In: 5th Int. Symp. Advances in Artificial Intelligence & Applications, Wisła, pp. 97–104 (2010)
Riesenhuber, M., Poggio, T.: Models of object recognition. Nature Neuroscience 3, 1199–1204 (2000)
Zhao, W.-L., Ngo, C.-W.: Scale-rotation invariant pattern entropy for keypoint-based near-duplicate detection. IEEE Trans. on Image Processing 18(2), 412–423 (2009)
Zhao, W.-L., Ngo, C.-W., Tan, H.-K., Wu, X.: Near-duplicate keyframe identification with interest point matching and pattern learning. IEEE Trans. on Multimedia 9(5), 1037–1048 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Śluzek, A., Paradowski, M. (2012). Towards Vision-Based Understanding of Unknown Environments. In: Hippe, Z.S., Kulikowski, J.L., Mroczek, T. (eds) Human – Computer Systems Interaction: Backgrounds and Applications 2. Advances in Intelligent and Soft Computing, vol 98. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23187-2_25
Download citation
DOI: https://doi.org/10.1007/978-3-642-23187-2_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23186-5
Online ISBN: 978-3-642-23187-2
eBook Packages: EngineeringEngineering (R0)