Abstract
In this paper, we propose a holistic classification scheme for different room types, like office or meeting room, based on 3D features. Such a categorization of scenes provides a rich source of information about potential objects, object locations, and activities typically found in them. Scene categorization is a challenging task. While outdoor scenes can be sufficiently characterized by color and texture features, indoor scenes consist of human-made structures that vary in terms of color and texture across different individual rooms of the same category. Nevertheless, humans tend to have an immediate impression in which room type they are. We suggest that such a decision could be based on the coarse spatial layout of a scene. Therefore, we present a system that categorizes different room types based on 3D sensor data extracted by a Time-of-Flight (ToF) camera. We extract planar structures combining region growing and RANSAC approaches. Then, feature vectors are defined on statistics over the relative sizes of the planar patches, the angles between pairs of (close) patches, and the ratios between sizes of pairs of patches to train classifiers. Experiments in a mobile robot scenario study the performance in classifying a room based on a single percept.
Chapter PDF
References
Strat, T.M., Fischler, M.A.: Context-based vision: Recognizing objects using both 2D and 3D imaging. PAMI 13, 1050–1065 (1991)
McKeown, D.M., Harvey, W.A., McDermott, J.: Rule-based interpretation of aerial imagery. In: Readings in Computer Vision: Issues, Problems, Principles, and Paradigms, pp. 415–430 (1987)
Murphy, K., Torralba, A., Freeman, W.T.: Using the forest to see the trees: A graphical model relating features, objects, and scenes. In: Advances in Neural Information Processing Systems, vol. 16 (2003)
Hoiem, D., Efros, A.A., Hebert, M.: Putting objects in perspective. CVPR 2, 2137–2144 (2006)
Szummer, M., Picard, R.W.: Indoor-outdoor image classification. In: CAIVD, pp. 42–51 (1998)
Paek, S., Chang, S.F.: A knowledge engineering approach for image classification based on probabilistic reasoning systems. ICME 2, 1133–1136 (2000)
Torralba, A.: Contextual priming for object detection. IJCV 53, 153–167 (2003)
Weingarten, J., Gruener, G., Siegwart, R.: A state-of-the-art 3D sensor for robot navigation. In: IROS (2004)
Swadzba, A., Liu, B., Penne, J., Jesorsky, O., Kompe, R.: A comprehensive system for 3D modeling from range images acquired from a 3D ToF sensor. In: ICVS (2007)
Fischler, M.A., Bolles, R.C.: Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24, 381–395 (1981)
Besl, P.J., McKay, N.D.: A method for registration of 3D shapes. PAMI 14, 239–256 (1992)
Nüchter, A., Surmann, H., Hertzberg, J.: Automatic model refinement for 3D reconstruction with mobile robots. In: 3DIM, pp. 394–401 (2003)
Lee, S., Jang, D., Kim, E., Hong, S., Han, J.: A real-time 3D workspace modeling with stereo camera. In: IROS, pp. 2140–2147 (2005)
Liu, Y., Emery, R., Chakrabarti, D., Burgard, W., Thurn, S.: Using EM to learn 3D models of indoor environments with mobile robots. In: ICML (2001)
Lakaemper, R., Latecki, L.J.: Using extended EM to segment planar structures in 3D. In: ICPR, pp. 1077–1082 (2006)
Hähnel, D., Burgard, W., Thrun, S.: Learning compact 3D models of indoor and outdoor environments with a mobile robot. Robotics and Autonomous Systems 44, 15–27 (2003)
Adams, R., Bischof, L.: Seeded region growing. PAMI 16, 641–647 (1994)
Fua, P.: From multiple stereo views to multiple 3D surfaces. IJCV 24, 19–35 (1997)
Stamos, I., Allen, P.K.: Geometry and texture recovery of scenes of large scale. CVIU 88, 94–118 (2002)
Lourenco, A., Freitas, P., Ribeiro, M.I., Marques, J.S.: Detection and classification of 3D moving objects. In: Mediterranean Conf. on Control and Automation (2002)
Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning internal representations by error propagation. Parallel Data Processing 1, 318–362 (1986)
Vapnik, V.N.: The Nature of Statistical Learning Theory (1995)
Joachims, T.: Learning to Classify Text Using Support Vector Machines. PhD thesis, Cornell University (2002)
Fink, G.A.: Developing HMM-based recognizers with ESMERALDA. In: Matoušek, V., Mautner, P., Ocelíková, J., Sojka, P. (eds.) TSD 1999. LNCS (LNAI), vol. 1692, pp. 229–234. Springer, Heidelberg (1999)
Haasch, A., Hohenner, S., Hüwel, S., Kleinehagenbrock, M., Lang, S., Toptsis, I., Fink, G.A., Fritsch, J., Wrede, B., Sagerer, G.: BIRON – The Bielefeld Robot Companion. In: ASER, pp. 27–32 (2004)
COGNIRON: The cognitive robot companion (FP6-IST-002020) (2004), http://www.cogniron.org
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Swadzba, A., Wachsmuth, S. (2008). Categorizing Perceptions of Indoor Rooms Using 3D Features. In: da Vitoria Lobo, N., et al. Structural, Syntactic, and Statistical Pattern Recognition. SSPR /SPR 2008. Lecture Notes in Computer Science, vol 5342. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89689-0_77
Download citation
DOI: https://doi.org/10.1007/978-3-540-89689-0_77
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89688-3
Online ISBN: 978-3-540-89689-0
eBook Packages: Computer ScienceComputer Science (R0)