Abstract
Doors are a significant object for the visually impaired and robots to enter and exit buildings. Although the accuracy of door detection is reported high in indoor scenes, it has become a difficult problem in outdoor scenes in computer vision. The reason may lie in the fact that such properties of a simple ordinary door such as handles, corners, and the gap between the door and the ground may not be visible due to the great variety of doors in outdoor environments. In this paper, we present a vision-based method for detecting building entrances in outdoor images. After extracting the lines and deleting the extra ones, regions between the vertical lines are specified and the features including height, width, location, color, texture and the number of lines inside the regions are obtained. Finally, some additional knowledge such as door existence at the bottom of the image, a reasonable height and width of a door, the difference between color and texture of the doors and those of the neighboring regions, and numerous lines on doors is used to decide on door detection. The method was tested on the eTRIMS dataset, door images from the ImageNet dataset, and our own dataset including doors of houses, apartments, and stores leading to acceptable results. The obtained results show that our approach outperforms comparable state-of-the-art approaches.
Similar content being viewed by others
References
Anguelov D, Koller D, Parker E, Thrun S (2004) Detecting and modeling doors with mobile robots. In: Proceedings of the IEEE International Conference on Robotics and Automation, New Orleans, pp. 3777–3784
Chen Z, Li Y, Birchfield ST (2011) Visual detection of lintel-occluded doors by integrating multiple cues using data-driven Markov chain Monte Carlo process. Robot Auton Syst 59(11):966–976
Cohen A, Schwing AG, Pollefeys M (2014) Efficient structured parsing of facades using dynamic programming. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, pp. 3206–3213
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 886–893
Dollar P, Appel R, Belongie S, Perona P (2014) Fast feature pyramids for object detection. IEEE Trans Pattern Anal Mach Intell (TPAMI) 36(8):1532–1545
Garcia-Garcia A, Orts-Escolano S, Oprea SO, Villena-Martinez V, Garcia-Rodriguez J (2017) A review on deep learning techniques applied to semantic segmentation. arXiv 1704.06857
Girshick R (2015) Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp. 1440–1448
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 580–587
Gould S, He X (2014) Scene understanding by labeling pixels. Commun ACM 57(11):68–77
Grompone von Gioi R, Jakubowicz J, Morel JM, Randall G (2010) LSD: A fast line segment detector with a false detection control. IEEE Trans Pattern Anal Mach Intell (TPAMI) 32(4):722–732
Grompone von Gioi R, Jakubowicz J, Morel JM, Randall G (2012) LSD: A line segment detector. Image Processing on Line 2:35–55
He Z, You X, Yuan Y (2009) Texture image retrieval based on non-tensor product wavelet filter banks. Signal Process 89(8):1501–1510
Hensler J, Blaich M, Bittel O (2009) Real-time door detection based on AdaBoost learning algorithm. In: Proceedings of the International Conference on Research and Education in Robotics, La Ferte-Bernard, pp. 61–73
Hoiem D, Efros AA, Hebert M (2007) Recovering surface layout from an image. Int J Comput Vis (IJCV) 75(1):151–172
Jain AK, Farrokhnia F (1991) Unsupervised texture segmentation using Gabor filters. Pattern Recogn 24(12):1167–1186
Kang SJ, Trinh HH, Kim DN, Jo KH (2010) Entrance detection of buildings using multiple cues. In: Proceedings of the International Conference on Intelligent Information and Database Systems, Hue, pp. 251–260
Korc F, Forstner W (2009) eTRIMS image database for interpreting images of man-made scenes. Technical Report, University of Bonn
Leung T, Malik J (2001) Representing and recognizing the visual appearance of materials using three-dimensional textons. Int J Comput Vis (IJCV) 43(1):29–44
Liu J, Korah T, Hedau V, Parameswaran V, Grzeszczuk R, Liu Y (2014) Entrance detection from street-view images. In: IEEE International Conference on Computer Vision and Pattern Recognition Workshop (CVPR), Columbus
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis (IJCV) 60(2):91–110
Mathias M, Martinovic A, Van Gool L (2016) ATLAS: A three-layered approach to facade parsing. Int J Comput Vis (IJCV) 118(1):22–48
Murillo AC, Kosecka J, Guerrero JJ, Sagues C (2008) Visual door detection integrating appearance and shape cues. Robot Auton Syst 56(6):512–521
Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: Towards real-time object detection with region proposal networks. In: Proceedings of the International Conference on Neural Information Processing Systems (NIPS), pp. 91–99
Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M, Berg AC, Fei-Fei L (2015) ImageNet large scale visual recognition challenge. Int J Comput Vis (IJCV) 115(3):211–252
Sekkal R, Pasteau F, Babel M, Brun B, Leplumey I (2013) Simple monocular door detection and tracking. In: Proceedings of the IEEE International Conference on Image Processing, Melbourne, pp. 3929–3933
Shuai B, Zuo Z, Wang G, Wang B (2016) Scene parsing with integration of parametric and non-parametric models. IEEE Trans Image Process 25(5):2379–2391
Szeliski R (2011) Computer vision: Algorithms and applications. Springer, London
Teboul O (2010) Ecole Centrale Paris Facades Database. Available: http://vision.mas.ecp.fr/Personnel/teboul/data.php
Teboul O, Kokkinos I, Simon L, Koutsourakis P, Paragios N (2013) Parsing facades with shape grammars and reinforcement learning. IEEE Trans Pattern Anal Mach Intell (TPAMI) 35(7):1744–1756
Tighe J, Niethammer M, Lazebnik S (2015) Scene parsing with object instance inference using regions and per-exemplar detectors. Int J Comput Vis (IJCV) 112(2):150–171
Yang MY, Förstner W (2011) Regionwise classification of building facade images. In: Proceedings of the ISPRS Conference on Photogrammetric Image Analysis, Munich, pp. 209–220
Zhang D, You X, Wang P, Yanushkevich SN, Tang YY (2009) Facial biometrics using non-tensor product wavelet and 2D discriminant techniques. Int J Pattern Recognit Artif Intell 23(3):1–21
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Talebi, M., Vafaei, A. & Monadjemi, A. Vision-based entrance detection in outdoor scenes. Multimed Tools Appl 77, 26219–26238 (2018). https://doi.org/10.1007/s11042-018-5846-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-018-5846-3