Automatic Semantic and Geometric Enrichment of CityGML Building Models Using HOG-Based Template Matching
- 1.1k Downloads
Semantically rich 3D building models give the potential for a wealth of rich geo-spatially-enabled applications such as cultural heritage augmented reality, urban planning, radio network planning and personal navigation. However, the majority of existing building models lack much if any semantic detail. This work demonstrates a novel method for automatically locating subclasses of windows and doors, using computer vision techniques including the histogram of oriented gradient (HOG) template matching, and automatically creating enriched CityGML content for the matched windows and doors. Good results were achieved for class identification with potential for further refinement of subclasses of windows and doors and other architectural features. It is part of a wider project to bring even richer semantic content to 3D geo-spatial building models.
KeywordsSemantic Geometric CityGML HOG Template matching
Funded by an EPSRC Industrial CASE studentship with Ordnance Survey, GB; special thanks go to Isabel Sargent and David Holland from Ordnance Survey. Aside from templates 13 and 14 (see Fig. 4) all data used in this work are already publicly available at the locations referenced in the text.
- Dalal, N., & Triggs, B. (2005). Histograms of oriented gradients for human detection. Paper presented at the 17th IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), San Diego, USA 20–26 June 2005.Google Scholar
- Dawe, S. (2013). King of the castles: Britain’s built heritage rules Huffington post. http://www.huffingtonpost.co.uk. Retrieved July 11, 2014.
- de Fornel, P., & Sizun, H. (2006). Radio wave propagation for telecommunication applications. Berlin: Springer.Google Scholar
- Debevec, P. E., Taylor, C. J., & Malik, J. (1996). Modeling and rendering architecture from photographs: A hybrid geometry-and-image-based approach. Paper presented at the 23rd International Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), New Orleans, USA, 04–09 August 1996.Google Scholar
- Döllner, J., & Hagedorn, B. (2007). Integrating urban GIS, CAD, and BIM data by service based virtual 3D city models. Urban and regional data management—annual. Leiden: Taylor & Francis.Google Scholar
- Dore, C., & Murphy, M. (2014). Semi-automatic techniques for generating BIM façade models of historic buildings. Journal of Information Technology in Construction, 19(2), 20–46.Google Scholar
- Gröger, G., Kolbe, T., Nagel, C., & Häfele, K. (2012). OGC city geography markup language (CityGML) en-coding standard. Open Geospatial Consortium.Google Scholar
- Ham, Y., & Golparvar-Fard, M. (2015). Mapping actual thermal properties to building elements in gbXML-based BIM for reliable building energy performance modeling. Automation in Construction, 49(Part B), 214–224.Google Scholar
- Hampson, K., Kraatz, J. A., & Sanchez, A. X. (2014). The global construction industry and R&D. R&D Investment and Impact in the Global Construction Industry. Abingdon: Taylor & Francis.Google Scholar
- Isikdag, U., & Zlatanova, S. (2009). Towards defining a framework for automatic generation of buildings in CityGML using building information models. Lecture Notes in Geoinformation and Cartography—3D Geo-Information Sciences. Berlin: Springer.Google Scholar
- Johansson, B., & Kahl, F. (2002). Detecting windows in city scenes. Lecture Notes in Computer Science—Pattern Recognition with Support Vector Machines. Berlin: Springer.Google Scholar
- Jones, C. B., Rosin, P. L., & Slade, J. (2014). Semantic and geometric enrichment of 3D geo-spatial models with captioned photos and labelled illustrations. Paper presented at the 25th International Conference on Computational Linguistics (COLING)—3rd Workshop on Vision and Language (VL), Dublin, Ireland, 23 August 2014.Google Scholar
- Kolbe, T. H. (2009). Representing and Exchanging 3D City Models with CityGML. Lecture Notes in Geoinformation and Cartography—3D Geo-Information Sciences. Berlin: Springer.Google Scholar
- Koziński, M., & Marlet, R. (2014). Image parsing with graph grammars and Markov Random Fields applied to facade analysis. Paper presented at the 14th IEEE Winter Conference on Applications of Computer Vision (WACV), Steamboat Springs, USA, 24–26 March 2014.Google Scholar
- Kroon, D.-J. (2011). Fast/robust template matching. MathWorks Inc. http://uk.mathworks.com. Retrieved September 01, 2014.
- Mayer, H., & Reznik, S. (2005). Building facade interpretation from image sequences. Paper presented at the ISPRS Workshop on Object Extraction for 3D City Models, Road Databases, and Traffic Monitoring—Concepts, Algorithms, and Evaluation (CMRT)—WG III/4–5 IV/3, Vienna, Austria, 29–30 August 2005.Google Scholar
- Meixner, P., Leberl, F., & Brédif, M. (2011). Interpretation of 2D and 3D building details on facades and roofs. Paper presented at the 3rd Conference on Photogrammetric Image Analysis (PIA)—ISPRS Technical Commission III Symposium, München, Germany, 5–7 October 2011.Google Scholar
- Ok, D., Kozinski, M., Marlet, R., & Paragios, N. (2012). High-level bottom-up cues for top-down parsing of facade images. Paper presented at the 2nd Joint 3DIM/3DPVT International Conference on 3D Imaging, Modeling, Processing, Visualization and Transmission (3DIMPVT), Zürich, Switzerland, 13–15 October 2012.Google Scholar
- Pevsner, N., Harris, J., & Antram, N. (1989). Lincolnshire. London: Yale University Press.Google Scholar
- Reznik, S., & Mayer, H. (2007). Implicit shape models, model selection, and plane sweeping for 3D facade interpretation. Paper presented at the 2nd Conference on Photogrammetric Image Analysis (PIA)—ISPRS Technical Commission III Symposium, München, Germany, 19–21 September 2007.Google Scholar
- Ross, L., Bolling, J., Döllner, J., & Kleinschmit, B. (2009). Enhancing 3D city models with heterogeneous spatial information: Towards 3D land information systems. Lecture Notes in Geoinformation and Cartography—Advances in GIScience—12th AGILE Conference. Berlin: Springer.Google Scholar
- Sivic, J., & Efros, A. A. (2014). Urban-scale quantitative visual analysis. ERCIM News—Special Theme: Smart Cities, 98, 43–44.Google Scholar
- Sonka, M., Hlaváč, V., & Boyle, R. (2014). Image processing, analysis, and machine vision (4th ed.). Boston: Cengage Learning.Google Scholar
- Stadler, A., & Kolbe, T. H. (2007). Spatio-semantic coherence in the integration of 3D city models. Paper presented at the 5th International Symposium on Spatial Data Quality (ISSDQ), Enschede, The Netherlands, 13–15 June 2007.Google Scholar
- Whiteside, A. (2009). Definition identifier URNs in OGC namespace. OpenGIS Best Practice document.Google Scholar
- Xiao, J. (2013). HOG-based template matching. http://vision.princeton.edu/code.html#templateMatching. Retrieved January 10, 2015.
- Zhang, Y., Xiao, J., Hays, J., & Tan, P. (2013). FrameBreak: Dramatic image extrapolation by guided shift-maps. Paper presented at the 25th IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Portland, USA, 23–28 June 2013.Google Scholar