Entwicklung eines Human-in-the-Loop-Systems zur Objekterkennung in Grundrissen

Jakubik, Johannes; Hemmer, Patrick; Vössing, Michael; Blumenstiel, Benedikt; Bartos, Andrea; Mohr, Kamilla

doi:10.1007/978-3-658-42796-2_7

Johannes Jakubik⁵,
Patrick Hemmer⁵,
Michael Vössing⁵,
Benedikt Blumenstiel⁵,
Andrea Bartos⁵ &
…
Kamilla Mohr⁵

704 Accesses

Zusammenfassung

In den vergangenen Jahren haben Unternehmen der Baubranche damit begonnen künstliche Intelligenz (KI) einzusetzen, um zeitaufwändige und repetitive Prozesse zu beschleunigen. Ein Anwendungsfall, der von der Einführung von KI profitieren kann, ist die Ermittlung von Mengen in Grundrissen. Diese Informationen werden für verschiedene Planungs- und Konstruktionsschritte benötigt, ihre Erhebung erfordert allerdings einen erheblichen manuellen Aufwand. Aktuell kann die Mengenermittlung nur mithilfe von digitalen Grundrissen automatisiert werden. Für Bestandsbauten liegen allerdings keine digitalen Grundrisse vor und für Neubauten können die digitalen Formate aufgrund fehlender Standardisierung häufig nicht verarbeitet werden. In diesem Beitrag schlagen wir ein KI-basiertes System zur Erkennung und Klassifizierung von Symbolen in Grundrissen vor, wobei ein Domänenexperte KI Entscheidungen korrigieren kann (sog. Human-in-the-Loop-System). Das entwickelte KI-Modell berechnet ein Unsicherheitsmaß für die Klassifizierung von jedem erkannten Symbol. Symbole mit hohem Unsicherheitswert werden dann an Domänenexperten geleitet und von diesen klassifiziert. Wir evaluieren den vorgeschlagenen Ansatz mithilfe eines realen Datensatzes, der von einem Industriepartner zur Verfügung gestellt wurde. In einer Reihe von Experimenten stellen wir fest, dass das selektive Hinzuziehen von menschlichem Expertenwissen die Leistung des Systems um bis zu 12,9 % steigern kann, was zu einer Systemgenauigkeit von durchschnittlich 92,1 % führt (Dieser Beitrag basiert auf der Veröffentlichung von Jakubik et al. (2022b)).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 74.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Literatur

Amershi, S.; Cakmak, M.; Knox, W. B.; und Kulesza, T. 2014. Power to the People: The Role of Humans in Interactive Machine Learning. AI Magazine, 35(4): 105–120.
Article Google Scholar
Blanco, J. L.; Fuchs, S.; Parsons, M.; und Ribeirinho, M. J. 2018. Artificial Intelligence: Construction Technology’s next Frontier. Building Economist, 1: 7–13.
Google Scholar
Budd, S.; Robinson, E. C.; und Kainz, B. 2021. A Survey on Active Learning and Human-in-the-Loop Deep Learning for Medical Image Analysis. Medical Image Analysis, 102062.
Google Scholar
Dellermann, D.; Ebel, P.; Sollner, M.; und Leimeister, J. M.¨ 2019. Hybrid Intelligence. Business & Information Systems Engineering, 61(5): 637–643.
Article Google Scholar
Dong, S.; Wang, W.; Li, W.; und Zou, K. 2021. Vectorization of Floor Plans Based on EdgeGAN. Information, 12(5): 206.
Article Google Scholar
Dwibedi, D.; Misra, I.; und Hebert, M. 2017. Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection. In International Conference on Computer Vision, 1310–1319.
Google Scholar
Everingham, M.; Eslami, S. M. A.; Van Gool, L.; Williams, C. K. I.; und Zisserman, A. 2015. The Pascal Visual Object Classes Challenge: A Retrospective. International Journal of Computer Vision, 111(111): 98–136.
Article Google Scholar
Everingham, M.; und Winn, J. 2011. The Pascal Visual Object Classes Challenge 2012 (VOC2012) Development Kit. Pattern Analysis, Statistical Modelling and Computational Learning, Tech. Rep, 8: 5.
Google Scholar
Fan, Z.; Zhu, L.; Li, H.; Chen, X.; Zhu, S.; und Tan, P. 2021. FloorPlanCAD: A Large-Scale CAD Drawing Dataset for Panoptic Symbol Spotting. arXiv preprint arXiv:2105.07147.
Gal, Y.; und Ghahramani, Z. 2016. Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning. In International Conference on Machine Learning, 1050–1059.
Google Scholar
Gal, Y.; Islam, R.; und Ghahramani, Z. 2017. Deep Bayesian Active Learning with Image Data. In International Conference on Machine Learning, 1183–1192.
Google Scholar
Goyal, S.; Chattopadhyay, C.; und Bhatnagar, G. 2021. Knowledge-driven Description Synthesis for Floor Plan Interpretation. International Journal on Document Analysis and Recognition, 1–14.
Google Scholar
Goyal, S.; Mistry, V.; Chattopadhyay, C.; und Bhatnagar, G. 2019. BRIDGE: Building Plan Repository for Image Description Generation, and Evaluation. In International Conference on Document Analysis and Recognition, 1071–1076.
Google Scholar
Grønsund, T.; und Aanestad, M. 2020. Augmenting the Algorithm: Emerging Human-in-the-Loop Work Configurations. The Journal of Strategic Information Systems, 29(2).
Google Scholar
Hemmer, P.; Kühl, N.; und Schöffer, J. 2022. Utilizing Active Machine Learning for Quality Assurance: A Case Study of Virtual Car Renderings in the Automotive Industry. In Proceedings of the 55th Hawaii International Conference on System Sciences.
Google Scholar
Hinterstoisser, S.; Lepetit, V.; Wohlhart, P.; und Konolige, K. 2018. On Pre-Trained Image Features and Synthetic Images for Deep Learning. In European Conference on Computer Vision..
Google Scholar
Holzinger, A. 2016.Interactive Machine Learning for Health Informatics: When do we Need the Human-in-theLoop? Brain Informatics, 3(2): 119–131.
Article Google Scholar
Houlsby, N.; Huszar, F.; Ghahramani, Z.; und Lengyel, M. 2011. Bayesian Active Learning for Classification and Preference Learning. arXiv preprint arXiv:1112.5745.
Jakubik, J.; Blumenstiel, B.; Vossing, M.; und Hemmer, P. 2022a. Instance Selection Mechanisms for Human-in-the-Loop Systems in Few-Shot Learning. In International Conference on Wirtschaftsinformatik.
Google Scholar
Jakubik, J.; Hemmer, P.; Vössing, M.; Blumenstiel, B.; Bartos, A.; und Mohr, K. 2022b. Designing a Human-in-the-Loop System for Object Detection in Floor Plans. In Innovative Applications of Artificial Intelligence.
Google Scholar
Karim, M. M.; Qin, R.; Chen, G.; und Yin, Z. 2021. A Semi Supervised Self-Training Method to Develop Assistive Intelligence for Segmenting Multiclass Bridge Elements from Inspection Videos. Structural Health Monitoring.
Google Scholar
Kuncheva, L. I.; und Whitaker, C. J. 2003. Measures of Diversity in Classifier Ensembles and their Relationship with the Ensemble Accuracy. Machine Learning, 51(2): 181–207.
Article Google Scholar
Lakshminarayanan, B.; Pritzel, A.; und Blundell, C. 2017. Simple and Scalable Predictive Uncertainty Estimation Using Deep Ensembles. In Neural Information Processing Systems, 6405–6416.
Google Scholar
Liu, C.; Loy, C. C.; Gong, S.; und Wang, G. 2013. POP: Person Re-identification Post-rank Optimisation. In International Conference on Computer Vision, 441–448.
Google Scholar
Nikolenko, S. 2021. Synthetic Data for Basic Computer Vision Problems, 161–194. Springer.
Google Scholar
Ren, S.; He, K.; Girshick, R.; und Sun, J. 2017.Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. Transactions on Pattern Analysis and Machine Intelligence, 39(6): 1137–1149.
Article Google Scholar
Rezvanifar, A.; Cote, M.; und Albu, A. B. 2020. Symbol Spotting on Digital Architectural Floor Plans Using a Deep Learning-based Framework. In Computer Vision and Pattern Recognition Workshops, 568–569.
Google Scholar
Smith, R. 2007. An Overview of the Tesseract OCR Engine. In International Conference on Document Analysis and Recognition, volume 2, 629–633.
Google Scholar
Wang, H.; Gong, S.; Zhu, X.; und Xiang, T. 2016. Humanin-the-Loop Person Re-identification. In European Conference on Computer Vision, 405–422.
Google Scholar
Zhu, R.; Shen, J.; Deng, X.; Wallden, M.; und Ino, F. 2020. Training Strategies for CNN-based Models to Parse Complex Floor Plans. In International Conference on Software and Computer Applications, 11–16.
Google Scholar
Ziran, Z.; und Marinai, S. 2018. Object Detection in Floor Plan Images. In IAPR Workshop on Artificial Neural Networks in Pattern Recognition, 383–394.
Google Scholar

Download references

Danksagung

Wir bedanken uns bei digitales bauen Teil von DREES & SOMMER für die Ermöglichung und Unterstützung dieser Forschung. Die Autoren bedanken sich für die finanzielle Unterstützung durch das Bundesministerium für Wirtschaft und Klimaschutz im Rahmen des Projektes „Smart Design and Construction“ (Projektnummer 01MK20016F).

Author information

Authors and Affiliations

Karlsruhe Digital Service Research and Innovation Hub (KSRI), Karlsruher Institut für Technologie (KIT), Karlsruhe, Deutschland
Johannes Jakubik, Patrick Hemmer, Michael Vössing, Benedikt Blumenstiel, Andrea Bartos & Kamilla Mohr

Authors

Johannes Jakubik
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Hemmer
View author publications
You can also search for this author in PubMed Google Scholar
Michael Vössing
View author publications
You can also search for this author in PubMed Google Scholar
Benedikt Blumenstiel
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Bartos
View author publications
You can also search for this author in PubMed Google Scholar
Kamilla Mohr
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Johannes Jakubik .

Editor information

Editors and Affiliations

TMB, Karlsruher Inst. für Technologie (KIT), Karlsruhe, Baden-Württemberg, Deutschland
Shervin Haghsheno
IISM, Karlsruhe Institute of Technology, Karlsruhe, Baden-Württemberg, Deutschland
Gerhard Satzger
TMB, Karlsruhe Institute of Technology, Karlsruhe, Deutschland
Svenja Lauble
IISM, Karlsruher Institut für Technologie, Karlsruhe, Baden-Württemberg, Deutschland
Michael Vössing

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Jakubik, J., Hemmer, P., Vössing, M., Blumenstiel, B., Bartos, A., Mohr, K. (2024). Entwicklung eines Human-in-the-Loop-Systems zur Objekterkennung in Grundrissen. In: Haghsheno, S., Satzger, G., Lauble, S., Vössing, M. (eds) Künstliche Intelligenz im Bauwesen . Springer Vieweg, Wiesbaden. https://doi.org/10.1007/978-3-658-42796-2_7

Download citation

DOI: https://doi.org/10.1007/978-3-658-42796-2_7
Published: 10 May 2024
Publisher Name: Springer Vieweg, Wiesbaden
Print ISBN: 978-3-658-42795-5
Online ISBN: 978-3-658-42796-2
eBook Packages: Computer Science and Engineering (German Language)

Publish with us

Policies and ethics