Skip to main content

Parallelization Strategies for Hierarchical Density-Based Clustering Algorithm Using OpenMP for Scan-To-BIM Applications

  • Conference paper
  • First Online:
Proceedings of the Canadian Society of Civil Engineering Annual Conference 2021 (CSCE 2021)

Part of the book series: Lecture Notes in Civil Engineering ((LNCE,volume 247))

Included in the following conference series:

  • 694 Accesses

Abstract

Clustering is an unsupervised learning method that provides insights by investigating unknown structures in a dataset without exploiting any ground truth target information. For constructing an as-built Building Information Models (BIM) from captured laser-scanned datasets, the segmentation process precedes modeling, which provides a baseline to be traced for obtaining 3D models from point clouds. For the segmentation process, a clustering algorithm can be effectively applied so that it can group the points having similar features without predefined criteria which, in turn, segments can be easily separated from the entire scene. Amongst various types of clustering algorithms, Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN) was developed as a density-based and hierarchical clustering algorithm which provides a simplified tree of significant clusters. This algorithm has several distinct advantages over other clustering algorithms: (1) avoids “flat” (i.e. non-hierarchical) labeling of data objects, (2) automatically simplifies the hierarchy into the most significant clusters, and (3) requires a single input parameter (i.e. minimum number of points) for density threshold. However, this algorithm has an overall computation time complexity represented as a quadratic form (i.e., \({\text{O}}\left( {dn^2 } \right)\)) which suffers from the computational efficiency issue especially for massive amounts of data such as those found in 3D point clouds. To ease the applicability of HDBSCAN to Scan-to-BIM applications, this research aims to parallelize major time-consuming components of HDBSCAN algorithm. OpenMP interface was adopted for thread parallelization and parallel efficiency was measured by calculating speedup and efficiency from strong and weak scaling results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 219.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 279.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 279.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Agapaki E, Nahangi M (2020) Scene understanding and model generation. In: Infrastructure Computer Vision, pp 65–167. Elsevier

    Google Scholar 

  2. Boruvka O (1926) "O jistém problému minimálním." Práce Mor. Prırodved. Spol. v Brne (Acta Societ. Scienc. Natur. Moravicae) 3(3): 37–58

    Google Scholar 

  3. F Bosché M Ahmed Y Turkan CT Haas R Haas 2015 The value of integrating Scan-to-BIM and Scan-vs-BIM techniques for construction monitoring using laser scanning and BIM: the case of cylindrical MEP components Autom Constr 49 201 213

    Article  Google Scholar 

  4. Campello RJ, Kröger P, Sander J, Zimek A (2020) Density‐based clustering. Wiley Interdisc Rev: Data Min Knowl Discovery 10(2):e1343

    Google Scholar 

  5. Campello RJ, Moulavi D, Sander J (2013) Density-based clustering based on hierarchical density estimates. In: Proceedings of the 17th Pacific-Asia conference on knowledge discovery and data mining (PAKDD), pp 160–172

    Google Scholar 

  6. RJGB Campello D Moulavi A Zimek J Sander 2015 Hierarchical density estimates for data clustering, visualization, and outlier detection ACM Trans Knowl Discovery Data (TKDD) 10 1 1 51

    Article  Google Scholar 

  7. T Cheng J Teizer 2014 Modeling tower crane operator visibility to minimize the risk of limited situational awareness J Comput Civ Eng 28 3 04014004

    Article  Google Scholar 

  8. T Czerniawski B Sankaran M Nahangi C Haas F Leite 2018 6D DBSCAN-based segmentation of building point clouds for planar object classification Autom Constr 88 44 58

    Article  Google Scholar 

  9. Ester M, Kriegel HP, Sander J, Xu X (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. Kdd 96:226-231

    Google Scholar 

  10. V Estivill-Castro 2002 Why so many clustering algorithms: a position paper ACM SIGKDD Explor Newsl 4 1 65 75

    Article  MathSciNet  Google Scholar 

  11. Jung J, Stachniss C, Ju S, Heo J (2018) Automated 3D volumetric reconstruction of multiple-room building interiors for as-built BIM. Adv Eng Inform 38:811–825

    Google Scholar 

  12. JB Kruskal 1956 On the shortest spanning subtree of a graph and the traveling salesman problem Proc Am Math Soc 7 1 48 50

    Article  MathSciNet  Google Scholar 

  13. Ma JW, Czerniawski T, Leite F (2020) Semantic segmentation of point clouds of building interiors with deep learning: augmenting training datasets with synthetic BIM-based point clouds. Autom Constr 113:103144

    Google Scholar 

  14. RC Prim 1957 Shortest connection networks and some generalizations Bell Syst Tech J 36 6 1389 1401

    Article  Google Scholar 

  15. Radenski A (2011) Shared memory, message passing, and hybrid merge sorts for standalone and clustered SMPs. In: 2011 International conference on parallel and distributed processing techniques and applications, vol 11, pp 367–373

    Google Scholar 

  16. Richter R, Döllner J (2010) Out-of-core real-time visualization of massive 3D point clouds. In: Proceedings of the 7th international conference on computer graphics, virtual reality, visualisation and interaction in Africa, pp 121–128

    Google Scholar 

  17. KH Rosen 2017 Handbook of discrete and combinatorial mathematics CRC Press

    Google Scholar 

  18. Shun J, Blelloch GE, Fineman GT, Gibbons PB (2013) Reducing contention through priority updates. In: Proceedings of the twenty-fifth annual ACM symposium on Parallelism in algorithms and architectures, pp 152–163

    Google Scholar 

  19. Q Wang M-K Kim JCP Cheng H Sohn 2016 Automated quality assessment of precast concrete elements with geometry irregularities using terrestrial laser scanning Autom Constr 68 170 182

    Article  Google Scholar 

  20. Zhou W (2017) A practical scalable shared-memory parallel algorithm for computing minimum spanning trees. Master’s thesis, Karlsruher Institut für Technologie (KIT)

    Google Scholar 

Download references

Acknowledgements

This research was supported, in part, by the National Science Foundation (NSF) under award number 1562438. Their support is gratefully acknowledged. Any opinions, findings and conclusions, or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation. Mention of trade names in this article does not imply endorsement by the University of Texas at Austin or NSF.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to J. Ma .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 Canadian Society for Civil Engineering

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ma, J., Leite, F. (2023). Parallelization Strategies for Hierarchical Density-Based Clustering Algorithm Using OpenMP for Scan-To-BIM Applications. In: Walbridge, S., et al. Proceedings of the Canadian Society of Civil Engineering Annual Conference 2021. CSCE 2021. Lecture Notes in Civil Engineering, vol 247. Springer, Singapore. https://doi.org/10.1007/978-981-19-0968-9_43

Download citation

  • DOI: https://doi.org/10.1007/978-981-19-0968-9_43

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-19-0967-2

  • Online ISBN: 978-981-19-0968-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics