Abstract
With the advance of the internet of things and building management system (BMS) in modern buildings, there is an opportunity of using the data to extend the use of building energy modeling (BEM) beyond the design phase. Potential applications include retrofit analysis, measurement and verification, and operations and controls. However, while BMS is collecting a vast amount of operation data, different suppliers and sensor installers typically apply their own customized or even random non-uniform rules to define the metadata, i.e., the point tags. This results in a need to interpret and manually map any BMS data before using it for energy analysis. The mapping process is labor-intensive, error-prone, and requires comprehensive prior knowledge. Additionally, BMS metadata typically has considerable variety and limited context information, limiting the applicability of existing interpreting methods. In this paper, we proposed a text mining framework to facilitate interpreting and mapping BMS points to EnergyPlus variables. The framework is based on unsupervised density-based clustering (DBSCAN) and a novel fuzzy string matching algorithm “X-gram”. Therefore, it is generalizable among different buildings and naming conventions. We compare the proposed framework against commonly used baselines that include morphological analysis and widely used text mining techniques. Using two building cases from Singapore and two from the United States, we demonstrated that the framework outperformed baseline methods by 25.5%, with the measurement extraction F-measure of 87.2% and an average mapping accuracy of 91.4%.
Similar content being viewed by others
References
Ahn KU, Kim YJ, Park CS, Kim I, Lee K (2014). BIM interface for full vs. semi-automated building energy simulation. Energy and Buildings, 68: 671–678.
Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997). Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Research, 25: 3389–3402.
Augenbroe G (2004). Trends in building simulation. In: Malkawi A, Augenbroe G (eds), Advanced Building Simulation. London: Routledge. pp. 18–38.
Balaji B, Bhattacharya A, Fierro G, Gao J, Gluck J, et al. (2018). Brick: Metadata schema for portable smart building applications. Applied Energy, 226: 1273–1292.
Bhattacharya AA, Hong D, Culler D, Ortiz J, Whitehouse K, Wu E (2015a). Automated metadata construction to support portable building applications. In: Proceedings of the 2nd ACM International Conference on Embedded Systems for Energy-Efficient Built Environments.
Bhattacharya A, Ploennigs J, Culler D (2015b). Analyzing metadata schemas for buildings: The good, the bad, and the ugly. In: Proceedings of the 2nd ACM International Conference on Embedded Systems for Energy-Efficient Built Environments.
Cheng JC, Deng Y, Anumba C (2015). Mapping BIM schema and 3D GIS schema semi-automatically utilizing linguistic and text mining techniques. Journal of Information Technology in Construction (ITcon), 20: 193–212.
Chong A, Xu W, Chao S, Ngo NT (2019). Continuous-time Bayesian calibration of energy models using BIM and energy data. Energy and Buildings, 194: 177–190.
EnergyPlus (2019). Available at https://energyplus.net/documentation
Ester M, Kriegel HP, Sander J, Xu X (1996). A density-based algorithm for discovering clusters in large spatial databases with noise.
Gaizauskas R, Demetriou G, Artymiuk PJ, Willett P (2003). Protein structures and information extraction from biological texts: the PASTA system. Bioinformatics, 19: 135–143.
Gao H, Koch C, Wu Y (2019). Building information modelling based building energy modelling: A review. Applied Energy, 238: 320–343.
Hakenberg J, Bickel S, Plake C, Brefeld U, Zahn H, Faulstich L, Leser U, Scheffer T (2005). Systematic feature evaluation for gene name recognition. BMC Bioinformatics, 6: S9.
Hong D, Wang H, Ortiz J, Whitehouse K (2015). The building adapter: Towards quickly applying building analytics at scale. In: Proceedings of the 2nd ACM International Conference on Embedded Systems for Energy-Efficient Built Environments.
Kim H, Shen Z, Kim I, Kim K, Stumpf A, Yu J (2016). BIM IFC information mapping to building energy analysis (BEA) model with manually extended material information. Automation in Construction, 68: 183–193.
Kinoshita S, Cohen KB, Ogren PV, Hunter L (2005). BioCreAtIvE Task1A: Entity identification with a stochastic tagger. BMC Bioinformatics, 6: S4.
Koh J, Balaji B, Sengupta D, McAuley J, Gupta R, Agarwal Y (2018). Scrabble: Transferrable semi-automated semantic metadata normalization using intermediate representation. in: Proceedings of the 5th Conference on Systems for Built Environments.
Leser U, Hakenberg J (2005). What makes a gene name? Named entity recognition in the biomedical literature. Briefings in Bioinformatics, 6: 357–369.
Levenshtein V (1965). Leveinshtein Distance.
Pan SJ, Yang Q (2010). A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering, 22: 1345–1359.
Van Rijsbergen CJ (1979). Information Retrieval. London: ButterworthHeinemann.
Welle B, Haymaker J, Rogers Z (2011). ThermalOpt: a methodology for automated BIM-based multidisciplinary thermal simulation for use in optimization environments. Building Simulation, 4: 293–313.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhan, S., Chong, A. & Lasternas, B. Automated recognition and mapping of building management system (BMS) data points for building energy modeling (BEM). Build. Simul. 14, 43–52 (2021). https://doi.org/10.1007/s12273-020-0612-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12273-020-0612-7