Research and Applications of Data Mining Techniques for Improving Building Operational Performance

  • Cheng Fan
  • Fu XiaoEmail author
  • Chengchu Yan
Building Sustainability (N Nord, Section Editor)
Part of the following topical collections:
  1. Topical Collection on Building Sustainability


Purpose of Review

This paper reviews the data mining (DM)-related research and applications at the building operation stage. It aims to summarize DM-based solutions for building energy management and reveal current research and development outcomes in analyzing massive building operational data using advanced DM techniques.

Recent Findings

Previous studies mainly adopt DM techniques for two tasks, i.e., (1) predictive modeling; (2) fault detection and diagnosis. The knowledge discovered has been successfully utilized to facilitate the decision-making during building operations. Domain expertise play the dominant role in the knowledge discovery process, which limits the chance of discovering novel knowledge.


DM is a promising technology for the development of intelligent and automated building management systems. Despite encouraging results, more research efforts should be made in (1) exploring the usefulness of unsupervised DM, (2) developing generic analytic frameworks, and (3) analyzing unstructured and multi-relational data sets.


Big data Data mining Knowledge discovery Building operational performance Building energy management Intelligent building 



The authors gratefully acknowledge the support of this research by the Research Grant Council of the Hong Kong SAR (152181/14E) and the Natural Science Foundation of SZU (Grant No. 2017061).

Compliance with Ethical Standards

Conflict of Interest

The authors declare that they have no conflict of interest.

Human and Animal Rights and Informed Consent

This article does not contain any studies with human or animal subjects performed by any of the authors.


Papers of particular interest, published recently, have been highlighted as: • Of importance •• Of major importance

  1. 1.
    Ramesh T, Prakash R, Shukla KK. Life cycle energy analysis of buildings: an overview. Energy Build. 2010;42:1592–600.CrossRefGoogle Scholar
  2. 2.
    Waide P, Ure J, Karagianni N, Smith G, Bordass B. The scope for energy and CO2 savings in the EU through the use of building automation technology. Final Report for the European Copper Institute, August 2013.Google Scholar
  3. 3.
    Gantz J, Reinsel D. The digital universe in 2020: big data, bigger digital shadows, and biggest growth in the far east. International Data Corporation, IDC iView: IDC Analyze the Future, 2012.Google Scholar
  4. 4.
    Han JW, Kamber M. Data mining: concepts and techniques. The Morgan Kaufmann Series in Data Management Systems; 2011.Google Scholar
  5. 5.
    Mikut R, Reischl M. Data mining tools. Data Min Knowl Discov. 2011;5:431–43.CrossRefGoogle Scholar
  6. 6.
    Saxena A, Prasad M, Gupta A, et al. A review of clustering techniques and development. Neurocomputing. 2017;267:664–81.CrossRefGoogle Scholar
  7. 7.
    •• Fan C, Xiao F, Li ZD, Wang JY. Unsupervised data analytics in mining big building operational data for energy efficiency enhancement: a review. Energy Build. 2018;159:296–308. The paper provides a comprehensive review on the use of unsupervised data analytics in analyzing big building operational data. CrossRefGoogle Scholar
  8. 8.
    Dalene F. Technology and information management for low-carbon building. J Renew Sustain Ener. 2012;4:041402. Scholar
  9. 9.
    • Wei YX, Zhang XX, Shi Y, Xia L, et al. A review of data-driven approaches for prediction and classification of building energy consumption. Renew Sust Energ Rev. 2018;82:1027–47. The paper serves as an updated review on the status-quo of data-driven techniques for building energy consumption. CrossRefGoogle Scholar
  10. 10.
    Ding Y, Zhang Q, Yuan TH. Research on short-term and ultra-short-term cooling load prediction models for office buildings. Energy Build. 2017;154:254–67.CrossRefGoogle Scholar
  11. 11.
    Ahmad AS, Hassan MY, Abdullah MP, Rahman HA, Hussin F, Abdullah H, et al. A review on applications of ANN and SVM for building electrical energy consumption forecasting. Renew Sust Energ Rev. 2014;33:102–9.CrossRefGoogle Scholar
  12. 12.
    •• Amasyali K. El-Gohary NM. A review of data-driven building energy consumption prediction studies. Renew Sust Energ Rev. 2018;81:1192–205. The paper provides a review on various prediction methods in analyzing building energy consumption data. CrossRefGoogle Scholar
  13. 13.
    Kim G, Schaefer L, Lim TS, Kim JT. Thermal comfort prediction of an underfloor air distribution system in a large indoor environment. Energy Build. 2013;64:323–31.CrossRefGoogle Scholar
  14. 14.
    Ahmed A, Korres NE, Ploennigs J, Elhadi H, Menzel K. Mining building performance data for energy-efficient operation. Adv Eng Inform. 2011;25:341–54.CrossRefGoogle Scholar
  15. 15.
    Kucuksille EU, Selbas R, Sencan A. Prediction of thermodynamic properties of refrigerants using data mining. Energy Convers Manag. 2011;52:836–48.CrossRefGoogle Scholar
  16. 16.
    Chou JS, Hsu YC, Lin LT. Smart meter monitoring and data mining techniques for predicting refrigeration system performance. Expert Syst Appl. 2014;41:2144–56.CrossRefGoogle Scholar
  17. 17.
    • Dong B, Cao C, Lee SE. Applying support vector machines to predict building energy consumption in tropical region. Energy Build. 2005;37:545–53. It is the first attempt in utilizing support vector machine to predict building energy consumption. CrossRefGoogle Scholar
  18. 18.
    Robinson C, Dilkina B, Hubbs J, et al. Machine learning approaches for estimating commercial building energy consumption. Appl Enregy. 2017;208:889–904.CrossRefGoogle Scholar
  19. 19.
    Rafe Biswas MA, Robinson MD, Fumo N. Prediction of residential building energy consumption: a neural network approach. Energy. 2016;117:84–92.CrossRefGoogle Scholar
  20. 20.
    Deb C, Eang LS, Yang JJ, Santamouris M. Forecasting diurnal cooling energy load for institutional buildings using artificial neural networks. Energy Build. 2016;121:284–97.CrossRefGoogle Scholar
  21. 21.
    Yu Z, Haghighat F, Fung CM, Yoshino H. A decision tree method for building energy demand modeling. Energy Build. 2010;42:1637–46.CrossRefGoogle Scholar
  22. 22.
    Fan C, Xiao F, Wang SW. Development of prediction models for next-day building energy consumption and peak power demand using data mining techniques. Appl Energy. 2014;127:1–10.CrossRefGoogle Scholar
  23. 23.
    Jetcheva JG, Majidpour M, Chen WP. Neural network model ensembles for building-level electricity load forecasts. Energy Build. 2014;84:214–23.CrossRefGoogle Scholar
  24. 24.
    Chen YB, Tan HW. Short-term prediction of electric demand in building sector via hybrid support vector regression. Appl Energy. 2017;204:1363–74.CrossRefGoogle Scholar
  25. 25.
    •• Guyon I, Elisseeff A. An introduction to variable and feature selection. J Mach Learn Res. 2003;3:1157–82. The paper provides a detailed description on different variable selection methods. zbMATHGoogle Scholar
  26. 26.
    Zhao HX, Magoules F. Feature selection for predicting building energy consumption based on statistical learning method. J Algorithm Comput Technol. 2012;6:59–77.CrossRefGoogle Scholar
  27. 27.
    Saeys Y, Inza I, Larranaga P. A review of feature selection techniques in bioinformatics. Bioinform. 2007;23:2507–17.CrossRefGoogle Scholar
  28. 28.
    Kapetanakis DS, Mangina E, Finn DP. Input variable selection for thermal load predictive models of commercial buildings. Energy Build. 2017;137:13–26.CrossRefGoogle Scholar
  29. 29.
    Antonucci D, Oberegger UF, Pasut W, Gasparella A. Building performance evaluation through a novel feature selection algorithm for automated arx model identification procedures. Energy Build. 2017;150:432–46.CrossRefGoogle Scholar
  30. 30.
    Cui C, Wu T, Hu MQ, Wier JD, Li XW. Short-term building energy model recommendation system: a meta-learning approach. Appl Energy. 2016;172:251–63.CrossRefGoogle Scholar
  31. 31.
    Matijas M, Suykens JAK, Krajcar S. Load forecasting using multivariate meta-learning system. Expert Syst Appl. 2013;40:4427–37.CrossRefGoogle Scholar
  32. 32.
    • Fan C, Xiao F, Zhao Y. A short-term building cooling load prediction method using deep learning algorithms. Appl Energy. 2017;195:222–33. The paper investigates the performance of deep learning in predicting building cooling load. It validates the power of unsupervised deep learning in deriving useful high-level input variables. CrossRefGoogle Scholar
  33. 33.
    Ren XX, Yan D, Hong TZ. Data mining of space heating system performance in affordable housing. Build Environ. 2015;89:1–13.CrossRefGoogle Scholar
  34. 34.
    Tang F, Kusiak A, Wei XP. Modeling and short-term prediction of HVAC system with a clustering algorithm. Energy Build. 2014;82:310–21.CrossRefGoogle Scholar
  35. 35.
    Jota PRS, Silva VRB, Jota FG. Building load management using cluster and statistical analyses. Int J Electr Power. 2011;33:1498–505.CrossRefGoogle Scholar
  36. 36.
    Magoules F, Zhao HX, Elizondo D. Development of an RDP neural network for building energy consumption fault detection and diagnosis. Energy Build. 2013;62:133–8.CrossRefGoogle Scholar
  37. 37.
    Zhao Y, Wang SW, Xiao F. A statistical fault detection and diagnosis method for centrifugal chillers based on exponentially-weighted moving average control charts and support vector regression. Appl Therm Eng. 2013;51:560–72.CrossRefGoogle Scholar
  38. 38.
    Wang SW, Xiao F. AHU sensor fault diagnosis using principal component analysis. Energy Build. 2004;36:147–60.CrossRefGoogle Scholar
  39. 39.
    Li D, Hu GQ, Spanos CJ. A data-driven strategy for detection and diagnosis of building chiller faults using linear discriminant analysis. Energy Build. 2016;128:519–29.CrossRefGoogle Scholar
  40. 40.
    Chang HH. Non-intrusive fault identification of power distribution systems in intelligent buildings based on power-spectrum-based wavelet transform. Energy Build. 2016;127:930–41.CrossRefGoogle Scholar
  41. 41.
    Capozzoli A, Lauro F, Khan I. Fault detection analysis using data mining techniques for a cluster of smart office buildings. Expert Syst Appl. 2015;42:4324–38.CrossRefGoogle Scholar
  42. 42.
    Yan K, Shen W, Mulumba T, Afshari A. ARX model based fault detection and diagnosis for chillers using support vector machines. Energy Build. 2014;81:287–95.CrossRefGoogle Scholar
  43. 43.
    Hu YP, Chen HX, Xie JL, Yang XS, Zhou C. Chiller sensor fault detection using a self-adaptive principal component analysis method. Energy Build. 2012;54:252–8.CrossRefGoogle Scholar
  44. 44.
    Wen J, Li S. Application of pattern matching method for detecting faults in air handling unit system. Autom Constr. 2014;43:49–58.CrossRefGoogle Scholar
  45. 45.
    Zhao Y, Xiao F, Wang SW. An intelligent chiller fault detection and diagnosis methodology using Bayesian belief network. Energy Build. 2013;57:278–88.CrossRefGoogle Scholar
  46. 46.
    Xiao F, Zhao Y, Wen J, Wang SW. Bayesian network based FDD strategy for variable air volume terminals. Autom Constr. 2014;41:106–18.CrossRefGoogle Scholar
  47. 47.
    • Seem JE. Using intelligent data analysis to detect abnormal energy consumption in buildings. Energy Build. 2007;39:52–8. The paper firstly investigates the potential of generalized extreme studentized deviate in finding anomalies in building energy data. CrossRefGoogle Scholar
  48. 48.
    Yu Z, Haghighat F, Fung CM, Zhou L. A novel methodology for knowledge discovery through mining associations between building operational data. Energy Build. 2012;47:430–40.CrossRefGoogle Scholar
  49. 49.
    Cabrera DFM, Zareipour H. Data association mining for identifying lighting energy waste patterns in educational institutes. Energy Build. 2013;62:210–6.CrossRefGoogle Scholar
  50. 50.
    Fan C, Xiao F, Yan CC. A framework for knowledge discovery in massive building automation data and its application in building diagnostics. Automat Constr. 2015;50:81–90.CrossRefGoogle Scholar
  51. 51.
    Fan C, Xiao F, Madsen H, Wang D. Temporal knowledge discovery in big BAS data for building energy management. Energy Build. 2015;109:75–89.CrossRefGoogle Scholar
  52. 52.
    Du ZM, Fan B, Jin XQ, Chi JL. Fault detection and diagnosis for buildings and HVAC systems using combined neural networks and subtractive clustering analysis. Build Environ. 2014;73:1–11.CrossRefGoogle Scholar
  53. 53.
    Ma ZJ, Yan R, Nord N. A variation focused cluster analysis strategy to identify typical daily heating load profiles of higher educational buildings. Energy. 2017;134:90–102.CrossRefGoogle Scholar
  54. 54.
    Capozzoli A, Lauro F, Khan I. Fault detection analysis using data mining techniques for a cluster of smart office buildings. Expert Syst Appl. 2015;42:4324–38.CrossRefGoogle Scholar
  55. 55.
    Miller C, Nagy Z, Schlueter A. Automated daily pattern filtering of measured building performance data. Autom Constr. 2015;49:1–17.CrossRefGoogle Scholar
  56. 56.
    Xue PN, Zhou ZG, Fang XM, Chen X, Liu L, Liu YW, et al. Fault detection and operation optimization in district heating substations based on data mining techniques. Appl Energy. 2017;205:926–40.CrossRefGoogle Scholar
  57. 57.
    • Fan C, Xiao F, Zhao Y. Wang JY. Analytical investigation of autoencoder-based methods for unsupervised anomaly detection in building energy data. Appl Eenrgy. 2018;211:1123–35. The paper investigates the power of different autoencoders in detecting anomalies in building energy data in an unsupervised way. CrossRefGoogle Scholar
  58. 58.
    Araya DB, Grolinger K, El Yamany HF, Capretz MAM, Bitsuamlak G. An ensemble learning framework for anomaly detection in building energy consumption. Energy Build. 2017;144:191–206.CrossRefGoogle Scholar
  59. 59.
    •• Molina-Solana M, Ros M, Ruiz MD, Gomez-Romero J. Martin-Bautista MJ. Data science for building energy management: a review. Renew Sust Energy Rev. 2017;70:598–609. The paper presents a comprehensive review on the power of data science in building energy management. CrossRefGoogle Scholar
  60. 60.
    •• Miller C, Nagy Z, Schlueter A. A review of unsupervised statistical learning and visual analytics techniques applied to performance analysis of non-residential buildings. Renew Sust Energy Rev. 2017;81:1365–77. The paper summarizes the applications of unsupervised data analytics and visualization techniques in analyzing building data. CrossRefGoogle Scholar
  61. 61.
    Shen LY, Yan H, Fan HQ, Wu Y, Zhang Y. An integrated system of text mining technique and case-based reasoning for supporting green building design. Build Environ. 2017;124:388–401.CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Department of Construction Management and Real EstateShenzhen UniversityShenzhenChina
  2. 2.Department of Building Services EngineeringThe Hong Kong Polytechnic UniversityHong KongHong Kong, China
  3. 3.College of Urban ConstructionNanjing Tech UniversityNanjingChina

Personalised recommendations