Skip to main content
Log in

Energy characteristics of urban buildings: Assessment by machine learning

  • Research Article
  • Published:
Building Simulation Aims and scope Submit manuscript

Abstract

Machine learning techniques have attracted more attention as advanced data analytics in building energy analysis. However, most of previous studies are only focused on the prediction capability of machine learning algorithms to provide reliable energy estimation in buildings. Machine learning also has great potentials to identify energy patterns for urban buildings except for model prediction. Therefore, this paper explores energy characteristic of London domestic properties using ten machine learning algorithms from three aspects: tuning process of learning model; variable importance; spatial analysis of model discrepancy. The results indicate that the combination of these three aspects can provide insights on energy patterns for urban buildings. The tuning process of these models indicates that gas use models should have more terms in comparison with electricity in London and the interaction terms should be considered in both gas and electricity models. The rankings of important variables are very different for gas and electricity prediction in London residential buildings, which suggests that gas and electricity use are affected by different physical and social factors. Moreover, the importance levels for these key variables are markedly different for gas and electricity consumption. There are much more important variables for electricity use in comparison with gas use for the importance levels over 40. The areas with larger model discrepancies can be determined using the local spatial analysis based on these machine learning models. These identified areas have significantly different energy patterns for gas and electricity use. More research is required to understand these unusual patterns of energy use in these areas.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Ahmad T, Chen H, Guo Y, Wang J (2018). A comprehensive overview on the data driven and large scale based approaches for forecasting of building energy demand: A review. Energy and Buildings, 165: 301–320.

    Google Scholar 

  • Amasyali K, El-Gohary NM (2018). A review of data-driven building energy consumption prediction studies. Renewable and Sustainable Energy Reviews, 81: 1192–1205.

    Google Scholar 

  • Anselin L, Syabri I, Kho Y (2006). GeoDa: An introduction to spatial data analysis. Geographical Analysis, 38: 5–22.

    Google Scholar 

  • Anselin L (2019). A local indicator of multivariate spatial association: Extending Geary’s C. Geographical Analysis, 51: 133–150.

    Google Scholar 

  • ASHRAE (2014). Guideline 14–2014 Measurement of Energy, Demand, and Water Savings. Atlanta, GA, USA: American Society of Heating, Refrigerating and Air-Conditioning Engineers.

    Google Scholar 

  • Bivand RS, Wong DWS (2018). Comparing implementations of global and local indicators of spatial association. TEST, 27: 716–748.

    MathSciNet  MATH  Google Scholar 

  • Caputo B, Sim K, Furesjo F, Smola A (2002). Appearance-based object recognition using SVMs: Which kernel should I use? In: Proceedings of NIPS Workshop on Statistical Methods for Computational Experiments in Visual Processing and Computer Vision.

  • de Wilde P, Tian W (2009). Identification of key factors for uncertainty in the prediction of the thermal performance of an office building under climate change. Building Simulation, 2: 157–174.

    Google Scholar 

  • Doshi-Velez F, Kim B (2017). Towards a rigorous science of interpretable machine learning. arXiv:1702.08608.

  • Fan C, Xiao F, Zhao Y (2017). A short-term building cooling load prediction method using deep learning algorithms. Applied Energy, 195: 222–233.

    Google Scholar 

  • Fan C, Xiao F, Zhao Y, Wang J (2018). Analytical investigation of autoencoder-based methods for unsupervised anomaly detection in building energy data. Applied Energy, 211: 1123–1135.

    Google Scholar 

  • Fan C, Xiao F, Yan C, Liu C, Li Z, Wang J (2019). A novel methodology to explain and evaluate data-driven building energy performance models based on interpretable machine learning. Applied Energy, 235: 1551–1560.

    Google Scholar 

  • Fischer MM, Getis A (2010). Handbook of Applied Spatial Analysis: Software Tools, Methods and Applications. Berlin: Springer.

    Google Scholar 

  • Gevrey M, Dimopoulos I, Lek S (2003). Review and comparison of methods to study the contribution of variables in artificial neural network models. Ecological Modelling, 160: 249–264.

    Google Scholar 

  • Grömping U (2015). Variable importance in regression models. Wiley Interdisciplinary Reviews: Computational Statistics, 7: 137–152.

    MathSciNet  Google Scholar 

  • Hastie T, Tibshirani R, Friedman J (2009). The Elements of Statistical Learning: Data Mining, Inference, and Prediction. New York: Springer.

    MATH  Google Scholar 

  • Houborg R, McCabe MF (2018). A hybrid training approach for leaf area index estimation via Cubist and random forests machine-learning. ISPRS Journal of Photogrammetry and Remote Sensing, 135: 173–188.

    Google Scholar 

  • Huang S, Zuo W, Sohn MD (2018). A Bayesian Network model for predicting cooling load of commercial buildings. Building Simulation, 11: 87–101.

    Google Scholar 

  • Kuhn M (2008). Building predictive models in R using the caret package. Journal of Statistical Software 28(5): 1–26.

    Google Scholar 

  • Kuhn M, Johnson K (2013). Applied Predictive Modeling. New York: Springer.

    MATH  Google Scholar 

  • Li Z, Dai J, Chen H, Lin B (2019). An ANN-based fast building energy consumption prediction method for complex architectural form at the early design stage. Building Simulation, 12: 665–681.

    Google Scholar 

  • Lim H, Zhai ZJ (2017). Review on stochastic modeling methods for building stock energy prediction. Building Simulation, 10: 607–624.

    Google Scholar 

  • London Datastore (2019). LSOA Atlas, Greater London Authority, UK. Available at https://data.london.gov.uk/dataset/lsoa-atlas. Accessed 22 Jul 2019.

    Google Scholar 

  • Milborrow S (2013). R package Earth: Multivariate Adaptive Regression Spline Models Version 3.2–6.

  • Miller C, Nagy Z, Schlueter A (2015). Automated daily pattern filtering of measured building performance data. Automation in Construction, 49: 1–17.

    Google Scholar 

  • Miller C, Nagy Z, Schlueter A (2018). A review of unsupervised statistical learning and visual analytics techniques applied to performance analysis of non-residential buildings. Renewable and Sustainable Energy Reviews, 81: 1365–1377.

    Google Scholar 

  • Molnar C (2019). Interpretable machine learning: A guide for making black box models explainable. Available at https://christophm.github.io/interpretable-ml-book/.

  • Ngo N-T (2019). Early predicting cooling loads for energy-efficient design in office buildings by machine learning. Energy and Buildings, 182: 264–273.

    Google Scholar 

  • Potočnik P, Vidrih B, Kitanovski A, Govekar E (2019). Neural network, ARX, and extreme learning machine models for the short-term prediction of temperature in buildings. Building Simulation, 12: 1077–1093.

    Google Scholar 

  • Quinlan JR (1993). Combining instance-based and model-based learning. In: Proceedings of the 10th International Conference on Machine Learning.

    Google Scholar 

  • Ripley BD (2007). Pattern Recognition and Neural Networks. Cambridge, UK: Cambridge University Press.

    MATH  Google Scholar 

  • Seyedzadeh S, Pour Rahimian F, Rastogi P, Glesk I (2019). Tuning machine learning models for prediction of building energy loads. Sustainable Cities and Society, 47: 101484.

    Google Scholar 

  • Tian W, Choudhary R (2012). A probabilistic energy model for non-domestic building sectors applied to analysis of school buildings in greater London. Energy and Buildings, 54: 1–11.

    Google Scholar 

  • Tian W (2013). A review of sensitivity analysis methods in building energy analysis. Renewable and Sustainable Energy Reviews, 20: 411–419.

    Google Scholar 

  • Tian W, Song J, Li Z (2014). Spatial regression analysis of domestic energy in urban areas. Energy, 76: 629–640.

    Google Scholar 

  • Tian W, Choudhary R, Augenbroe G, Lee SH (2015a). Importance analysis and meta-model construction with correlated variables in evaluation of thermal performance of campus buildings. Building and Environment, 92: 61–74.

    Google Scholar 

  • Tian W, Yang S, Wei L, Meng Q (2015b). Spatial Variation of Urban Building Energy Analysis. In: Proceedings of the 14th Inernational IBPSA Building Simulation Conference, Hyderabad, India.

  • Tian W, Liu Y, Heo Y, Yan D, Li Z, An J, Yang S (2016a). Relative importance of factors influencing building energy in urban environment. Energy, 111: 237–250.

    Google Scholar 

  • Tian W, Yang S, Li Z, Wei S, Pan W, Liu Y (2016b). Identifying informative energy data in Bayesian calibration of building energy models. Energy and Buildings, 119: 363–376.

    Google Scholar 

  • Tian W, Yang S, Zuo J, Li Z, Liu Y (2017). Relationship between built form and energy performance of office buildings in a severe cold Chinese region. Building Simulation, 10: 11–24.

    Google Scholar 

  • Tian W, de Wilde P, Li Z, Song J, Yin B (2018). Uncertainty and sensitivity analysis of energy assessment for office buildings based on Dempster-Shafer theory. Energy Conversion and Management, 174: 705–718.

    Google Scholar 

  • Wei L, Tian W, Silva EA, Choudhary R, Meng Q, Yang S (2015). Comparative study on machine learning for urban building energy analysis. Procedia Engineering, 121: 285–292.

    Google Scholar 

  • Wei Y, Zhang X, Shi Y, Xia L, Pan S, Wu J, Han M, Zhao X (2018). A review of data-driven approaches for prediction and classification of building energy consumption. Renewable and Sustainable Energy Reviews, 82: 1027–1047.

    Google Scholar 

  • Westermann P, Evins R (2019). Surrogate modelling for sustainable building design—A review. Energy and Buildings, 198: 170–186.

    Google Scholar 

  • Xu X, Wang W, Hong T, Chen J (2019). Incorporating machine learning with building network analysis to predict multi-building energy use. Energy and Buildings, 186: 80–97.

    Google Scholar 

  • Zendehboudi A, Baseer MA, Saidur R (2018). Application of support vector machine models for forecasting solar and wind energy resources: A review. Journal of Cleaner Production, 199: 272–285.

    Google Scholar 

Download references

Acknowledgements

This research was supported by the National Natural Science Foundation of China (No. 51778416) and the Key Projects of Philosophy and Social Sciences Research, Ministry of Education (China) “Research on Green Design in Sustainable Development” (contract No. 16JZDH014, approval No. 16JZD014).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wei Tian.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tian, W., Zhu, C., Sun, Y. et al. Energy characteristics of urban buildings: Assessment by machine learning. Build. Simul. 14, 179–193 (2021). https://doi.org/10.1007/s12273-020-0608-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12273-020-0608-3

Keywords

Navigation