Skip to main content
Log in

Diagnosis of building energy consumption in the 2012 CBECS data using heterogeneous effect of energy variables: A recursive partitioning approach

  • Research Article
  • Published:
Building Simulation Aims and scope Submit manuscript

Abstract

Numerous previous literature has attempted to apply machine learning techniques to analyze relationships between energy variables in energy consumption. However, most machine learning methods are primarily used for prediction through complicated learning processes at the expense of interpretability. Those methods have difficulties in evaluating the effect of energy variables on energy consumption and especially capturing their heterogeneous relationship. Therefore, to identify the energy consumption of the heterogeneous relationships in actual buildings, this study applies the MOdel-Based recursive partitioning (MOB) algorithm to the 2012 CBECS survey data, which would offer representative information about actual commercial building characteristics and energy consumption. With resultant tree-structured subgroups, the MOB tree reveals the heterogeneous effect of energy variables and mutual influences on building energy consumptions. The results of this study would provide insights for architects and engineers to develop energy conservative design and retrofit in U.S. office buildings.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Afram A, Janabi-Sharifi F (2015). Gray-box modeling and validation of residential HVAC system for control system design. Applied Energy, 137: 134–150.

    Article  Google Scholar 

  • Ahmad AS, Hassan MY, Abdullah MP, et al. (2014). A review on applications of ANN and SVM for building electrical energy consumption forecasting. Renewable and Sustainable Energy Reviews, 33: 102–109.

    Article  Google Scholar 

  • Ahmad T, Chen H, Guo Y, et al. (2018). A comprehensive overview on the data driven and large scale based approaches for forecasting of building energy demand: A review. Energy and Buildings, 165: 301–320.

    Article  Google Scholar 

  • Ali U, Shamsi MH, Bohacek M, Hoare C, et al. (2020). A data-driven approach to optimize urban scale energy retrofit decisions for residential buildings. Applied Energy, 267: 114861.

    Article  Google Scholar 

  • Amasyali K, El-Gohary NM (2018). A review of data-driven building energy consumption prediction studies. Renewable and Sustainable Energy Reviews, 81: 1192–1205.

    Article  Google Scholar 

  • Ascione F, Bianco N, de Stasio C, et al. (2017). Artificial neural networks to predict energy performance and retrofit scenarios for any member of a building category: A novel approach. Energy, 118: 999–1017.

    Article  Google Scholar 

  • ASHRAE (2017). 2017 ASHRAE Handbook—Fundamentals. Atlanta, GA, USA: American Society of Heating, Refrigerating, and Air-conditioning Engineers.

    Google Scholar 

  • ASHRAE (2019a). ANSI/ASHRAE/IES Standard 90.1–2019. Atlanta, GA, USA: American Society of Heating, Refrigerating, and Air-conditioning Engineers.

    Google Scholar 

  • ASHRAE (2019b). Advanced Energy Design Guides for Small to Medium Office Buildings: Achieving Zero Energy. Atlanta, GA, USA: American Society of Heating, Refrigerating, and Air-conditioning Engineers.

    Google Scholar 

  • Aydinalp M, Ismet Ugursal V, Fung AS (2004). Modeling of the space and domestic hot-water heating energy-consumption in the residential sector using neural networks. Applied Energy, 79: 159–178.

    Article  Google Scholar 

  • Berthou T, Stabat P, Salvazet R, et al. (2014). Development and validation of a gray box model to predict thermal behavior of occupied office buildings. Energy and Buildings, 74: 91–100.

    Article  Google Scholar 

  • Bourdeau M, Zhai X, Nefzaoui E, et al. (2019). Modeling and forecasting building energy consumption: A review of data-driven techniques. Sustainable Cities and Society, 48: 101533.

    Article  Google Scholar 

  • BRE (2017). BREEAM International New Construction 2016, Technical Manual.

  • Breiman L, Friedman J, Olshen R, et al. (1998). Classification and Regression Trees. Boca Raton, FL, USA: Chapman & Hall/CRC.

    MATH  Google Scholar 

  • Catalina T, Virgone J, Blanco E (2008). Development and validation of regression models to predict monthly heating demand for residential buildings. Energy and Buildings, 40: 1825–1832.

    Article  Google Scholar 

  • Catalina T, Iordache V, Caracaleanu B (2013). Multiple regression model for fast prediction of the heating energy demand. Energy and Buildings, 57: 302–312.

    Article  Google Scholar 

  • Chaudhuri P, Huang M-C, Loh W-Y, et al. (1994). Piecewise-polynomial regression trees. Statistica Sinica, 4: 143–167.

    MATH  Google Scholar 

  • Cheng M-Y, Cao M-T (2014). Accurately predicting building energy performance using evolutionary multivariate adaptive regression splines. Applied Soft Computing, 22: 178–188.

    Article  Google Scholar 

  • Choi D, Li L, Liu H, Zeng L (2020). A recursive partitioning approach for subgroup identification in brain-behaviour correlation analysis. Pattern Analysis and Applications, 23: 161–177.

    Article  MathSciNet  Google Scholar 

  • Choi D, Zeng L (2020). Robust logistic regression tree for subgroup identification in healthcare outcome modeling. IISE Transactions on Healthcare Systems Engineering, 10: 184–199.

    Article  Google Scholar 

  • Ciulla G, D’Amico A (2019). Building energy performance forecasting: A multiple linear regression approach. Applied Energy, 253: 113500.

    Article  Google Scholar 

  • Deb C, Lee SE (2018). Determining key variables influencing energy consumption in office buildings through cluster analysis of pre- and post-retrofit building data. Energy and Buildings, 159: 228–245.

    Article  Google Scholar 

  • Deng H, Fannon D, Eckelman MJ (2018). Predictive modeling for US commercial building energy use: A comparison of existing statistical and machine learning algorithms using CBECS microdata. Energy and Buildings, 163: 34–43.

    Article  Google Scholar 

  • Dong B, Cao C, Lee SE (2005). Applying support vector machines to predict building energy consumption in tropical region. Energy and Buildings, 37: 545–553.

    Article  Google Scholar 

  • Ferlito S, Atrigna M, Graditi G, et al. (2015). Predictive models for building’s energy consumption: An Artificial Neural Network (ANN) approach. In: Proceedings of 2015 XVIII AISEM Annual Conference, Trento, Italy.

  • Gao X, Malkawi A (2014). A new methodology for building energy performance benchmarking: An approach based on intelligent clustering algorithm. Energy and Buildings, 84: 607–616.

    Article  Google Scholar 

  • Harb H, Boyanov N, Hernandez L, et al. (2016). Development and validation of grey-box models for forecasting the thermal response of occupied buildings. Energy and Buildings, 117: 199–207.

    Article  Google Scholar 

  • Hastie T, Tibshirani R, Friedman J (2009). The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd edn. New York: Springer.

    Book  MATH  Google Scholar 

  • Haykin SS (1999). Neural Networks: A Comprehensive Foundation, 2nd edn. Englewood Cliffs, NJ, USA: Prentice Hall.

    MATH  Google Scholar 

  • Hoffman AJ (1998). Peak demand control in commercial buildings with target peak adjustment based on load forecasting. In: Proceedings of the 1998 IEEE International Conference on Control Applications (Cat. No. 98CH36104), Trieste, Italy.

  • Hong S-M, Paterson G, Mumovic D, et al. (2014). Improved benchmarking comparability for energy consumption in schools. Building Research & Information, 42: 47–61.

    Article  Google Scholar 

  • Hothorn T, Hornik K, Zeileis A (2007). Party: A Laboratory for Recursive Part(y)itioning. R Package, Version 0.9–0, Available at http://CRAN.r-Project.Org.

  • Hou Z, Lian Z (2009). An application of support vector machines in cooling load prediction. In: Proceedings of 2009 International Workshop on Intelligent Systems and Applications, Wuhan, China.

  • Jiménez MJ, Heras MR (2005). Application of multi-output ARX models for estimation of the U and g values of building components in outdoor testing. Solar Energy, 79: 302–310.

    Article  Google Scholar 

  • Jung HC, Kim JS, Heo H (2015). Prediction of building energy consumption using an improved real coded genetic algorithm based least squares support vector machine approach. Energy and Buildings, 90: 76–84.

    Article  Google Scholar 

  • Kamel E, Sheikh S, Huang X (2020). Data-driven predictive models for residential building energy use based on the segregation of heating and cooling days. Energy, 206: 118045.

    Article  Google Scholar 

  • Karatasou S, Santamouris M, Geros V (2006). Modeling and predicting building’s energy use with artificial neural networks: Methods and results. Energy and Buildings, 38: 949–958.

    Article  Google Scholar 

  • Karhade AV, Schwab JH, Bedair HS (2019). Development of machine learning algorithms for prediction of sustained postoperative opioid prescriptions after total hip arthroplasty. The Journal of Arthroplasty, 34: 2272–2277.e1.

    Article  Google Scholar 

  • KICT (2020). Green Standard for Energy & Environmental Design (GSEED) reference manual for non-residential buildings. ver 2016–5 v2.

  • Kimbara A, Kurosu S, Endo R, et al.(1995). On-line prediction for load profile of an air-conditioning system. ASHRAE Transactions, 101(2): 198–207.

    Google Scholar 

  • Kopf J, Augustin T, Strobl C (2013). The potential of model-based recursive partitioning in the social sciences: Revisiting Ockham’s razor. In: McArdle JJ, Ritschard G (eds), Contemporary Issues in Exploratory Data Mining in the Behavioral Sciences. New York: Routledge. pp. 97–117.

    Google Scholar 

  • Korolija I, Zhang Y, Marjanovic-Halburd L, et al. (2013). Regression models for predicting UK office building energy consumption from heating and cooling demands. Energy and Buildings, 59: 214–227.

    Article  Google Scholar 

  • Lai F, Magoulès F, Lherminier F (2008). Vapnik’s learning theory applied to energy consumption forecasts in residential buildings. International Journal of Computer Mathematics, 85: 1563–1588.

    Article  MathSciNet  MATH  Google Scholar 

  • Lam JC, Wan KKW, Liu D, et al. (2010). Multiple regression models for energy use in air-conditioned office buildings in different climates. Energy Conversion and Management, 51: 2692–2697.

    Article  Google Scholar 

  • LBNL and JJA (2015). DOE-2.2, Building Energy Use and Cost Analysis Program, Volume 3: Topics. Lawrence Berkeley National Laboratory, James J. Hirsch & Associates.

  • Li Q, Meng Q, Cai J, et al. (2009a). Applying support vector machine to predict hourly cooling load in the building. Applied Energy, 86: 2249–2256.

    Article  Google Scholar 

  • Li Q, Meng Q, Cai J, et al. (2009b). Predicting hourly cooling load in the building: a comparison of support vector machine and different artificial neural networks. Energy Conversion and Management, 50: 90–96.

    Article  Google Scholar 

  • Loh W-Y (2002). Regression trees with unbiased variable selection and interaction detection. Statistica Sinica, 12: 361–386.

    MathSciNet  MATH  Google Scholar 

  • Loh W-Y (2014). Fifty years of classification and regression trees. International Statistical Review, 82: 329–348.

    Article  MathSciNet  MATH  Google Scholar 

  • Ma Y, Yu J, Yang C, Wang L (2010). Study on power energy consumption model for large-scale public building. In: Proceedings of 2010 the 2nd International Workshop on Intelligent Systems and Applications, Wuhan, China.

  • Morgan JN, Sonquist JA (1963). Problems in the analysis of survey data, and a proposal. Journal of the American Statistical Association, 58: 415–434.

    Article  MATH  Google Scholar 

  • Newsham GR, Birt BJ (2010). Building-level occupancy data to improve ARIMA-based electricity use forecasts. In: Proceedings of the 2nd ACM Workshop on Embedded Sensing Systems for Energy-Efficiency in Building (BuildSys’10), Zurich, Switzerland.

  • Nghiem TX, Jones CN (2017). Data-driven demand response modeling and control of buildings with Gaussian Processes. In: Proceedings of 2017 American Control Conference (ACC), Seattle, WA, USA.

  • Nguyen A-T, Reiter S (2015). A performance comparison of sensitivity analysis methods for building energy models. Building Simulation, 8: 651–664.

    Article  Google Scholar 

  • Noh H, Rajagopal R (2013). Data-driven forecasting algorithms for building energy consumption. In: Proceedings of SPIE 8692, Sensors and Smart Structures Technologies for Civil, Mechanical, and Aerospace Systems.

  • Pisello AL, Taylor JE, Xu X, et al. (2012). Inter-building effect: Simulating the impact of a network of buildings on the accuracy of building energy performance predictions. Building and Environment, 58: 37–45.

    Article  Google Scholar 

  • PNNL, U.S. DOE, (2019). Commercial Prototype Building Models. Available at https://www.energycodes.gov/development/commercial/prototype_models. Accessed 23 Oct 2019.

  • Rastogi P, Khan ME, Andersen M (2017). Gaussian-process-based emulators for building performance simulation. In: Proceedings the 15th International IBPSA Building Simulation Conference, San Francisco, CA, USA.

  • Robinson C, Dilkina B, Hubbs J, et al. (2017). Machine learning approaches for estimating commercial building energy consumption. Applied Energy, 208: 889–904.

    Article  Google Scholar 

  • Rusch T, Zeileis A (2013). Gaining insight with recursive partitioning of generalized linear models. Journal of Statistical Computation and Simulation, 83: 1301–1315.

    Article  MathSciNet  MATH  Google Scholar 

  • Seyedzadeh S, Rahimian FP, Glesk I, Roper M (2018). Machine learning for estimation of building energy consumption and performance: A review. Visualization in Engineering, 6: 5.

    Article  Google Scholar 

  • Seyedzadeh S, Pour Rahimian F, Oliver S, et al. (2020a). Machine learning modelling for predicting non-domestic buildings energy performance: a model to support deep energy retrofit decision-making. Applied Energy, 279: 115908.

    Article  Google Scholar 

  • Seyedzadeh S, Rahimian FP, Oliver S, et al. (2020b). Data driven model improved by multi-objective optimisation for prediction of building energy loads. Automation in Construction, 116: 103188.

    Article  Google Scholar 

  • Sholahudin, Alam AG, Baek CI, et al. (2016). Prediction and analysis of building energy efficiency using artificial neural network and design of experiments. Applied Mechanics and Materials, 819: 541–545.

    Article  Google Scholar 

  • Silva AS, Ghisi E (2020). Estimating the sensitivity of design variables in the thermal and energy performance of buildings through a systematic procedure. Journal of Cleaner Production, 244: 118753.

    Article  Google Scholar 

  • Tian Z, Wei S, Shi X (2020). Developing data-driven models for energy-efficient heating design in office buildings. Journal of Building Engineering, 32: 101778.

    Article  Google Scholar 

  • U.S.DOE (2018). EnergyPlus Version 9.0.1 Documentation—Engineering Reference. U.S. Department of Energy.

  • U.S.EIA (2016a). 2012 CBECS Survey Data: Public Use Microdata File.

  • U.S.EIA (2016b). 2012 Commercial Buildings Energy Consumption Survey: Energy Usage Summary (CBECS). U.S. Energy Information Administration.

  • U.S.EIA (2016c). User’s Guide to the 2012 CBECS Public Use Microdata File.

  • U.S.EIA (2016d). Survey Questionnaire (EIA-871A).

  • U.S.EIA (2016e). Variable and Response Codebook.

  • U.S.EIA (2018). Energy Use in Commercial Buildings. Available at https://www.eia.gov/energyexplained/use-of-energy/commercial-buildings.php. Accessed 23 Oct 2019.

  • U.S.EIA (n.d.). Commercial Building Energy Consumption Survey (CBECS): Maps. Available at https://www.eia.gov/consumption/commercial/maps.php

  • USGBC (2020). LEED v4.1 Building Design and Construction, Getting Started Guide for Beta Participants.

  • Wang E (2017). Decomposing core energy factor structure of US commercial buildings through clustering around latent variables with Random Forest on large-scale mixed data. Energy Conversion and Management, 153: 346–361.

    Article  Google Scholar 

  • Wang Z, Wang Y, Srinivasan RS (2018). A novel ensemble learning approach to support building energy use prediction. Energy and Buildings, 159: 109–122.

    Article  Google Scholar 

  • Wang W, Chen J (2019). Investigation on correlation of energy consumption of multi-buildings on campus area. Energy Procedia, 158: 3559–3564.

    Article  Google Scholar 

  • Wei Y, Zhang X, Shi Y, et al. (2018). A review of data-driven approaches for prediction and classification of building energy consumption. Renewable and Sustainable Energy Reviews, 82: 1027–1047.

    Article  Google Scholar 

  • Zeileis A, Hothorn T, Hornik K (2008). Model-based recursive partitioning. Journal of Computational and Graphical Statistics, 17: 492–514.

    Article  MathSciNet  Google Scholar 

  • Zhang X-P, Rui G (2007). Electrical energy consumption forecasting based on cointegration and a support vector machine in China. WSEAS Transactions on Mathematics, 12: 878–883.

    Google Scholar 

  • Zhang Y, O’Neill Z, Wagner T, Augenbroe G (2013). An inverse model with uncertainty quantification to estimate the energy performance of an office building. In: Proceedings the 13th International IBPSA Building Simulation Conference.

  • Zhao H, Magoulès F (2010). Parallel support vector machines applied to the prediction of multiple buildings energy consumption. Journal of Algorithms & Computational Technology, 4: 231–249.

    Article  Google Scholar 

  • Zhao H, Magoulès F (2012). A review on the prediction of building energy consumption. Renewable and Sustainable Energy Reviews, 16: 3586–3592.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chul Kim.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Choi, D., Kim, C. Diagnosis of building energy consumption in the 2012 CBECS data using heterogeneous effect of energy variables: A recursive partitioning approach. Build. Simul. 14, 1737–1755 (2021). https://doi.org/10.1007/s12273-021-0777-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12273-021-0777-8

Keywords

Navigation