Abstract
Numerous previous literature has attempted to apply machine learning techniques to analyze relationships between energy variables in energy consumption. However, most machine learning methods are primarily used for prediction through complicated learning processes at the expense of interpretability. Those methods have difficulties in evaluating the effect of energy variables on energy consumption and especially capturing their heterogeneous relationship. Therefore, to identify the energy consumption of the heterogeneous relationships in actual buildings, this study applies the MOdel-Based recursive partitioning (MOB) algorithm to the 2012 CBECS survey data, which would offer representative information about actual commercial building characteristics and energy consumption. With resultant tree-structured subgroups, the MOB tree reveals the heterogeneous effect of energy variables and mutual influences on building energy consumptions. The results of this study would provide insights for architects and engineers to develop energy conservative design and retrofit in U.S. office buildings.
Similar content being viewed by others
References
Afram A, Janabi-Sharifi F (2015). Gray-box modeling and validation of residential HVAC system for control system design. Applied Energy, 137: 134–150.
Ahmad AS, Hassan MY, Abdullah MP, et al. (2014). A review on applications of ANN and SVM for building electrical energy consumption forecasting. Renewable and Sustainable Energy Reviews, 33: 102–109.
Ahmad T, Chen H, Guo Y, et al. (2018). A comprehensive overview on the data driven and large scale based approaches for forecasting of building energy demand: A review. Energy and Buildings, 165: 301–320.
Ali U, Shamsi MH, Bohacek M, Hoare C, et al. (2020). A data-driven approach to optimize urban scale energy retrofit decisions for residential buildings. Applied Energy, 267: 114861.
Amasyali K, El-Gohary NM (2018). A review of data-driven building energy consumption prediction studies. Renewable and Sustainable Energy Reviews, 81: 1192–1205.
Ascione F, Bianco N, de Stasio C, et al. (2017). Artificial neural networks to predict energy performance and retrofit scenarios for any member of a building category: A novel approach. Energy, 118: 999–1017.
ASHRAE (2017). 2017 ASHRAE Handbook—Fundamentals. Atlanta, GA, USA: American Society of Heating, Refrigerating, and Air-conditioning Engineers.
ASHRAE (2019a). ANSI/ASHRAE/IES Standard 90.1–2019. Atlanta, GA, USA: American Society of Heating, Refrigerating, and Air-conditioning Engineers.
ASHRAE (2019b). Advanced Energy Design Guides for Small to Medium Office Buildings: Achieving Zero Energy. Atlanta, GA, USA: American Society of Heating, Refrigerating, and Air-conditioning Engineers.
Aydinalp M, Ismet Ugursal V, Fung AS (2004). Modeling of the space and domestic hot-water heating energy-consumption in the residential sector using neural networks. Applied Energy, 79: 159–178.
Berthou T, Stabat P, Salvazet R, et al. (2014). Development and validation of a gray box model to predict thermal behavior of occupied office buildings. Energy and Buildings, 74: 91–100.
Bourdeau M, Zhai X, Nefzaoui E, et al. (2019). Modeling and forecasting building energy consumption: A review of data-driven techniques. Sustainable Cities and Society, 48: 101533.
BRE (2017). BREEAM International New Construction 2016, Technical Manual.
Breiman L, Friedman J, Olshen R, et al. (1998). Classification and Regression Trees. Boca Raton, FL, USA: Chapman & Hall/CRC.
Catalina T, Virgone J, Blanco E (2008). Development and validation of regression models to predict monthly heating demand for residential buildings. Energy and Buildings, 40: 1825–1832.
Catalina T, Iordache V, Caracaleanu B (2013). Multiple regression model for fast prediction of the heating energy demand. Energy and Buildings, 57: 302–312.
Chaudhuri P, Huang M-C, Loh W-Y, et al. (1994). Piecewise-polynomial regression trees. Statistica Sinica, 4: 143–167.
Cheng M-Y, Cao M-T (2014). Accurately predicting building energy performance using evolutionary multivariate adaptive regression splines. Applied Soft Computing, 22: 178–188.
Choi D, Li L, Liu H, Zeng L (2020). A recursive partitioning approach for subgroup identification in brain-behaviour correlation analysis. Pattern Analysis and Applications, 23: 161–177.
Choi D, Zeng L (2020). Robust logistic regression tree for subgroup identification in healthcare outcome modeling. IISE Transactions on Healthcare Systems Engineering, 10: 184–199.
Ciulla G, D’Amico A (2019). Building energy performance forecasting: A multiple linear regression approach. Applied Energy, 253: 113500.
Deb C, Lee SE (2018). Determining key variables influencing energy consumption in office buildings through cluster analysis of pre- and post-retrofit building data. Energy and Buildings, 159: 228–245.
Deng H, Fannon D, Eckelman MJ (2018). Predictive modeling for US commercial building energy use: A comparison of existing statistical and machine learning algorithms using CBECS microdata. Energy and Buildings, 163: 34–43.
Dong B, Cao C, Lee SE (2005). Applying support vector machines to predict building energy consumption in tropical region. Energy and Buildings, 37: 545–553.
Ferlito S, Atrigna M, Graditi G, et al. (2015). Predictive models for building’s energy consumption: An Artificial Neural Network (ANN) approach. In: Proceedings of 2015 XVIII AISEM Annual Conference, Trento, Italy.
Gao X, Malkawi A (2014). A new methodology for building energy performance benchmarking: An approach based on intelligent clustering algorithm. Energy and Buildings, 84: 607–616.
Harb H, Boyanov N, Hernandez L, et al. (2016). Development and validation of grey-box models for forecasting the thermal response of occupied buildings. Energy and Buildings, 117: 199–207.
Hastie T, Tibshirani R, Friedman J (2009). The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd edn. New York: Springer.
Haykin SS (1999). Neural Networks: A Comprehensive Foundation, 2nd edn. Englewood Cliffs, NJ, USA: Prentice Hall.
Hoffman AJ (1998). Peak demand control in commercial buildings with target peak adjustment based on load forecasting. In: Proceedings of the 1998 IEEE International Conference on Control Applications (Cat. No. 98CH36104), Trieste, Italy.
Hong S-M, Paterson G, Mumovic D, et al. (2014). Improved benchmarking comparability for energy consumption in schools. Building Research & Information, 42: 47–61.
Hothorn T, Hornik K, Zeileis A (2007). Party: A Laboratory for Recursive Part(y)itioning. R Package, Version 0.9–0, Available at http://CRAN.r-Project.Org.
Hou Z, Lian Z (2009). An application of support vector machines in cooling load prediction. In: Proceedings of 2009 International Workshop on Intelligent Systems and Applications, Wuhan, China.
Jiménez MJ, Heras MR (2005). Application of multi-output ARX models for estimation of the U and g values of building components in outdoor testing. Solar Energy, 79: 302–310.
Jung HC, Kim JS, Heo H (2015). Prediction of building energy consumption using an improved real coded genetic algorithm based least squares support vector machine approach. Energy and Buildings, 90: 76–84.
Kamel E, Sheikh S, Huang X (2020). Data-driven predictive models for residential building energy use based on the segregation of heating and cooling days. Energy, 206: 118045.
Karatasou S, Santamouris M, Geros V (2006). Modeling and predicting building’s energy use with artificial neural networks: Methods and results. Energy and Buildings, 38: 949–958.
Karhade AV, Schwab JH, Bedair HS (2019). Development of machine learning algorithms for prediction of sustained postoperative opioid prescriptions after total hip arthroplasty. The Journal of Arthroplasty, 34: 2272–2277.e1.
KICT (2020). Green Standard for Energy & Environmental Design (GSEED) reference manual for non-residential buildings. ver 2016–5 v2.
Kimbara A, Kurosu S, Endo R, et al.(1995). On-line prediction for load profile of an air-conditioning system. ASHRAE Transactions, 101(2): 198–207.
Kopf J, Augustin T, Strobl C (2013). The potential of model-based recursive partitioning in the social sciences: Revisiting Ockham’s razor. In: McArdle JJ, Ritschard G (eds), Contemporary Issues in Exploratory Data Mining in the Behavioral Sciences. New York: Routledge. pp. 97–117.
Korolija I, Zhang Y, Marjanovic-Halburd L, et al. (2013). Regression models for predicting UK office building energy consumption from heating and cooling demands. Energy and Buildings, 59: 214–227.
Lai F, Magoulès F, Lherminier F (2008). Vapnik’s learning theory applied to energy consumption forecasts in residential buildings. International Journal of Computer Mathematics, 85: 1563–1588.
Lam JC, Wan KKW, Liu D, et al. (2010). Multiple regression models for energy use in air-conditioned office buildings in different climates. Energy Conversion and Management, 51: 2692–2697.
LBNL and JJA (2015). DOE-2.2, Building Energy Use and Cost Analysis Program, Volume 3: Topics. Lawrence Berkeley National Laboratory, James J. Hirsch & Associates.
Li Q, Meng Q, Cai J, et al. (2009a). Applying support vector machine to predict hourly cooling load in the building. Applied Energy, 86: 2249–2256.
Li Q, Meng Q, Cai J, et al. (2009b). Predicting hourly cooling load in the building: a comparison of support vector machine and different artificial neural networks. Energy Conversion and Management, 50: 90–96.
Loh W-Y (2002). Regression trees with unbiased variable selection and interaction detection. Statistica Sinica, 12: 361–386.
Loh W-Y (2014). Fifty years of classification and regression trees. International Statistical Review, 82: 329–348.
Ma Y, Yu J, Yang C, Wang L (2010). Study on power energy consumption model for large-scale public building. In: Proceedings of 2010 the 2nd International Workshop on Intelligent Systems and Applications, Wuhan, China.
Morgan JN, Sonquist JA (1963). Problems in the analysis of survey data, and a proposal. Journal of the American Statistical Association, 58: 415–434.
Newsham GR, Birt BJ (2010). Building-level occupancy data to improve ARIMA-based electricity use forecasts. In: Proceedings of the 2nd ACM Workshop on Embedded Sensing Systems for Energy-Efficiency in Building (BuildSys’10), Zurich, Switzerland.
Nghiem TX, Jones CN (2017). Data-driven demand response modeling and control of buildings with Gaussian Processes. In: Proceedings of 2017 American Control Conference (ACC), Seattle, WA, USA.
Nguyen A-T, Reiter S (2015). A performance comparison of sensitivity analysis methods for building energy models. Building Simulation, 8: 651–664.
Noh H, Rajagopal R (2013). Data-driven forecasting algorithms for building energy consumption. In: Proceedings of SPIE 8692, Sensors and Smart Structures Technologies for Civil, Mechanical, and Aerospace Systems.
Pisello AL, Taylor JE, Xu X, et al. (2012). Inter-building effect: Simulating the impact of a network of buildings on the accuracy of building energy performance predictions. Building and Environment, 58: 37–45.
PNNL, U.S. DOE, (2019). Commercial Prototype Building Models. Available at https://www.energycodes.gov/development/commercial/prototype_models. Accessed 23 Oct 2019.
Rastogi P, Khan ME, Andersen M (2017). Gaussian-process-based emulators for building performance simulation. In: Proceedings the 15th International IBPSA Building Simulation Conference, San Francisco, CA, USA.
Robinson C, Dilkina B, Hubbs J, et al. (2017). Machine learning approaches for estimating commercial building energy consumption. Applied Energy, 208: 889–904.
Rusch T, Zeileis A (2013). Gaining insight with recursive partitioning of generalized linear models. Journal of Statistical Computation and Simulation, 83: 1301–1315.
Seyedzadeh S, Rahimian FP, Glesk I, Roper M (2018). Machine learning for estimation of building energy consumption and performance: A review. Visualization in Engineering, 6: 5.
Seyedzadeh S, Pour Rahimian F, Oliver S, et al. (2020a). Machine learning modelling for predicting non-domestic buildings energy performance: a model to support deep energy retrofit decision-making. Applied Energy, 279: 115908.
Seyedzadeh S, Rahimian FP, Oliver S, et al. (2020b). Data driven model improved by multi-objective optimisation for prediction of building energy loads. Automation in Construction, 116: 103188.
Sholahudin, Alam AG, Baek CI, et al. (2016). Prediction and analysis of building energy efficiency using artificial neural network and design of experiments. Applied Mechanics and Materials, 819: 541–545.
Silva AS, Ghisi E (2020). Estimating the sensitivity of design variables in the thermal and energy performance of buildings through a systematic procedure. Journal of Cleaner Production, 244: 118753.
Tian Z, Wei S, Shi X (2020). Developing data-driven models for energy-efficient heating design in office buildings. Journal of Building Engineering, 32: 101778.
U.S.DOE (2018). EnergyPlus Version 9.0.1 Documentation—Engineering Reference. U.S. Department of Energy.
U.S.EIA (2016a). 2012 CBECS Survey Data: Public Use Microdata File.
U.S.EIA (2016b). 2012 Commercial Buildings Energy Consumption Survey: Energy Usage Summary (CBECS). U.S. Energy Information Administration.
U.S.EIA (2016c). User’s Guide to the 2012 CBECS Public Use Microdata File.
U.S.EIA (2016d). Survey Questionnaire (EIA-871A).
U.S.EIA (2016e). Variable and Response Codebook.
U.S.EIA (2018). Energy Use in Commercial Buildings. Available at https://www.eia.gov/energyexplained/use-of-energy/commercial-buildings.php. Accessed 23 Oct 2019.
U.S.EIA (n.d.). Commercial Building Energy Consumption Survey (CBECS): Maps. Available at https://www.eia.gov/consumption/commercial/maps.php
USGBC (2020). LEED v4.1 Building Design and Construction, Getting Started Guide for Beta Participants.
Wang E (2017). Decomposing core energy factor structure of US commercial buildings through clustering around latent variables with Random Forest on large-scale mixed data. Energy Conversion and Management, 153: 346–361.
Wang Z, Wang Y, Srinivasan RS (2018). A novel ensemble learning approach to support building energy use prediction. Energy and Buildings, 159: 109–122.
Wang W, Chen J (2019). Investigation on correlation of energy consumption of multi-buildings on campus area. Energy Procedia, 158: 3559–3564.
Wei Y, Zhang X, Shi Y, et al. (2018). A review of data-driven approaches for prediction and classification of building energy consumption. Renewable and Sustainable Energy Reviews, 82: 1027–1047.
Zeileis A, Hothorn T, Hornik K (2008). Model-based recursive partitioning. Journal of Computational and Graphical Statistics, 17: 492–514.
Zhang X-P, Rui G (2007). Electrical energy consumption forecasting based on cointegration and a support vector machine in China. WSEAS Transactions on Mathematics, 12: 878–883.
Zhang Y, O’Neill Z, Wagner T, Augenbroe G (2013). An inverse model with uncertainty quantification to estimate the energy performance of an office building. In: Proceedings the 13th International IBPSA Building Simulation Conference.
Zhao H, Magoulès F (2010). Parallel support vector machines applied to the prediction of multiple buildings energy consumption. Journal of Algorithms & Computational Technology, 4: 231–249.
Zhao H, Magoulès F (2012). A review on the prediction of building energy consumption. Renewable and Sustainable Energy Reviews, 16: 3586–3592.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Choi, D., Kim, C. Diagnosis of building energy consumption in the 2012 CBECS data using heterogeneous effect of energy variables: A recursive partitioning approach. Build. Simul. 14, 1737–1755 (2021). https://doi.org/10.1007/s12273-021-0777-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12273-021-0777-8