Abstract
The available modelling data shortage issue makes it difficult to guarantee the performance of data-driven building energy prediction (BEP) models for both the newly built buildings and existing information-poor buildings. Both knowledge transfer learning (KTL) and data incremental learning (DIL) can address the data shortage issue of such buildings. For new building scenarios with continuous data accumulation, the performance of BEP models has not been fully investigated considering the data accumulation dynamics. DIL, which can learn dynamic features from accumulated data adapting to the developing trend of new building time-series data and extend BEP model’s knowledge, has been rarely studied. Previous studies have shown that the performance of KTL models trained with fixed data can be further improved in scenarios with dynamically changing data. Hence, this study proposes an improved transfer learning cross-BEP strategy continuously updated using the coarse data incremental (CDI) manner. The hybrid KTL-DIL strategy (LSTM-DANN-CDI) uses domain adversarial neural network (DANN) for KLT and long short-term memory (LSTM) as the Baseline BEP model. Performance evaluation is conducted to systematically qualify the effectiveness and applicability of KTL and improved KTL-DIL. Real-world data from six-type 36 buildings of six types are adopted to evaluate the performance of KTL and KTL-DIL in data-driven BEP tasks considering factors like the model increment time interval, the available target and source building data volumes. Compared with LSTM, results indicate that KTL (LSTM-DANN) and the proposed KTL-DIL (LSTM-DANN-CDI) can significantly improve the BEP performance for new buildings with limited data. Compared with the pure KTL strategy LSTM-DANN, the improved KTL-DIL strategy LSTM-DANN-CDI has better prediction performance with an average performance improvement ratio of 60%.
Similar content being viewed by others
Abbreviations
- b :
-
bias matrix
- c t :
-
storage cell
- f t :
-
forget gate
- G λ :
-
gradient reversal layer
- h t :
-
hidden layer
- i t :
-
input gate
- I :
-
unit matrix
- k :
-
number of iterations
- l i :
-
domain labels for true values
- \(\overline {{l_i}} \) :
-
domain labels for predicted values
- L d :
-
domain classification loss value
- L(D t ) :
-
loss function of fine-tune
- L ft :
-
weight parameter of the change layer
- L MMD :
-
MMD loss
- L y :
-
regression prediction loss value
- m :
-
total number of iterations
- N :
-
amount of training data
- o t :
-
output gate
- P :
-
data distributions
- r :
-
length of the minimum total batch in the training data
- R :
-
function symbol
- G f :
-
feature extractor
- G y :
-
regression predictor
- G d :
-
domain classifier
- tanh:
-
activation function tanh
- x :
-
input data
- y i :
-
true values of energy consumption
- \(\overline {{y_i}} \) :
-
predicted values of energy consumption
- u :
-
weight matrix
- w :
-
weight matrix
- α :
-
hyperparameter
- θ f :
-
network connection weights of feature extractor
- θ y :
-
network connection weights of regression predictor
- θ d :
-
network connection weights of domain classifier
- i :
-
number of data
- S:
-
source domain
- t :
-
moment of input data
- T:
-
target domain
- BAS:
-
building automation system
- BED:
-
building energy data
- BEP:
-
building energy prediction
- BES:
-
building energy system
- CDI:
-
coarse incremental learning
- CV_RMSE:
-
coefficient of variation of root-mean squared error
- DANN:
-
domain adversarial neural network
- DIL:
-
data incremental learning
- DTL:
-
deep transfer learning
- LSTM:
-
long short-term memory
- MAPE:
-
mean absolute percentage error
- KTL:
-
knowledge transfer learning
- RMSE:
-
root mean square error
- PIR:
-
performance improvement ratio
References
Ahmad MW, Mourshed M, Rezgui Y (2017). Trees vs Neurons: Comparison between random forest and ANN for high-resolution prediction of building energy consumption. Energy and Buildings, 147: 77–89.
Ahmad T, Zhang H, Yan B (2020). A review on renewable energy and electricity requirement forecasting models for smart grid and buildings. Sustainable Cities and Society, 55: 102052.
Chen Y, Xu P, Chu Y, et al. (2017). Short-term electrical load forecasting using the Support Vector Regression (SVR) model to calculate the demand response baseline for office buildings. Applied Energy, 195: 659–670.
Fan C, Xiao F, Zhao Y (2017). A short-term building cooling load prediction method using deep learning algorithms. Applied Energy, 195: 222–233.
Fan C, Sun Y, Xiao F, et al. (2020). Statistical investigations of transfer learning-based methodology for short-term building energy predictions. Applied Energy, 262: 114499.
Fang X, Gong G, Li G, et al. (2021a). A general multi-source ensemble transfer learning framework integrate of LSTM-DANN and similarity metric for building energy prediction. Energy and Buildings, 252: 111435.
Fang X, Gong G, Li G, et al. (2021b). A hybrid deep transfer learning strategy for short term cross-building energy prediction. Energy, 215: 119208.
Fang X, Gong G, Li G, et al. (2022). Deep reinforcement learning optimal control strategy for temperature setpoint real-time reset in multi-zone building HVAC system. Applied Thermal Engineering, 212: 118552.
Feng R, Grana D, Balling N (2021). Imputation of missing well log data by random forest and its uncertainty analysis. Computers & Geosciences, 152: 104763.
Foucquier A, Robert S, Suard F, et al. (2013). State of the art in building modelling and energy performances prediction: A review. Renewable and Sustainable Energy Reviews, 23: 272–288.
Fu G (2018). Deep belief network based ensemble approach for cooling load forecasting of air-conditioning system. Energy, 148: 269–282.
Ganin Y, Ustinova E, Ajakan H, et al. (2017). Domain-adversarial training of neural networks. In: Csurka G (Ed), Domain Adaptation in Computer Vision Applications. Cham, Switzerland: Springer.
Gao Y, Ruan Y, Fang C, et al. (2020). Deep learning and transfer learning models of energy consumption forecasting for a building with poor information data. Energy and Buildings, 223: 110156.
García-Laencina PJ, Sancho-Gómez J-L, Figueiras-Vidal AR (2010). Pattern classification with missing data: A review. Neural Computing and Applications, 19: 263–282.
González-Vidal A, Mendoza-Bernal J, Niu S, et al. (2023). A transfer learning framework for predictive energy-related scenarios in smart buildings. IEEE Transactions on Industry Applications, 59: 26–37.
Jin Y, Acquah MA, Seo M, Han S (2022). Short-term electric load prediction using transfer learning with interval estimate adjustment. Energy and Buildings, 258: 111846.
Li A, Xiao F, Fan C, et al. (2021). Development of an ANN-based building energy model for information-poor buildings using transfer learning. Building Simulation, 14: 89–101.
Li G, Zhao X, Fan C, et al. (2021). Assessment of long short-term memory and its modifications for enhanced short-term building energy predictions. Journal of Building Engineering, 43: 103182.
Li A, Fan C, Xiao F, et al. (2022a). Distance measures in building informatics: An in-depth assessment through typical tasks in building energy management. Energy and Buildings, 258: 111817.
Li G, Li F, Ahmad T, et al. (2022b). Performance evaluation of sequence-to-sequence-Attention model for short-term multi-step ahead building energy predictions. Energy, 259: 124915.
Li G, Li F, Xu C, et al. (2022c). A spatial-temporal layer-wise relevance propagation method for improving interpretability and prediction accuracy of LSTM building energy prediction. Energy and Buildings, 271: 112317.
Li G, Wu Y, Liu J, et al. (2022d). Performance evaluation of short-term cross-building energy predictions using deep transfer learning strategies. Energy and Buildings, 275: 112461.
Li J, Zhang C, Zhao Y, et al. (2022e). Federated learning-based short-term building energy consumption prediction method for solving the data silos problem. Building Simulation, 15: 1145–1159.
Li G, Chen L, Liu J, et al. (2023a). Comparative study on deep transfer learning strategies for cross-system and cross-operation-condition building energy systems fault diagnosis. Energy, 263: 125943.
Li W, Tang R, Wang S (2023b). A fully distributed robust optimal control approach for air-conditioning systems considering uncertainties of communication link in IoT-enabled building automation systems. Energy and Built Environment
Lissa P, Schukat M, Keane M, et al. (2021). Transfer learning applied to DRL-Based heat pump control to leverage microgrid energy efficiency. Smart Energy, 3: 100044.
Liu J, Zhang Q, Li X, et al. (2021). Transfer learning-based strategies for fault diagnosis in building energy systems. Energy and Buildings, 250: 111256.
Low R, Tekler ZD, Cheah L (2020). Predicting commercial vehicle parking duration using generative adversarial multiple imputation networks. Transportation Research Record: Journal of the Transportation Research Board, 2674: 820–831.
Ma J, Cheng JCP, Jiang F, et al. (2020). A bi-directional missing data imputation scheme based on LSTM and transfer learning for building energy data. Energy and Buildings, 216: 109941.
Masana M, Liu X, Twardowski B, et al. (2023). Class-incremental learning: Survey and performance evaluation on image classification. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45: 5513–5533.
Mat Daut MA, Hassan MY, Abdullah H, et al. (2017). Building electrical energy consumption forecasting analysis using conventional and artificial intelligence methods: A review. Renewable and Sustainable Energy Reviews, 70: 1108–1118.
Miller C, Kathirgamanathan A, Picchetti B, et al. (2020). The Building Data Genome Project 2, energy meter data from the ASHRAE Great Energy Predictor III competition. Scientific Data, 7: 368.
Miller C, Picchetti B, Fu C, et al. (2022). Limitations of machine learning for building energy prediction: ASHRAE Great Energy Predictor III Kaggle competition error analysis. Science and Technology for the Built Environment, 28: 610–627.
Ng RW, Begam KM, Rajkumar RK, et al. (2021). An improved self-organizing incremental neural network model for short-term time-series load prediction. Applied Energy, 292: 116912.
O’Neill Z, Eisenhower B (2013). Leveraging the analysis of parametric uncertainty for building energy model calibration. Building Simulation, 6: 365–377.
Parisi GI, Kemker R, Part JL, et al. (2019). Continual lifelong learning with neural networks: A review. Neural Networks, 113: 54–71.
Park H, Park DY, Noh B, et al. (2022). Stacking deep transfer learning for short-term cross building energy prediction with different seasonality and occupant schedule. Building and Environment, 218: 109060.
Peirelinck T, Kazmi H, Mbuwir BV, et al. (2022). Transfer learning in demand response: A review of algorithms for data-efficient modelling and control. Energy and AI, 7: 100126.
Pinto G, Messina R, Li H, et al. (2022a). Sharing is caring: An extensive analysis of parameter-based transfer learning for the prediction of building thermal dynamics. Energy and Buildings, 276: 112530.
Pinto G, Wang Z, Roy A, et al. (2022b). Transfer learning for smart buildings: A critical review of algorithms, applications, and future perspectives. Advances in Applied Energy, 5: 100084.
Sayed AN, Himeur Y, Bensaali F (2022). Deep and transfer learning for building occupancy detection: A review and comparative analysis. Engineering Applications of Artificial Intelligence, 115: 105254.
Sendra-Arranz R, Gutiérrez A (2020). A long short-term memory artificial neural network to predict daily HVAC consumption in buildings. Energy and Buildings, 216: 109952.
Singh MM, Singaravel S, Geyer P (2021). Machine learning for early stage building energy prediction: Increment and enrichment. Applied Energy, 304: 117787.
Skillington K, Crawford RH, Warren-Myers G, et al. (2022). A review of existing policy for reducing embodied energy and greenhouse gas emissions of buildings. Energy Policy, 168: 112920.
Somu N, M R GR, Ramamritham K (2020). A hybrid model for building energy consumption forecasting using long short term memory networks. Applied Energy, 261: 114131.
Stekhoven DJ, Bühlmann P (2012). MissForest—non-parametric missing value imputation for mixed-type data. Bioinformatics, 28: 112–118.
Sun Y, Haghighat F, Fung BCM (2020). A review of the-state-of-the-art in data-driven approaches for building energy prediction. Energy and Buildings, 221: 110022.
Sun Y, Haghighat F, Fung BCM (2022). Trade-off between accuracy and fairness of data-driven building and indoor environment models: A comparative study of pre-processing methods. Energy, 239: 122273.
Sun Y, Li J, Xu Y, et al. (2023). Deep learning versus conventional methods for missing data imputation: A review and comparative study. Expert Systems with Applications, 227: 120201.
van Buuren S, Oudshoorn CGM (2000). Multivariate Imputation by Chained Equations: Mice V1.0 User’s manual. Netherlands Organization for Applied Scientific Research (TNO).
Vincent P, Larochelle H, Bengio Y, et al. (2008). Extracting and composing robust features with denoising autoencoders. In: Proceedings of the 25th International Conference on Machine learning, Helsinki, Finland.
Wang Z, Srinivasan RS (2017). A review of artificial intelligence based building energy use prediction: Contrasting the capabilities of single and ensemble prediction models. Renewable and Sustainable Energy Reviews, 75: 796–808.
Wang Z, Wang Y, Zeng R, et al. (2018). Random Forest based hourly building energy prediction. Energy and Buildings, 171: 11–25.
Wang W, Hong T, Xu X, et al. (2019). Forecasting district-scale energy dynamics through integrating building network and long short-term memory learning algorithm. Applied Energy, 248: 217–230.
Wang Z, Liu J, Zhang Y, et al. (2021). Practical issues in implementing machine-learning models for building energy efficiency: Moving beyond obstacles. Renewable and Sustainable Energy Reviews, 143: 110929.
Wang C, Yuan J, Huang K, et al. (2022a). Research on thermal load prediction of district heating station based on transfer learning. Energy, 239: 122309.
Wang Z, Xia L, Yuan H, et al. (2022b). Principles, research status, and prospects of feature engineering for data-driven building energy prediction: A comprehensive review. Journal of Building Engineering, 58: 105028.
Xiao T, Xu P, He R, et al. (2022a). Status quo and opportunities for building energy prediction in limited data Context—Overview from a competition. Applied Energy, 305: 117829.
Xiao Z, Gang W, Yuan J, et al. (2022b). Impacts of data preprocessing and selection on energy consumption prediction model of HVAC systems based on deep learning. Energy and Buildings, 258: 111832.
Yoon J, Jordon J, van der Schaar M (2018). GAIN: Missing data imputation using generative adversarial nets. arXiv: 1806.02920.
Zhang L, Wen J (2021). Active learning strategy for high fidelity short-term data-driven building energy forecasting. Energy and Buildings, 244: 111026.
Zhang X, Sun Y, Gao D, et al. (2022). Similarity-based grouping method for evaluation and optimization of dataset structure in machine-learning based short-term building cooling load prediction without measurable occupancy information. Applied Energy, 327: 120144.
Zhao Y, Li T, Zhang X, et al. (2019). Artificial intelligence-based fault detection and diagnosis methods for building energy systems: Advantages, challenges and the future. Renewable and Sustainable Energy Reviews, 109: 85–101.
Zhuang F, Qi Z, Duan K, et al. (2021). A comprehensive survey on transfer learning. Proceedings of the IEEE, 109: 43–76.
Acknowledgements
This work was jointly supported by the Opening Fund of Key Laboratory of Low-grade Energy Utilization Technologies and Systems of Ministry of Education of China (Chongqing University) (LLEUTS-202305), the Opening Fund of State Key Laboratory of Green Building in Western China (LSKF202316), the open Foundation of Anhui Province Key Laboratory of Intelligent Building and Building Energy-saving (IBES2022KF11), “The 14th Five-Year Plan” Hubei Provincial advantaged characteristic disciplines (groups) project of Wuhan University of Science and Technology (2023D0504, 2023D0501), the National Natural Science Foundation of China (51906181), the 2021 Construction Technology Plan Project of Hubei Province (2021-83), and the Science and Technology Project of Guizhou Province: Integrated Support of Guizhou [2023] General 393.
Author information
Authors and Affiliations
Contributions
All authors contributed to the study conception and design. Original draft, review, editing and funding acquisition were performed by Guannan Li. Review, editing and project administration were performed by Chengchu Yan. Data preparation and software were performed by Yubei Wu and Zixi Wang. Review and editing were performed by Xi Fang, Tao Li, Jiajia Gao and Chengliang Xu. The first draft of the manuscript was written by Yubei Wu and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
We declare that we do not have any commercial or associative interest that represents a conflict of interest in connection with the work submitted
Rights and permissions
About this article
Cite this article
Li, G., Wu, Y., Yan, C. et al. An improved transfer learning strategy for short-term cross-building energy prediction using data incremental. Build. Simul. 17, 165–183 (2024). https://doi.org/10.1007/s12273-023-1053-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12273-023-1053-x