Real-Time Stream Mining Electric Power Consumption Data Using Hoeffding Tree with Shadow Features
Many energy load forecasting models have been established from batch-based supervised learning models where the whole data must be loaded to learn. Due to the sheer volumes of the accumulated consumption data which arrive in the form of continuous data streams, such batch-mode learning requires a very long time to rebuild the model. Incremental learning, on the other hand, is an alternative for online learning and prediction which learns the data stream in segments. However, it is known that its prediction performance falls short when compared to batch learning. In this paper, we propose a novel approach called Shadow Features (SF) which offer extra dimensions of information about the data streams. SF are relatively easy to compute, suitable for lightweight online stream mining.
KeywordsElectric power consumption prediction Data stream mining Shadow features
The authors are thankful for the financial support from the Research Grant Temporal Data Stream Mining by Using Incrementally Optimized Very Fast Decision Forest (iOVFDF), Grant no. MYRG2015-00128-FST, offered by the University of Macau, FST, and RDAO.
- 1.Getty Museum, J.P.: Photography: Discovery and Invention. ISBN 0-89236-177-8 (1990)Google Scholar
- 2.Vishwakarma, D.K., Rawat, P., Kapoor, R.: Human activity recognition using gabor wavelet transform and ridgelet transform. In: 3rd International Conference on Recent Trends in Computing 2015 (ICRTC-2015), vol. 57, pp. 630–636 (2015)Google Scholar
- 3.Zhang, M., Sawchuk, A.A.: A feature selection-based framework for human activity recognition using wearable multimodal sensors. In: Proceedings of the 6th International Conference on Body Area Networks, pp. 92–98 (2011)Google Scholar
- 4.Fong, S.: Adaptive forecasting of earthquake time series by incremental decision tree algorithm. Inf. J. 16(12), 8387–8395 (2013). International Information Institute (Tokyo)Google Scholar
- 6.Zhou, N.: Earthquake Forecasting Using Dynamic Hurst Coefficiency, MSc thesis, Department of Computer and Information Science, University of Macau, Macau SAR (2013)Google Scholar
- 7.Holmes, B.A.: Bernhard Pfahringer, Philipp Kranen, Hardy Kremer, Timm Jansen, Thomas Seidl. In: MOA: Massive Online Analysis, a Framework for Stream Classification and Clustering. Workshop and Conference Proceedings. vol. 11: Workshop on Applications of Pattern Analysis, pp. 1–14 (2010)Google Scholar