A Study of Machine Learning Techniques for Daily Solar Energy Forecasting Using Numerical Weather Models
Forecasting solar energy is becoming an important issue in the context of renewable energy sources and Machine Learning Algorithms play an important rule in this field. The prediction of solar energy can be addressed as a time series prediction problem using historical data. Also, solar energy forecasting can be derived from numerical weather prediction models (NWP). Our interest is focused on the latter approach.We focus on the problem of predicting solar energy from NWP computed from GEFS, the Global Ensemble Forecast System, which predicts meteorological variables for points in a grid. In this context, it can be useful to know how prediction accuracy improves depending on the number of grid nodes used as input for the machine learning techniques. However, using the variables from a large number of grid nodes can result in many attributes which might degrade the generalization performance of the learning algorithms. In this paper both issues are studied using data supplied by Kaggle for the State of Oklahoma comparing Support Vector Machines and Gradient Boosted Regression. Also, three different feature selection methods have been tested: Linear Correlation, the ReliefF algorithm and, a new method based on local information analysis.
KeywordsSupport Vector Machine Ensemble Member Grid Node Machine Learning Technique Feature Selection Method
Unable to display preview. Download preview PDF.
- 6.Friedman, J.H.: Greedy function approximation: a gradient boosting machine. Annals of Statistics, 1189–1232 (2001)Google Scholar
- 8.Gala, Y., Fernández, A., Dorronsoro, J.R.: Machine learning prediction of global photovoltaic energy in spain. In: International Conference on Renewable Energies and Power Quality, number 12 (2014)Google Scholar
- 10.Kononenko, I.: Estimating attributes: analysis and extensions of relief. In: Bergadano, F., De Raedt, L. (eds.) ECML 1994. LNCS, vol. 784, pp. 171–182. Springer, Heidelberg (1994)Google Scholar
- 12.Monteiro, C., Bessa, R., Miranda, V., Botterud, A., Wang, J., Conzelmann, G., et al.: Wind power forecasting: state-of-the-art 2009. Technical report, Argonne National Laboratory, ANL (2009)Google Scholar
- 13.Schonlau, M.: Boosted regression (boosting): An introductory tutorial and a stata plugin. Stata Journal 5(3), 330 (2005)Google Scholar
- 14.Sharma, N., Sharma, P., Irwin, D., Shenoy, P.: Predicting solar generation from weather forecasts using machine learning. In: 2011 IEEE International Conference on Smart Grid Communications (SmartGridComm), pp. 528–533. IEEE (2011)Google Scholar
- 15.R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2014)Google Scholar
- 16.Vapnik, V.N.: Statistical learning theory (adaptive and learning systems for signal processing, communications and control series). John Wiley & Sons, A Wiley-Interscience Publication, New York (1998)Google Scholar
- 17.Greg Ridgeway with contributions from others. gbm: Generalized Boosted Regression Models. R package version 2.1. (2013)Google Scholar
- 18.Wolff, B., Lorenz, E., Kramer, O.: Statistical learning for short-term photovoltaic power predictions. In: DARE: Data Analytics for Renewable Energy Integration. European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (2013)Google Scholar