Skip to main content

Short Term Wind Turbine Power Output Prediction

  • Conference paper
  • First Online:
Performance Evaluation Methodologies and Tools (VALUETOOLS 2022)

Abstract

In the wind energy industry, it is of great importance to develop models that accurately forecast the power output of a wind turbine, as such predictions are used for wind farm location assessment or power pricing and bidding, monitoring, and preventive maintenance. As a first step, and following the guidelines of the existing literature, we use the supervisory control and data acquisition (SCADA) data to model the wind turbine power curve (WTPC). We explore various parametric and non-parametric approaches for the modeling of the WTPC, such as parametric logistic functions, and non-parametric piecewise linear, polynomial, or cubic spline interpolation functions. We demonstrate that all aforementioned classes of models are rich enough (with respect to their relative complexity) to accurately model the WTPC, as their mean squared error (MSE) is close to the MSE lower bound calculated from the historical data. However, all aforementioned models, when it comes to forecasting, seem to have an intrinsic limitation, due to their inability to capture the inherent auto-correlation of the data. To avoid this conundrum, we show that adding a properly scaled ARMA modeling layer increases short-term prediction performance, while keeping the long-term prediction capability of the model. We further enhance the accuracy of our proposed model, by incorporating additional environmental factors that affect the power output, such as the ambient temperature and the wind direction.

The authors acknowledge the Daisy4offshore consortium for the provision of the data. The work of Sándor Kolumbán and Stella Kapodistria is supported by NWO through the Gravitation-grant NETWORKS-024.002.003. Sándor Kolumbán also received funding from the Institute for Complex Molecular Systems (ICMS) in Eindhoven. Furthermore, the research of Stella Kapodistria was partly done in the framework of the TKI-WoZ: Daisy4offshore project.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 64.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 84.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Akaike, H.: A new look at the statistical model identification. IEEE Trans. Autom. Control 19(6), 716–723 (1974)

    Article  MathSciNet  MATH  Google Scholar 

  2. Belsley, D.A., Kuh, E., Welsch, R.E.: Regression Diagnostics: Identifying Influential Data and Sources of Collinearity, vol. 571. Wiley, Hoboken (2005)

    MATH  Google Scholar 

  3. Carrillo, C., Obando Montaño, A., Cidrás, J., Díaz-Dorado, E.: Review of power curve modelling for wind turbines. Renew. Sustain. Energy Rev. 21, 572–581 (2013)

    Article  Google Scholar 

  4. De Boor, C.: A Practical Guide to Splines. Applied Mathematical Sciences, vol. 27, 1st edn. Springer, New York (1978)

    Book  MATH  Google Scholar 

  5. Pelletier, F., Masson, C., Tahan, A.: Wind turbine power curve modelling using artificial neural network. Renew. Energy 89, 207–214 (2016)

    Article  Google Scholar 

  6. IEC 61400–12-1, E.: Wind turbines - part 12-1: Power performance measurements of electricity producing wind turbines. Technical report, 88/244/FDIS (2005)

    Google Scholar 

  7. Kang, H., Chen, F., Li, Y., Deng, J., Yang, Z.: Knot calculation for spline fitting via sparse optimization. Comput.-Aided Design 58, 179–188 (2015). Solid and Physical Modeling 2014

    Article  MathSciNet  Google Scholar 

  8. Kusiak, A., Zheng, H., Song, Z.: On-line monitoring of power curves. Renew. Energy 34(6), 1487–1493 (2009)

    Article  Google Scholar 

  9. Lee, G., Ding, Y., Genton, M.G., Xie, L.: Power curve estimation with multivariate environmental factors for inland and offshore wind farms. J. Am. Stat. Assoc. 110(509), 56–67 (2015)

    Article  MathSciNet  Google Scholar 

  10. Li, S., Wunsch, D.C., O’Hair, E., Giesselmann, M.G.: Comparative analysis of regression and artificial neural network models for wind turbine power curve estimation. J. Sol. Energy Eng. 123, 327–332 (2001)

    Article  Google Scholar 

  11. Ljung, L.: System Identification Toolbox User’s Guide. The MathWorks Inc. (2010)

    Google Scholar 

  12. Lydia, M., Kumar, S.S., Selvakumar, A.I., Prem Kumar, G.E.: A comprehensive review on wind turbine power curve modeling techniques. Renew. Sustain. Energy Rev. 30, 452–460 (2014)

    Article  Google Scholar 

  13. Lydia, M., Selvakumar, A., Kumar, S., Kumar, G.: Advanced algorithms for wind turbine power curve modeling. IEEE Trans. Sustain. Energy 4(3), 827–835 (2013)

    Article  Google Scholar 

  14. Manwell, J.F., McGowan, J.G., Rogers, A.L.: Wind Energy Explained: Theory, Design and Application. Wiley, Hoboken (2010)

    Google Scholar 

  15. Nesti, T., Nair, J., Zwart, A.P.: Reliability of dc power grids under uncertainty: a large deviations approach. arXiv preprint arXiv:1606.02986 (2016)

  16. Nesti, T., Zocca, A., Zwart, A.P.: Line failure probability bounds for power grids. arXiv preprint arXiv:1611.02338 (2016)

  17. R. Lázaro, N.Y., Melero, J. (eds.): Wind turbine power curve modelling using Gaussian mixture copula, ANN regressive and BANN. In: The Science of Making Torque from Wind (TORQUE 2022) Journal of Physics: Conference Series (2022)

    Google Scholar 

  18. Ricketts, J., Head, G.: A five-parameter logistic equation for investigating asymmetry of curvature in baroreflex studies. Am. J. Physiol. 277(2), 441–454 (1999)

    Google Scholar 

  19. Schumaker, L.: Spline Functions: Basic Theory. Cambridge Mathematical Library, 3rd edn. (2007)

    Google Scholar 

  20. Schwarz, G.: Estimating the dimension of a model. Ann. Stat. 6(2), 461–464 (1978)

    Article  MathSciNet  MATH  Google Scholar 

  21. Shi, J., Qu, X., Zeng, S.: Short-term wind power generation forecasting: direct versus indirect ARIMA-based approaches. Int. J. Green Energy 8(1), 100–112 (2011)

    Article  Google Scholar 

  22. Shokrzadeh, S., Jafari Jozani, M., Bibeau, E.: Wind turbine power curve modeling using advanced parametric and nonparametric methods. IEEE Trans. Sustain. Energy 5(4), 1262–1269 (2014)

    Article  Google Scholar 

  23. Sohoni, V., Gupta, S.C., Nema, R.K.: A critical review on wind turbine power curve modelling techniques and their applications in wind based energy systems. J. Energy 2016, 18 p., 8519785 (2016). https://doi.org/10.1155/2016/8519785

  24. Stukel, T.: Generalized logistic models. J. Am. Stat. Assoc. 83(402), 426–431 (1988)

    Article  MathSciNet  Google Scholar 

  25. Tümse, S., İlhan, A., Bilgili, M., Şahin, B.: Estimation of wind turbine output power using soft computing models. Energy Sour. Part A: Recovery Util. Environ. Effects 44(2), 3757–3786 (2022)

    Google Scholar 

  26. Vestas: Vestas Manual: V80-2.0 MW, V90–1.8/2.0 MW, V100-1.8 MW (2011)

    Google Scholar 

  27. Widén, J., et al.: Variability assessment and forecasting of renewables: a review for solar, wind, wave and tidal resources. Renew. Sustain. Energy Rev. 44, 356–375 (2015)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Stella Kapodistria .

Editor information

Editors and Affiliations

A Appendix: Data

A Appendix: Data

The goal of this section is to describe the features of the data used in this study. The data was obtained by the supervisory control and data acquisition (SCADA) system of a wind turbine operator in the Netherlands. The data was collected from an off-shore Vestas V80-2.0MW wind turbine, with a rated capacity of 2MW. Vestas V80-2.0MW joins the grid connection at a wind speed of 4 m/s, has a rated actual power output of 2 MW (typically achieved) at a wind speed of 16 ms, and it is disconnected at a wind speed of 25 ms. See Fig. 1 for a depiction of the theoretical WTPC. These are suggested values offered by the manufacturer, but they might change due to wear of the turbine or due to installation or geographic circumstances.

The data used for the analysis presented in this paper spans across two years and the dataset contains recordings of the environmental conditions, as well as the physical state, and power output of the turbine.

There are two important features of SCADA datasets, which are not specific to the data of our study but are common amongst SCADA datasets recorded throughout the wind industry. One of these is the 10 min reported frequency of the SCADA observations; although the signals of interest are collected at a relatively high frequency, only processed observations calculated on a 10 min window are recorded in the SCADA databases. These processed signals contain the average, maximum, minimum and standard deviation of the wind speed, and the power output amongst other quantities of interest. The second important feature is that the data are strongly quantized due to the rounding of the reported number. As a result, the observations are recorded up to one decimal digit. Some of our finding are consequences of these two properties which correspond to the quasi industry standard. Because of this, we expect that our results are also applicable to similar data coming from other wind turbine operators or wind turbine service providers.

1.1 A.1 Description of the Raw Dataset

All graphs and figures were produced using two seasonal parts of the available dataset. Throughout the paper, we refer to the data recorded between June 1, 2013, and August 31, 2013, as the training data, and the corresponding period of year 2014 as the validation data. Although, we have access to the full two year data set, we choose to restrict our analysis in a specific season of the year, as this reduces seasonality effects, while still maintaining a significant amount of data, and it permits a full decoupling between the training and the validation data. It is important to note that the results presented in the paper still hold when we perform the same analysis using the full year 2013 as training data and the data from 2014 for validation.

The dataset contains observations of various signals every 10 min. Some of the signals contained in the dataset are the ambient wind speed, say \(w_t\), the relative direction of the wind speed with respect to the nacelle, say \(\phi _t\), the ambient temperature, say \(T_t\), and the power output produced by the turbine, say \(p_t\), at time t, \(t\ge 0\). Besides the aforementioned continuous valued signals, there are some nominal variables with a discrete support, such as the variable pertaining to the different operational states of the turbine. Such variables help to identify time periods during which the turbine is out of use (maintenance, free run, blades turned into low resistance position) or if the wind turbine is in a state different from normal operational condition.

In the first part of the paper, we suppress the subscript t as we deal with static models, while in the second part of the paper we deal with dynamic models, and we, therefore, reinstate the subscript t notation.

Table 8. Summary report of the data cleaning
Fig. 18.
figure 18

The power output, p, against the wind speed, w, in the raw (red) and the cleaned (blue) dataset from 2013 (Color figure online)

1.2 A.2 Cleaning the data

The quality of the available SCADA data is extremely good, nevertheless it requires some pre-processing before creating the forecasting models. We list below the cleaning rules implemented in this study, according to which we disregard observations:

  1. 1.

    Missing entries (NAs): there are a few timestamps that are completely missing from the 10 min sampling sequence.

  2. 2.

    Incomplete entries (IN): if one or more of the signal values, e.g., the power output, the wind speed, etc., are missing from a data record, then the full record corresponding to this time stamp is discarded.

  3. 3.

    Not normal operation (NNO): based on the value of the state variables we can disregard states that do not correspond to normal operational conditions, e.g. free rotation of the wind turbine without connection to the grid, derated operation, etc.

  4. 4.

    Outliers: Firstly, all observations of wind power corresponding to the same wind speed are grouped together and the corresponding box plot is generated. Then, for every given wind speed value, all points with power generation outside the whiskers of the box plot (i.e., all observations falling outside the interval \((Q_1-3\text {IQR},Q_3+3\text {IQR})\)) are discarded.

Table 8 contains the summary report of the data cleaning procedure. It shows that approximately 5% of the original data is discarded, still leaving a trove of data to be used for estimation purposes. The scatter plot of the power output, p, against the wind speed, w, is shown in Fig. 18. In this figure, we have color-characterized the training data by depicting in red the raw data, and in blue the cleaned dataset used for the analysis.

Rights and permissions

Reprints and permissions

Copyright information

© 2023 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kolumbán, S., Kapodistria, S., Nooraee, N. (2023). Short Term Wind Turbine Power Output Prediction. In: Hyytiä, E., Kavitha, V. (eds) Performance Evaluation Methodologies and Tools. VALUETOOLS 2022. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 482. Springer, Cham. https://doi.org/10.1007/978-3-031-31234-2_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-31234-2_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-31233-5

  • Online ISBN: 978-3-031-31234-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics