# Comparative assessments of binned and support vector regression-based blade pitch curve of a wind turbine for the purpose of condition monitoring

## Abstract

The unexpected failure of wind turbine components leads to significant downtime and loss of revenue. To prevent this, supervisory control and data acquisition (SCADA) based condition monitoring is considered as a cost-effective approach. In several studies, the wind turbine power curve has been used as a critical indicator for power performance assessment. In contrast, the application of the blade pitch angle curve has hardly been explored for wind turbine condition monitoring purposes. The blade pitch angle curve describes the nonlinear relationship between pitch angle and hub height wind speed and can be used for the detection of faults. A support vector machine (SVM) is an improved version of an artificial neural networks (ANN) and is widely used for classification- and regression-related problems. Support vector regression is a data-driven approach based on statistical learning theory and a structural risk minimization principle which provides useful nonlinear system modeling. In this paper, a support vector regression (a nonparametric machine learning approach)-based pitch curve is presented and its application to anomaly detection explored for wind turbine condition monitoring. A radial basis function (RBF) was used as the kernel function for effective SVR blade pitch curve modeling. This approach is then compared with a binned pitch curve in the identification of operational anomalies. The paper will outline the advantages and limitations of these techniques.

## Keywords

Condition monitoring Support vector regression Performance monitoring Performance curves Wind turbines## Introduction

Increased demand for clean energy has led to the impressive expansion of global wind power installed capacity over the past decade. The newly installed wind turbines requires less maintenance cost but as it get old and out of warranty, then turbines maintenance cost increases significantly. Author of [1] pointed that of the total of 433 GW of wind capacity in 2015, the bulk of this was out of warranty which suggest a massive requirement for operation and maintenance (O&M). It is also well known that such costs are considerably higher for offshore wind turbines and in Ref. [1] it was found that improvement of O&M practice could lead to a reduction of 21% and 11% of the life-cycle costs of offshore and onshore wind farms, respectively. Furthermore, it is expected that the global wind O&M market will reach 20.6 billion US dollars by 2023. With the increase in age of wind turbines and the move to less accessible offshore sites, the O&M cost is expected to grow significantly, which reinforces the drive towards condition-based maintenance [2]. It is imperative to detect failures at an early stage to minimize downtime and maximize productivity, and condition-based maintenance has a crucial role to play in this.

Wind farms equipped with supervisory control and data acquisition (SCADA) systems provide data essential for reliable performance optimization [3]. Performance monitoring based on the available SCADA data is also a cost-effective approach to turbine condition appraisal, as confirmed by various literature reviews [4, 5] that highlight the feasibility of identifying turbine health status using SCADA data, and the vast potential of further enhancing the health monitoring function through sophisticated data analysis. SCADA-based monitoring of the condition of internal components of a wind turbine can be used to optimize maintenance activities and thus reduce O&M costs and increase reliability and production time; see [6, 7, 8, 9].

The power curve is widely used to assess the performance of a wind turbine; it signifies the nonlinear relationship between power production and hub height wind speed [10]. IEC-61400-12-2 [11] prescribes the method known as ‘binning’ to calculate the power curve. The “method of bins” is a data reduction technique used to normalize the data to construct the measured power curve. This binned power curve includes the effect of the site turbulence and all other effects reflecting onsite conditions [12].

Nonparametric models are data driven, and their structure is not specified a priori but is obtained exclusively from the data [13]. Commonly used nonparametric models in wind turbine condition monitoring are support vector machine (SVM), copulas, Gaussian process (GP) and other data derived models; see [13, 14, 15]. SCADA data record a large number of measurements which make nonparametric models appropriate.

In the last decade, support vector machine (SVM), a novel and potent machine learning technique, has been successfully used for classification- and regression-related problems. Support vector machine (SVM) based on the applications is categorized into support vector classification (SVC) and support vector regression (SVR). The SVM method uses a technique called ‘kernel trick’ to solve linear and nonlinear classification-related problems where its ability to deal with high dimensional data for a relatively small training set is satisfactory [16]. This allows replacement of the inner product (<*x*, *y*>) in an algorithm with a kernel (*k* (*x*, *y*)) and this approach is particularly valuable in a condition where it is more convenient to compute the kernel than the feature vector itself. Comparative studies [17] show that generalization of SVM to complex models is better than that for artificial neural network (ANN), though it suffers from a more extended training time for large datasets. To deal with this, the least squares support vector machine (LSSVM) approach proposed transforming complex quadratic programming into a linear problem; see [18]. SVM has demonstrated satisfactory performance on regression and time-series prediction by solving the nonlinear relationship efficiently and stable across a range of applications; see examples [19, 20, 21]. The SVM model has been reported for short-term wind speed forecasting and yielded accurate results; see [22]. Comparative performance of SVM and multilayer perceptron (MLP) neural networks for wind speed prediction have been studied in [23], and the results suggest that the SVM approach outperforms the MLP model with respect to the root mean squared error (RMSE). Furthermore, SVM and neural network developed for short-term wind forecasting [24] indicate that SVM performance is superior. Li et al. [25] proposed a model based on SVM classification to diagnose gearbox faults with promising results. In another paper [26], an SCADA-based SVM model was constructed for diagnosing and predicting wind turbine faults.

Recently, the wind turbine condition monitoring studies have mostly focused on the power curve for evaluating performance. However, this cannot reflect the complete turbine operation since the operational behavior of the wind turbines is profoundly influenced by a parameter such as a rotor power, torque, and pitch angle. Valid assessments of these parameters improve the power performance of a wind turbine. In this study, the blade pitch angle impact on wind turbine performance is analyzed using the blade pitch curve that reveals the nonlinear relationship between pitch angle and the hub height wind speed that can be useful for analyzing wind turbine performance and the detection of faults.

This paper proposed a novel support vector regression (SVR) approach to estimate wind turbine blade pitch curve and its application in anomaly detection for condition monitoring. The binning method is a benchmark data reduction approach for the wind industries, but its application is generally limited to the power curve. In this study, the binning method is applied to calculate the blade pitch curve. Finally, a comparative analysis of the binned blade pitch curve and support vector regression blade pitch curve is undertaken regarding fitting uncertainty and identifies the advantages and disadvantages of the SVR model.

This paper is structured as follows: The introduction is the first section. The next section describes the wind turbine performance curves and air density corrections. The following section describes the SCADA dataset and its pre-processing. The next section outlines the methodologies and this section is further divided into subsections explaining support vector regression (SVR) and the binning approach to wind turbine blade pitch curve modeling. The next section presents the comparative analysis of proposed models and the last section concludes the paper.

## Wind turbine performance curves

*ρ*is the air density (kg/m

^{3}),

*A*is the swept area (m

^{2}),

*C*

_{p}is the power coefficient of wind turbine and

*v*is the hub height wind speed (m/s). The tip speed ratio (

*λ*) and pitch angle (

*β*) affect the power coefficient and thus affect the power production of a wind turbine.

*V*

_{C}is calculated using Eqs. (2) and (3) as shown below

*V*

_{C}and

*V*

_{M}are the corrected and measured wind speed in m/s and the corrected air density is calculated by Eq. (2) where

*B*is atmospheric pressure in mbar and

*T*the temperature in Kelvin for which 10 min average values obtained from SCADA data are used. The corrected wind speed (

*V*

_{C}) from Eq. (3) is then used to calculate the power curve normally by binning.

## SCADA data for wind turbine performance curves

Supervisory control and data acquisition (SCADA) systems record the operational status of wind turbines and are essential for reliable performance optimization. Performance monitoring based on available SCADA data is also a cost-effective approach to turbine condition appraisal, as confirmed by various literature reviews that highlight the feasibility of identifying turbine health status using SCADA data, and the high potential of further enhancing the health monitoring function through sophisticated data analysis.

SCADA dataset description

Start timestamp | End timestamp | Measured dataset | Filtered dataset |
---|---|---|---|

1/7/2012 00:00 AM | 31/7/2012 23:50 PM | 4464 | 626 |

## Methodologies to be compared

The two approaches, namely binning and support vector regression used to build effective blade pitch angle curves for wind turbine condition monitoring, are described as follows.

### Support vector regression-based blade pitch curve

*n*-by-

*n*matrix which contains elements;

*g*

_{ij}=

*G*(

*x*

_{i},

*x*

_{j}), where \(x _{i} ,x_{j}\) are the training SCADA data points. This Lagrange dual formulation complements the nonlinear system and hence used in this study. A nonlinear SVR calculates the optimal function

*f*(

*x*) in the transformed predictor space where the SVR looks for the coefficient that minimizes the Lagrangian function using the dual formula [32, 33]:

*f*(

*x*) used to fit the SVR model for blade pitch curve is given by

*ɛ*-SVR due to its scarcity representation capability [34]. The ε-insensitive loss function is used to build the objective function of the

*ɛ*SVR. The v-SVR is another type of regression where v parameters are used to control the number of support vectors; see [35, 36]. The Karush–Kuhn–Tucker (KKT) conditions play an important role in dealing with constrained optimization and using KKT conditions [37]; of the quadratic programming, only a certain number of the coefficients (

*α*

_{n}−

*α*

_{ n}

^{*}) will assume nonzero values. The KKT complementarity conditions are optimization constraints required to obtain optimal solutions, and for nonlinear SVM regression these conditions are

Parameter C determines the trade off between the model complexity (flatness) and the degree to which deviations larger than *ɛ* are tolerated in optimization formulation [31]. Parameter *ɛ* controls the width of the *ɛ*-insensitive zone, used to fit the training data sets. It also affects the number of support vectors and hence is important for an effective blade pitch curve SVR model. In short, both *C* and *ɛ* affect SVR model performance and hence it is necessary to find optimal values for these parameters using appropriate optimization techniques. The calculation of *ɛ* and *C* is based on the nature of input datasets and choice of kernel. In this study, a Gaussian kernel was used and *C* and *ɛ* values were calculated as iqr(*Y*)/13.49 where iqr(*Y*) is the interquartile range of the response variable *Y* [33, 34]. The 13.349 is a rescaling factor (that quantifies the statistical dispersion in a set of numerical data) that reflects the change from interquartile range to standard deviation. The calculated values of box constraint and epsilon are used in the wind turbine blade pitch curve modelling.

*α*

_{n}and

*α*

_{ n}

^{*}are the nonnegative multipliers for each observation

*x*

_{n}. The obtained biased value for this study is 1.70 which is added into the SVR model to predict the blade pitch curve of wind turbine accurately.

*γ*is the kernel scale for given points

*x*and

*x*

^{′}.

The Gaussian kernel is also popularly known by radial basis function (RBF) kernel and widely used. For example, the authors of [38] demonstrated that the use of SVR in hydrological modeling and they highlighted the excellent performance of the RBF.

The cross-validation of five folds is used to find the best value for kernel scale and to prevent overfitting [35]. The SCADA datasets described in “SCADA data for wind turbine performance curves” were randomly shuffled and split into training and testing datasets for training and SVR model validations purposes, respectively.

### Binned based blade pitch curve

*V*

_{i}is the normalized and averaged wind speed in bin

*i*,

*V*

_{n,i,j}is the normalized wind speed of data sets

*j*in bin

*i*,

*B*

_{i}is the normalized and averaged pitch angle in bin

*i*,

*B*

_{n,i,j}is the normalized pitch angle of data set

*j*in bin

*i*, and

*N*

_{i}is the number of 10 min average data sets in bin

*i*.

## Comparative analysis of binned pitch curve and SVR pitch curve

## Conclusion and discussion

This paper has proposed an SVM-based regression model for estimating the blade pitch angle curve. The estimated SVR pitch curve follows the standard variations, though due to the lack of data points in above rated wind speed, its accuracy suffers. This highlights how the quality and quantity of data points significantly affects the SVR model prediction accuracy. SVR is then compared with the conventional approach based on a binned pitch curve together with individual bin probability distributions to identify operational anomalies. This comparative study yielded significant results. The SVR blade pitch curve closely follows the binned pitch curve, but above rated wind speed, there are fewer SCADA data values available and, as a result, the SVR curve is less well determined with some mismatch with the binned pitch curve. The major issue associated with wind turbine condition monitoring is to detect a fault or failure as soon as possible and with limited computational time and processing power so that catastrophic damage due to failure can be prevented with a cost-effective approach. The comparative analysis illustrates the strengths and weaknesses of these techniques in context to anomaly detection and model uncertainty. This should support a wind farm operator in selecting the best method for wind turbine condition monitoring.

The future work is to develop and appropriate uncertainty analysis for the SVR blade pitch curve and then use it for developing a practical fault detection SVR algorithm.

## Notes

### Acknowledgements

This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie Grant Agreement No 642108.

## References

- 1.Sheng, S.: Prognostics and Health Management of Wind Turbines: Current Status and Future Opportunities. National Renewable Energy Laboratory, Golden (2015).
**(NREL/PR-5000–65605)**Google Scholar - 2.Schlechtingen, M., Santos, I.F., Achiche, S.: Wind turbine condition monitoring based on SCADA data using normal behavior models. Part 1: system description. Appl. Soft Comput.
**13**, 259–270 (2013)CrossRefGoogle Scholar - 3.Kabir, M.J., Oo, A.M.T., Rabbani, M.: A brief review on offshore wind turbine fault detection and recent development in condition monitoring based maintenance system. In: Australasian Universities Power Engineering Conference (AUPEC), Wollongong, SW, pp. 1–7. https://doi.org/10.1109/aupec.2015.7324871(2015
- 4.Tautz-Weinert, J., Watson, S.J.: Using SCADA data for wind turbine condition monitoring—a review. IET Renew. Power Gener.
**11**(4), 382–394 (2017). https://doi.org/10.1049/iet-rpg.2016.0248 CrossRefGoogle Scholar - 5.Wang, K.S., Sharma, V.S., Zhang, Z.Y.: SCADA data based condition monitoring of wind turbines. Adv. Manuf.
**2**, 61 (2014). https://doi.org/10.1007/s40436-014-0067-0 CrossRefGoogle Scholar - 6.Gómez Muñoz, C.Q., García Marquez, F.P., Liang, C., Maria, K., Abbas, M., Mayorkinos, P.: A new condition monitoring approach for maintenance management in concentrate solar plants. In: Proceedings of the 9th international conference on management science and engineering management. Springer, Berlin, pp. 999–1008 (2015)Google Scholar
- 7.Márquez, F.P.G., Pedregal, D.J., Roberts, C.: New methods for the condition monitoring of level crossings. Int. J. Syst. Sci.
**46**, 878–884 (2015)CrossRefGoogle Scholar - 8.Feng, Y., Qiu, Y., Crabtree, C.J., Long, H., Tavner, P.J.: Monitoring wind turbine gearboxes. Wind Energy.
**16**, 728–740 (2012). https://doi.org/10.1002/we.1521 CrossRefGoogle Scholar - 9.Zaher, A., McArthur, S.D.J.: Online wind turbine fault detection through automated SCADA data analysis. Wind Energy.
**12**, 574–597 (2009)CrossRefGoogle Scholar - 10.Burton, T., Sharpe, D., Jenkind, N., Bossanyi, E.: Wind energy handbook. John Wiley & Sons, Chichester, UK (2001)CrossRefGoogle Scholar
- 11.Wind Turbines—Part 12-1.: Power Performance Measurements of Electricity Producing Wind Turbines, British Standard, IEC 61400-12-1 (2006)Google Scholar
- 12.Trivellato, F., Battisti, L., Miori, G.: The ideal power curve of small wind turbines from field data. J. Wind Eng. Ind. Aerodyn.
**107–108**, 263–273 (2012)CrossRefGoogle Scholar - 13.Lydia, M., Kumar, S.S., Selvakumar, A.I., Kumar, G.E.P.: A comprehensive review on wind turbine power curve modeling techniques. Renew. Sustain. Energy Rev.
**30**, 452–460 (2014)CrossRefGoogle Scholar - 14.Pandit, R., Infield, D.: Gaussian process operational curves for wind turbine condition monitoring. Energies.
**11**(7), 1631 (2018). https://doi.org/10.3390/en11071631 CrossRefGoogle Scholar - 15.Pandit, R., Infield, D.: Performance Assessment of a Wind Turbine Using SCADA based Gaussian Process Model. Int. J. Progn. Health Manag.
**9**(023), 8 (2018)Google Scholar - 16.Belousov, A., Verzakov, S.A., Von Frese, J.: A flexible classification approach with optimal generalisation performance; support vector machines. Chemom. Intell. Lab Syst.
**64**, 15–25 (2002)CrossRefGoogle Scholar - 17.Sideratos, G., Hatziargyriou, N.: Wind power forecasting focused on extreme power system events. IEEE Trans. Sustain. Energy
**3**(3), 445–454 (2012)CrossRefGoogle Scholar - 18.Yuan, X.H., Chen, C., Yuan, Y.B., Huang, Y.H., Tan, Q.X.: Short-term wind power prediction based on LSSVM–GSA model. Energy Convers Manag.
**101**, 393–401 (2015)CrossRefGoogle Scholar - 19.Hong, W.: Rainfall forecasting by technological machine learning models. Appl. Math. Comput.
**200**, 41–57 (2008)MathSciNetzbMATHGoogle Scholar - 20.Trafalis, B.T., Adrianto, I., Richman, M.B.: Active learning with support vector machines for tornado prediction. Lect. Notes Comput. Sci.
**4487**, 1130–1137 (2007)CrossRefGoogle Scholar - 21.Lin, G.F., Chen, G.R., Huang, P.-Y., Chou, Y.C.: Support vector machine-based models for hourly reservoir inflow forecasting during typhoon-warning periods. J Hydrol.
**372**, 17–29 (2009)CrossRefGoogle Scholar - 22.Zhou, J., Shi, J., Li, G.: Fine tuning support vector machines for short-term wind speed forecasting. Energy Convers. Manag.
**52**, 1990–1998 (2011)CrossRefGoogle Scholar - 23.Mohandes, M.A., Halawani, T.O., Rehman, S., Hussain, A.A.: Support vector machines for wind speed prediction. Renew. Energy.
**29**(6), 939–947 (2004)CrossRefGoogle Scholar - 24.Sreelakshmi, K., Kumar, P.R.: Performance evaluation of short-term wind speed prediction techniques. Int. J. Comput. Sci. Netw. Secur.
**8**, 162–169 (2008)Google Scholar - 25.Li, C., et al.: Multimodal deep support vector classification with homologous features and its application to gearbox fault diagnosis. Neurocomputing.
**168**, 119–127 (2015)CrossRefGoogle Scholar - 26.Leahy, K., Hu, R.L., Konstantakopoulos, I.C., Spanos, C.J., Agogino, A.M., O’Sullivan, D.T. J.: Diagnosing and predicting wind turbine faults from SCADA data using support vector machines. Int. J. Progn. Health Manag. ISSN 2153-2648, 006 (2018)Google Scholar
- 27.Ouyang, T., Kusiak, A., He, Y.: Modeling wind-turbine power curve: a data partitioning and mining approach. Renew Energy.
**102**, 1–8 (2017)CrossRefGoogle Scholar - 28.Kim, K., Parthasarathy, G., Uluyol, O., et al.: Use of SCADA data for failure detection in wind turbines. In: Conference Paper, NREL/CP-5000-51653 (2011)Google Scholar
- 29.Schlechtingen, M., Santos, I.F.: Comparative analysis of neural network and regression based condition monitoring approaches for wind turbine fault detection. Mech. Syst. Signal Process.
**25**(5), 1849–1875 (2011)CrossRefGoogle Scholar - 30.Boser, B.E., Guyon, I. M., Vapnik, V. N.: A training algorithm for optimal margin classifiers. In: Proceedings of the fifth annual workshop on Computational learning theory—COLT ‘92, p. 144. ISBN 089791497X (1992). https://doi.org/10.1145/130385.130401
- 31.Cherkassky, V., Ma, Y.: Practical selection of SVM parameters and noise estimation for SVM regression. Neural Netw.
**17**, 113–126 (2004)CrossRefGoogle Scholar - 32.Fan, R.E., Chen, P.H., Lin, C.J.: A study on SMO-type decomposition methods for support vector machines. IEEE Trans. Neural Netw.
**17**, 893–908 (2006)CrossRefGoogle Scholar - 33.Mathworks (2017) Support vector machine toolbox. https://uk.mathworks.com/help/stats/supportvector-machine-regression.html. Accessed Apr 2018
- 34.Zeng, J., Qiao, W.: Support Vector Machine-Based Short-Term Wind Power Forecasting. In: Proceedings of the IEEE PES Power System Conference and Exposition, Phoenix, March 20–23, ISBN: 978-1-61284-788-7 (2011)Google Scholar
- 35.Chang, C.-C., Lin, C.-J.: Training v-support vector regression: theory and algorithms. Neural Comput.
**14**(8), 1959–1977 (2002)CrossRefGoogle Scholar - 36.Bo, Q., Wang, X., Liu, K.: Minimum frequency prediction of power system after disturbance based on the v-support vector regression. In: International Conference on Power System Technology, Chengdu, pp. 614–619 (2014). https://doi.org/10.1109/powercon.2014.6993789
- 37.Miller, K.R., Vapnik, V.: Using Support Vector Machine for Time Series Prediction, pp. 243–253. MIT, Cambridge (1999)Google Scholar
- 38.Ch, S., Anand, N., Panigrahi, B.K., Mathur, S.: Streamflow forecasting by SVM with quantum behaved particle swarm optimization. Neurocomputing.
**101**, 18–23 (2013)CrossRefGoogle Scholar - 39.Llombart, A., Watson, S.J., Llombart, D., Fandos, J.M.: Power curve characterization I: improving the bin method. In: International conference on renewable energies and power quality, Zaragoza (2005)Google Scholar

## Copyright information

**Open Access**This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.