Abstract
An accurate probability distribution model of wind speed is critical to the assessment of reliability contribution of wind energy to power systems. Most of current models are built using the parametric density estimation (PDE) methods, which usually assume that the wind speed are subordinate to a certain known distribution (e.g. Weibull distribution and Normal distribution) and estimate the parameters of models with the historical data. This paper presents a kernel density estimation (KDE) method which is a nonparametric way to estimate the probability density function (PDF) of wind speed. The method is a kind of datadriven approach without making any assumption on the form of the underlying wind speed distribution, and capable of uncovering the statistical information hidden in the historical data. The proposed method is compared with three parametric models using wind data from six sites. The results indicate that the KDE outperforms the PDE in terms of accuracy and flexibility in describing the longterm wind speed distributions for all sites. A sensitivity analysis with respect to kernel functions is presented and Gauss kernel function is proved to be the best one. Case studies on a standard IEEE reliability test system (IEEERTS) have verified the applicability and effectiveness of the proposed model in evaluating the reliability performance of wind farms.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
The utilization of wind energy has been increasing around the world at an accelerating pace due to its nonexhausted property, environmental and social benefits, together with public support and government incentives. However, generating capacity from wind farms behaves quite differently than that from conventional generating sources because of the fluctuating and intermittent nature of wind. To handle with these features, wind speed is assumed to be a random variable which follows various types of distributions, such as Weibull, Rayleigh, Gauss and etc. [1–8]. Therefore, wind farms and generating capacity assessment of power system incorporating wind energy [9–16] should be conducted with previously published wind speed distribution analysis.
The estimation of wind speed distribution is an essential requirement in the reliability analysis of power system with wind power integrated since the wind energy available for a particular location is mainly determined by the probability distribution of wind speed. Therefore, a variety of PDFs have been proposed in literature to describe wind speed distributions. Currently, the widely adopted PDFs are unimodal types [1–8] including Weibull function, Rayleigh function, Gauss function, Gamma function, lognormal function, etc. Besides, some mixture functions of simple unimodal distributions [17–19], such as the twocomponent mixture Weibull function (Weibull–Weibull) and singly truncated normal Weibull mixture function (Normal–Weibull), are also applied to wind energy analysis recently, and they have been proved to be more effective than unimodal types for wind regimes with bimodal distribution. A number of detailed reviews on modeling the probability distributions of wind speed in wind energy analysis can be found in references [20–28].
The wind speed models above have a characteristic in common, that is to assume wind speed can be described by a known probability density function (PDF), and then estimate the function parameters using the historical wind speed data. These assumptions aim to simplify wind speed model, and make analysis and calculation easy. However, the errors in the results could be significant and lead to wrong conclusions when the supposed wind distributions do not match the real ones.
Nonparametric density estimation (NPDE) methods provide a new idea and solution to these problems. The methods have on need of prior knowledge about wind speed distribution or any assumption on probability distribution thus is suitable to analyze feature space of any structure. KDE is the most widely used NPDE techniques in data analysis and has applications in geography [29], ecology [30] and other fields.
This paper presents a technique for modeling longterm wind speed distribution using the KDE and applies this method to reliability assessment of power systems containing wind energy. The effects of adding wind capacity to a conventional generating system are illustrated using IEEERTS which consists of 32 traditional generating units with a total capacity of 3405 MW and a peak load of 2850 MW. The generating unit ratings and reliability parameters are shown in Ref. [31]. The loss of load expectation (LOLE) and loss of energy expectation (LOEE) indices are used to assess the risk in this study.
2 Histogram estimation on wind speed probability density
Histogram is a kind of simple and initial NPDE method. Its basic principle is to divide sample spaces into several subspaces, and estimate density based on the sampling amount in every subspace. Let V = (v _{1},…,v _{ i },…,v _{ n }) represent wind speed sample. Divide the sample space into m subspaces and the j ^{th} subspace, such that \(B_{j} = [x_{0} + (j  1)h, \, x_{0} + jh] \quad \left( {j = 1,{ 2}, \ldots ,m} \right)\), where x _{0} is the starting point and h is the subspace width. The PDF using the histogram estimation at point v can be expressed as.
where I(V) is the indicator function, which is 1 provided that v ∈ B _{ j } and is 0 otherwise.
The calculation of histogram estimation process is simple, but it has several demerits: 1) Accurate estimation on center point for subspace B _{ j }, but weak on edge estimation; 2) Strong dependence on the starting point x _{0} and smooth parameter h; 3) High requirement of data for high dimensional feature space. These disadvantages make histogram estimation only suitable to PDE with low dimensions.
To overcome above problems, Rosenblatt [32] and Parzen [33] make improvements on histogram estimation methods. Firstly, replace indicator function in histogram by smooth kernel function, and then set estimation interval center as sample observation value. These improvements lead to method commonly referred to as KDE.
3 KDE on wind speed probability density
3.1 Basic principle of KDE
Let (v _{1}, v _{2},…, v _{ n }) represents a sample of the wind speed series. Its underlying PDF can be estimated by the following kernel density function (KDF):
where n is the sample size, h is the bandwidth and K(·) is a kernel function, which satisfies the constraints:
When sample size is sufficiently large, \(\hat{f}\left( v \right)\) is converged to f(v) with probability one.
It can be observed from Eq. (2) that the performance of KDE depends on the kernel function and bandwidth. There are many different kinds of kernel functions. Prakasa Rao [34] indicates that KDE is insensitive to the selection of kernel function when n is sufficiently large. Therefore, a kernel function is generally chosen for a certain type. Table 1 shows the commonly used kernel functions.
3.2 Selection of bandwidth
Bandwidth has relatively great impacts on the accuracy of KDE. The selection of bandwidth is, therefore, key to accurate estimation of wind speed distribution. The bandwidth can be chosen from measuring the estimation error between the underlying function f(v) and its estimate \(\hat{f}(v)\). One commonly used error measure is the mean integrated squared error (MISE), which is expressed as
where
Omitting higher order infinitesimal in Eq. (4), an asymptotic MISE (AMISE) can be obtained.
The optimal bandwidth can be obtained by differentiating AMISE with respect to the h.
It can be seen from Eq. (6) that the expression involves second derivative f″(v) of unknown function f(v), so the estimation of f″(v) must be carried out before calculating of smooth parameter. Silverman [35] proposes an empirical method on selecting smooth parameter which takes normal density function as the reference distribution of the unknown PDF f(v). However, it can lead to oversmoothing when the underlying distribution is asymmetric or multimodal. In these cases, more sophisticated selection methods such as directplugin (DPI) and crossvalidation (CV) should be adopted. A conceptually simple technique, with theoretical justification and good empirical performance, is the DPI technique. The key step of DPI method relies on finding an estimate of the density functional R(f″(v)) in Eq. (6). The detailed information of this method can refer to Ref. [36].
4 Wind speed sampling
An appropriate random simulation of wind speed is required for the assessment the reliability of power system containing wind energy. The KDF can be used to simulate the wind speed. However, the KDF is extremely complicated according to its expression, which means that there will be significant difficulties in applying direct sampling. This paper presents an acceptancerejection sampling technique to simulate wind speed [37]. The basic idea of the proposed technique is to generate wind speed data from a proposal PDF g(v;α) instead of sampling directly from the target distribution f(v). To make sure that the technique can be performed, f(v) should satisfy \(f(v) \leqslant Cg(v;a)\), where C is a bias factor with \(C \geqslant 1\).
The sampling process carries out as following steps:

Step 1 Simulate a random u from U(0,1) uniform distribution;

Step 2 Simulate a random v from the proposal PDF g(v;α);

Step 3 Check whether or not u < f(v)/(C × g(v;α)).

If this holds, accept v as a sample for f(v);

If not, reject v, and repeat the sampling steps.

The sampling efficiency of the proposed technique is inversely proportional to bias factor C, thus C should be as small as possible. An unconstrained nonlinear optimization method is used to select C and α and the objective function is modeled as:
The simulated wind speed data generated through the above procedures can meet the model (2).
5 Case studies for verification of KDE in modeling wind speed distribution
5.1 Information of wind sites and wind data
The hourly wind speed data at six wind sites from the United States (designated as A, B, C) and New Zealand (designated as D, E, F) for five years (from Jan. 1, 2007 to Dec. 31, 2011) were used to illustrate the accuracy and flexibility of the proposed model. The wind speeds at six sites were upscale to a hub height of 80 m. The geographical information and basic statistics of the six sites are shown in the Table 2.
5.2 Accuracy judgment criteria
To show how a theoretical probability function matches with the observation data, a statistic \(R_{a}^{2}\) [17] is used as the judgment criteria. The higher \(R_{a}^{2}\) is, the greater the fit is. The statistic \(R_{a}^{2}\) is given by
where
N is the total number of intervals; s is the number of parameters in the model; p _{ i } and \(\hat{p}_{i}\) are the probability obtained with the sample data and probability model at the i ^{th} interval; \(\bar{p}\) is the mean of p _{ i }.
5.3 Wind speed modeling
Gauss kernel is applied as the weighting function in the study and the bandwidths are calculated using DPI technique. Figure 1 illustrates the PDF plot of wind speed for the six sites using the histogram estimation, KDE, Weibull model, Normal model and Rayleigh model.
These figures demonstrate that the wind speed at different sites can have widely varying distribution modes, and the KDE method can always agrees well with the probability distribution characteristic for each site, whereas the Weibull model, Normal model and Rayleigh model show a high goodness of fit only at certain sites, which indicates the flexibility of KDE method in modeling the wind speed distributions.
The results of statistic \(R_{a}^{2}\) obtained by KDE, Weibull model, Normal model and Rayleigh model are shown in Table 3. It is observed that the KDE presents the best fit to the sample data at the six sites, which verifies its flexibility and accuracy again. Among the three parametric estimation models, Weibull model provide a higher fit degree for sites A, B, C, D and E while Normal model for sites F. This indicates that it is unreasonable to use a fixed parametric function to model the wind speed distribution.
5.4 KDF investigation
The influence of kernel function on KDE is examined in this part. Four different kernel functions introduced above are chosen as the weight function in KDE for each site, respectively. The results of statistic \(R_{a}^{2}\) obtained by KDE with different kernel functions at the six sites are shown in Table 4.
It can be seen from Table 4 that the statistic \(R_{a}^{2}\) obtained by KDE with different kernel functions are similar. It indicates that the KDF has minor impact on KDE. Among the four kernel functions, Gauss kernel always outperforms the others. Thus, Gauss kernel can be used as the weight function in KDE for wind speed.
6 Application of KDE in generating system reliability evaluation
6.1 Wind energy conversion model
Wind energy is converted into power by WTGs (wind turbine generator). The power output characteristics of a WTG are quite different from those of conventional generating units. There is a nonlinear relationship between the power output and the wind speed. The relation can be described using the operational parameters of the WTG. The commonly used parameters are the cutin wind speed, rated wind speed, and cutout wind speed. The power output [38] can be obtained from the simulated wind speed v using Eq. (10)
where P _{ r }, v _{ ci }, v _{ r }, and v _{ co } are the rated power output, the cutin wind speed, rated wind speed, and cutout wind speed of the WTG, respectively. The constants A, B, and C are determined by v _{ ci }, v _{ r }, and v _{ co } as expressed in Ref. [23].
6.2 Reliability assessment of power systems incorporating wind power
A wind farm with a total capacity of 400 MW is assumed to be located at the six sites and added to the IEEE RTS. The v _{ ci }, v _{ r }, and v _{ co } of each WTG in the wind farms are 3.0, 14.5 and 20.0 m/s, respectively. Studies in Ref. [9] show that the changes in the FOR of the WTG do not have a significant impact on the calculated system reliability indices. It is, therefore, assumed in this paper that the wind farm consists of identical WTG with zero forced outage rates.
The reliability analysis on the system is conducted using statesampling simulation method combing with the measured wind data, KDE, Weibull model, Normal model and Rayleigh model, respectively. Weibull PDF is selected as the proposal distribution for KDF in the acceptancerejection sampling technique. Tables 5 and 6 show the reliability evaluation results of different models.
It can be seen from Tables 5 and 6 that the reliability indices calculated by the proposed model are more close to those calculated by the measured data. The average absolute errors using the proposed model for the two indices are 0.22% and 0.39%, respectively, whereas those calculated by the Weibull, Normal and Rayleigh are 3.24%, 6.25% and 5.06% for LOLE, respectively, and 3.50%, 7.19% and 5.30% for LOEE, respectively. The investigation indicates that the proposed model has higher accuracy compared with the parametric models, and can be directly applied in reliability evaluation of power system containing wind energy.
7 Conclusion
This paper presents a wind speed distribution model based on the KDE. The model is a kind of datadriven approach, without any assumptions for distribution mode of wind speed and flexible to any wind regime. Five years’ actual wind speed data from six sites were used to examined the the proposed model, Weibull model, Normal model and Rayleigh model. A statistic \(R_{a}^{2}\) is used as an index to measure the fit degree of the models to actual wind speed data.
The accuracy and applicability of the proposed method is verified using the wind data at six sites. The results indicate that the KDE can describe the wind speed distributions with a high accuracy and excellent robustness. The fitting statistics of \(R_{a}^{2}\) for six sites are more than 0.99. The average absolute errors of loss of load expectation (LOLE) and loss of energy expectation (LOEE) are 0.22% and 0.39%, respectively. The influence of kernel function on KDE is also investigated. The result shows that the Gauss kernel always performs better than the others for wind speed data.
Although KDE is a good NPDE method for estimation of wind speed distribution, it also has several limitations. For example, its computational complexity makes the computation quite tedious and results in that the traditional KDE method can only deal with smallscale and low dimension data set. Therefore, our next research direction is how to solve or mitigate these problems, and build a multidimensional wind speed model based on KDE.
References
Celik AN (2004) A statistical analysis of wind power density based on the Weibull and Rayleigh models at the southern region of Turkey. Renew Energy 29(4):593–604
Ucar A, Balo F (2009) Evaluation of wind energy potential and electricity generation at six locations in Turkey. Appl Energy 86(10):1864–1872
Bekele G, Palm B (2009) Wind energy potential assessment at four typical locations in Ethiopia. Appl Energy 86(3):388–396
Chang TP (2011) Performance comparison of six numerical methods in estimating Weibull parameters for wind energy application. Appl Energy 88(1):272–282
Garcia A, Torres JL, Prieto E, De Francisco A (1998) Fitting wind speed distributions: a case study. Sol Energy 62(2):139–144
Ucar A, Balo F (2009) Evaluation of wind energy potential and electricity generation at six locations in Turkey. Appl Energy 86(10):1864–1872
Balouktsis A, Chassapis D, Karapantsios TD (2002) A nomogram method for estimating the energy produced by wind turbine generators. Sol Energy 72(3):251–259
Seguro JV, Lambert TW (2000) Modern estimation of the parameters of the Weibull wind speed distribution for wind energy analysis. J Wind Eng Ind Aerodyn 85(1):75–84
Wang P, Gao Z, Bertling L (2012) Operational adequacy studies of power systems with wind farms and energy storages. IEEE Trans Power Syst 27(4):2377–2384
Billinton R, Karki R, Gao Y, Huang D, Hu P, Wangdee W (2012) Adequacy assessment considerations in wind integrated power systems. IEEE Trans Power Syst 27(4):2297–2305
Wu CX, Chung CY, Wen FS, Du DY (2014) Reliability/cost evaluation with PEV and wind generation system. IEEE Trans Sustain Energy 5(1):273–281
Gang L, Jinfu C, Defu C, Dongyuan S, Xianzhong D (2013) Probabilistic assessment of available transfer capability considering spatial correlation in wind power integrated system. IET Proc Gener Transm Distrib 71(2):1527–1535
Zhang Y, Chowdhury AA, Koval DO (2011) Probabilistic wind energy modeling in electric generation system reliability assessment. IEEE Trans Ind Appl 47(3):1507–1514
Dobakhshari AS, FotuhiFiruzabad M (2009) A reliability model of large wind farms for power system adequacy studies. IEEE Trans Energy Convers 24(3):792–801
Billinton R, Huang D (2010) Wind power modeling and the determination of capacity credit in an electric power system. Proc I Mech Eng Part O J Risk Reliab 224(1):1–9
Wang P, Gao Z, Bertling L (2012) Operational adequacy studies of power systems with wind farms and energy storages. IEEE Trans Power Syst 27(4):2377–2384
Jaramillo OA, Borja MA (2004) Wind speed analysis in La Ventosa, Mexico: a bimodal probability distribution case. Renew Energy 29(10):1613–1630
Carta JA, Mentado D (2007) A continuous bivariate model for wind power density and wind turbine energy output estimations. Energy Convers Manag 48(2):420–432
Carta JA, Ramirez P (2007) Analysis of twocomponent mixture Weibull statistics for estimation of wind speed distributions. Renew Energy 32(3):518–531
Carta JA, Ramirez P, Velázquez S (2008) Influence of the level of fit of a density probability function to windspeed data on the WECS mean power output estimation. Energy Convers Manag 49(10):2647–2655
Carta JA, Ramirez P, Velázquez S (2009) A review of wind speed probability distributions used in wind energy analysis: case studies in the Canary Islands. Renew Sustain Energy Rev 13(5):933–955
Chang TP (2011) Estimation of wind energy potential using different probability density functions. Appl Energy 88(5):1848–1856
Jangamshetti SH, Rau VG (1999) Site matching of wind turbine generators: a case study. IEEE Trans Energ Convers 14(4):1537–1543
Xie K, Billinton R (2011) Energy and reliability benefits of wind energy conversion systems. Renew Energy 36(7):1983–1988
Mulugetta Y, Drake F (1996) Assessment of solar and wind energy resources in Ethiopia—II: wind energy. Sol Energy 57(4):323–334
Jangamshetti SH, Ran VG (2001) Optimum siting of wind turbine generators. IEEE Trans Energy Convers 16(1):8–13
Meng Z, Xue F, Li X (2013) Wind speed equalizationbased incoming wind classification by aggregating DFIGs. J Mod Power Syst Clean Energy 1(1):42–48. doi:10.1007/s4056501300071
Ding Y, Cheng L, Zhang Y et al (2014) Operational reliability evaluation of restructured power systems with wind power penetration utilizing reliability network equivalent and timesequential simulation approaches. J Mod Power Syst Clean Energy 2(4):329–340. doi:10.1007/s4056501400778
Shi X (2010) Selection of bandwidth type and adjustment side in kernel density estimation over inhomogeneous backgrounds. Int J Geogr Inf Sci 24(5):643–660
Markus S, Michael RL, Robert AA (2010) Density estimation of plankton size spectra: a reanalysis of IronEx II data. J Plankton Res 32(8):1167–1184
Reliability test system task force of the application of probability methods subcommittee (1979) A reliability test system. IEEE Trans Power Appar Syst 98(6):2047–2054
Rosenblatt M (1956) Remarks on some nonparametric estimates of a density function. Ann Math Stat 27(3):832–837
Parzen E (1962) On estimation of a probability density function and mode. Ann Math Stat 33(3):1065–1076
Prakasa Rao BLS (1983) Nonparametric function estimation. Academic Press, New York
Silverman W (1986) Density estimation for statistics and data analysis. Chapman and Hall, London
Wand MP, Jones MC (1995) Kernel smoothing. Chapman and Hall, London
Dubi A (2000) Monte Carlo applications in system engineering. Wiley, New York
Giorsetto P, Utsurogi KF (1983) Development of a new procedure for reliability modeling of wind turbine generators. IEEE Trans Power Appar Syst PAS102(1):134–143
Acknowledgments
This work was supported in part by the National Natural Science Foundation of China (No. 51307185), Natural Science Foundation Project of CQ CSTC (No. cstc2012jjA90004) and the Fundamental Research Funds for the Central Universities (No. CDJPY12150002).
Author information
Authors and Affiliations
Corresponding author
Additional information
CrossCheck date: 5 May 2015
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
HU, B., LI, Y., YANG, H. et al. Wind speed model based on kernel density estimation and its application in reliability assessment of generating systems. J. Mod. Power Syst. Clean Energy 5, 220–227 (2017). https://doi.org/10.1007/s4056501501725
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s4056501501725