Gaussian mixture model for extreme  wind turbulence estimation

Zhang, Xiaodong; Natarajan, Anand

doi:https://doi.org/10.5194/wes-7-2135-2022

Articles | Volume 7, issue 5

https://doi.org/10.5194/wes-7-2135-2022

Articles | Volume 7, issue 5

Research article

26 Oct 2022

Research article |

| 26 Oct 2022

Gaussian mixture model for extreme wind turbulence estimation

Xiaodong Zhang and Anand Natarajan

Abstract

Uncertainty quantification is necessary in wind turbine design due to the random nature of the environmental inputs, through which the uncertainty of structural loads and response under specific situations can be quantified. Specifically, wind turbulence (described by the standard deviation of the longitudinal wind speed over a 10 min time duration) has a significant impact on the extreme and fatigue design envelope of the wind turbine. The wind parameters (mean and standard deviation of longitudinal wind speed over 10 min time duration) are not independent stochastic variables, and structural reliability analysis or uncertainty quantification therefore requires these wind parameters to be correlated stochastic parameters. An accurate probabilistic model should be established to model the correlation among wind parameters. Compared to univariate distributions, theoretical multivariate distributions are limited and not flexible enough to model the wind parameters from different sites or direction sectors. Copula-based models are often used for correlation description, but existing parametric copulas may not model the correlation among wind parameters well, due to limitations of the copula structures. The Gaussian mixture model is widely applied for density estimation and clustering in many domains, but limited studies have been conducted in wind energy and few have used it for density estimation of wind parameters. In this paper, the Gaussian mixture model is used to model the joint distribution of mean and standard deviation of longitudinal wind speed over 10 min time duration, which is calculated from 15 years of wind measurement time series data. As a comparison, the Nataf transformation (Gaussian copula) and Gumbel copula are compared with the Gaussian mixture model in terms of the estimated marginal distributions and conditional distributions. The Gaussian mixture model is then adopted to estimate the extreme wind turbulence (wind parameters for extreme load), which could be taken as an input to design loads used in the ultimate design limit state of turbine structures. The wind parameter contour associated with a 50-year return period computed from the Gaussian mixture model is compared with what is used in the design of wind turbines as given in IEC 61400-1. The Gaussian mixture model is able to model the joint distribution of wind parameters well, where the estimated tail distributions of both the marginal distributions and conditional distribution have good accuracy, and it is a good candidate for extreme turbulence estimation.

Download & links

Article (PDF, 7323 KB)

Download & links

Received: 25 Nov 2021 – Discussion started: 04 Jan 2022 – Revised: 27 Jul 2022 – Accepted: 09 Oct 2022 – Published: 26 Oct 2022

1 Introduction

Wind turbulence is characterized by the turbulence kinetic energy, its dissipation rate, and the length scale. This is modeled using three-dimensional anisotropic spectra that capture the auto-correlation and cross-correlation of the spatio-temporal wind speed variation, such as through the Mann model (Mann, 1994). Such models assume the wind turbulence is a Gaussian process, whereby several frequencies of wind velocity variations may occur, resulting in different wind velocities distributed as a function of time and space. Usually, the wind turbulence for wind turbine design is specified over a 10 min time window and the stochastic process is assumed to be stationary. The occurrence of extreme turbulence can then be categorized based on its return period. In wind turbine design, the wind turbulence with a 50-year return period is used in ultimate limit state analysis (IEC, 2019).

Many uncertainties exist in the evaluation of the design loads of wind turbine components. The IEC 61400-1 standard lists several load cases of the relevance of ultimate limit state analysis, wherein the load cases under normal operation usually require a partial safety factor (PSF) of 1.35 applied to the characteristic loads. Such PSFs are determined by quantifying the uncertainties in the load evaluation and the underlying distributions of the relevant inputs. An important load case towards determining ultimate design loads on wind turbine structures is the design load case (DLC) 1.3, in which the turbine is under normal operation under 50-year extreme wind turbulence. While relationships to evaluate the extreme turbulence level are provided in IEC 61400-1, there has been much debate on its accuracy and quantification, with edition-3 of IEC 61400-1 specifying a lognormal distribution for turbulence and edition-4 specifying it as a Weibull distribution. Several studies (Dimitrov et al., 2017; Abdallah et al., 2016) have proposed different models for extreme wind turbulence based on site measurements, and a large uncertainty can be seen in determining the long-term behavior of wind turbulence. Mathematically, an issue with the modeling of wind turbulence has been that the IEC 61400-1 standard and the literature are mainly focused on the probability distribution of the standard deviation of the wind speed (σ_u) conditional on the mean of the longitudinal wind speed over a 10 min time duration (u), whereas it is required that the joint distribution of σ_u and u is properly modeled.

A joint distribution model could be used for modeling multivariate random variables and generating random samples. Theoretical bivariate distributions are limited and not flexible enough. Monahan (2018) modeled the joint probability distribution of wind speeds at different locations using bivariate Rice distribution and bivariate Weibull distribution. The joint distribution of random variables could also be described by the univariate marginal distribution functions and a copula. A copula is a multivariate cumulative distribution function, where the marginal distribution follows a uniform distribution on the interval [0, 1]. Copulas are used for modeling the dependency among the random variables. Several families of copulas have been proposed in the literature, e.g., Gaussian copula (Nataf transformation (Xiao, 2014)) and Archimedean copulas (Bouyé et al., 2011). Using marginal distributions and copula to model the multivariate distributions is feasible, but the marginal distributions should be flexible enough to represent the wind inflow under varying environmental conditions, and the tail of the fitted distribution should be well representative of the actual inflow behavior. The copula structures should also be flexible enough to model different correlation structures. It is not clear which copula model (Abdallah, 2015) to choose to determine the joint distribution given marginal distributions.

To model extreme turbulence well, both the main body and the tail of the joint probability distribution of σ_u and u should be accurately represented. The Gaussian mixture model (GMM) is broadly used for clustering tasks (Zhang et al., 2021). GMM is a flexible model which can also perform density estimation on multivariate data with different marginal distributions and correlation structures. It is widely applied to different fields of study, e.g., speech and audio processing (Reynolds and Rose, 1995), image classification (Permuter et al., 2003), density estimation of microarray data in bioinformatics (Steinhoff et al., 2003), cancer classification (Prabakaran et al., 2019), and finance (Miyazaki et al., 2014). GMM is less commonly applied in wind energy compared to other domains, although Chang et al. (2017) used a GMM-based neural network for short-term wind power forecast, Cui et al. (2018) used GMM for fitting the probability distribution of wind power ramping features, Zhang et al. (2019) used GMM for wind turbine power dispatching, Li et al. (2020) used GMM for electrical loads forecast, and Srbinovski et al. (2021) used GMM for modeling the site-specific wind turbine power curves. GMM has been rarely adopted for wind parameter modeling, although Wahbah et al. (2018) used univariate GMM for wind speed probability density estimation, where the joint distribution of wind speed with other parameters was not investigated. Scarce published literature uses GMM for density estimation of wind inflow parameters and GMM has not been used for modeling the joint distribution of u and σ_u.

In this paper, GMM is used for modeling the joint distribution of the wind parameters u and σ_u. GMM is firstly used for density estimation of a random sample from a theoretical bivariate t distribution. It is then used for modeling the wind parameters from both offshore and onshore sectors. GMM is benchmarked to the measurement data by comparing the marginal distributions and the conditional distributions. The wind parameter contour with a 50-year return period is also computed from a GMM model with IFORM analysis (Winterstein et al., 1993). For the wind parameters from the offshore sector, Gaussian copula (Nataf transformation) and Gumbel copula are also compared.

2 Gaussian mixture model

GMM (McLachlan et al., 2019) is a mixture of several weighted Gaussian distributions and has been used for cluster analysis (Janouek et al., 2015) and density estimation (Steinhoff et al., 2003). GMM could be used for hard clustering and soft clustering of data. For hard clustering, each observation is assigned to the component returning the highest posterior probability, where each observation is assigned to exactly one cluster. Soft clustering, as opposed to hard clustering, assigns each observation to more than one cluster and each observation is assigned a responsibility (relative density). In terms of density estimation, the GMM is useful for multivariate distribution representations with multiple modes, but this does not prevent it from also being used for single-mode distributions. GMM is a linear combination of multivariate Gaussian distribution components, where each component is defined by its mean and covariance. Even though a weighted sum of Gaussian random variables is a Gaussian random variable, a weighted Gaussian distribution is not necessarily Gaussian. When there are more than two components for GMM, it is multi-modal and the distribution is not Gaussian. The probability distribution function (pdf) of a d-dimensional multivariate Gaussian is

\begin{matrix} (1) & N (x | μ, Σ) = \frac{1}{\sqrt{| Σ | (2 π)^{d}}} \exp (- \frac{1}{2} (x - μ) Σ^{- 1} {(x - μ)}^{T}), \end{matrix}

where μ is the 1-by-d mean vector and Σ is the d-by-d covariance matrix. The pdf of GMM is

\begin{matrix} (2) & p (x) = \sum_{j = 1}^{k} π_{j} N (x | μ_{j}, Σ_{j}), \end{matrix}

where k is the number of components, which is a hyper-parameter, and π_j is the component coefficient (weight) and follows

\begin{matrix} (3) & \sum_{j = 1}^{k} π_{j} = 1 0 \leq π_{j} \leq 1 . \end{matrix}

Some information criteria are proposed in the literature (Akaike, 1998; Schwarz, 1978) to determine k, where k is selected as a balance of overfitting and underfitting. Nevertheless, when the sample size is too large, the criteria are not effective and further research is required. To use GMM for density estimation and also for random sample generation, the model parameters ${π_{j}, μ_{j}, Σ_{j}$ , j=1, 2, …, k} should be estimated from the data sample {x_n, n=1, 2, …, N}, where N is the sample size. The initial model parameters are calculated from the clusters evaluated by the k-means clustering algorithm (Arthur and Vassilvitskii, 2006), and optimized by the expectation-maximization (EM) algorithm (McLachlan et al., 2019) as follows:

Assign the N observations to the k clusters using the k-means clustering algorithm. Compute μ_j, Σ_j, and π_j from the observations within each cluster.

The k-means clustering assigns N observations to k clusters, which are defined by the centroids. Each data point x_n with the closest centroid is assigned to the corresponding cluster. The centroids are recalculated and the data points are reassigned until the clusters do not change or the maximum iteration number is met. This is a hard clustering, and within each component, the μ_j and Σ_j are calculated, and the π_j is calculated as the number of data points in the current cluster divided by N.
Expectation-maximization (EM) algorithm: the model parameters ${π_{j}, μ_{j}, Σ_{j}, j = 1$ , 2, …, k} are found by an iterative EM algorithm (Dempster et al., 1977) to have a maximum likelihood estimation.
- a.
  E step: evaluate the responsibilities using the current model parameters. The responsibility γ_j(x_n) is the probability that component j takes for explaining the observation x_n, which is calculated as
  $\begin{matrix} (4) & γ_{j} (x_{n}) = \frac{π_{j} N (x_{n} | μ_{j}, Σ_{j})}{\sum_{i = 1}^{k} π_{i} N (x_{n} | μ_{i}, Σ_{i})} . \end{matrix}$
- b.
  M step: update the model parameters using the responsibilities from E step. The mean for component j is calculated as
  $\begin{matrix} (5) & μ_{j} = \frac{\sum_{n = 1}^{N} γ_{j} (x_{n}) x_{n}}{\sum_{n = 1}^{N} γ_{j} (x_{n})} . \end{matrix}$
  The covariance for component j is calculated as
  $\begin{matrix} (6) & Σ_{j} = \frac{\sum_{n = 1}^{N} γ_{j} (x_{n}) (x_{n} - μ_{j}) {(x_{n} - μ_{j})}^{T}}{\sum_{n = 1}^{N} γ_{j} (x_{n})} \end{matrix}$
  and the j component coefficient is calculated as
  $\begin{matrix} (7) & π_{j} = \frac{1}{N} \sum_{n = 1}^{N} γ_{j} (x_{n}) . \end{matrix}$
Repeat step 2 until the model parameters converge or the maximum number of iterations is met.

3 Results

GMM is proposed to model the joint distribution of u and σ_u, where the estimation error is small at both the main body pdf and the tail distribution. To verify the use of GMM, it is firstly used to recover the multivariate t distribution from a t distribution random sample. The flexibility of GMM (especially for modeling non-Gaussian joint distribution) and the demonstration of the procedure of using GMM for density estimation is detailed. To sample from the fitted joint distribution is very important, as many reliability analysis and uncertainty quantification applications require random samples as inputs. The random samples from GMM are compared with the random sample from the t distribution and wind parameters. To compute the number of components k, the value is increased from 1 until the estimated density function converges.

Table 1Initial GMM parameters.

Download Print Version | Download XLSX

Table 2Final GMM parameters.

Download Print Version | Download XLSX

Using copulas to develop non-Gaussian joint distributions of u and σ_u is initially attempted. A joint probability distribution of u and σ_u is then modeled by GMM. For estimating the extreme turbulence (wind parameter contour with 50-year return period), the accuracy of the tail distribution is important. The probability of exceedance of σ_u conditional on u from GMM is thus compared with the measurement data. To further examine the flexibility of GMM, the wind measurement data from both the offshore and onshore sectors are investigated and the 50-year wind parameter contours are compared.

3.1 Multivariate t distribution

The pdf of the d-dimensional multivariate Student's t distribution is

\begin{matrix} (8) & f (x, Σ, v) = \frac{1}{| Σ |^{1 / 2}} \frac{1}{\sqrt{(v π)^{d}}} \frac{Γ ((v + d) / 2)}{Γ (v / 2)} (1 + \frac{x^{'} Σ^{- 1} x}{v}), \end{matrix}

where Σ is a correlation matrix with a correlation coefficient 0.6 and v=5 is the degrees of freedom. The multivariate Student's t distribution generalizes the univariate Student's t distribution, and its marginal distributions all have univariate Student's t distribution. The marginal distributions of multivariate Student's t distribution have fatter tails than the normal distribution. A random sample with size 10⁵ is generated from the bivariate t distribution, and GMM is used to fit the bivariate t distribution.

The estimated density function converges when the number of components k=4 and, therefore, the k-means clustering algorithm is used to cluster the data points into k=4 components. The mean, covariance, and the component coefficient (sample size at each component divided by the total sample size) calculated from each component are taken as initial parameters for GMM, which are shown in Table 1. The four clusters are plotted in Fig. 1, where the means are plotted in circles.

https://wes.copernicus.org/articles/7/2135/2022/wes-7-2135-2022-f01

Figure 1The k-means clustering of the t distribution sample.

Gaussian mixture model for extreme wind turbulence estimation

3.1 Multivariate t distribution

3.2 Wind measurements

3.3 GMM-based estimation of wind parameters for the offshore sector

3.4 GMM-based estimation of wind parameters for the onshore sectors