Parameterization of wind evolution using lidar

Chen, Yiyin; Schlipf, David; Cheng, Po Wen

doi:https://doi.org/10.5194/wes-6-61-2021

Articles | Volume 6, issue 1

https://doi.org/10.5194/wes-6-61-2021

Articles | Volume 6, issue 1

Research article

12 Jan 2021

Research article |

| 12 Jan 2021

Parameterization of wind evolution using lidar

Yiyin Chen, David Schlipf, and Po Wen Cheng

Abstract

Wind evolution, i.e., the evolution of turbulence structures over time, has become an increasingly interesting topic in recent years, mainly due to the development of lidar-assisted wind turbine control, which requires accurate prediction of wind evolution to avoid unnecessary or even harmful control actions. Moreover, 4D stochastic wind field simulations can be made possible by integrating wind evolution into standard 3D simulations to provide a more realistic simulation environment for this control concept. Motivated by these factors, this research aims to investigate the potential of Gaussian process regression in the parameterization of wind evolution. Wind evolution is commonly quantified using magnitude-squared coherence of wind speed and is estimated with lidar data measured by two nacelle-mounted lidars in this research. A two-parameter wind evolution model modified from a previous study is used to model the estimated coherence. A statistical analysis is done for the wind evolution model parameters determined from the estimated coherence to provide some insights into the characteristics of wind evolution. Gaussian process regression models are trained with the wind evolution model parameters and different combinations of wind-field-related variables acquired from the lidars and a meteorological mast. The automatic relevance determination squared exponential kernel function is applied to select suitable variables for the models. The performance of the Gaussian process regression models is analyzed with respect to different variable combinations, and the selected variables are discussed to shed light on the correlation between wind evolution and these variables.

Download & links

Article (PDF, 7224 KB)

Download & links

Received: 11 Feb 2020 – Discussion started: 17 Mar 2020 – Revised: 05 Oct 2020 – Accepted: 28 Oct 2020 – Published: 12 Jan 2021

1 Introduction

Wind evolution refers to the physical phenomenon of turbulence structures (eddies) changing over time and is defined, in this study, as magnitude-squared coherence dependent on evolution time. Magnitude-squared coherence (hereafter referred to as coherence) is a common statistical measure of turbulence structure properties (see, e.g., Panofsky and McCormick, 1954; Davenport, 1961; Panofsky et al., 1974). In general, coherence describes the correlation between spectral components of two signals or data sets, taking values between zero, for no correlation, to unity, for perfect correlation. Because turbulent eddies are advected by the mean flow while evolving, the longitudinal coherence, i.e., coherence of turbulent velocity at locations separated in the mean direction of the flow, is used to measure wind evolution in practice (see, e.g., Schlipf et al., 2015; Simley and Pao, 2015). And when estimating the coherence, the data measured at the downstream location should be shifted by the travel time, corresponding to the evolution time, to match the data measured at the upstream location. Taylor's 1938 hypothesis is a special case that assumes all turbulent motions remain unchanged, while eddies move with the mean flow. In other words, it assumes no wind evolution, which means the coherence is unity for all frequencies. The validity of Taylor's 1938 hypothesis was researched in some studies (see, e.g., Willis and Deardorff, 1976; Schlipf et al., 2011), and this hypothesis is widely used in data analysis and wind field modeling for the sake of simplification (see, e.g., Kelberlau and Mann, 2019; Veers, 1988).

The research on wind evolution dates back to the 1970s. Pielke and Panofsky (1970) attempted to generalize some of the mathematical descriptions for horizontal variation of turbulence characteristics. The final goal at that time was to figure out an empirical model of the 4D (space–time) structure of turbulence. In Pielke and Panofsky's 1970 work, the coherence model suggested by Davenport (1961) to describe the correlation between horizontal wind components at different heights, also known as Davenport's geometric similarity, was extended into other wind components and separation directions. Pielke and Panofsky’s 1970 model also followed Davenport's idea to approximate the coherence with a simple exponential function using a single decay parameter. The decay parameters were assumed to be constants. After that, Ropelewski et al. (1973) systematically studied the coherence for streamwise and cross-stream wind components with horizontal separations. Based on their theoretical discussion, the decay parameter for longitudinal separation is supposed to be a function of turbulence intensity, which is a function of roughness length and the Richardson number (a measure of atmospheric stability) (Lumley and Panofsky, 1964). Extending the study, Panofsky and Mizuno (1975) found that the relationships between coherence and other parameters were rather complicated. A model for the decay parameter was proposed based on its empirical properties. This decay parameter model involves turbulence intensity accounting for the influence of terrain roughness, standard deviation of the lateral wind component, lateral integral length scale of the longitudinal wind component (which shows a relationship with the Richardson number), separation of two observations, and angle between the wind direction and the measurement line. This model can be regarded as the first parameterization of Pielke and Panofsky's 1970 model. However, the model was developed using only very few observations taken on meteorological towers, and the dependence of coherence on separation and atmospheric stability was not thoroughly researched in that study.

It is worth mentioning that the longitudinal coherence differs from the lateral and vertical coherence because the former is coupled with time-dependent variations in turbulence, while the latter measures the decay of correlation due to spatial separations in their respective directions. However, in the above-mentioned studies the longitudinal coherence was not clearly distinguished. Kristensen (1979) proposed that the longitudinal coherence should behave differently and deduced an alternative expression, for which we refer to Kristensen's 1979 model. This model assumes that the coherence can be modeled with the probability that an eddy observed at the first point can also be observed at the second point, given that the eddy has not completely faded out during the travel time and the eddy has been taken towards the second point.

Wind evolution has become interesting again because of the new concept of lidar-assisted wind turbine control (see, e.g., Schlipf, 2015; Simley, 2015; Simley et al., 2018). Lidar – more specifically, Doppler wind lidar – is a remote sensing technology which can be used to measure wind speed in a certain spatial range (Weitkamp, 2005). The main idea of lidar-assisted wind turbine control is to enable a feedforward control of wind turbines by using a nacelle-mounted lidar to measure the approaching wind field at some distance upwind. The control system should react only to the changes in the wind field which can be predicted accurately to avoid harmful and unnecessary control actions. This is made possible by applying an adaptive filter to remove the uncorrelated part of the lidar signal. An accurate prediction of the wind evolution will thus benefit the filter design. Moreover, the application of Taylor's 1938 hypothesis in the wind field simulation is no longer appropriate for modeling the lidar-assisted control system. To solve this problem, different approaches (see, e.g., Bossanyi, 2013; Laks et al., 2013) have been proposed to integrate the wind evolution model within the wind field simulation method of Veers (1988) to make it possible to simulate a 4D wind field.

Some attempts were made to further promote the modeling of wind evolution. Schlipf et al. (2015) suggested an approach to determine the decay parameter in Pielke and Panofsky's 1970 model with data measured by a nacelle-mounted lidar, taking into account the influence of lidar measurement on coherence. However, the limitation of this study is that only four 1 h data blocks were examined. Simley and Pao (2015) attempted to validate the models of Pielke and Panofsky 1970 and Kristensen 1979 with data from large-eddy simulation (LES) wind fields but found that neither model can always correctly model the coherence as frequency approaches zero. To improve this issue, Simley and Pao (2015) tried to apply the coherence model for transverse and vertical separations suggested by Thresher et al. (1981) to the longitudinal coherence. This model has a form similar to Pielke and Panofsky's 1970 model but includes an additional parameter to allow coherence less than unity at a very low frequency. Davoust and von Terzi (2016) examined Simley and Pao's 2015 model with data from nacelle-mounted lidars on three sites. To enable a direct comparison with Simley and Pao's 2015 work, a correction method was applied to compensate the influence of lidar measurement on coherence. However, the linear dependence of the decay parameter on turbulence intensity suggested by Simley and Pao (2015) was not clearly observed. The relationship between the offset parameter and integral length scale shows a good match with that suggested in Simley and Pao's 2015 work, but the agreement decreases after the correction of coherence. At the same time, de Maré and Mann (2016) developed a 4D model to describe the space–time structure of turbulence by combining the Mann 1994 spectral velocity tensor and Kristensen's 1979 longitudinal-coherence model.

Motivated by the above-mentioned research, this study aims to achieve parameterization models for a wind evolution model modified from Simley and Pao's 2015 model. In addition, it is desired to gain some insights into the complex relationships between wind evolution and wind-field-related variables such as wind statistics, atmospheric stability, and relative positions of measurement points. For these purposes, a previous study (Chen, 2019) was done to explore different supervised machine learning algorithms on a simple level, including stepwise linear regression (see, e.g., Hocking, 1976), regression tree (see, e.g., Breiman et al., 1984), support vector regression (see, e.g., Vapnik, 1995), and Gaussian process regression (see, e.g., Rasmussen and Williams, 2006). It was found that Gaussian process regression, overall, performs the best for prediction of wind evolution model parameters, and thus its potential is further analyzed in this study with more extensive data.

This research is mainly done using lidar measurement because lidar can provide large amounts of spatially separated measuring points simultaneously, which is of great advantage for studying the dependence of wind evolution on separation in comparison to data from a meteorological tower. Lidar data from two measurement campaigns undertaken in different terrain types are available. In one of the measurement campaigns, data taken on a meteorological tower are also involved in the analysis to provide a comparison.

The present paper is organized as follows: Sect. 2 briefly explains the theoretical basis of wind evolution and its prediction concept as well as the principles of the methods applied in this work; Sect. 3 introduces the measurement campaigns and the data processing; Sect. 4 presents the results of the statistical analysis of the wind evolution model parameters; Sect. 5 illustrates the process of model training and the evaluation of the parameterization models; and Sect. 6 summarizes the results and gives the conclusions and an outlook.

2 Methodology

This section first explains the mathematical expression of wind evolution in Sect. 2.1. Then, our concept of wind evolution prediction and a corresponding workflow are presented in Sect. 2.2. After that, the wind evolution model applied in this work is introduced in Sect. 2.3. Finally, the details of the workflow are introduced and discussed in Sect. 2.4–2.7.

2.1 Wind evolution

As mentioned in the Introduction, wind evolution is mathematically defined as the magnitude-squared coherence between two wind speed signals i and j measured at two points separated in the longitudinal direction, with i for the signal measured at the upstream point and j at the downstream point:

\begin{matrix} (1) & γ_{i j}^{2} (f) = \frac{| S_{i j} (f) |^{2}}{S_{i i} (f) S_{j j} (f)}, \end{matrix}

where S_ii(f) and S_jj(f) represent the power spectral densities (PSDs) of signals i and j, respectively, and S_ij(f) represents the cross-spectral density between i and j. It must be emphasized that the coherence corresponds to a lagged correlation, which means the signal j should be shifted by the travel time Δt after which the signal i is expected to arrive at the downstream point for calculation of the coherence.

2.2 Concept and workflow

A supervised learning algorithm aims to find the mapping function from predictors (i.e., input variables) to a target (i.e output variable) through known data about the predictors and the target without relying on a predefined equation as a model. The key to using supervised learning is to identify suitable predictors and targets, which is in fact a process of abstracting and condensing information.

In this study, we aim to develop a predictive model for wind evolution of the longitudinal wind component. It is worth noting the different meanings of wind evolution and wind evolution model. Wind evolution, i.e., the coherence estimated from measured data in practice, is not predictable because the estimated coherence consists of approximately infinite data points. Therefore, a model with a limited number of parameters is needed to approximate the estimated coherence; this is a wind evolution model. From the perspective of machine learning, using a wind evolution model is essentially condensing the information in the estimated coherence into several model parameters which are predictable. These model parameters are targets of predictive models, and thus the predictive model is deemed a parameterization model in this study.

Wind-field-related variables such as wind statistics, atmospheric stability, and relative positions of measurement points are considered as potential predictors, based on the theoretical and experimental studies mentioned in the Introduction. A discussion about the potential predictors is provided in Sect. 2.5. Further analysis needs to be done to determine which of the potential predictors should be selected for model training, i.e., feature selection. The principle of feature selection is to figure out which variables provide the best predictive power (accounting for most of the variation in the target values), and, ideally, these variables should be independent of each other to prevent overfitting in model training. To investigate the necessary predictors under different data availability, different combinations of predictors are discussed in Sect. 5.

Figure 1 illustrates our concept and workflow of wind evolution prediction. For model training, the essential steps are the determination of observed values of predictors and targets from measured data and training parameterization models using a machine learning algorithm, more specifically: (1) to estimate the coherence using lidar data; (2) to determine the observed target values, i.e., the wind evolution model parameters, by fitting the estimated coherence to a wind evolution model; (3) to calculate observed predictor values from measured data (mainly lidar data; sonic data could be used if available); and (4) to train parameterization models using a machine learning algorithm. The prediction process goes in the opposite direction: firstly, the wind evolution model parameters are predicted by the trained parameterization models using new predictor values calculated from new measured data, and then, the predicted coherence is reconstructed by the wind evolution model using the predicted model parameters.

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f01

Figure 1Concept and workflow of wind evolution prediction. The workflow of model training is the following. (1) Estimation of coherence using lidar data. (2) Determination of wind evolution model parameters by fitting the estimated coherence to a wind evolution model. (3) Calculation of potential predictors from measured data (mainly lidar data; sonic data could be involved if available). (4) Training of parameterization models using a machine learning algorithm.

Download

To demonstrate our concept and workflow: Sect. 2.3 explains the wind evolution model used in this study; Sect. 2.4 discusses special issues regarding coherence estimation using lidar data; Sect. 2.5 discusses the potential predictors of the parameterization models; Sect. 2.6 and Sect. 2.7 briefly introduce the principle of Gaussian process regression (the machine learning algorithm applied in this study) and the method of model validation, respectively; Sect. 3.2 shows the fitting process of the estimated coherence in detail; and Sect. 5 demonstrates the training of parameterization models, predictor selection, and model validation in the respective subsections.

2.3 Wind evolution model

Following the theoretical considerations by Ropelewski et al. (1973), the coherence decreases exponentially with increasing evolution time Δt of the signal with respect to “eddy turnover time” τ

\begin{matrix} (2) & γ_{model}^{2} (f) = \exp (- C \cdot \frac{Δ t}{τ}) . \end{matrix}

The term C represents the decay behavior of the coherence depending on the time ratio. C could be a constant, a linear function, or a more complicated term. τ is a timescale associated with the characteristic eddy size λ and characteristic velocity of turbulence, which is approximated by the standard deviation of wind speed σ as follows:

\begin{matrix} (3) & τ \sim \frac{λ}{σ} . \end{matrix}

This expression implies that eddies are supposed to decay faster under strong turbulence. Given the same degree of turbulence, large eddies are supposed to take a longer time to decay. The eddy size λ is linked to the frequency of horizontal-wind-velocity fluctuations f and the flow mean wind speed U with the relation

\begin{matrix} (4) & λ \sim \frac{U}{f} . \end{matrix}

Combining Eqs. (2)–(4), the coherence model becomes

\begin{matrix} (5) & γ_{model}^{2} (f) = \exp (- C \cdot \frac{σ}{U} \cdot f \cdot Δ t) . \end{matrix}

This equation is essentially the same as the model proposed by Pielke and Panofsky (1970), except that, in their model, Δt is approximated by d∕U (d is separation) (Taylor, 1938; Willis and Deardorff, 1976), indicated as Δt_T.

Simley and Pao (2015) noted a limitation of this one-parameter model form: the intercept (coherence for 0 frequency) of the modeled coherence is forced to be unity, which is not always realistic. To overcome this issue, Simley and Pao (2015) introduced a second parameter in the coherence model, taking a model form similar to the coherence model for transverse and vertical separations suggested by Thresher et al. (1981):

\begin{matrix} (6) & γ_{model}^{2} (f, d) = \exp (- a^{'} \sqrt{{(\frac{f d}{U})}^{2} + (b^{'} d)^{2}}), \end{matrix}

where a^′ and b^′ are tuning parameters. A comparison between the fitting quality of a one-parameter model and a two-parameter model is given in Sect. 3.2 to confirm the necessity of using a two-parameter wind evolution model.

We have made two modifications to Simley and Pao's 2015 model. Firstly, d∕U is restored to the travel time Δt to avoid coupling the approximation of $Δ t = d / U$ in the wind evolution model, considering the effect of the wind turbine’s induction zone. In fitting the estimated coherence to the wind evolution model, Δt is determined by the time lag of the peak of the cross-correlation between two wind speed signals, indicated as Δt_M. Secondly, $a^{'} b^{'} d$ is replaced with b. The reasons for that are the following. (1) With the original form $a^{'} b^{'} d$ , $a^{'} b^{'}$ is essentially the fitted term (given that d is known) in the curve fitting. Thus, b^′ shows a strong dependence on a^′, which is generally undesirable for machine learning algorithms. (2) The form $a^{'} b^{'} d$ implies that this term is proportional to d, but we found that d is still an important predictor for b^′, indicating that the assumption of a linear relationship might be not proper. Therefore, we decided to directly use b to represent the intercept and take d as a predictor instead (see Sect. 2.5).

The modified wind evolution model is

\begin{matrix} (7) & γ_{model}^{2} (f) = \exp (- \sqrt{a^{2} \cdot (f \cdot Δ t)^{2} + b^{2}}), \end{matrix}

where the decay parameter a represents the decay effect of coherence and the offset parameter b is used to adjust the intercept (coherence for 0 frequency) of the modeled coherence curve. The intercept equals exp( $- | b |$ ). Both parameters are dimensionless. The term f⋅Δt is dimensionless and thus is defined as dimensionless frequency f_dless. In the end, our wind evolution model is defined as

\begin{matrix} (8) & γ_{model}^{2} (f_{dless}) = \exp (- \sqrt{a^{2} \cdot f_{dless}^{2} + b^{2}}) . \end{matrix}

In some studies (see, e.g., Schlipf et al., 2015), the wind evolution model is defined as a function of wavenumber k, with $k = 2 π f / U$ . The relationship between k and f_dless is $k = 2 π f_{dless} / d$ , applying the approximation of $Δ t = d / U$ . To give an intuitive impression of the wind evolution model, Fig. 2 shows the theoretical curves calculated with different values of a and b as examples.

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f02

Figure 2Impact of the model parameters a and b on the wind evolution model. (a) b=0. (b) a=3.

Download

2.4 Estimating coherence using lidar data

In this work, the coherence is estimated with lidar data because lidar can provide more data with respect to different spatial separations. This is not easy to obtain when using meteorological towers because multiple towers would be needed, and only when the wind direction is aligned with the tower locations would the data be usable. Further, the prediction of the coherence is mainly expected to be applied when coupled with the deployment of a lidar, e.g., in lidar-assisted wind turbine control.

A Doppler wind lidar is a remote sensing device that measures wind speed based on the optical Doppler effect. Lidar emits laser pulses and detects the Doppler shift in backscattered light from aerosol particles in the atmosphere that are entrained with the wind. The Doppler shift is proportional to the line-of-sight wind speed, i.e., the wind speed projected onto the laser beam and thus can be used to estimate the line-of-sight wind speed. The measurement principle of Doppler wind lidar is explained in many publications (e.g., Weitkamp, 2005; Peña et al., 2013; Liu et al., 2019) and thus is not introduced here in detail.

However, it must be emphasized that the coherence estimated with lidar data deviates from that estimated with data taken from ultrasonic anemometers. The reasons for that are the following. (1) The sampling rate of lidars is generally much lower than that of ultrasonic anemometers, and thus lidars cannot measure high-frequency fluctuations in wind speed. (2) The measuring volume of lidars is generally much longer than that of ultrasonic anemometers because of its measurement principle, and thus for lidars, the spatial-averaging effect within the measuring volume needs to be considered. (3) Lidars can only measure the wind speed projected onto the emitted laser beams, i.e., the line-of-sight wind speed. The influence of these three aspects is discussed in the following, specifically considering lidar in the staring mode.

Low sampling rate of lidar. According to the Nyquist–Shannon sampling theorem (Shannon, 1949), the upper frequency limit of a signal transformed from the time domain into the frequency domain is half of the sampling frequency. As long as the lidar sampling rate is sufficiently high to acquire a complete coherence curve covering the range from the highest coherence (e.g., 0.9–1.0) to the lowest coherence (e.g., 0–0.1), it would probably not have a large impact on studying the coherence. To obtain as high a sampling rate as possible, it is decided to select staring-mode data to calculate the coherence. Use of the staring mode generally means that the lidar measures the wind speed with a single laser beam pointing in a fixed direction. Specifically in this work, the laser beam points horizontally upstream of the wind turbine.

Spatial-averaging effect of lidar. Consider a pulsed lidar (only pulsed lidars are involved in this work). The spatial-averaging effect can be modeled with a moving average weighted by a Gaussian-like shape function (see, e.g., Carious, 2013) or a triangular function (see, e.g., Sathe and Mann, 2012) centered at a measurement point. Following Carious (2013), the weighting function w(x) is an even function centered at every measurement point along the laser beam. The lidar-measured wind speed at the measurement point x₀ for any instant can be modeled with

\begin{matrix} (9) & u_{l} (x_{0}) = \int_{- \infty}^{\infty} w (x_{0} - x) u_{p} (x) d x = (w * u_{p}) (x_{0}), \end{matrix}

where u_p(x) is a wind speed function of spatial points on the x axis aligned with the lidar's laser beam. According to the convolution theorem (Oppenheim et al., 1997), the following relationship is valid for the Fourier transformation between space and the wavenumber domain

\begin{matrix} (10) & F {u_{l}} = F {w * u_{p}} = F {w} \cdot F {u_{p}}, \end{matrix}

where ℱ{ } is the Fourier transform operator.

Following Eq. (1), the coherence estimated with lidar data, indicated with the subscript “l'’, is

\begin{matrix} (11) & γ_{i j, l}^{2} (f) = \frac{| S_{i j, l} (f) |^{2}}{S_{i i, l} (f) \cdot S_{j j, l} (f)}, \end{matrix}

where S_ii,l(f) and S_jj,l(f) are the auto-spectrum at the point i and j, respectively; S_ij,l(f) is the cross-spectrum between i and j; and f is the frequency in Hz. They are all estimated from lidar data. The auto-spectrum is

\begin{matrix} (12) & S_{i i, l} (f) = F {u_{i, l} (t)} \cdot F^{*} {u_{i, l} (t)}, \end{matrix}

where u_i,l(t) is the time series of the wind speed at i, and the symbol ^* means conjugate. And the cross-spectrum is

\begin{matrix} (13) & S_{i j, l} (f) = F {u_{i, l} (t)} \cdot F^{*} {u_{j, l} (t)} . \end{matrix}

Assume that the laser beam is aligned with the wind direction and Taylor's 1938 hypothesis applies within the measurement volume and that Eq. (10) is also valid for the Fourier transformation between the time and frequency domains. Taylor's 1938 hypothesis is considered valid within the measurement volume because, in principle, wind evolution depends on the evolution time of turbulence (see Eq. 2), and the measurement volume corresponds to a temporal length on the order of magnitude of 10⁻⁷ s (typical length of a laser pulse). Now, Eq. (11) can be written as (with t and f omitted for clarity)

\begin{matrix} (14) & \begin{aligned} γ_{i j, l}^{2} & = \frac{| F {u_{i, l}} \cdot F^{*} {u_{j, l}} |^{2}}{F {u_{i, l}} \cdot F^{*} {u_{i, l}} \cdot F {u_{j, l}} \cdot F^{*} {u_{j, l}}} \\ = \frac{| F {w} \cdot F {u_{i, p}} \cdot F^{*} {w} \cdot F^{*} {u_{j, p}} |^{2}}{F {w} \cdot F {u_{i, p}} \cdot F^{*} {w} \cdot F^{*} {u_{i, p}} \cdot F {w} \cdot F {u_{j, p}} \cdot F^{*} {w} \cdot F^{*} {u_{j, p}}} . \end{aligned} \end{matrix}

Because the function w(x) is real and even, according to the conjugate symmetry of the Fourier transformation (Oppenheim et al., 1997), $F {w} = F^{*} {w}$ and ℱ{w} is real and even as well. As a result, all instances of ℱ{w} in the denominator and the numerator are canceled out. And thus Eq. (14) becomes:

\begin{matrix} (15) & γ_{i j, l}^{2} = \frac{| F {u_{i, p}} \cdot F^{*} {u_{j, p}} |^{2}}{F {u_{i, p}} \cdot F^{*} {u_{i, p}} \cdot F {u_{j, p}} \cdot F^{*} {u_{j, p}}} = γ_{i j, p}^{2} . \end{matrix}

This means that the spatial-averaging effect does not influence the coherence under the above-mentioned ideal assumptions.

Misalignment of wind direction and lidar measurement. The above derivation is based on an important assumption that the laser beam is aligned with the wind direction. This will not always be fulfilled in reality, even for a nacelle-mounted lidar operating in the staring mode. Figure 3 shows a misalignment between wind direction and lidar measurement direction, at an angle α. The coherence of the line-of-sight wind speed is $γ_{12}^{2}$ , which is no longer the longitudinal coherence but the horizontal coherence as defined by Panofsky and Mizuno (1975). $γ_{13}^{2}$ and $γ_{23}^{2}$ are the longitudinal and lateral coherence, respectively.

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f03

Figure 3Misalignment of wind direction and lidar measurement. α is the misalignment angle. $γ_{12}^{2}$ is the coherence of the line-of-sight wind speed. $γ_{13}^{2}$ and $γ_{23}^{2}$ are the longitudinal and lateral coherence, respectively.

Download

Schlipf et al. (2015) suggested a model for the horizontal coherence (magnitude coherence) based on the assumption of point measurement for simplification

\begin{matrix} (16) & γ_{i j, losP} = \frac{\cos^{2} (α) γ_{i j, ux} γ_{i j, uy} S_{i i, u}}{\cos^{2} (α) S_{i i, u} + \sin^{2} (α) S_{i i, v}}, \end{matrix}

where γ_ij,losP is the horizontal coherence of line-of-sight wind speed point measurements, γ_ij,ux and γ_ij,uy are the longitudinal and lateral coherence of the longitudinal wind component, and S_ii,u and S_ii,v are the auto-spectra of the longitudinal and lateral wind components. Based on this equation, determining the longitudinal coherence γ_ij,ux is possible only given a specific turbulence model (knowing S_ii,u, S_ii,v, and γ_ij,uy) and knowing the misalignment angle α. Moreover, the above-discussed spatial-averaging effect must be coupled to the horizontal coherence, considering that the lateral coherence for the point at x depends on the lateral separation Δy associated with its distance from the center point of the range gate x₀, i.e., $Δ y = \cos (α) (| x - x_{0} |)$ . Therefore, the longitudinal coherence is implicitly included in the integration of horizontal coherence weighted by the range-weighting function of lidars.

In this study, we decide to develop a parameterization model based on horizontal coherence for the following reasons. Firstly, consider the case for a nacelle-mounted lidar. The misalignment of the lidar measurement means that the wind turbine is misaligned as well. In this case, it makes sense to predict the corresponding horizontal coherence. Secondly, a standalone parameterization model, independent of any turbulence model, is desired for more flexibility in application. Thirdly, determining the parameters in an implicit wind evolution model is complicated when using measured data. And it is necessary to acquire the misalignment angle α, which is not always possible in application, especially when lidar is the only data source, though deployment of lidars with multiple beams might help in this case. Moreover, the requirement for the accuracy of α is very high because α is included in the most basic step – fitting the estimated coherence to the wind evolution model. The uncertainties contained in α will propagate through the whole model and affect the further analysis radically. Since the prediction concept needs to be applicable under different data availabilities, it is not desired to make the fitting process depend so critically on a variable whose availability and accuracy are not always guaranteed. It is thus helpful to consider α as a predictor (see Sect. 2.5) to account for variations in the horizontal coherence caused by the direction misalignment. The benefit of doing so is to make α more standalone and to prevent its errors from affecting everything else, while reasonably taking its influences into account. In addition, Gaussian process regression inherently assumes imperfect training data (containing noisy terms; see Sect. 2.6), so it is better to keep uncertainties in predictors.

Certainly, if the direction misalignment is available and sufficiently accurate in a given application scenario, the prediction concept can be easily adjusted by changing the wind evolution model to which the estimated coherence is supposed to fit.

2.5 Potential predictors

In the literature reviewed in the Introduction, the variables considered relevant to wind evolution are as listed below:

Ropelewski et al. (1973): turbulence intensity (a function of roughness length and the Richardson number; Lumley and Panofsky, 1964);
Panofsky and Mizuno (1975): mean wind speed, turbulence intensity, standard deviation of the lateral wind component, lateral integral length scale of the longitudinal wind component, longitudinal separation, and the angle between the wind direction and the measurement line (if misalignment exists);
Kristensen (1979): turbulence intensity, longitudinal integral length scale of the longitudinal wind component, and longitudinal separation;
Simley and Pao (2015): turbulence intensity, longitudinal integral length scale of the longitudinal wind component, and longitudinal separation.

The above-mentioned variables can be categorized into three groups: wind statistics, atmospheric stability, and relative positions of measurement points. We follow this train of thought to discuss potential predictors of the parameterization models. It is worth mentioning, in advance, that not all of these predictors will be used in the final models. Useful features will be selected using the automatic relevance determination squared exponential kernel function (Duvenaud, 2014). The goal of this initial step is to collect all possible predictors, even though some of them will turn out to be redundant and can be converted to each other.

Wind statistics. Following prior research, turbulence intensity I_T is considered as a predictor. The turbulence intensity is defined as

\begin{matrix} (17) & I_{T} = \frac{σ}{U} . \end{matrix}

In addition, mean wind speed U and its standard deviation σ are also included because they are the fundamental variables of turbulence intensity. Apparently, I_T and σ are equivalent (given U), so only one of them will be selected according to the result of feature selection.

Moreover, integral length scale L is considered as a predictor and approximated with (Pope, 2000; Simley and Pao, 2015)

\begin{matrix} (18) & L = U \cdot \int_{0}^{\infty} ρ (s) d s = U \cdot T, \end{matrix}

where ρ(s) is the autocorrelation function. Indeed, integrating the autocorrelation gives the integral timescale T. The approximation of L is essentially based on assuming the turbulent eddies advected by the mean flow at U. Please note that this is not necessarily equivalent to assuming “frozen” turbulence. Turbulent eddies can evolve when preserving the same mean wind speed and statistical properties (including autocorrelation). The multiplication of U can be understood as translating the integration domain from time lag s to spatial separation by approximating the spatial separation with U⋅s. This approximation might contain uncertainties, but we have no alternatives for calculation of L from measured data. The integration of autocorrelation is computed up to the first zero-crossing location instead of infinity in practice (Simley and Pao, 2015). Considering the correlation between L and T shown in Eq. (18), T is also considered as a predictor, and thus L and T constitute another pair of redundant predictors from which only one will be selected.

Besides the variables already considered in prior studies, it is interesting to explore whether high-order wind statistics such as skewness and kurtosis of wind speed could play a role in wind evolution prediction. Skewness (i.e., the third standardized central moment) and kurtosis (i.e., the fourth standardized central moment) are measures of the asymmetry and flatness of the wind speed distribution, respectively. The sample skewness G₁, with bias correction, is defined as (Joanes and Gill, 1998)

\begin{matrix} (19) & G_{1} = \frac{\sqrt{n (n - 1)}}{n - 2} \cdot \frac{\frac{1}{n} \sum_{i = 1}^{n} u_{i}^{3}}{{(\frac{1}{n} \sum_{i = 1}^{n} u_{i}^{2})}^{3 / 2}}, \end{matrix}

and the sample kurtosis G₂ (not subtracting 3), with bias correction, is defined as (Joanes and Gill, 1998)

\begin{matrix} (20) & \begin{aligned} G_{2} & = \frac{n - 1}{(n - 2) (n - 3)} \\ \cdot [(n + 1) \cdot \frac{\frac{1}{n} \sum_{i = 1}^{n} u_{i}^{4}}{{(\frac{1}{n} \sum_{i = 1}^{n} u_{i}^{2})}^{2}} - 3 (n - 1)] + 3, \end{aligned} \end{matrix}

where u_i is wind speed fluctuations and n is the number of data points. According to Lenschow et al. (1994), statistical moments estimated using time series data with limited length show a systematic deviation from the true moments and also contain random errors. Both are decreasing functions of the averaging time. Compared to the sample standard deviation, the sample skewness and kurtosis would probably contain larger uncertainties. Nevertheless, we still want to test, on a simple level, whether these two high-order wind statistics could be useful for prediction.

Atmospheric stability. The atmospheric stability represents a global effect of the surface layer in the boundary layer on a wind field. It is believed to affect wind evolution, being an influence factor on turbulence stability (Ropelewski et al., 1973; Lumley and Panofsky, 1964). A dimensionless height ζ, built with Obukhov length L_MO (Obukhov, 1971), is considered as a predictor (Businger et al., 1971)

\begin{matrix} (21) & ζ = \frac{z}{L_{MO}} = - \frac{κ g \overline{w^{'} θ_{v}^{'}} z}{\overline{θ} u_{*}^{3}}, \end{matrix}

where κ is the von Kármán constant, g is gravitational acceleration, z is the measurement height, $\overline{θ}$ is the mean potential temperature, u_* is the friction velocity, and $\overline{w^{'} θ_{v}^{'}}$ is the covariance of vertical velocity perturbations and virtual potential temperature.

Relative positions of measurement points. Based on our modifications to Simley and Pao's 2015 model (see Sect. 2.3), measurement separation d has been removed from the wind evolution model and is now considered as a predictor. As discussed in Sect. 2.4, the misalignment angle α is not involved in fitting the wind evolution model but is considered as a predictor to account for the influence of the lateral coherence on the horizontal coherence. In fact, d is associated with two different effects. On the one hand, d corresponds to travel time or, rather, to evolution time Δt, which is believed to play an important role in wind evolution. On the other hand, d together with α account for the decay of the lateral coherence. The travel time determined with the maximum cross-correlation Δt_M is a more accurate variable. However, considering that calculating Δt_M might not always be feasible due to its computational complexity, the travel time approximated using Taylor's 1938 translation hypothesis Δt_T is included as well.

The notations of the above-mentioned potential predictors are summarized in Table 1. These variables are derived from both lidar data and data measured with ultrasonic anemometers (hereafter referred to as sonic data) according to their availability in each measurement campaign. The measurement instrument is indicated with a subscript: “l” for lidar and “s” for sonic (i.e., ultrasonic anemometer). For example, U_l represents the mean wind speed calculated from lidar data. Regarding sonic data, it is more reasonable for the analysis of wind evolution to use a wind coordinate system with the x axis aligned to the mean wind direction instead of the meteorological coordinate system. The mean wind direction is determined with the mean wind direction for each data block. The high-resolution longitudinal (indicated with the subscript “x”) and lateral (indicated with the subscript “y”) wind speeds are obtained by projecting the high-resolution wind components measured with ultrasonic anemometers on the wind coordinate system. Then, the above-mentioned variables are derived from the data based on the wind coordinate system. For example, U_x,s represents the mean wind speed calculated from the longitudinal wind component measured with ultrasonic anemometers.

Table 1Notations of potential predictors.

Download Print Version | Download XLSX

2.6 Gaussian process regression

This section briefly introduces the principle of the Gaussian process regression (GPR) and the hyperparameters that modify the behavior of a GPR model. The model training is done using the MATLAB Statistics and Machine Learning Toolbox¹.

The principle of GPR. Consider making a regression model from some data. A very intuitive approach is to fit certain functions, e.g., linear or polynominal. However, this requires an initial guess about the functional relationship(s) behind the data, which is very difficult in this case because the wind evolution model parameters do not indicate any clear dependence on the potential predictors. The reasons for that could be multiple: (1) the data could be noisy; and (2) the dependence could exist in multidimensional space not observable in a single dimension, etc. Under this circumstance, GPR turns out to be a good choice because it is non-parametric probabilistic model, which means the model is not a specific function, but a probability distribution over functions. The principle underlying GPR is Bayesian inference. The prior distribution over functions, which can be understood as a guess about what kinds of function could be present without knowing the data, is specified by a particular Gaussian process (GP) which favors smooth functions. In the training process, as adding the data, the probabilities associated with the functions which do not agree with the observations will be decreased, which gives the posterior distribution over the functions (Rasmussen and Williams, 2006).

Hyperparameters of GPR. The behavior of a GPR model is defined by its hyperparameters. To introduce the hyperparameters, a basic explanation is given following Rasmussen and Williams (2006). Please note that the complete deduction is not displayed here because it is beyond the scope of this paper. For further details, please refer to chap. 2 of Rasmussen and Williams' (Rasmussen and Williams, 2006) book.

GPR is based on Bayesian inference. First, consider a single observation. The Bayesian linear regression model with Gaussian noise is defined as

\begin{matrix} (22) & f (x) = ϕ (x)^{⊤} w, y = f (x) + ε, \end{matrix}

where x is an input vector containing D different predictors of a single observation, ϕ(x) is the function which maps the input vector onto a higher dimensional space where the Bayesian linear model is applicable, w is a vector of weights of the linear model, f(x) is the function value, y is the observed target value, and ε is independent identically distributed Gaussian noise with zero mean and variance $σ_{n}^{2}$

\begin{matrix} (23) & ε \sim N (0, σ_{n}^{2}) . \end{matrix}

The Bayesian linear model is a GP given that the prior distribution of w is normally distributed with zero mean. Since a GP is fully specified by its mean and covariance, the Bayesian linear model is written as

\begin{matrix} (24) & f (X) \sim G P (0, cov (f (X))), \end{matrix}

where X is the aggregation of all input vectors of n observations. This is the prior distribution over functions. The presence of ε shows another advantage of GPR, viz. that it is able to inherently assume noisy observations and take this effect into account in the model. σ_n is one of the hyperparameters.

It is common, but not necessary, to assume GPs with a zero mean function. The mean function can be modeled with a set of basis functions h(x) and a corresponding coefficient vector β. So, GPs with a non-zero mean function can be assumed as

\begin{matrix} (25) & g (x) = f (x) + h (x)^{⊤} β . \end{matrix}

The basis function is one of the hyperparameters. MATLAB provides four types of basis function: zero (assuming no basis function), constant, linear, and pure quadratic. The coefficient vector β can also be understood as the weight vector of h(x). But we have defined w as a weight vector in Eq. (22), we want to avoid using the same word here in case reader might confuse these two different processes. β is estimated from training data.

The covariance of the function values is not specified explicitly but estimated using a kernel function

\begin{matrix} (26) & cov (f (X)) = K (X, X), \end{matrix}

which is the so-called kernel trick. There are two types of kernel functions: one is kernel functions with the same characteristic length scale for all predictors; the other has separate characteristic length scales. The latter are called automatic relevance determination kernel functions and can be used to select predictors. The kernel function and its characteristic length scale(s) are hyperparameters of the GPR model.

In this work, automatic relevance determination squared exponential kernel function (ARD-SE kernel) (Duvenaud, 2014) is applied. The ARD-SE kernel function is basically a squared exponential kernel function (SE kernel) with a separate characteristic length scale σ_m for each predictor m (m is the index of predictors). For any pairs of observations i,j, the ARD-SE kernel function is defined as

\begin{matrix} (27) & K (x_{i}, x_{j}) = σ_{f}^{2} \exp [- \frac{1}{2} \sum_{m = 1}^{D} \frac{(x_{i m} - x_{j m})^{2}}{σ_{m}^{2}}], \end{matrix}

where $σ_{f}^{2}$ denotes the signal variance, which determines the variation of function values from their mean. In the context of machine learning, the characteristic length scale σ_m is not a “length” in the physical sense; it is a characteristic magnitude for the predictor m which implies the sensitivity of the function being modeled to the predictor m. A relatively large σ_m indicates a relatively small variation along the corresponding dimensions in the function, which means these predictors are less relevant than the others (Duvenaud, 2014).

In the end, the key predictive equation for GPR can be derived by conditioning the joint Gaussian prior distribution on the observations, and it is normally distributed.

\begin{matrix} (28) & f_{*} | X, y, X_{*} \sim N ({\overline{f}}_{*}, cov (f_{*})), \end{matrix}

where X_* denotes new input data used in the prediction. f_* represents f(X_*) for convenience, which is the predicted function value.

To summarize, the hyperparameters defining a GPR model are the basis function h(x), the noise standard deviation of the Gaussian process model σ_n, the kernel function K(x_i,x_j), the standard deviation of the function values σ_f, and the characteristic length scale in the kernel function σ_m. These hyperparameters can be tuned in the training process to achieve a better model.

2.7 Model validation

The trained model is evaluated with a k-fold cross-validation in which the data are divided into k disjoint, equally sized subsets. The model validation is done with one subset (also called in-fold observations), and the training is done with the remaining (k−1) subsets (also called out-of-fold observations). This procedure is repeated k times, each time with a different subset for validation. The predicted target values and the goodness-of-fit measures of the regression models are computed for in-fold observations using a model trained on out-of-fold observations.

Theoretically, k can be any integer between 2 and the number of observations (a special case called “leave-one-out” cross-validation). When k is very small, the sample size of training data ( $\frac{k - 1}{k}$ of the total observations) could be insufficiently large. However, considering that the training process must be repeated k times, it would take a very long time when k is very large. As a compromise between these two factors, k is commonly set to 5–10 in machine learning. In this study, 5-fold cross-validation is applied.

The model performance is evaluated with two goodness-of-fit measures: root mean square error (RMSE),

\begin{matrix} (29) & RMSE = \sqrt{\frac{1}{N} \sum_{i}^{N} (y_{i} - y_{pred, i})^{2}}, \end{matrix}

and the coefficient of determination (R²),

\begin{matrix} (30) & R^{2} = 1 - \frac{\sum_{i}^{N} (y_{i} - y_{pred, i})^{2}}{\sum_{i} (y_{i} - \overline{y})^{2}}, \end{matrix}

where y and y_pred denote the observed and predicted target values, respectively; $\overline{y}$ denotes the average of the observed target values; and N denotes the number of observations. It is worth mentioning that, according to this definition, R² can be understood as taking the prediction with the mean value of the observations as a reference by which to evaluate the model performance. In this case, R² ranges from −∞ to one, for perfect prediction. R² equals zero if the prediction is made simply with the mean value of the observations. The higher R² is, the better the model performs. A negative value of R² indicates that the selected model performs even worse than prediction using just the mean value of the observations.

3 Data processing

This section first introduces the data sources in Sect. 3.1 and then explains the procedure for the determination of the wind evolution model parameters in Sect. 3.2.

3.1 Data source

This study involves measured data from two research projects. The reasons for using two different data sources are, on the one hand, to find commonality between two different measurements and avoid accidental conclusions and, on the other hand, to study whether there are differences or what kind of differences in the wind evolution can be observed. The relevant research projects as well as the measurement campaigns are (briefly) as follows.

Lidar Complex. The research project Lidar Complex was funded by the German Federal Ministry for Economic Affairs and Energy (BMWi). In this project, a lidar measurement campaign was carried out in Grevesmühlen, Germany. The measurement site is basically flat, mainly farmland with hedges and a few large trees. More details about the measurement campaign can be found in Schlipf et al. (2015). The lidar deployed in this measurement campaign was the SWE (Stuttgart Wind Energy) Scanner 1.0, which was adapted from a WindCube V1 from Leosphere (Schlipf et al., 2015). This lidar has five measurement range gates focusing at distances of 54.5, 81.75, 109, 136.25, and 163.5 m, respectively. The full width at half maximum (FWHM) of the measurement range gates is 30 m (Carious, 2013). The lidar was installed on the nacelle of a wind turbine (rotor diameter of 109 m) at 95 m. In addition, a meteorological mast is located 295 m southwest of the wind turbine; data from an ultrasonic anemometer installed at 93 m on the meteorological mast are also involved in this study. SCADA (supervisory control and data acquisition) data of the wind turbine are also available. Recorded yaw positions are used to estimate the misalignment angle α, assuming that the mean wind direction at the turbine can be approximated with the mean wind direction measured on the meteorological mast.

ParkCast. The ParkCast² project is an ongoing project funded by the German Federal Ministry for Economic Affairs and Energy (BMWi). While this paper is in preparation, a lidar measurement campaign is being conducted on the offshore wind farm “alpha ventus”³. Two long-range lidars (StreamlineXR) have been deployed in the measurement campaign. The data used here are from the lidar installed on the nacelle of wind turbine AV4 (rotor diameter of 126 m) at 92 m, measuring the inflow. The measurement distances were set to 30–990 m with an increment of 60 m. The FWHM of the measurement range gates is 60 m. Unfortunately, neither data from the meteorological mast on FINO1⁴ nor SCADA data of AV4 for the observed period were available when the analysis was done. Therefore, the misalignment angle α is not available for ParkCast.

Compared to ultrasonic anemometers, lidar systems have much lower sampling rates. To obtain the highest possible sampling rate, we select the measurement periods where the staring mode was used, for both campaigns.

Essential information about the measurements is summarized in Table 2. Figure A1 gives an overview of the wind statistics of these two selected measurement periods by illustrating the relative-frequency distribution of lidar-measured wind speed and turbulence intensity. For brevity, “Lidar Complex” and “ParkCast” are used to refer to the selected measurements throughout the paper.

Table 2Summary of measurement setups.

^* After lidar data filtering, data pairing, and outlier filtering. For details see Sect. 3.2.

Download Print Version | Download XLSX

3.2 Determination of wind evolution model parameters

To obtain the wind evolution model parameters a and b, the wind evolution is estimated with lidar data and then fitted to the wind evolution model (Eq. 8). The processing procedure is described as follows.

Step 1: filtering of the lidar data. The lidar data from Lidar Complex are filtered according to the carrier-to-noise ratio (CNR) of the lidar signals (CNR filter). The valid range of the CNR filter is −24 to −5 dB, determined from the plot of CNR values and wind speed.

A CNR filter is not, however, suitable for lidar data from ParkCast because, for a long-range lidar, the backscattered signals from distant range gates could be very weak, and thus the CNR values could be low even when the measured wind speed is plausible. Würth et al. (2018) suggested an approach to filter the data based on the value range (range filter) and the standard deviation (standard-deviation filter) within a certain number of adjacent data points defined as a window, which can keep more valid data than a CNR filter. A range filter detects the maximum value difference within a window and filters the data points for which the maximum value difference exceeds a threshold. A standard-deviation filter calculates the standard deviation within a window and filters the data points for which the standard deviation exceeds a threshold. Both filters are applied to check the line-of-sight wind speed with thresholds of 6 m s⁻¹ and 3 m s⁻¹, respectively. The window size is set to three data points.

Step 2: estimation of coherence. The lidar data are divided into 30 min blocks. This is consistent with the commonly used period for calculating the Obukhov length. Only the data blocks with more than 80 % valid data points are used to estimate the coherence. The missing values are estimated by shape-preserving piecewise cubic interpolation (Fritsch and Carlson, 1980). The missing end values are each replaced with their nearest value. Data measured at different range gates (i.e., measurement distances) are paired in the way shown in Fig. 4 to obtain as many samples (i.e., data blocks) as possible. The pairing has $(\begin{matrix} N \\ 2 \end{matrix})$ possibilities (N is the number of the lidar range gates). The travel time of the wind field is approximated with the time lag at the maximum of the cross-correlation Δt_M between these two wind speed signals. The upstream point is always regarded as the reference point. The data measured at the downstream point are shifted by Δt_M to match the reference wind speed data. The magnitude-squared coherence is estimated using Welch’s overlapped averaged periodogram method using a Hamming window, 24 segments, and 50 % overlap. The data of the reference point are used to calculate lidar-measured wind statistics.

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f04

Figure 4Pairing of different range gates for estimating coherence for Lidar Complex, given as an example.

Download

Step 3: fitting to the wind evolution model. Before fitting the model, we must consider two issues that might introduce noise into the coherence estimate. Firstly, because both lidars are installed on the nacelle of a wind turbine which is actually in motion; the focus points of the laser beams are moving as well. This motion causes excitation at certain frequencies in the estimated coherence. Figure A2 shows a comparison between an example coherence curve and the power spectral density (PSD) of the fore-aft and in-plane tower top acceleration of Lidar Complex. The excitation in the coherence conforms to that in both PSDs and occurs mainly at frequencies above 0.2 Hz. To avoid negative effects on the fitting quality caused by this excitation, the cutoff frequency is hence set at 0.2 Hz, and the coherence is fitted only up to this cutoff frequency.

Secondly, according to Schlipf (2015), critical wavenumbers where the lidar signals would be only determined by noises must be checked. The critical wavenumbers are 2π∕W_L (W_L is the full width at half maximum of the range gate) and its harmonics. As mentioned in Sect. 2.3, the relationship between wavenumber k and dimensionless frequency f_dless is $f_{dless} = k d / 2 π$ . Thus, the smallest critical value of f_dless is d∕W_L. Considering Lidar Complex as an example, W_L = 30 m and d = 27.25 m for the smallest separation, which is the most critical case. $d / W_{L} \approx 0.91$ , which is already located in the filtered part (see the grey area in Fig. 5a).

The fitting is done by a nonlinear least-squares method using the Levenberg–Marquardt algorithm (Levenberg, 1944; Marquardt, 1963; Moré, 1978). Only the data blocks with R²>0.8 are considered as valid samples.

Step 4: outlier filtering. The final filtering was done by checking the value distribution of every relevant variable to omit outliers. It is emphasized that outliers are not necessarily false data. In some cases, the outlier is from a value range in which not enough samples were collected. It is very important to filter outliers properly because it is difficult for a regression model to capture the relationship for those value ranges with too few samples. Because the distributions of the variables all have a long right tail, the outliers are chosen as all data exceeding the 99th percentile of the data.

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f05

Figure 5(a–d) Example plots of the estimated coherence between the lidar wind speeds measured at different range gates and the corresponding fitted curves. The separations between the corresponding range gates are 27.25, 54.5, 81.75, and 109 m, respectively. The shaded areas indicate the data filtered by the cutoff frequency of 0.2 Hz. (e) Time series of the lidar wind speed. The mean lidar wind speed U_l ranges from 7.3 to 7.7 m s⁻¹, and the lidar-measured turbulence intensity I_T,l ranges from 0.10 to 0.12, for different range gates. Date: 7 December 2013. Data source: Lidar Complex.

Download

Figure 5 is an example plot of the data block from 7 December 2013 at 12:00–12:30 from Lidar Complex. This data block is selected here for two reasons: data integrity and representative wind statistics. In this data block, the lidar-measured mean wind speed is 7.3–7.7 m s⁻¹, and the lidar-measured turbulence intensity is 0.10–0.12, for different range gates. These values appeared frequently in the selected period according to Fig. A1. Hence, this data block is regarded as a representative case-study example for Lidar Complex and is referred to throughout the paper. The figure illustrates the estimated coherence between different range gates and the corresponding fitted curves. The shaded areas show that the selected cutoff frequency of 0.2 Hz is reasonable for this case. A similar plot from ParkCast is found in Fig. A3. Because the sampling rate of ParkCast is lower, the excitation by the nacelle’s movement is not observed in the coherence, and thus no cutoff frequency was set for ParkCast data.

In Fig. 5c and d, the intercept of the coherence is much lower than 1, even though the separation is not very large. This confirms the necessity of choosing a wind evolution model which is able to define different offset values depending on the conditions. Indeed, compared with the fitting quality of Pielke and Panofsky’s model which contains merely a single parameter – the decay parameter a – the fitting quality of the wind evolution model (Eq. 8) is overall better (see Fig. A4). The value of R² for the fitting of Eq. (8) is almost always higher than for the fitting of Pielke and Panofsky’s 1970 model. The wind evolution model used in this work (Eq. 8) is thus proven able to model the coherence better.

4 Statistical analysis of wind evolution

This section presents a statistical analysis of wind evolution, including the distributions of the wind evolution model parameters (Sect. 4.1) and their dependence on measurement separation (Sect. 4.2).

4.1 Distribution of the wind evolution model parameters

To study the overall characteristics of wind evolution, the distributions of the wind evolution model parameters for both measurements are displayed in Fig. 6.

As listed in Table 2, there are two main differences between the lidar settings in both measurements, sampling rate and measurement range, which might affect the distributions of the wind evolution parameters. To enhance the comparability of both distributions, two special post-processings are executed correspondingly. Firstly, because the lidar sampling rate of Lidar Complex is approximately 3 times that of ParkCast, an artificial data set is made for Lidar Complex by averaging every three data points of the original lidar data to simulate measurement at a sampling rate similar to that of ParkCast so that the distributions of both measurements can be compared. The fitted probability density function (PDF) of the wind evolution model parameters determined with this data set is plotted as yellow dashed lines in Fig. 6a and b. The comparison between the fitted PDF of the original data and that of the data with a reduced sampling rate indicates that the lidar sampling rate only very slightly affects the wind evolution model parameters or, perhaps more accurately, the estimated coherence. Hence, the different sampling rates do not account for the differences between the cases observed in Fig. 6. Secondly, because of the limited measurement range of Lidar Complex, the maximum separation between two range gates reaches only 109 m, while that of ParkCast reaches more than 700 m. To make them comparable, Fig. 6c and d show only the wind evolution parameters calculated from the coherence with separation below 120 m of ParkCast.

Apart from that, the measurements were carried out in different environments (onshore and offshore) and at different times of the year (which impacts atmospheric stability) and have different wind speed and turbulence intensity distributions (see Fig. A1). Despite these differences, the distributions of the wind evolution model parameters do have some common characteristics. First of all, the value ranges of both wind evolution model parameters for both measurements are similar; a ranges mostly from 0 to 6, and b ranges from 0 to 0.5. Values out of these ranges are less likely to happen, according to the measurements. Second, the values of a and b are found to follow an inverse Gaussian distribution and a Gamma distribution, respectively. These two PDFs are determined by fitting the histograms to all the PDFs supported by the MATLAB Statistics and Machine Learning Toolbox and searching for the one with the maximum likelihood. This is done using a tool called “fitmethis”⁵.

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f06

Figure 6Distribution of wind evolution model parameters. (a, b) Lidar Complex. (c, d) ParkCast. The curves show the corresponding fitted probability density function.

Download

The corresponding fitted parameters of the PDFs (orange curves) are displayed in Table 3. It is interesting to observe that the peak of the probability density is located around a=1.8 for the onshore Lidar Complex, while it is around a=0.8 for the offshore ParkCast. Moreover, the medians of a are approximately 2.0 and 1.5 for Lidar Complex and ParkCast, respectively. The mean (see μ in Table 3) and median of a as well as its value of the peak location of the PDF of Lidar Complex are all higher than that of ParkCast. This indicates that the coherence under similar separation generally decays faster in an onshore location than an offshore location. In terms of b, most of the values are near 0, and values higher than 0.1 are not often observed. Therefore, the y axes in Fig. 6b and d are plotted logarithmically to make the higher-value part of b visible. However, b shows no significant difference between the two cases observed in the figure.

It is not yet possible to explain the physical relationship between the wind evolution model parameters and the above-mentioned PDFs and the physical meaning of the corresponding PDF parameters. To verify whether the above-discussed phenomena commonly occur in wind evolution, further research involving more different measurement campaigns is necessary. At this point, a hypothesis is made that the values of a and b might follow an inverse Gaussian distribution and a Gamma distribution, respectively. The corresponding PDF parameters might depend on the terrain types, on the one hand. It is not clear if the roughness length would be a suitable parameter to quantify the influence of the terrain type on the value distribution of wind evolution model parameters. To figure out a concrete relationship between the PDF parameters and the terrain types, again, it is necessary to involve more measured data gathered from different terrain types. On the other hand, unfortunately, it is not yet possible to estimate to what extent the atmospheric stability would affect the distribution of the wind evolution model parameters because there was no sonic data available for ParkCast to inform the associated investigation until this work was finished.

Table 3Parameters of the fitted probability density functions.

Note: the notations $μ, λ, k, θ$ in the table are independent of the other notations in the article.

Download Print Version | Download XLSX

4.2 Dependence of the wind evolution model parameters on measurement separation

Figure 7 shows the fitted curves of the estimated coherence of all pairings of the above-mentioned Lidar Complex case-study example. Each color indicates a particular range gate, while each marker indicates a particular measurement separation. The figure shows a very clear dependence of the fitted-curve form on the measurement separation – the curves with the same marker overlap despite having different range gates. This confirms that the coherence depends on the separation of the measurement points but not on their positions, even in the wind turbine’s induction zone (defined as within 2.5 rotor diameters on the inflow side of the wind turbine). Since the curve offset is related only to the offset parameter b, obviously, b must strongly depend on the measurement separation. In addition, that all the fitted curves of the coherence are grouped together suggests it is reasonable to model the wind evolution based on the dimensionless frequency. Similar conclusions can be drawn from the example plot of ParkCast (see Fig. A5), which proves that these conclusions are not accidental.

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f07

Figure 7Fitted curves of the estimated coherence between the lidar wind speeds measured at different range gates. The range gates R₁ to R₅ are located at 54.5, 81.75, 109, 136.25, and 163.5 m, respectively. 1D= 27.25 m. The mean lidar wind speed U_l ranges from 7.3 to 7.7 m s⁻¹, and the lidar-measured turbulence intensity I_T,l ranges from 0.10 to 0.12, for different range gates. Date and time: 7 December 2013 at 12:00–12:30. Data source: Lidar Complex.

Download

To further study the dependence of the wind evolution model parameters on the measurement separation, the box plots of the wind evolution parameters, grouped by the measurement separations, are given in Fig. 8. Although the ranges of the measurement separation from the two measurement campaigns are very different, the box plots still show similar trends. The decay parameter a shows a decreasing trend with increasing measurement separation. This decreasing trend of a gradually stops at a separation of about 300 m, as observed in Fig. 8c. The offset parameter b shows an increasing trend with separation. An increase in b implies a decreased offset of the coherence curve. This is consistent with the phenomena observed in Fig. 7 and Fig. A5.

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f08

Figure 8Box plots of the wind evolution model parameters grouped by the measurement separations d. (a, b) Lidar Complex. (c, d) ParkCast. The bottom and top of the boxes indicate the first (25th percentile) and third (75th percentile) quartiles. The lower and upper whiskers show the 5th and 95th percentiles. The red line in the middle indicates the median value. Minimum sample size is 50.

Download

The decay of coherence is supposed to result from the evolution of turbulence eddies depending on travel time. The dependence of the decay parameter a on the measurement separation, or rather the travel distances, actually reveals the dependence of a on the travel time. Figure 9 shows the correlation between a and the travel time approximated by Δt_M of ParkCast. The fitted curve represents a negative correlation trend between them. This implies that the decay rate of the coherence decreases with increasing travel time. The nonlinear least-squares fitting is done using the Levenberg–Marquardt algorithm (Levenberg, 1944; Marquardt, 1963; Moré, 1978).

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f09

Figure 9Correlation between the decay parameter a and the travel time approximated by Δt_M. Data source: ParkCast.

Download

5 Parameterization model

This section first presents the training procedure of GPR models with the application of the ARD-SE kernel to select the suitable predictors in Sect. 5.1. Following that is a discussion of the selected predictors in Sect. 5.2 and an evaluation of the model performance in Sect. 5.3.

5.1 Model training

The initial settings for GPR model training are listed in Table 4. The setting of “exact GPR” means that a standard GPR is applied in the fitting and prediction process; otherwise GPR can be approximated using different methods to reduce the computation time for large amounts of training data. The initial values of σ_n, σ_f, and σ_m listed in the table are just used to initiate the training process, and their final values will be estimated from the training data by the GPR algorithm. The training data are standardized by centering and scaling the data of each predictor by its mean and standard deviation, respectively, which gives the standard scores (also called z scores) (Kreyszig, 1979; Mendenhall and Sincich, 2007) of the predictor data.

Table 4Initial settings of GPR model training.

Download Print Version | Download XLSX

Training the model is a two-step process. In the first step, all the potential predictors are included in a preliminary training to determine the characteristic length scale σ_m for each predictor (see Eq. 27). Figure 10 illustrates a comparison among the $\log (σ_{m}^{- 2})$ of all potential predictors. As explained in Sect. 2.6, the larger $\log (σ_{m}^{- 2})$ is, the more important and useful the corresponding predictor is for a GPR model, and thus this predictor should be selected. In the second step, new GPR models are trained only with the selected predictors, applying a 5-fold cross-validation to evaluate the model performance, using RMSE (see Eq. 29) and R² (see Eq. 30) as criteria.

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f10

Figure 10Comparison of the relative importance of predictors. (a) ParkCast, lidar data. (b) Lidar Complex, lidar data. (c) Lidar Complex, sonic data. $\log (σ_{m}^{- 2}) = - \infty$ is not displayed.

Download

Table 5 displays the predictors selected according to different lower limits of $\log (σ_{m}^{- 2})$ under different measurement campaigns (Lidar Complex or ParkCast), different data availability (whether sonic data are available), and different targets (a or b). R² and the RMSE of the 5-fold cross-validation for the model trained with the respective combination of predictors are shown in the table as well.

Table 5Summary of the predictors selected according to different lower limits of $\log (σ_{m}^{- 2})$ under different measurement campaigns, different data availability, and different targets. R² and RMSE are obtained from a 5-fold cross-validation of the model trained with the respective combination of predictors. The bold text indicates the recommended predictor combinations.

Notes: cases 4–6 and cases 9–12 are selected for comparison with case 1 and case 7 for different purposes, respectively. Case 4 and case 9 are selected for examination of the effect of introducing G₁ and G₂ as predictors. Case 5 and case 10 are selected for examination of the different effects of Δt_M and Δt_T. Case 6 and case 11 are selected for examination of the effect of having α available. Case 12 is selected for examination of the effect of introducing a as a predictor for b.

Download Print Version | Download XLSX

In general, the more relevant predictors are involved in the model, the more accurate predictions the model can make. However, using more predictors entails a larger training data set and thus a longer model training time. On the other hand, it might also reduce the applicability of the model because predictions can only be made when all predictors are consistently available and reliable. The trade-off between these factors must be considered in predictor selection, and it is aimed to achieve relatively high model performance with as few predictors as possible. The bold text in Table 5 indicates the recommended predictor combinations for each situation based on these considerations. The predictors with $\log (σ_{m}^{- 2}) > - 2$ are generally essential for the model.

Let us take the situation of using lidar data from Lidar Complex to predict a as an example to explain the process of predictor selection (see Fig. 10a top and the first block in Table 5). Firstly, since $\log (σ_{m}^{- 2}$ ) of I_T,l and T_l are much smaller than the others, it is not necessary to consider these two predictors, and the lower limit of $\log (σ_{m}^{- 2})$ can be initially set to −4 (see Table 5: case 1). Then, try to increase the lower limit of $\log (σ_{m}^{- 2})$ step by step, e.g., first to −2 (see Table 5: case 2) and then to 0 (see Table 5: case 3), to further reduce the number of predictors. The resulting models are evaluated to determine whether it is appropriate to remove these predictors. For example, the comparison between case 1 and case 2 shows that removing d almost does not affect the model performance in this situation, with R² decreasing only slightly from 0.70 to 0.69. However, further abandonment of Δt_M significantly reduces the prediction accuracy, reducing R² more substantially from 0.69 in case 2 to 0.59 in case 3. Therefore, it is no longer proper to remove further predictors, and the predictor combination in case 2 is recommended.

5.2 Discussion of selected predictors

Feature selection is not only a tool to select suitable predictors for a machine learning model but also could shed some light on intrinsic relationships among data. Here are some discussions about the selected predictors to provide some insights into possible correlations between wind evolution and these predictors.

Selection between two related variables. In the preliminary training, two pairs of related variables are intentionally involved at the same time: σ and I_T and T and L. It is only necessary to select one of the two related variables (if determined to be relevant) because they can be converted into each other (given U). In terms of σ and I_T, it is surprising to notice that the GPR models show a preference for σ rather than I_T, although I_T is more commonly used in data analysis and simulation in wind energy. The only exception is the situation of using sonic data from Lidar Complex to predict b (cases 16–18). It is possible that GPR generally tends to select fundamental variables (directly calculated from measured data) instead of derived variables (calculated from other variables). However, the selection becomes complicated for T and L. In some situations, L is clearly more preferred, e.g., $\log (σ_{m}^{- 2})$ of L is obviously higher than $\log (σ_{m}^{- 2})$ of T in Fig. 10a top and b. In the other situations, $\log (σ_{m}^{- 2})$ of L and $\log (σ_{m}^{- 2})$ of T show similar values. For consistency, we decided to select L for all cases whenever L is determined to be relevant.

Introducing higher-order wind statistics as predictors. So far, skewness G₁ and kurtosis G₂ of wind speed have not been considered in wind evolution research. However, it is worth noting that both are selected as predictors in all cases except case 15, despite different measurement sites and devices. Case 4 and case 9 are aimed at examining the effects of G₁ and G₂ on the prediction of a and b, respectively, with G₁ and G₂ removed in comparison to cases 1 and 7. Case 4 and case 9 show much worse prediction accuracy, with R²=0.53 in case 4 compared to R²=0.70 in case 1 and R²=0.46 in case 9 compared to R²=0.70 in case 7. This comparison confirms that G₁ and G₂ are essential for predicting wind evolution when using lidar data and introducing G₁ and G₂ as predictors can significantly improve the models, despite uncertainties contained in their estimated values from measured data (see Sect. 2.5). This implies that G₁ and G₂ might contain additional information which could distinguish different states of turbulence given a particular mean wind speed and turbulence intensity, and this “different state” might be relevant to wind evolution.

Different approximations of travel time. Δt_M and Δt_T are two different approximations of travel time. Although Δt_M is expected to be more predictive than Δt_T, Δt_T is still involved in the model training because, in application, it is easier to calculate Δt_T than Δt_M. Cases 5 and 10 are selected to compare with cases 1 and 7, respectively, to examine the different effects of Δt_T and Δt_M on the GPR models. The respective values of R² show that replacing Δt_M with Δt_T only slightly decreases the prediction accuracy. Therefore, for a simpler calculation of travel time, Δt_T can be used as a predictor instead.

Effect of misalignment angle. As discussed in Sect. 2.5, misalignment angle α is supposed to be an important predictor for the prediction of the horizontal coherence. In cases 13–15, where sonic data from Lidar Complex are used to predict a, α shows a high relevance with $\log (σ_{m}^{- 2}) > 0$ . However, for the prediction of b using sonic data (cases 16–18), removing α from predictors does not influence the prediction accuracy much, especially when comparing case 16 and case 17, with R²=0.80 and R²=0.78, respectively. These results indicate that α is essential for the prediction of a but not relevant for predicting b.

In addition, α is introduced in the prediction using lidar data (case 6 and case 11) as well to examine its effect, although α is actually not available when only using a lidar in the staring mode. As mentioned in Sect. 3.1, α is approximated by the deviation between the yaw position of the turbine and mean wind direction taken on the meteorological mast. Cases 6 and 11 both show better prediction accuracy than cases 1 and 7, with R²=0.76 and R²=0.81, respectively, despite the uncertainties in the approximation of α. This means that if α were available, the prediction accuracy of the models trained with lidar data could be further improved. As mentioned earlier, α could be made available, e.g., by deploying a multi-beam lidar.

Introducing one of the targets as a predictor for the other. According to the wind evolution model (Eq. 8), a and b jointly determine the shape and the position of the modeled coherence, and thus they have a certain correlation with each other. Introducing one of them as a predictor for the other may improve its prediction accuracy. Case 12, with R²=0.74, compared to R²=0.70 in Case 7, confirms that introducing a as a predictor for b can help with the prediction of b. This means it could be a good idea to predict the wind evolution model parameters successively rather than in parallel. This concept is not yet fully studied in this work, and thus case 12 is not presented as a recommendation. To prove its applicability, it is necessary to investigate which wind evolution model parameter should be first predicted and how the prediction uncertainty in the first parameter would propagate to the second.

Prediction using sonic data. Additional research on using sonic data as predictors aims to provide some insights into whether it is worth involving sonic data in wind evolution prediction when available. When comparing the model performance of using lidar data and sonic data from Lidar Complex, case 13 – the best case of using sonic data to predict a – shows a higher prediction accuracy (R²=0.83 ) than case 6 – the best case of using lidar data given α available (R²=0.76). However, case 13 needs many more predictors than case 6, whereas case 14 and case 15, with fewer predictors, do not show any advantage in prediction accuracy. For predicting b, case 16 – the best case of using sonic data (R²=0.80) – does not outperform case 11 – the best case of using lidar data given α available (R²=0.81). It must be emphasized that the ultrasonic anemometer is installed on a meteorological mast located 295 m away from the lidar. There must be a deviation between the sonic data and the true values in the wind field where the coherence is estimated, which reduces the prediction accuracy when using sonic data. Figure 11 illustrates a comparison between the model performance of the recommended cases of using lidar data and sonic data.

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f11

Figure 11Comparison of prediction performance of models using lidar data and sonic data from Lidar Complex. (a, b) Lidar data. (c, d) Sonic data. The subscript “pred” indicates the predicted values.

Download

Interestingly, case 15 can achieve the same predictive accuracy as case 1, with only three predictors: mean wind speed U_x,s, standard deviation of the vertical wind component σ_z,s, and the misalignment angle α. In fact, σ_z,s is determined to be the most important predictor by the ARD-SE kernel, having the maximum value of $\log (σ_{m}^{- 2})$ . This might imply a possible correlation between wind evolution and vertical convection.

Influence of atmospheric stability. We initially intended to study the influence of atmospheric stability using a dimensionless height ζ as the stability parameter (see Sect. 2.5). However, very surprisingly, ζ is not selected as a relevant predictor in any cases, and $\log (σ_{m}^{- 2})$ is quite small compared to the others (see Fig. 10c). In the end, we found that the stability happens to be mostly neutral during the chosen measurement in Lidar Complex. This could be the reason for ζ not being selected as a predictor. Therefore, it is not possible to analyze the influence of atmospheric stability on wind evolution in this study.

5.3 Model evaluation

As shown in Table 5, R² of all recommended cases ranges from 0.67 to 0.83. These results are much better than that of the preliminary study (Chen, 2019); in particular, the prediction accuracy of the offset parameter b has been significantly improved. This is mainly owing to the use of the ARD-SE kernel, which can help to select predictors reasonably and give different weights to predictors according to their relevant importance for the prediction, whereas kernel functions with a common length scale for predictors were applied in the preliminary study.

The prediction errors of a and b are quantified with the respective RMSE between their predicted and observed values. But in fact, the shape and position of the predicted coherence determined by both parameters together is the final prediction goal. And the corresponding prediction errors will eventually appear as the deviation between the predicted curve and its estimated curve due to the prediction errors of a and b.

To intuitively display how the prediction errors affect the shape and the position of the predicted coherence in the frequency domain, Fig. 12 shows the predicted coherence and the corresponding 95 % confidence interval for the example case from Lidar Complex. For the example prediction with lidar data in Fig. 12a, the prediction of a and b is made by the GPR models in cases 6 and 11, respectively. And for the example prediction with sonic data in Fig. 12b, the prediction of a and b is made by the GPR models in cases 13 and 17, respectively. The predicted coherence and the 95 % confidence interval are reconstructed by putting the predicted values of a and b and their lower and upper bounds of the 95 % confidence interval into the wind evolution model (Eq. 8). It can be observed that the prediction is very good for this example because the predicted coherence is almost overlapped with the one estimated from the measured data, and the 95 % confidence interval is quite narrow.

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f12

Figure 12Example predicted coherence with 95 % confidence interval for two different measurement separations of Lidar Complex. (a) Prediction with lidar data: a for case 6 and b for case 11. (b) Prediction with sonic data: a for case 13 and b for case 17. The shaded areas indicate the RMSE interval. The input predictor data and the estimated coherence are from the case-study example of Lidar Complex: 7 December 2013 at 12:00–12:30. The mean lidar wind speed U_l ranges from 7.3 to 7.7 m s⁻¹, and the lidar-measured turbulence intensity I_T,l ranges from 0.10 to 0.12, for different range gates.

Download

To show the prediction errors in a more general sense, the RMSE interval is additionally indicated as shaded areas in Fig. 12. The lower and upper bounds of the RMSE interval are determined with

\begin{matrix} (31) & \begin{aligned} γ_{model, lb}^{2} (f_{dless}) \\ = \exp [- \sqrt{(a_{pred} + Δ a)^{2} \cdot f_{dless}^{2} + (b_{pred} + Δ b)^{2}}] \end{aligned} \end{matrix}

and

\begin{matrix} (32) & \begin{aligned} γ_{model, ub}^{2} (f_{dless}) \\ = \exp [- \sqrt{(a_{pred} - Δ a)^{2} \cdot f_{dless}^{2} + (b_{pred} - Δ b)^{2}}], \end{aligned} \end{matrix}

respectively, where a_pred and b_pred are the predicted values of a and b and Δa and Δb are the respective RMSE. The narrow RMSE interval shows that the GPR models perform overall well in the prediction of wind evolution.

Moreover, it is important to check if the prediction errors of the models are relevant to the values of the predictors. Taking the models trained with the lidar data from Lidar Complex (case 6 and case 11) as an example, Figs. 13–16 show the box plots of the prediction errors, defined as the deviation between the predicted and the observed values of targets, with respect to the values of the predictors. The histograms of the predictor values are plotted below the box plots correspondingly. The x axes of the box plots correspond to the upper bound of the respective bin in the histograms. For example, in Fig. 13a, the first box labeled with “4” means it is plotted with the prediction errors of the samples attributed to the mean wind speed range of 3–4 m s⁻¹. To avoid accidental conclusions, there is a minimum sample size requirement of 50 for the box plots.

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f13

Figure 13Prediction errors of a (case 6) and b (case 11) from Lidar Complex with respect to the values of predictors. (a) Lidar-measured mean wind speed U_l. (b) Standard deviation of lidar-measured wind speed σ_l. n is the sample size. The bottom and top of the boxes indicate the first and the third quartiles, i.e., 25th and 75th percentile, respectively. The lower and upper whiskers show the 5th and 95th percentiles. The red line and the yellow cross in the middle indicate the median and mean value, respectively.

Download

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f14

Figure 14Prediction errors of a (case 6) and b (case 11) from Lidar Complex with respect to the values of predictors. (a) Skewness of lidar-measured wind speed G_1,l. (b) Kurtosis of lidar-measured wind speed G_2,l. n is the sample size. The bottom and top of the boxes indicate the first and the third quartiles, i.e., 25th and 75th percentile, respectively. The lower and upper whiskers show the 5th and 95th percentiles. The red line and the yellow cross in the middle indicate the median and mean value, respectively.

Download

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f15

Figure 15Prediction errors of a (case 6) and b (case 11) from Lidar Complex with respect to the values of predictors. (a) Integral length scale of lidar-measured wind speed L_l. (b) Time lag determined by the peak of maximum cross-correlation Δt_M. n is the sample size. The bottom and top of the boxes indicate the first and the third quartiles, i.e., 25th and 75th percentile, respectively. The lower and upper whiskers show the 5th and 95th percentiles. The red line and the yellow cross in the middle indicate the median and mean value, respectively.

Download

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f16

Figure 16Prediction errors of a (case 6) and b (case 11) from Lidar Complex with respect to the values of predictors. (a) Measurement separation d. (b) Misalignment angle of wind direction and lidar measurement α. n is the sample size. The bottom and top of the boxes indicate the first and the third quartiles, i.e., 25th and 75th percentile, respectively. The lower and upper whiskers show the 5th and 95th percentiles. The red line and the yellow cross in the middle indicate the median and mean value, respectively.

Download

The box plots indicate data within the first and the third quartiles (i.e., 25th and 75th percentile) and represent the main part of the data, whereas whiskers show the tails of the distributions of the data indicating extreme values. In Figs. 13–16, it can be observed that the boxes of the prediction errors of a and b are all quite narrow and centered around 0, indicating small prediction errors for the majority of samples. The fact that the boxes are centered around 0, as well as the median and mean values (indicated as red lines and yellow crosses, respectively), means that there is no systematic error with respect to predictor values. In the box plots for the prediction errors of a, the ranges of boxes and whiskers do not show obvious relevance to predictor values except for small travel time and measurement separation. The large range of the box and whiskers of the first box in Fig. 15b (and that of the first box in Fig. 16a) implies that the prediction of a is likely more uncertain for small travel time and measurement separation (both are related to some extent). The ranges of boxes and whiskers of the prediction errors of b show some relevance to the values of standard deviation, skewness, travel time, and measurement separation. In Fig. 13b, a clear trend can be observed: the ranges of the boxes and whiskers decrease with the values of standard deviation, indicating that the prediction of b might be better for high turbulence. A similar trend can be observed in Fig. 14a, meaning that the prediction of b might be better under the circumstance of negative skewness (longer left tail) than that of positive skewness (longer right tail). In Fig. 15b and Fig. 16a, the ranges of boxes and whiskers get larger with travel time and measurement separation, implying that the prediction errors of b increase with travel time and measurement separation.

It is worth emphasizing that the performance of any regression model can be only as good as the quality of the training data. No choice of regression model can eliminate noise from the training data. And the noisier the training data are, the more uncertainties the prediction of the regression model will contain. A good data source is always essential for training a good regression model.

6 Conclusions and outlook

This paper aims to investigate the potential of Gaussian process regression (GPR) in the parameterization of wind evolution. This research has been motivated by the need for lidar-assisted wind turbine control for accurate models to predict wind evolution, in order to avoid harmful and unnecessary control actions. In addition, the commonly used 3D stochastic wind field simulation method can be extended to 4D by integrating wind evolution to provide a more realistic simulation environment for this control concept.

In this research, data from two nacelle-mounted lidars in both onshore and offshore locations were used to estimate wind evolution. The estimated wind evolution was fitted to a two-parameter wind evolution model, modified from a model suggested in the literature. To shed light on some characteristics of wind evolution, a statistical analysis was done for the wind evolution model parameters.

In the statistical analysis, the distributions of the wind evolution model parameters of both measurements show some common characteristics, despite different wind-field-related variables and settings of the measurements. The value ranges of both wind evolution parameters a (i.e., the decay parameter) and b (i.e., the offset parameter) are very similar in both measurements. The distributions of a and the b seem to follow an inverse Gaussian distribution and a Gamma distribution, respectively. The fitted parameters of the probability density functions are different in both measurements. We hypothesize that the parameters of the probability density functions might depend on the terrain type. Moreover, a strong dependence of wind evolution model parameters was observed on measurement separations. The decay parameter a shows a decreasing trend with increasing measurement separation, while the offset parameter b shows an increasing trend with increasing measurement separation.

An investigation was done to explore the potential of using GPR to achieve parameterization models for wind evolution. GPR models were trained with the wind evolution model parameters (i.e., targets) and some wind-field-related variables (i.e., predictors) acquired from the lidars and a meteorological mast. The automatic relevance determination squared exponential kernel was applied to evaluate the relative importance of different predictors and to select the essential predictors for the models under different data availabilities. The performance of the GPR models was evaluated with the coefficient of determination R² and root mean square error (RMSE) using a 5-fold cross-validation. The R² of the models in the recommended cases for both targets, under different measurement campaigns and different data availabilities, ranges from 0.67 to 0.83.

A comparison between the models trained with different predictor combinations provides some interesting insights. (1) GPR models show preference to a fundamental variable than a derived variable when selecting between two related variables. (2) Introducing higher-order wind statistics (i.e., skewness and kurtosis) as predictors can improve the models. (3) When using travel time as a predictor, the approximation determined with the maximum cross-correlation is slightly preferred than Taylor's translation hypothesis, but the latter could still be an option for the sake of simplification. (4) Introducing one of the targets as a predictor for the other can also improve the models, but further research needs to be done to understand the propagation of the uncertainties introduced by the first predicted target. (5) Considering the misalignment angle as a predictor can properly account for its influence on the horizontal coherence. (6) Prediction using sonic data (not measured nearby) does not show any advantages given that it requires many more predictors to exceed the prediction using lidar data.

The predicted coherence is obtained by putting the two predicted parameters into the wind evolution model. To intuitively display how the prediction errors of a and b affect the shape and the position of the predicted coherence in the frequency domain, the predicted coherence and its 95 % confidence interval were visualized for a representative case-study example. The predicted coherence matches the coherence estimated from data very well, and the 95 % confidence interval is relatively narrow. In addition, the RMSE interval was also demonstrated to show the impact of the RMSE of a and b in a more general sense. The RMSE interval turns out to be quite narrow, indicating an overall good model performance. Furthermore, the prediction errors of a and b were analyzed with respect to the values of each predictor, shown as box plots. The results show that, for both a and b, there is no systematic error with respect to predictor values. The prediction of a seems to be less accurate for small travel time and measurement separation. The prediction errors of b show some relevance to the values of standard deviation and skewness of wind speed, travel time, and measurement separation.

There is still space to improve the performance of the parameterization model. Since the performance of any regression model can be only as good as the quality of the training data, reducing the uncertainty in the training data or increasing the data amount could improve the model performance. For example, methods to improve the estimation of the coherence and the wind statistics from lidar data are desirable. Moreover, the predictors discussed above do not cover all possibilities. Introducing new proper predictors could hence also improve the model performance. In fact, the model concept is very flexible. Any improvement of any part of the workflow can be easily integrated.

In the future, besides the ideas mentioned above, it would be interesting to involve more measurement data, especially from different terrain types, to further investigate whether the wind evolution characteristics found here occur commonly, and what physical principles stand behind them. Another question that needs answering is whether it is possible to achieve a generally applicable parameterization model and how to do so. Moreover, considering that the computational time of the model training could be an important issue for some applications, e.g., real-time model training, it is worth comparing GPR with some alternative algorithms to develop insight into the trade-off between computation time and the prediction accuracy. Furthermore, considering the application of the parameterization model using real-time measurement data as predictors, an additional model will be needed to determine whether the current data meet the quality requirements to be input into the parameterization model.

Last but not least, as mentioned above, our model concept is very flexible, and its methodology can be applied in different situations. For example, for other lidar trajectories or even other measurement devices, the model concept can be modified by replacing the coherence estimation method. The wind evolution model and the regression model can also be changed. Basically, one can achieve a parameterization model to meet various specific requirements by following the concept and the methodology presented in this paper.

Appendix A

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f17

Figure A1Distribution of lidar-measured mean wind speed and turbulence intensity for the selected period. (a, c) Lidar Complex, from the range gate at 163.5 m. (b, d) ParkCast, from the range gate at 150 m.

Download

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f18

Figure A2(a) Example coherence: U_l=11.7 m s⁻¹, d=81.75 m, R²=0.95. (b) and (c) PSDs of the fore-aft and in-plane tower top acceleration, respectively. The x axis is logarithmic. Date and time: 7 December 2013 at 12:00–12:30. Data source: Lidar Complex. Because of data protection it is not allowed to show any values concerning the turbine properties.

Download

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f19

Figure A3(a–d) Example plots of the estimated coherence between the selected range gates R and the corresponding fitted curves. The corresponding measurement distances are 120, 240, 360, and 480 m, respectively. (e) Time series of the lidar wind speed. The mean lidar wind speed is 11.6 m s⁻¹. Date: 12 June 2019. Data source: ParkCast.

Download

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f20

Figure A4Comparison of the fitting quality (R²) of the two-parameter wind evolution model (Eq. 8) and that of the one-parameter wind evolution model (Eq. 5). The subscripts “1par” and “2par” indicate one-parameter and two-parameter, respectively.

Download

https://wes.copernicus.org/articles/6/61/2021/wes-6-61-2021-f21

Figure A5Fitted curves of the estimated coherence between the lidar wind speeds measured at different range gates. The range gates R₁ to R₅ are located at 150, 270, 390, 510, and 630 m, respectively. 1D= 120 m. The mean lidar wind speed U_l is 11.6 m s⁻¹. Date and time: 12 June 2019 at 22:30–23:00. Data source: ParkCast.

Download

Data availability

The data are not publicly available according to the confidentiality agreement.

Author contributions

YC conceived the concept, developed the model codes, processed the data, created the figures, conducted the analysis, and prepared the paper. DS and PWC provided general guidance and essential suggestions throughout the process. PWC supervised the research.

Competing interests

The authors declare that they have no conflict of interest.

Acknowledgements

The measured data used in this project were collected in the projects Lidar Complex (grant no. 0325519A) and ParkCast (grant no. 0324330A), which are funded by the German Federal Ministry for Economic Affairs and Energy (BMWi).

Financial support

This research is carried out within the framework of the Joint Graduate Research Training Group Windy Cities, supported by the Baden-Württemberg Ministry of Science, Research and Arts.

Review statement

This paper was edited by Jakob Mann and reviewed by Felix Kelberlau, Mark Kelly, and one anonymous referee.

References

Bossanyi, E.: Un-freezing the turbulence: Application to LiDAR-assisted wind turbine control, IET Renewable Power Generation, 7, 321–329, https://doi.org/10.1049/iet-rpg.2012.0260, 2013. a

Breiman, L., Friedman, J., Stone, C. J., and Olshen, R.: Classification and regression trees, CRC Press, Boca Raton, FL, 1984. a

Businger, J. A., Wyngaard, J. C., Izumi, Y., and Bradley, E. F.: Flux-Profile Relationships in the Atmospheric Surface Layer, J. Atmos. Sci., 28, 181–189, https://doi.org/10.1175/1520-0469(1971)028<0181:FPRITA>2.0.CO;2, 1971. a

Carious, J.-P.: Pulsed Lidars, in: Remote Sensing for Wind Energy, DTU Wind Energy-E-Report-0029(EN), chap. 5, DTU Wind Energy, Roskilde, Denmark, 104–121, 2013. a, b, c

Chen, Y.: Parameterization of wind evolution model using lidar measurement, Zenodo, https://doi.org/10.5281/zenodo.3366119, 2019. a, b

Davenport, A. G.: The spectrum of horizontal gustiness near the ground in high winds, Q. J. Roy. Meteorol. Soc., 87, 194–211, https://doi.org/10.1002/qj.49708737208, 1961. a, b

Davoust, S. and von Terzi, D.: Analysis of wind coherence in the longitudinal direction using turbine mounted lidar, J. Phys. Conf. Ser., 753, 072005, https://doi.org/10.1088/1742-6596/753/7/072005, 2016. a

de Maré, M. and Mann, J.: On the Space-Time Structure of Sheared Turbulence, Bound.-Lay. Meteorol., 160, 453–474, https://doi.org/10.1007/s10546-016-0143-z, 2016. a

Duvenaud, D.: Automatic model construction with Gaussian processes, Apollo – university of cambridge repository, University of Cambridge, Cambridge, UK, https://doi.org/10.17863/CAM.14087, 2014. a, b, c

Fritsch, F. N. and Carlson, R. E.: Monotone Piecewise Cubic Interpolation, SIAM J. Numer. Anal., 17, 238–246, https://doi.org/10.1137/0717021, 1980. a

Hocking, R. R.: The Analysis and Selection of Variables in Linear Regression, Biometrics, 32, 1–49, https://doi.org/10.2307/2529336, 1976. a

Joanes, D. N. and Gill, C. A.: Comparing measures of sample skewness and kurtosis, J. Roy. Stat. Soc. D-Stat., 47, 183–189, https://doi.org/10.1111/1467-9884.00122, 1998. a, b

Kelberlau, F. and Mann, J.: Better turbulence spectra from velocity–azimuth display scanning wind lidar, Atmos. Meas. Tech., 12, 1871–1888, https://doi.org/10.5194/amt-12-1871-2019, 2019. a

Kreyszig, E.: Advanced engineering mathematics, Wiley, New York and Chichester, 4th edn., 1979. a

Kristensen, L.: On longitudinal spectral coherence, Bound.-Lay. Meteorol., 16, 145–153, https://doi.org/10.1007/BF02350508, 1979. a, b, c, d, e

Laks, J., Simley, E., and Pao, L.: A spectral model for evaluating the effect of wind evolution on wind turbine preview control, in: 2013 American Control Conference, IEEE, 3673–3679, https://doi.org/10.1109/ACC.2013.6580400,2013. a

Lenschow, D. H., Mann, J., and Kristensen, L.: How Long Is Long Enough When Measuring Fluxes and Other Turbulence Statistics?, J. Atmos. Ocean. Techn., 11, 661–673, https://doi.org/10.1175/1520-0426(1994)011<0661:HLILEW>2.0.CO;2, 1994. a

Levenberg, K.: A method for the solution of certain non-linear problems in least squares, Q. Appl. Math., 2, 164–168, https://doi.org/10.1090/qam/10666, 1944. a, b

Liu, Z., Barlow, J. F., Chan, P.-W., Fung, J. C. H., Li, Y., Ren, C., Mak, H. W. L., and Ng, E.: A Review of Progress and Applications of Pulsed Doppler Wind LiDARs, Remote Sensing, 11, 2522, https://doi.org/10.3390/rs11212522, 2019. a

Lumley, J. L. and Panofsky, H. A.: The structure of atmospheric turbulence., Interscience Publishers, Wiley & Sons, New York, 1st edn., https://doi.org/10.1002/qj.49709138926, 1964. a, b, c

Mann, J.: The spatial structure of neutral atmospheric surface-layer turbulence, J. Fluid Mech., 273, 141, https://doi.org/10.1017/S0022112094001886, 1994. a

Marquardt, D. W.: An Algorithm for Least-Squares Estimation of Nonlinear Parameters, J. Soc. Ind. Appl. Math., 11, 431–441, https://doi.org/10.1137/0111030, 1963. a, b

Mendenhall, W. and Sincich, T.: Statistics for engineering and the sciences, Pearson Prentice Hall, Upper Saddle River, N.J., 5th edn., 2007. a

Moré, J. J.: The Levenberg-Marquardt Algorithm: Implementation and Theory, edited by: Watson, G. A., Numerical Analysis, 630, 105–116, https://doi.org/10.1007/BFb0067700, 1978. a, b

Obukhov, A. M.: Turbulence in an atmosphere with a non-uniform temperature, Bound.-Lay. Meteorol., 2, 7–29, https://doi.org/10.1007/BF00718085, 1971. a

Oppenheim, A. V., Willsky, A. S., and Nawab, S. H.: Signals & systems, Prentice-Hall signal processing series, Prentice Hall and London: Prentice-Hall International, Upper Saddle River, 2nd edn., 1997. a, b

Panofsky, H. A. and McCormick, R. A.: Properties of spectra of atmospheric turbulence at 100 metres, Q. J. Roy. Meteor. Soc., 80, 546–564, https://doi.org/10.1002/qj.49708034604, 1954. a

Panofsky, H. A. and Mizuno, T.: Horizontal coherence and Pasquill's beta, Bound.-Lay. Meteorol., 9, 247–256, https://doi.org/10.1007/BF00230769, 1975. a, b, c

Panofsky, H. A., Thomson, D. W., Sullivan, D. A., and Moravek, D. E.: Two-point velocity statistics over Lake Ontario, Bound.-Lay. Meteorol., 7, 309–321, https://doi.org/10.1007/BF00240834, 1974. a

Peña, A., Hasager, C., Lange, J., Anger, J., Badger, M., Bingöl, F., Bischoff, O., Cariou, J.-P., Dunne, F., Emeis, S., Harris, M., Hofsäss, M., Karagali, I., Laks, J., Larsen, S., Mann, J., Mikkelsen, T., Pao, L., Pitter, M., Rettenmeier, A., Sathe, A., Scanzani, F., Schlipf, D., Simley, E., Slinger, C., Wagner, R., and Würth, I.: Remote Sensing for Wind Energy, no. 0029(EN) in DTU Wind Energy E, DTU Wind Energy, Denmark, 2013. a

Pielke, R. A. and Panofsky, H. A.: Turbulence characteristics along several towers, Bound.-Lay. Meteorol., 1, 115–130, https://doi.org/10.1007/BF00185733, 1970. a, b, c, d, e, f, g, h, i

Pope, S. B.: Turbulent flows, Cambridge University Press, Cambridge and New York, 2000. a

Rasmussen, C. E. and Williams, C. K. I.: Gaussian processes for machine learning, Adaptive computation and machine learning, MIT, Cambridge, Mass. and London, 2006. a, b, c, d

Ropelewski, C. F., Tennekes, H., and Panofsky, H. A.: Horizontal coherence of wind fluctuations, Bound.-Lay. Meteorol., 5, 353–363, https://doi.org/10.1007/BF00155243, 1973. a, b, c, d

Sathe, A. and Mann, J.: Measurement of turbulence spectra using scanning pulsed wind lidars, J. Geophys. Res.-Atmos., 117, D01201, https://doi.org/10.1029/2011JD016786, 2012. a

Schlipf, D.: Lidar-assisted control concepts for wind turbines, Dissertation, University of Stuttgart, Stuttgart, 2015. a, b

Schlipf, D., Trabucchi, D., Bischoff, O., Hofsäß, M., Mann, J., Mikkelsen, T., Rettenmeier, A., Trujillo, J. J., and Kühn, M.: Testing of frozen turbulence hypothesis for wind turbine applications with a scanning LIDAR system, OPUS – Publication Server of the University of Stuttgart, https://doi.org/10.18419/opus-3915, 2011. a

Schlipf, D., Haizmann, F., Cosack, N., Siebers, T., and Cheng, P. W.: Detection of Wind Evolution and Lidar Trajectory Optimization for Lidar-Assisted Wind Turbine Control, Meteorol. Z., 24, 565–579, https://doi.org/10.1127/metz/2015/0634, 2015. a, b, c, d, e, f

Shannon, C. E.: Communication in the Presence of Noise, P. IRE, 37, 10–21, https://doi.org/10.1109/JRPROC.1949.232969, 1949. a

Simley, E.: Wind Speed Preview Measurement and Estimation for Feedforward Control of Wind Turbines, ProQuest Dissertations & Theses, PhD dissertation, University of Colorado, Ann Arbor, 2015. a

Simley, E. and Pao, L. Y.: A longitudinal spatial coherence model for wind evolution based on large-eddy simulation, in: 2015 American Control Conference (ACC), IEEE, 3708–3714, https://doi.org/10.1109/ACC.2015.7171906, 2015. a, b, c, d, e, f, g, h, i, j, k, l, m, n, o

Simley, E., Fürst, H., Haizmann, F., and Schlipf, D.: Optimizing Lidars for Wind Turbine Control Applications – Results from the IEA Wind Task 32 Workshop, Remote Sensing, 10, 863, https://doi.org/10.3390/rs10060863, 2018. a

Taylor, G. I.: The Spectrum of Turbulence, P. Roy. Soc. A-Math. Phy., 164, 476–490, https://doi.org/10.1098/rspa.1938.0032, 1938. a, b, c, d, e, f, g

Thresher, R. W., Holley, W. E., Smith, C. E., Jafarey, N., and Lin, S.-R.: Modeling the response of wind turbines to atmospheric turbulence, Tech. Rep. RL0/2227-81/2, Mechanical Engineering Department, Oregon State University, Corvallis, Oregon, 1981. a, b

Vapnik, V. N.: The Nature of Statistical Learning Theory, Springer New York, New York, NY, 1995. a

Veers, P. S.: Three-Dimensional Wind Simulation, Tech. Rep. SAND88-0152, Sandia National Laboratories, Albuquerque, New Mexico, 1988. a, b

Weitkamp, C.: Lidar: Range-Resolved Optical Remote Sensing of the Atmosphere, vol. 102, Springer-Verlag, New York, https://doi.org/10.1007/b106786, 2005. a, b

Willis, G. E. and Deardorff, J. W.: On the use of Taylor's translation hypothesis for diffusion in the mixed layer, Q. J. Roy. Meteorol. Soc., 102, 817–822, https://doi.org/10.1002/qj.49710243411, 1976. a, b

Würth, I., Ellinghaus, S., Wigger, M., Niemeier, M. J., Clifton, A., and Cheng, P. W.: Forecasting wind ramps: Can long-range lidar increase accuracy?, J. Phys. Conf. Ser., 1102, 012013, https://doi.org/10.1088/1742-6596/1102/1/012013, 2018. a

https://de.mathworks.com/products/statistics.html, last access: 18 June 2020

https://www.rave-offshore.de/en/parkcast.html, last access: 18 June 2020

https://www.alpha-ventus.de/english, last access: 18 June 2020

⁴

Forschungsplattform In Nord- und Ostsee Nr. 1 (Research Platform in the North and Baltic Seas No. 1); https://www.fino1.de/en/, last access: 18 June 2020

⁵

Francisco de Castro (2020); fitmethis (https://www.mathworks.com/matlabcentral/fileexchange/40167-fitmethis, available at: 13 January 2020), MATLAB Central File Exchange

Articles

Short summary

Wind evolution is currently of high interest, mainly due to the development of lidar-assisted wind turbine control (LAC). Moreover, 4D stochastic wind field simulations can be made possible by integrating wind evolution into 3D simulations to provide a more realistic simulation environment for LAC. Motivated by these factors, we investigate the potential of Gaussian process regression in the parameterization of a two-parameter wind evolution model using data of two nacelle-mounted lidars.