Improving wind farm flow models by learning from operational data

Schreiber, Johannes; Bottasso, Carlo L.; Salbert, Bastian; Campagnolo, Filippo

doi:https://doi.org/10.5194/wes-5-647-2020

Articles | Volume 5, issue 2

https://doi.org/10.5194/wes-5-647-2020

Articles | Volume 5, issue 2

Research article

27 May 2020

Research article |

| 27 May 2020

Improving wind farm flow models by learning from operational data

Johannes Schreiber, Carlo L. Bottasso, Bastian Salbert, and Filippo Campagnolo

Abstract

This paper describes a method to improve and correct an engineering wind farm flow model by using operational data. Wind farm models represent an approximation of reality and therefore often lack accuracy and suffer from unmodeled physical effects. It is shown here that, by surgically inserting error terms in the model equations and learning the associated parameters from operational data, the performance of a baseline model can be improved significantly. Compared to a purely data-driven approach, the resulting model encapsulates prior knowledge beyond that contained in the training data set, which has a number of advantages. To assure a wide applicability of the method – also including existing assets – learning here is purely driven by standard operational (SCADA) data. The proposed method is demonstrated first using a cluster of three scaled wind turbines operated in a boundary layer wind tunnel. Given that inflow, wakes, and operational conditions can be precisely measured in the repeatable and controllable environment of the wind tunnel, this first application serves the purpose of showing that the correct error terms can indeed be identified. Next, the method is applied to a real wind farm situated in a complex terrain environment. Here again learning from operational data is shown to improve the prediction capabilities of the baseline model.

Download & links

How to cite.

Received: 20 Nov 2019 – Discussion started: 02 Dec 2019 – Revised: 31 Mar 2020 – Accepted: 20 Apr 2020 – Published: 27 May 2020

1 Introduction

Knowledge of the flow at the rotor disk of each wind turbine in a wind power plant enables several applications, including wind farm control, the provision of grid services, predictive maintenance, the estimation of life consumption, the feed-in to digital twins, and power forecasting, among others.

This paper describes a new method to improve a wind farm flow model directly from standard operational data. The main idea pursued here is to use an existing wind farm flow model to provide a baseline predictive capability; however, as all models contain approximations and may lack the description of some physical phenomena, the baseline model is improved (or “augmented”, which is the term used in this work) by adding parametric correction terms. In turn, these extra elements of the model are learned by using operational data. The correction terms capture effects that are typically not present in standard flow models (such as, for example, secondary steering, Fleming et al., 2018; or wind farm blockage, Bleeg et al., 2018) or that are highly dependent on a specific site or difficult to model upfront (such as, for example, nonuniform inflow caused by local orography and vegetation).

Various wind farm flow models have been developed and are described in the literature. Whereas direct numerical simulation (DNS) is still out of reach for practical applications due to its overwhelming computational cost, large-eddy simulation (LES) methods are now routinely used for the modeling of wind farm flows (Fleming et al., 2014; Breton et al., 2017). Although invaluable for the understanding of the behavior of the atmospheric boundary layer and of wakes, LES is however still very expensive, so that its use outside of some specialized applications is limited. To reduce cost, one can resort to lower-fidelity computational fluid dynamics (CFD) models (Boersma et al., 2019), or to the extraction of reduced-order models (ROMs) from higher-fidelity ones (Bastine et al., 2014). Instead of deriving models from first principles, another widely adopted approach is to use engineering models, which are expressed in the form of parametric analytical formulas with a limited number of degrees of freedom and hence a much reduced numerical complexity (Frandsen et al., 2006; Gebraad et al., 2014; Bastankhah and Porté-Agel, 2016). The present paper uses this last family of methods, although ideas similar to the ones developed here could also be applicable to higher-fidelity models.

Even though engineering models are constantly improved and refined (Fleming et al., 2018), they will most likely always exhibit only a limited accuracy in many practical applications, for example whenever an important role is played by effects such as orography, (seasonal) vegetation, spatial variability of the wind, sea state roughness, the erection of other neighboring wind turbines, the presence of obstacles, and others. In addition, low-fidelity models often lack some physics, e.g. the flow acceleration caused by wake and rotor blockage, secondary steering, or others. The idea pursued in this paper is then to take a rather pragmatic approach: based on the realization that it will always be difficult – if not altogether impossible – to include all effects and all physics in a model of limited numerical complexity, a given model is corrected by unknown parametric terms, which are then learned by using operational data.

The idea of improving an existing model based on measurements is hardly new, and it is actually an important topic in the areas of controls and system identification. For example, in the field of wind farm flows, a Kalman filtering approach has been proposed by Doekemeijer et al. (2017) to update model predictions based on lidar measurements. Here again the present paper takes a more pragmatic approach, and model updating is based exclusively on data provided by the standard supervisory control and data acquisition (SCADA) systems that are typically available on contemporary wind turbines. On the one hand this has the advantage that the proposed method is applicable to existing assets, as it does not necessitate extra sensors. On the other hand, given that stored SCADA data typically represent 10 min averages, this also implies that the models obtained by this technique are of a steady-state nature. Although unsteady effects in wind farms are clearly important, steady-state models are still very valuable and can support many of the applications listed above. In addition, nothing prevents the generalization of the proposed approach to unsteady flow models, assuming that the relevant higher-frequency data sets are available, which is already the subject of ongoing work from these authors.

The contemporary literature – and not only in the field of wind energy – indicates an increasing interest in data-driven approaches. Just to give one single example related to wake modeling, a purely data-driven approach has been recently described by Göçmen and Giebel (2018). However, the current enthusiasm for data should not make one forget that physics-based and analytical models are also extremely valuable because they often encapsulate significant knowledge on a given problem, often corroborated by long experience. In fact, purely data-driven approaches suffer from a number of limitations that descend directly from a very simple and inevitable fact: a model that is exclusively based on data can only know what is contained in the data set that was used to build it. Typically, this means that a very significant amount of data is necessary to obtain a model that is sufficiently general and accurate. Furthermore, the data have to cover the entire spectrum of operation of the system. This also means that the model might have very poor knowledge (and hence poor performance) for rare situations or conditions that take place at the boundaries of the operating envelope, where few if any data points might be available.

An alternative to the purely data-driven approach is presented in this work, where a reference baseline model is augmented with parametric error terms, which are then identified using data. The baseline model already includes prior knowledge based on physics, empirical observations, and experience. Therefore, even prior to the use of data, a minimum performance can be guaranteed. The model is augmented with parametric error terms, whose choice is driven by physics and the knowledge of the limitations of the baseline model. Once the errors are identified using operational data, their inspection can clarify the causes of discrepancy between model and measurements. Eventually, this can be used to improve the underlying baseline model. Furthermore, by looking at the magnitude of the identified errors, significant deviations from the baseline model can be flagged to highlight issues with the model itself, the data, or the training process.

Finally, it should be noted that the identification of the error terms can be combined with the tuning of the parameters of the baseline model. This addresses yet another problem: tuning the parameters of a model that lacks some physics may lead to unreasonable values for the parameters, as the model is “stretched” to represent phenomena that it does not contain. By the proposed hybrid approach, the simultaneous identification of the parameters of the baseline model together with the ones of the error terms eases this problem, as unmodeled phenomena can be captured by the model-augmenting terms, thereby reducing the chances of nonphysical tuning of the baseline parameters.

The baseline model parameters and the extra correction terms have a different functional form in the augmented governing equations. Hence, they should be distinguishable from each other, as they imply different effects on the model. However, as for many identification problems, it is in general not possible to guarantee that all unknown parameters are observable and noncollinear given a set of measurements and, hence, given a certain informational content. To address this problem, the method proposed by Bottasso et al. (2014 a) is used here, where the original unknown parameters are recast into a new set of statistically uncorrelated variables by using the singular value decomposition (SVD) of the inverse Fisher information matrix. Once the problem has been solved in the space of the orthogonal uncorrelated parameters, the solution is mapped back onto the original physical space. This approach not only avoids the ill-posedness of the original problem, but also allows one to clarify which physical parameters are visible given a certain data set.

The paper is organized as follows. First, the baseline model is introduced in Sect. 2.1, together with a detailed description of the proposed parametric corrections in Sect. 2.2. Next, the SVD-based parameter identification method is presented in Sect. 2.3. The approach is then applied in Sect. 3.1 to a cluster of scaled wind turbines operating in the atmospheric test section of the wind tunnel of the Politecnico di Milano (Bottasso et al., 2014 b). The goal of this first application is to show that a correct identification of the error terms can be achieved. This is indeed possible in the controllable and repeatable conditions of a wind tunnel, where inflow and wake characteristics can be precisely measured, something that is hardly possible today in the field. Specifically, it is shown that the method can correctly learn the lack of uniformity of the wind tunnel inflow, which is akin to what happens in a real wind farm because of orographic effects. Similarly, it is shown that secondary steering, which is completely absent from the baseline model used here, can be learned by using turbine power measurements only. A more extended view on the wind tunnel results is reported in Appendix A. After having demonstrated the method in the known and controlled wind tunnel environment, a second application is developed in Sect. 3.2 that targets a real 43-turbine wind farm. Here results indicate that the augmented model has a markedly improved prediction capability when compared to the baseline one, thanks primarily to the identification of orographic effects on the inflow and the tuning of other model parameters. Finally, conclusions are drawn in Sect. 4.

2 Methods

2.1 Baseline wind farm flow model

The proposed method is applied here to the baseline wake model of Bastankhah and Porté-Agel (2016), implemented within the FLORIS framework (Doekemeijer and Storm, 2018). Given ambient wind conditions, steady-state velocities within a wind farm can be computed by this model, together with the corresponding operating states and power outputs of all its turbines. First, ambient conditions are estimated from un-waked machines operating in free stream, which are identified by the turbine yaw orientations and the wake model (Schreiber et al., 2018). Then, power and thrust of the upstream turbines are computed based on the turbine aerodynamic characteristics, regulation strategy, and alignment with the local wind direction. Next, the wakes shed by these turbines are calculated in terms of their trajectory and speed deficit. In turn, this yields the velocity at the rotor disks of the turbines immediately downstream. In the case of multiple wake impingements on a rotor, a combination model is used to superimpose multiple wake deficits. Similarly, an added turbulence model is used to estimate the turbulence intensity at a downstream turbine rotor disk, as this local ambient parameter affects the expansion of the wake. This process is repeated marching downstream throughout the wind farm until the last downstream turbine is reached.

In this work, the implementation uses the selfSimilar FLORIS velocity deficit model, the rans deflection model, the quadraticRotorVelocity wake combination model, and the crespoHernandez added turbulence model. The interested reader is referred to Bastankhah and Porté-Agel (2016), Crespo and Hernández (1996), and Doekemeijer et al. (2019) and references therein for detailed descriptions and derivations of these models.

Engineering wake models depend on a number of parameters, which should be tuned in order to obtain accurate predictions. For the specific model used in this work, these tunable factors are the wake parameters α, β, k_a, k_b, a_d, and b_d and the turbulence model parameters TI_a, TI_b, TI_c, and TI_d (Bastankhah and Porté-Agel, 2016).

In this work, the parameters are first set to an initial value, either taken from the literature or identified with ad hoc measurements; these initial values are held fixed throughout the analysis and not changed further. Corrections to the initial values are then expressed as

\begin{matrix} (1) & k = k^{*} + p_{k}, \end{matrix}

where k is a model parameter, k^∗ its initial value, and p_k the correction. Although this is not strictly necessary, this redundant notation helps highlight the changes to the nominal model parameters obtained by the proposed procedure.

2.2 Model augmentation

The engineering model described earlier is a rather simple approximation of a flow through a wind power plant and it is therefore bound to have only a limited fidelity to reality, with a consequently only limited predictive accuracy. Even for more sophisticated future models, it is difficult to imagine that all relevant physics will ever be precisely accounted for. But even if such a model existed, in practice one might simply not have all necessary detailed information on the relevant boundary and operating conditions that would be required. For example, one might not know with precision the conditions of the vegetation around and within a wind farm, with its effects on roughness and, hence, on the flow characteristics. In other words, it is safe to assume that all models are in error to some extent and probably always will be.

To address this problem, the model can be pragmatically augmented with correction terms. Here one could take two alternative approaches: either a generic all-encompassing error term is added to the model or “surgical” errors are introduced at ad hoc locations in the model to target specific presumed deficiencies. The first approach could be treated with a brute-force parametric modeling approach, for example by using a neural network. Here, the second approach was used, as it allows for more insight into the nature of the identified corrections. The specific parametric corrections used in the present paper are reviewed next. It is clear that these are only some of the many corrections that could be applied to the present baseline model, so that the following does not pretend to be a comprehensive treatment of the topic. Nonetheless, results indicate that some of these corrections are indeed significant and provide for a marked improvement of the baseline model.

Nonuniform inflow. The inflow to a wind farm can exhibit spatial variability, mostly because of orographic and local effects, especially in complex terrain conditions. For example, commercial wind resource assessment tools include topographic speedup ratios customarily computed by CFD models (Jacobsen, 2019). In contrast to this established practice, no direct or equivalent modeling of orographic effects is at present available in engineering wake models. Another reason for inflow variability may be due to wind farm blockage effects (Bleeg et al., 2018). Indeed, current wake models such as the one used here assume that upstream turbines affect downstream ones through their wakes but do not model the effects of downstream machines on the upstream ones. In a wind farm, depending on the wind direction and cross-wind location considered, the number and operating state of downstream turbines vary, which may induce a cross-wind speed variability in the inflow.

To capture some of these effects, the model ambient flow speed V_∞ is expressed here as a function of height above ground Z, cross-wind lateral position Y, and ambient wind direction Γ as
$\begin{matrix} (2) & \begin{aligned} V_{\infty} (Y, Z, Γ) = \\ (1 + f_{augm, speed} (Y, Γ, c_{speed}, p_{speed})) \\ V_{\infty, 0} {(\frac{Z}{z_{h}})}^{α_{vs}}, \end{aligned} \end{matrix}$
where V_∞,0 is the reference (baseline uncorrected) ambient flow speed and z_h the reference height of the vertically sheared flow with exponent α_vs. Function $f_{augm, speed} (Y, Γ, c_{speed}, p_{speed})$ is the speed correction term. This function is defined in the 2D space $Y \in [Y_{min}, Y_{max}]$ , $Γ \in [Γ_{min}, Γ_{max}]$ . For each value of the ambient wind direction Γ, Y is a lateral coordinate orthogonal to it that spans the width of the farm; hence, by selecting Γ_min and Γ_max a lateral inflow nonuniformity can be modeled for a given sector or the whole wind rose of directions. The (Y,Γ) space is discretized into rectangular cells with corner nodes $c_{speed} = [\dots; (Y_{i}, Γ_{i}); \dots]$ (for an example, see Fig. 16). The corresponding unknown error nodal values are stored in vector p_speed, and bilinear shape functions interpolate the error in each cell based on the nodal values at its corners. Equation (2) could be extended to also include a longitudinal wind-aligned coordinate, similarly to the localized speedup ratios of Jacobsen (2019), to model wind farm blockage effects.

Local orographic effects and blockage may also induce variability in the wind direction Γ. Similarly, the vertical shear exponent α_vs and turbulence intensity I may vary, for example on account of nonuniform roughness induced by vegetation or other obstacles. To include these effects in the farm flow model, the baseline quantities are augmented as
$\begin{matrix} (3a) & Γ (Y) = Γ_{ref} + Y f_{augm, dir} (Γ_{ref}, c_{dir}, p_{dir}), \\ (3b) & α_{vs} (Γ) = α_{vs, ref} + f_{augm, shear} (Γ, c_{shear}, p_{shear}), \\ (3c) & I (Γ) = I_{ref} + f_{augm, I} (Γ, c_{I}, p_{I}) . \end{matrix}$
In all these expressions, (⋅)_ref indicates a baseline reference quantity, while function $f_{augm, (\cdot)}$ is a correction term. This function is defined on the 1D space $Γ \in [Γ_{min}, Γ_{max}]$ , discretized with nodes $c_{(\cdot)} = [\dots; Γ_{i}; \dots]_{(\cdot)}$ , using linear shape functions to interpolate the corresponding nodal values p_(⋅). Here again, by selecting Γ_min and Γ_max, corrections can be applied to the whole wind rose or just to a sector.
Secondary steering. By misaligning a wind turbine rotor with respect to the incoming flow direction, the rotor thrust force is tilted, thereby generating a cross-flow force that laterally deflects the wake. As shown with the help of numerical simulations by Fleming et al. (2018), this cross-flow force induces two counter-rotating vortices that, combining with the wake swirl induced by the rotor torque, lead to a curled wake shape. As observed experimentally by Wang et al. (2018), the effects of these vortices result in additional lateral flow speed components, which are not limited to the wake itself but also extend outside of it. By this phenomenon, the flow direction within and around a deflected wake is tilted with respect to the upstream undisturbed direction. Therefore, when a turbine is operating within or close to a deflected wake, its own wake undergoes a change of trajectory – termed secondary steering – induced by the locally modified wind direction. Although models of this phenomenon are being developed (Martínez-Tossas et al., 2019), they significantly increase the computational cost and are not yet available in standard implementations of engineering wake models such as the one used here.

The change of wind direction ΔΓ at a downstream turbine induced by secondary steering (indicated by the subscript ss) is modeled here as
$\begin{matrix} (4) & Δ Γ (y) = f_{augm, ss} (\tilde{y}, Γ_{init}, p_{ss}), \end{matrix}$
where f_augm,ss is the correction term and $\tilde{y} = Y - y_{wc}$ is the lateral distance to the wake centerline (see Fig. 1), defined in the baseline wind farm model as the locus of the points of minimum flow speed. According to the notation used in Eq. (6.12) of Bastankhah and Porté-Agel (2016), Γ_init indicates the initial wake direction of the closest upstream turbine. The correction term is expressed as the difference of two Gaussian functions and more precisely
$\begin{matrix} (5) & \begin{aligned} f_{augm, ss} (\tilde{y}, Γ_{init}, p_{ss}) = \\ Γ_{init} (p_{ss, 1} \exp (- 0.5 {(\frac{\tilde{y} + sgn (Γ_{init}) p_{ss, 3}}{p_{ss, 2}})}^{2}) \\ - p_{ss, 4} \exp (- 0.5 {(\frac{\tilde{y} + sgn (Γ_{init}) p_{ss, 6}}{p_{ss, 5}})}^{2})), \end{aligned} \end{matrix}$
where $p_{ss} = (p_{ss, 1}, p_{ss, 2}, p_{ss, 3}, p_{ss, 4}, p_{ss, 5}, p_{ss, 6})$ is the vector of free parameters, where parameters 1 and 4 are related to the amplitude, 3 and 6 to the standard deviation, and 2 and 5 to the location of the correction functions. Since the Gaussian functions are not centered at the wake centerline and the effect of secondary steering is assumed to be symmetric with respect to the misalignment angle, the correction term also depends on the direction of wake deflection sgn(Γ_init).

This particular choice of the shape functions is motivated by the results shown in Fig. 8b of Wang et al. (2018). Indeed, LES simulations and measurements reveal the presence of a stronger lateral velocity component directed towards the wake on the leeward side of the wake itself, and of an opposite and weaker lateral component on the windward side. Such a distribution can be approximated by two Gaussian functions using Eq. (5).

Note that the change in local wind direction also leads to a slight lateral deflection of the nonuniform wind farm inflow introduced previously. More precisely, for a turbine that is located ΔX behind an upstream turbine, the nonuniform inflow expressed by Eq. (2) is evaluated at Y+ΔXsin (ΔΓ) instead of Y.

Figure 1a shows the hub height flow speed for two wind turbines modeled in FLORIS, with the turbine rotor disks being indicated with thick black lines. The wake centerlines and the undisturbed free-stream wind direction are indicated by black dotted and dashed lines, respectively. The upstream turbine is misaligned with respect to the incoming flow, and therefore its wake is deflected laterally. Using the baseline wake model, the downstream turbine wake develops along the free-stream wind direction. Panel (b) of the same figure shows the effects of the secondary steering correction term given by Eq. (5). The plot clearly shows that the downstream turbine wake path is affected by the locally changed wind direction.
Non-Gaussian wake and flow acceleration. Engineering wake models are based, among other hypotheses, on assumed shapes of the speed deficit. For example, the present baseline model assumes a Gaussian distribution of the speed deficit within the wake. Another assumption is that the flow outside the wake is undisturbed and equal to the free stream. However, these assumptions can, at times, not be exactly satisfied, as already observed by Xie and Archer (2017) and Martínez-Tossas et al. (2019), among others. For example, aisle jets are local accelerations of the flow outside of the wake, produced by local blocking in the neighborhood of an operating turbine. It has been reported that aisle jets can induce local flow speedups in excess of 10 % of the undisturbed inflow (Dörenkämper et al., 2015).

To account for such effects, the wake velocity V_wake of the baseline model is corrected as
$\begin{matrix} (6) & \begin{aligned} V_{wake} (d_{wc}) = \\ V_{wake, FLORIS} (d_{wc}) (1 + f_{augm, acc} (d_{wc}, c_{acc}, p_{acc})), \end{aligned} \end{matrix}$
where V_wake,FLORIS is the baseline Gaussian wake speed profile, d_wc is the absolute distance to the wake center (which, at hub height, is equivalent to $|\tilde{y}|$ ), and f_augm,acc represents the correction term, which – similarly to the previous corrections – is modeled with linear shape functions characterized by node locations c_acc (in terms of d_wc) and nodal values p_acc.
Reduced power extraction due to nonuniform wind turbine inflow. Numerical simulations conducted in FAST (Jonkman and Jonkman, 2018) using its blade element momentum (BEM) implementation yielded a slight reduction in the rotor power coefficient for horizontally sheared flow, when compared to unsheared conditions with the same hub wind speed. Even though BEM can only give a rough indication for such an effect, a correction of the power coefficient of the baseline model is introduced here in the form
$\begin{matrix} (7) & C_{P} = C_{P, κ = 0} (1 + p_{κ} κ^{2}), \end{matrix}$
where $C_{P, κ = 0}$ is the nominal power coefficient, κ the equivalent horizontal linear shear coefficient on the rotor disk, and p_κ the free correction parameter. The linear shear κ is either due to a lack of lateral uniformity of the inflow or due to the impingement of a wake, and it is evaluated accordingly within the farm model.
Wind-speed-dependent power loss in yaw misalignment. The baseline formulation models the power extraction of a misaligned wind turbine using the cosine law $C_{P} (γ) = C_{P} \cos (γ)^{p_{P}}$ , where C_P is the power coefficient of the wind-aligned turbine, γ the misalignment angle with respect to the local flow direction, and p_P the power loss exponent. Different power loss exponents have been reported in the literature, ranging from the value of 1.4 found by Fleming et al. (2017) to 1.8 according to Schreiber et al. (2017), 1.9 for Gebraad et al. (2015), and all the way to the ideal value of 3 that is expected if only the rotor-orthogonal ambient flow component contributes to power extraction (Boersma et al., 2019). In addition, p_P might also depend on the regulation strategy used by the turbine controller. Here, the power coefficient in misaligned operation is augmented as
$\begin{matrix} (8) & C_{P} = C_{P} \cos {(γ + p_{P 0})}^{p_{P} + p_{P, a} (V - V_{rated}) + p_{P, b}}, \end{matrix}$
where C_P is the power coefficient of the flow-aligned turbine (possibly reduced by shear effects, as argued above), p_P0 is the misalignment angle at which the turbine produces maximum power, and V and V_rated are, respectively, the rotor effective and rated wind speeds. Finally, p_P is the baseline exponent, while p_P,a and p_P,b are free parameters that model a linear wind speed dependency of the cosine law.

https://www.wind-energ-sci.net/5/647/2020/wes-5-647-2020-f01

Figure 1Effect of secondary steering on the trajectory of a downstream turbine. (a) Baseline wake model; (b) baseline model augmented with the empirical correction term of Eq. (5).

Improving wind farm flow models by learning from operational data

2.1 Baseline wind farm flow model

2.2 Model augmentation

2.3 Parameter identification method

2.3.1 Maximum likelihood estimation of model parameters

2.3.2 Identifiability of parameters

2.3.3 Problem transformation and untangling using the SVD

2.3.4 Identification method with variable measurement weights

3.1 Wind tunnel verification

3.1.1 Experimental setup

3.1.2 Model setup

3.1.3 Ranking of correction terms

3.1.4 Results

3.2 Field application

3.2.1 Wind farm and data preprocessing

3.2.2 Model setup

3.2.3 Ranking of correction terms

3.2.4 Results