Analysing uncertainties in offshore wind farm power output using measure–correlate–predict methodologies

Mifsud, Michael Denis; Sant, Tonio; Farrugia, Robert Nicholas

doi:https://doi.org/10.5194/wes-5-601-2020

Articles | Volume 5, issue 2

https://doi.org/10.5194/wes-5-601-2020

Special issue:

Wind Energy Science Conference 2019

https://doi.org/10.5194/wes-5-601-2020

Articles | Volume 5, issue 2

Research article

26 May 2020

Research article |

| 26 May 2020

Analysing uncertainties in offshore wind farm power output using measure–correlate–predict methodologies

Michael Denis Mifsud, Tonio Sant, and Robert Nicholas Farrugia

Abstract

This paper investigates the uncertainties resulting from different measure–correlate–predict (MCP) methods to project the power and energy yield from a wind farm. The analysis is based on a case study that utilises short-term data acquired from a lidar wind measurement system deployed at a coastal site in the northern part of the island of Malta and long-term measurements from the island's international airport. The wind speed at the candidate site is measured by means of a lidar system. The predicted power output for a hypothetical offshore wind farm from the various MCP methodologies is compared to the actual power output obtained directly from the input of lidar data to establish which MCP methodology best predicts the power generated.

The power output from the wind farm is predicted by inputting wind speed and direction derived from the different MCP methods into windPRO^® (https://www.emd.dk/windpro, last access: 8 May 2020). The predicted power is compared to the power output generated from the actual wind and direction data by using the normalised mean absolute error (NMAE) and the normalised mean-squared error (NMSE). This methodology will establish which combination of MCP methodology and wind farm configuration will have the least prediction error.

The best MCP methodology which combines prediction of wind speed and wind direction, together with the topology of the wind farm, is that using multiple linear regression (MLR). However, the study concludes that the other MCP methodologies cannot be discarded as it is always best to compare different combinations of MCP methodologies for wind speed and wind direction, together with different wake models and wind farm topologies.

Download & links

Article (PDF, 9623 KB)

Download & links

How to cite.

Received: 23 Nov 2019 – Discussion started: 02 Dec 2019 – Revised: 22 Feb 2020 – Accepted: 11 Apr 2020 – Published: 26 May 2020

1 Introduction

The measure–correlate–predict (MCP) methodology introduces uncertainty due to its inherent statistical nature. Recent developments have seen the introduction of new computational regression techniques such as artificial neural networks (ANNs) and machine learning, which include decision trees (DTs) and support vector regression (SVR). In a previous study, light detection and ranging (lidar) data were used to compare the results of the various regression methodologies at different lidar measurement heights (Mifsud et al., 2018), with the reference site being Malta International Airport (MIA), Luqa, and the candidate site being a coastal watch tower at Qalet Marku on the northern part of the island. This study uses the same wind data for the year 2016 to construct the MCP models. However, this time the prediction is carried out for both wind speed and wind direction. Wind speed and direction are then predicted for the period June–December 2015. This is done for the different MCP models. The predicted wind speed and wind direction time series are then fed into a wind farm model implemented in windPRO^® version 2.7 to model the overall energy yield, considering wake losses. The power output for various wind farm configurations is obtained for each methodology. As the lidar is sited on the roof of a coastal tower, at a height of 20 m above mean sea level, the wind data measured at a height of 80 m would be equivalent to a wind turbine (WT) hub height of 100 m above the sea surface.

The power output in each case is compared to that obtained when the actual wind data are fed to the wind farm model. Thus, the NMAE, the NMSE and the percentage error in the overall energy yield are compared for the various methodologies and wind farm topologies. This is therefore a study about the uncertainties introduced by the various statistical methods, which are then further complicated by the wind farm layout. It is innovative due to the use of an MCP methodology to predict both the wind speed and the wind direction. The following literature review describes different MCP methodologies, four of which are then used in the prediction of wind speed and wind direction. The wake models are also described. This is followed by a description of the methodology used in the study, together with a description of the hypothetical wind farm used as a basis for this study. Finally, the results are presented and discussed.

2 Literature review

The first MCP methods estimated the mean long-term annual wind speed (Carta et al., 2013). MCP methods later made use of simple linear regression (SLR) (Rogers et al., 2005a) to establish a relationship between hourly wind characteristics of the candidate and the reference sites. A multiple linear regression is a regression model that involves more than one regressor variable (Montgomery et al., 2006). The regression is carried out using concurrent wind speed and wind direction data at the reference and the candidate sites. The reference site is normally the closest meteorological station, e.g. airports, and the candidate site is the location chosen for the wind farm. When the model is created, hence establishing a relationship between the wind speed at both sites, the long-term wind data at the reference site can be used to predict the long-term wind speed at the candidate site. More recent models established non-linear-type relationships (Clive, 2004; Carta and Velazquez, 2011) by employing statistical learning (Hastie et al., 2009). Amongst these are algorithms such as artificial neural networks (ANNs) (Bilgili et al., 2007; Monfared et al., 2009) and the more recent machine-learning (ML) techniques, which include support vector regression (SVR) (Oztopal 2006; Zhao et al., 2010; Scholkopf and Smola, 2002; Alpaydin, 2010) and decision trees (DTs) (James et al., 2015; Alpaydin, 2010).

A study (Carta et al., 2013) reviewed many MCP methodologies. These included the method of ratios, first-order linear regression, higher-than-first-order linear methods, non-linear methods and probabilistic methods. The authors were also concerned with the uncertainties associated with MCP methodologies and argued that users of MCP methodologies have little information with which to determine the uncertainty of the methodology. One methodology to measure this uncertainty is to use the full set of data from the concurrent period to train the model and assess its quality.

Another study by Rogers compared four different MCP methodologies (Rogers et al., 2005a). These included a linear regression model, the distributions of ratios of the wind speeds at the two sites, an SVR model and another method based on the ratio of the standard deviations of the two data sets. The authors concluded that SVR gave the best results. In a different study, the same authors (Rogers et al., 2005b) also analysed the uncertainties introduced with the use of MCP techniques. They concluded that linear regression methodologies could seriously underestimate uncertainties due to serial correlation of data. Another study shows that a proper assessment of uncertainty is critical for judging the feasibility and risk of a potential wind farm development, and the authors describe the risk of oversimplifying and assuming uncertainties (Lackner et al., 2012).

A hybrid MCP method (Zhang et al., 2014), which involved adding different weights depending on the distance and elevation of the candidate site to the reference sites, was applied to the input of five MCP methodologies. The methods used consisted of the linear regression, variance ratio, Weibull scale, ANNs and SVR. The results were assessed in terms of metrics such as the mean-squared error and mean absolute error. Other authors (Perea et al., 2011) evaluated three methodologies. One method included a linear regression, which was derived from the bivariate normal joint distribution and the Weibull regression method. The other method was based on conditional probability density functions applied to the joint distributions of the reference and the candidate sites. The results from these two methodologies were in turn compared to SVR. Although the conclusion was that the SVR method predicted all the parameters very accurately, the probability density function based on the Weibull distribution was better in terms of prediction accuracy.

The ability of ANNs to recognise patterns in complex data sets means that they can also be used to correlate and predict wind speed and wind direction (Zhang et al., 2014). A neural network contains an input layer, one or more hidden layers of neurons and an output layer. A learning process updates the weights of the interconnections and biases between the neurons in the various layers. The Levenberg–Marquardt (Principe et al., 2000) algorithm may be used for this purpose. The regression is performed by means of feed-forward networks (Alpaydin, 2010) with multilayer perceptrons (MLPs).

Another study (Velazquez et al., 2011) utilised wind speed and direction from various reference stations. These were introduced into the input layer of an ANN. It was concluded that when wind direction was used as an angular magnitude to the input signal, the model gave better results. Estimation errors also decreased as the number of reference stations was increased. The authors concluded that ANNs are superior to other methods for predicting long-term wind data.

The use of ANNs for long-term predictions was also investigated by Bechrakis et al. (2004) using wind speed and direction measurements from just one reference station and compared these to standard MCP algorithms. This resulted in an improved prediction accuracy of 5 % to 12 %. Unfortunately, many models that use various reference stations use only the recorded wind speeds as input. The topologies of the ANNs used have only a single neuron in the input layer, with the output signal being the wind speed at the candidate site (Monfared et al., 2009; Oztopal, 2006; Bilgili et al., 2009).

Data from meteorological stations possessing long measurement periods provide a large number of potential inputs for MCP methods. Apart from wind speed and direction, inputs can also include other climatological variables such as air temperature, relative humidity and atmospheric pressure. Hence, a multivariate MCP methodology may be utilised (Patane et al., 2011). This technique considers all the inputs and extracts the maximum amount of information at the sites. Since some input variables may be intercorrelated, or may not provide information about the target site wind characteristics, the methodology is a two-stage process. Input variables are analysed, and those that contain little or redundant information about the candidate site wind characteristics are discarded, after which a multivariate regression is performed. It was concluded from the results of the tests made that the methodology was more accurate than standard MCP methods, with the quality of the estimation of the long-term wind resource increasing by 19 %.

SVR is the adaptation of support vector machines to the regression problem. This technique was developed by Vapnik (Vapnik, 1995; Vapnik et al., 1998) to solve classification problems. SVR (Alpaydin, 2010) is popular within the renewable energy community since it is a unique way to construct smooth and non-linear regression approximations (Diaz et al., 2017). The analysis of MCP models using SVR techniques shows that SVR is one of the techniques which best represents the ML state of the art (Diaz et al., 2017). This is not only due to its prediction capability, but also to its property of universal approximation to any continuous function and an efficient and stable algorithm that provides a unique solution to the estimation problem (Diaz et al., 2017). Different hyperparameters were used to study the SVR methodology. Other studies describe how SVR may be adapted to wind speed prediction (Zhao et al., 2010).

Another recent study shows the importance of DTs in improving the regression results for MCP (Diaz et al., 2018). The study applied five different MCP techniques to mean hourly wind speed and direction, together with air density, using the data from 10 weather stations in the Canary Islands. The study showed that the models using SVR and DTs provided better results than ANNs. A DT is a hierarchical data structure which implements the “divide and conquer” rule, and it may also be applied to the regression problem (Hastie et al., 2009; Alpaydin, 2010; James et al., 2015).

The use of lidar for wind resource assessment (Probst and Cardenas, 2010) shows a distinct advantage of this method over the traditional cup and wind vane measurements. This is demonstrated by studies carried out using different MCP methods such as SLR and ratio analysis. However, no analysis with ANNs, DTs or SVR is carried out. A more recent study (Mifsud et al., 2018), which utilised the same data as this current study, analysed the accuracy of different MCP methodologies and their capability according to lidar measurement height. The study concluded that the MCP accuracy depended on both methodology and measurement height at the candidate site. Other studies using lidar at the same measurement site were also carried out. These analysed the turbulent behaviour of the wind data (Cordina et al., 2017).

The issue of wake losses in a wind farm has been described by several authors and can be minimised by optimising the layout of the wind farm (Manwell et al., 2009). A short literature review of wake models is now presented.

Wake models are classified into four categories (Manwell et al., 2009) which are surface roughness models (Bossanyi et al., 1980), semi-empirical models (Lissaman and Bates, 1977; Vermeulen, 1980), eddy viscosity models (Ainslie, 1985) and Navier–Stokes solutions (Crespo and Hernandez, 1986, 1993). A review of wind turbine wake models (Sanderse, 2009) shows the effects of reduced power production due to lower incident wind speed and the effect on the wind turbine rotors due to increased turbulence. The author presents a number of reasons on why the focus on numerical simulation is preferred to experimentation; this is mainly due to the use of computational fluid dynamics (CFD). One study presents the mathematical theory behind a simple wake model and that for a multiple wake model (Gonzalez-Longatt et al., 2012) while another study (Churchfield, 2013) describes a hierarchy of wake models ranging from the empirical to large-eddy simulation (LES). Some of the models compared include Ainslie's model (Ainslie, 1985), Frandsen's model (Frandsen, 2005) and Jensen's model (Jensen, 1983). The dynamic wake meandering model is another method which is described (Larsen et al., 2008) and also validated (Larsen et al., 2013) in a study carried out on the Egmond aan Zee offshore wind farm. Another study (Barthelmie et al., 2006) compares wake model simulations for offshore wind farms, with the wake profiles measured by sonic detection and ranging (sodar). In this case, the models gave a wide range of predictions, and it was not possible to identify a model with superior projections with respect to the measurements.

https://www.wind-energ-sci.net/5/601/2020/wes-5-601-2020-f01

Figure 1Difference between the meteorological wind direction and the mathematical wind direction and the component of the wind vector.

ANN	Artificial neural network
CFD	Computational fluid dynamics
DTs	Decision trees
Lidar	Light detection and ranging
LES	Large-eddy simulation
MCP	Measure–correlate–predict
MIA	Malta International Airport
MLR	Multiple linear regression
MLP	Multilayer perceptron
MSE	Mean-squared error
NMAE	Normalised mean absolute error
NMSE	Normalised mean-squared error
SLR	Simple linear regression
Sodar	Sonic detection and ranging
SVR	Support vector regression
WT	Wind turbine
V_i	Magnitude of wind speed in metres per second
$e_{{norm}_{i}}$	Normalised residual
e_eng	Percentage error in energy yield
e_i	Residual, MW
$u_{i_{p}}$	Predicted component of wind speed vector in the easterly direction at the candidate site in metres per second
$u_{i_{ref}}$	Component of wind speed vector in the easterly direction at the reference site in metres per second
$u_{i_{ref}}$	Component of wind speed vector in the easterly direction at the reference site in metres per second
u_i	Component of wind speed vector in the easterly direction in metres per second
$v_{i_{can}}$	Component of wind speed vector in the northerly direction at the candidate site in metres per second
$v_{i_{p}}$	Predicted component of wind speed vector in the northerly direction at the candidate site in metres per second
$v_{i_{ref}}$	Component of wind speed vector in the northerly direction at the reference site in metres per second
v_i	Component of wind speed vector in the northerly direction in metres per second
z₀	Surface roughness
V_i	Wind speed vector (speed in metres per second and wind direction in degrees)
$θ_{{math}_{i_{p}}}$	Predicted mathematical wind direction at the candidate site in degrees
$θ_{{met}_{i_{p}}}$	Predicted meteorological wind direction at the reference site in degrees
$θ_{{met}_{can}}$	Meteorological wind direction at the candidate site in degrees
$θ_{{met}_{ref}}$	Meteorological wind direction at the reference site in degrees
θ_math	Mathematical wind direction
θ_met	meteorological wind direction
D	Wind turbine diameter, m
N	Number of data points
P	Predicted power output from wind farm, MW
P_act	Actual power output from wind farm, MW

Analysing uncertainties in offshore wind farm power output using measure–correlate–predict methodologies

4.1 The reference and candidate sites

4.2 The available wind data

4.3 The wind farm design in windPRO®

6.1 Wind speed and wind direction with MCP methodology

6.1.1 Wind speed with MCP methodology

6.1.2 Wind direction with MCP methodology

6.2 Wind farm power output with MCP methodology, for a wind farm capacity of 250 MW

6.3 The actual wind data for 2015 measured by the lidar system

6.4 Wind speed and direction predicted using the MCP methodologies

4.3 The wind farm design in windPRO^®