Research article 31 Oct 2020
Research article  31 Oct 2020
Operationalbased annual energy production uncertainty: are its components actually uncorrelated?
 National Renewable Energy Laboratory, Golden, Colorado, USA
 National Renewable Energy Laboratory, Golden, Colorado, USA
Correspondence: Mike Optis (mike.optis@nrel.gov)
Hide author detailsCorrespondence: Mike Optis (mike.optis@nrel.gov)
Calculations of annual energy production (AEP) from a wind power plant – whether based on preconstruction or operational data – are critical for wind plant financial transactions. The uncertainty in the AEP calculation is especially important in quantifying risk and is a key factor in determining financing terms. A popular industry practice is to assume that different uncertainty components within an AEP calculation are uncorrelated and can therefore be combined as the sum of their squares. We assess the practical validity of this assumption for operationalbased uncertainty by performing operational AEP estimates for more than 470 wind plants in the United States, mostly in simple terrain. We apply a Monte Carlo approach to quantify uncertainty in five categories: revenue meter data, wind speed data, regression relationship between densitycorrected wind speed (from reanalysis data) and measured wind power, length of longtermcorrection data set, and future interannual variability. We identify correlations between categories by comparing the results across all 470 wind plants. We observe a positive correlation between interannual variability and the linearized longterm correction; a negative correlation between wind resource interannual variability and linear regression; and a positive correlation between reference wind speed uncertainty and linear regression. Then, we contrast total operational AEP uncertainty values calculated by omitting and considering correlations between the uncertainty components. We quantify that ignoring these correlations leads to an underestimation of total AEP uncertainty of, on average, 0.1 % and as large as 0.5 % for specific sites. Although these are not large increases, these would still impact wind plant financing rates; further, we expect these values to increase for wind plants in complex terrain. Based on these results, we conclude that correlations between the identified uncertainty components should be considered when computing the total AEP uncertainty.
This work was authored by the National Renewable Energy Laboratory, operated by Alliance for Sustainable Energy, LLC, for the US Department of Energy (DOE) under contract no. DEAC3608GO28308. Funding provided by the US Department of Energy Office of Energy Efficiency and Renewable Energy Wind Energy Technologies Office. The views expressed in the article do not necessarily represent the views of the DOE or the US Government. The US Government retains and the publisher, by accepting the article for publication, acknowledges that the US Government retains a nonexclusive, paidup, irrevocable, worldwide license to publish or reproduce the published form of this work, or allow others to do so, for US Government purposes.
Calculations of wind plant annual energy production (AEP) – whether based on preconstruction data before a wind power plant is built or on operational data after a wind plant has started its operations – are vital for wind plant financial transactions. Preconstruction estimates of AEP are needed to secure and set the terms for new project financing, whereas operational estimates of longterm AEP are required for important wind plant transactions, such as refinancing, purchasing/selling, and mergers/acquisitions. The need for AEP analyses of wind plants is increasing because global wind capacity increased to 539 GW in 2017, representing 11 % and 91 % increases over 1 and 5year periods, respectively; capacity is expected to increase by another 56 %, to 841 GW, by 2022 (Global Wind Energy Council, 2018). In the United States, wind plants generated more than 300 000 GWh in 2019, about 7.5 % of the total US electricity generation from utilityscale facilities that year, with a 50 % increase over a 6year period (Energy Information Administration, 2020).
This rapid growth of the wind energy industry is putting an increased spotlight on the accuracy and consistency of AEP calculations. For preconstruction AEP estimates, there has been considerable movement toward standardization. The International Electrotechnical Commission (IEC) is currently developing a standard (IEC 6140015:draft, 2020), and there have long been guidance and best practices available (Brower, 2012). By contrast, longterm operational AEP estimates do not have such extensive guidance or standards. Only limited standards covering operational analyses exist; IEC 61400121:2017 (2017) addresses turbine power curve testing, and IEC 61400263:2016 (2016) addresses the derivation and categorization of availability loss metrics. However, to our knowledge, there are no standards and very limited published guidance on calculating longterm AEP from operational data. Rather, documentation seems to be limited to a consultant report (Lindvall et al., 2016), an academic thesis (Khatab, 2017), and limited conference proceedings (Cameron, 2012; Lunacek et al., 2018).
Documentation and standards for preconstruction AEP methods are of limited use for operationalbased AEP methods given the many differences between the two approaches. In general, operational AEP calculations are simpler than preconstruction estimates because actual measurements of wind plant power production at the revenue meter replace the complicated preconstruction estimate process (e.g., meteorological measurements, wind and wakeflow modeling, turbine performance, estimates of wind plant losses). However, the two methods do share several similarities, including regression relationships between onsite measurements and a longterm wind speed reference, the associated longterm (windiness) correction applied to the onsite measurements, estimates of future interannual variability (IAV), and estimates of uncertainty in the resulting AEP calculation. The shared components between operational AEP calculations and preconstruction estimates (IEC 6140015:draft, 2020) are listed in Table 1.
The uncertainty values from each component listed in Table 1 must be combined to produce a total estimate of AEP uncertainty. While general guidelines on how to combine (measurement) uncertainty components exist (JCGM 100:2008, 2008) and can be applied to this task, we found no specific guidance in the literature for combining uncertainty components in an operational AEP estimate. On the other hand, considerable guidance exists for combining preconstruction AEP uncertainties (Lackner et al., 2008; Brower, 2012; Vaisala, 2014; Kalkan, 2015; Clifton et al., 2016). In every case, recommended best practices assume that all uncertainties, σ_{i}, are uncorrelated and can therefore be combined using a sum of squares approach to give the total AEP uncertainty, σ_{tot,uncorr}:
To better understand how uncertainties are combined in longterm operational AEP calculations, we reached out to several wind energy consultants who regularly perform these analyses. These conversations revealed that uncertainties in a longterm operational AEP calculation are also assumed uncorrelated and combined using Eq. (1).
1.1 Goal of study
The purpose of this study is to examine the extent to which the assumption of uncorrelated uncertainties – and, therefore, the combination of those uncertainties through a sum of squares approach – is accurate and appropriate for operational AEP calculations. Specifically, this study aims to identify potential correlations between AEP uncertainty components using data for over 470 wind plants. While in the analysis we focus on operational AEP calculations, we expect that the results from this analysis – namely, the potential identification of correlated uncertainty components – can be equally relevant for informing and improving preconstruction AEP methods.
In Sect. 2, we first describe the data sources used in this analysis (wind plant operational data and reanalysis products), the Monte Carlo approach to quantify single uncertainty components in operational AEP, and the approaches used to combine these uncertainty components. Section 3 presents the main results of our analysis in terms of uncertainty contributions and correlations among the different components. We conclude and suggest future work in Sect. 4.
2.1 Wind plant operational data and reanalysis products
Operational wind plant energy production data for this analysis are obtained from the publicly available Energy Information Administration (EIA) 923 database (EIA, 2018). This database provides reporting of monthly net energy production from all power plants in the United States, including wind plants. More than 670 unique wind plants are available from this data set.
Longterm wind speed data (needed to perform the longterm or windiness correction in an AEP estimate) are used from three reanalysis products over the period of January 1997 through December 2017:

Version 2 of the ModernEra Retrospective analysis for Research and Applications (MERRA2) (Gelaro et al., 2017). We specifically use the M2T1NXSLV data product which provides diagnostic wind speed at 50 m above ground level (a.g.l.), interpolated from the lowest model level output (on average about 32 m a.g.l.), using Monin Obukhov similarity theory^{1}. Data are provided at an hourly time resolution.

The European Reanalysis Interim (ERAInterim) data set (Dee et al., 2011). We specifically use output at the 58th model level, which on average corresponds to a height of about 72 m a.g.l. Data are provided at a 6hourly time resolution.

The National Centers for Environmental Prediction v2 (NCEP2) data set (Saha et al., 2014). We specifically use diagnostic wind speed data at 10 m a.g.l. Data are provided at a 6hourly time resolution.
The wind speed data are density corrected at their native time resolutions to correlate more strongly with wind plant power production (i.e., higherdensity air in winter produces more power than lowerdensity air in summer, wind speed being the same):
where U_{dens,corr} is the densitycorrected wind speed, U is the wind speed, ρ is air density (calculated at the same height as wind speed), ρ_{mean} is the mean density over the entire period of record of the reanalysis product, and the exponent 1∕3 is derived from the basic relationship between wind power and wind speed cubed (Manwell et al., 2010). To calculate air density at the same height as wind speed, we first extrapolate the reported surface pressure to the wind speed measurement height, assuming hydrostatic equilibrium (ISO 2533:1975, 1975):
where p is the pressure at the wind speed measurement height, p_{surf} is the surface pressure, g is the acceleration caused by gravity, z is the wind speed measurement height, R is the gas constant, and T_{avg} is the average temperature between the reported value at 2 m a.g.l. and that at the wind speed measurement height.
To lessen the impact of limited and/or poorquality data on the results of our analysis, we filter for wind plants with a moderatetostrong correlation with all three reanalysis products (R^{2}>0.6). About 25 % of the EIA wind plants are discarded with this filter. We also impose a threshold of 8 months of wind plant data availability in order to investigate uncertainty as it relates to a low number of data points – but not so low as to make the use of a regression relationship questionable. A total of 472 wind plants are kept for the final analysis, and their locations are shown in Fig. 1. Because obtaining an accurate representation of wind data in complex terrain by reanalysis products is challenging (Shravan Kumar and Anandan, 2009), most of the selected wind plants are located in the Midwest and Southern Great Plains. Notably, no wind plants in California pass the filtering criteria because they are predominately located in areas with thermally driven wind regimes, such as Tehachapi Pass, where coarseresolution reanalysis products are poor predictors of wind energy production.
The fundamental step in an AEP calculation involves a regression between densitycorrected wind speed (here, from the reanalysis products) and energy production (here, from the EIA923 database). To investigate whether a simple linear function can be assumed to express the relationship between densitycorrected wind speed and wind plant energy production when considering monthly data, we show a scatterplot between MERRA2 densitycorrected monthly wind speed and monthly energy production across all 472 sites in Fig. 2. For each site, data have been normalized by the respective site mean. We show bestfits using a linear, quadratic, and cubic function and calculate the mean absolute error (MAE) of each fit.
We find that the difference between the normalized MAE values from the considered functions is less than 0.7 %. Therefore, the uncertainty connected with the choice of using a linear regression in the operational AEP methodology at a monthly time resolution appears minimal. Moreover, through conversations with wind industry professionals, we found that a linear regression based on monthly data is the standard industry approach when performing bankable^{2} operational AEP analyses.
2.2 Operational AEP methodology
Given the lack of existing guidelines for a standard approach for operational AEP calculations, we base our methodology on conversations with four major wind energy consultants who represent most of the operational market share in North America. These conversations overwhelmingly revealed the following characteristics for operational AEP analysis, and we follow the same approach in our analysis.

Wind speed data (measured or modeled) are density corrected at their native time resolution using Eq. (2).

Monthly revenue meter data, monthly average availability and curtailment losses, and monthly average wind speeds from a longterm wind resource product are calculated.

Monthly revenue meter data are normalized to 30 d months (e.g., for January, the revenue meter values are multiplied by 30∕31).

Monthly revenue meter data are corrected for monthly availability and curtailment (i.e., monthly gross energy data are calculated).

A linear regression between monthly gross energy production and concurrent densitycorrected monthly average wind speeds is performed.

Longterm densitycorrected monthly average wind speed is then calculated for each calendar month (i.e., average January wind speed, average February wind speed, and so forth) with a hindcast approach using 10–20 years of the available longterm reference monthly wind resource data (reanalysis products, longterm reference measurements, etc.).

Slope and intercept values from the regression relationship are then applied to the longterm densitycorrected monthly average wind speed data using the longterm or socalled windiness correction. A longterm data set of monthly (January, February, etc.) estimated gross energy production is obtained.

The resulting longterm monthly gross energy estimates, which are based on 30 d months, are then denormalized to the actual number of days in each calendar month (e.g., for January, the obtained value is multiplied by 31∕30).

Longterm estimates of availability and curtailment losses are finally applied to the denormalized longterm monthly gross energy data, leading to a longterm calculation of operational AEP.
In the EIA923 database, availability and curtailment data are not available. Therefore, in our analysis, we omit steps 4 and 9 of the list and only perform calculations on net energy data.
2.3 Monte Carlo analysis
To quantify the impact of the single uncertainty components on the longterm operational AEP estimate obtained using the methodology described in the previous section, we implement a Monte Carlo approach. In general, a Monte Carlo method involves the randomized sampling of inputs to, or calculations within, a method which, when repeated many times, results in a distribution of possible outcomes from which uncertainty can be deduced. This is usually calculated as the standard deviation or the coefficient of variation (i.e., standard deviation normalized by mean) of the resulting distribution (JCGM 100:2008, 2008; Dimitrov et al., 2018). Monte Carlo methods have been used in different applications for uncertainty quantification within the wind energy industry, ranging from the prediction of extreme wind speed events (Ishihara and Yamaguchi, 2015), to offshore fatigue design (Müller and Cheng, 2018), and to the economic analysis of the benefits of wind energy projects (Williams et al., 2008). Here, we apply this approach to derive a distribution of longterm operational AEP values from which the uncertainty can be calculated. Using a Monte Carlo approach provides a direct estimate of AEP uncertainty by sampling the relevant parameters connected to the various uncertainty components. By contrast, traditional approaches to assessing uncertainty are often less direct. For example, wind resource interannual variability is often calculated and then converted to AEP uncertainty through an “energy / velocity” (EV) ratio estimated from the wind and energy data. A Monte Carlo approach avoids this intermediate ratio and any uncertainty or error associated with it.
In our analysis, we separately consider five operationalbased uncertainty components so that only the sampling of one parameter is performed in each Monte Carlo configuration. The following uncertainty components are included in our proposed Monte Carlo methodology for longterm operational AEP.

Revenue meter measurement error. To incorporate this uncertainty component in the Monte Carlo simulation, we sample monthly revenue meter data from a synthesized normal distribution centered on the reported value and a 0.5 % imposed standard deviation. In fact, a value of 0.5 % is consistent with what is typically assumed in the wind energy community as revenue meter uncertainty (IEC 60688:2012, 2012).

Reference wind speed data modeling error. Quantifying the uncertainty of the longterm wind resource data used in the operational AEP assessment is challenging because it can vary based on the location, longterm wind speed product used, or instrument from which reference observations are taken. To include this uncertainty component in a systematic way across the 472 locations considered in our analysis, we adopt an ensemble uncertainty approach (Taylor et al., 2009; Zhang et al., 2015) and use as proxy the variability of the wind resource between different reanalysis products. Therefore, at each Monte Carlo iteration at each site, we randomly select wind resource data from one of the three considered reanalysis products.

Linear regression model uncertainty. We adopt a novel way, directly enabled by the use of Monte Carlo, to incorporate this uncertainty component in the operational AEP assessment. We sample the regression slope and intercept values from a bivariate normal distribution centered on their bestfit values and covariance matrix equal to 1 of the bestfit parameters. The diagonal terms in the covariance matrix are given by the square of the slope and intercept standard errors. For a regression model between an independent variable, x, and a dependent variable, y, the standard error of the regression is defined (JCGM 100:2008, 2008) as follows:
$$\begin{array}{}\text{(4)}& {e}_{y}=\sqrt{{\displaystyle \frac{\sum {\left({y}_{i}{\widehat{y}}_{i}\right)}^{\mathrm{2}}}{n\mathrm{2}}}},\end{array}$$where ${\widehat{y}}_{i}$ is the regressionpredicted value for y_{i} and n is the number of data points used in the regression. The standard error of the regression slope is
$$\begin{array}{}\text{(5)}& {e}_{a}={\displaystyle \frac{{e}_{y}}{\sum {\left({x}_{i}{\stackrel{\mathrm{\u203e}}{x}}_{i}\right)}^{\mathrm{2}}}},\end{array}$$and the standard error of the intercept is
$$\begin{array}{}\text{(6)}& {e}_{b}={e}_{y}\phantom{\rule{0.33em}{0ex}}{e}_{a}\sqrt{{\displaystyle \frac{\sum {x}_{i}^{\mathrm{2}}}{n}}},\end{array}$$where ${e}_{a}^{\mathrm{2}}$ and ${e}_{b}^{\mathrm{2}}$ are the diagonal terms in the covariance matrix of the bivariate normal distribution of regression slope and intercept from which Monte Carlo values are drawn. Slope and intercept values are strongly negatively correlated, which is captured by their covariance when performing the linear regression. The offdiagonal terms in the covariance matrix of the bivariate normal distribution constrain the random sampling of slope and intercept values to avoid sampling unrealistic combinations. An example of this sampling is shown in Fig. 4 for two projects of different regression strengths. We sample 500 slope and intercept values from a bivariate normal distribution centered around the bestfit parameters, as well as with the covariance matrix derived from the standard errors of slope and intercept and their covariance. As shown in Fig. 4, the low standard errors found for the leftmost regression relationship constrain the possible slope and intercept values that can be sampled, while the high standard errors in the rightmost regression relationship allow for a much wider sampling.

Longterm (windiness) correction uncertainty. We incorporate this component by sampling the number of years (randomly picked between 10 and 20) to use as the longterm wind resource data to which the regression coefficients are applied to derive longterm energy production data (the socalled windiness correction).

Wind resource interannual variability (IAV) uncertainty. We incorporate this uncertainty component in the Monte Carlo method by sampling the longterm (reanalysis) average calendarmonthly wind speeds (i.e., average January, average February) used to calculate longterm monthly energy production data. The sampling distribution is normal, centered on the calculated longterm average calendarmonthly wind speed, and with a standard deviation equal to the 20year standard deviation of the longterm average monthly wind speed for each calendar month. In doing so, we assume that wind speeds in contiguous months are independent.
Each of the listed sources of uncertainty corresponds to a Monte Carlo sampling and is highlighted by a probability distribution in the flowchart in Fig. 3. Note that uncertainty components related to availability and curtailment losses are not considered in our approach because the EIA923 database does not include measurements of these losses.
To calculate these uncertainty components at each wind plant, we run the Monte Carlo simulation under five different setups, each of them having only a single sampling performed (i.e., either revenue meter, reference wind speed data, IAV, linear regression, or windiness correction). For each component, we run the Monte Carlo simulation 10 000 times. We quantify the impact of each single uncertainty component on the longterm operational AEP in terms of the coefficient of variation of the distribution of operational AEP resulting from the Monte Carlo simulation run. Convergence of the AEP distribution within 0.5 % of the true mean after the 10 000 Monte Carlo runs was verified for all projects with 95 % confidence.
The code used to perform the AEP calculations is published and documented in NREL's (National Renewable Energy Laboratory) opensource operational assessment software, OpenOA^{3}. Calculations were performed on Eagle, NREL's highperformance computing cluster. Specifically, each wind plant was assigned a different processor and run in parallel. Given the general simplicity of the AEP method used here, computational requirements were moderate despite the 50 000 simulations (10 000 runs times 5 uncertainty setups) required for each wind plant.
2.4 Combination of uncertainty components
Once the contribution from each uncertainty component to the longterm operational AEP uncertainty has been quantified, the different components need to be combined to obtain the total AEP uncertainty. As stated in the Introduction, it is common practice for wind energy consultants to assume that all uncertainty components are uncorrelated and combine them using Eq. (1) to obtain σ_{tot,uncorr}. To test the validity of this assumption, we apply Eq. (1) in which each of the five considered uncertainty components, σ_{i}, is quantified as the coefficient of variation of the corresponding operational AEP distribution obtained by running the Monte Carlo simulation with a single sampling performed. We note that the same values of σ_{tot,uncorr} would be obtained by running the Monte Carlo simulation with, at each iteration, all of the five samplings performed independently of each other.
We contrast the total AEP uncertainty calculated assuming uncorrelated components with what we obtain by taking into account these correlations in the calculation. Following the guidance in JCGM 100:2008 (2008), we combine the various uncertainty components and calculate the total longterm operational AEP uncertainty for each wind plant as follows:
where, in our analysis, N equals 5 and R_{ij} is the correlation coefficient between each pair of uncertainty components calculated from the results obtained for all 472 wind plants considered in the analysis.
The comparison between σ_{tot,uncorr} and σ_{tot,corr} will give insights into the error arising from ignoring the correlations existing between the various uncertainty components.
3.1 Operationalbased AEP uncertainty contributions
Distributions of each uncertainty component, expressed in terms of the percent coefficient of variation of the resulting AEP distributions, across all 472 wind plants are shown in Fig. 5. Uncertainty connected to wind resource IAV is found to contribute the most (average 4.1 % across all wind plants). The uncertainty in the linear regression model has the secondlargest contribution (1.5 %), followed by the uncertainty of the reference wind speed data (0.8 %; here, of the reanalysis products) and revenue meter data (here, imposed at 0.5 %). The longterm windiness correction has the smallest uncertainty component (0.4 %). Therefore, the number of years used for the longterm windiness correction does not have a large impact on the overall uncertainty in operational AEP, at least for the sampled range of 10–20 years. Using as few as 10 years seems sufficient to give stability to the longterm AEP estimate, and adding additional years does not provide a significant reduction in the uncertainty connected with the longterm estimate. As already mentioned in Sect. 2, these results are obtained for wind plants in mostly simple terrain and with a moderatetostrong correlation between reanalysis wind resource and wind energy production and, therefore, with an overall low operational AEP uncertainty. We acknowledge that the inclusion of wind plants with a weaker correlation with the reanalysis products would modify the relative contribution of the various uncertainty components (e.g., the importance of the regression uncertainty would increase).
3.2 Correlation between operationalbased AEP uncertainty components
To be able to assess the validity of the uncorrelated assumption when combining different uncertainty components, we assess potential correlations between uncertainty components by analyzing the Pearson's correlation coefficients, R_{ij} (needed in Eq. 7 to calculate σ_{tot,corr}), from each pair of AEP uncertainty components across the 472 wind plants, and we summarize the results in the correlation matrix in Fig. 6.
To assess which of the obtained correlations have statistical significance, we calculate the p value (Westfall and Young, 1993) associated with the 10 correlation coefficients. The test reveals that for three pairs of uncertainty components, the probability of finding the observed notzero correlation coefficients if the actual correlation coefficient were, in fact, zero (p value) is less than 10^{−5}. Therefore, the following three correlations have strong statistical significance.

The wind resource IAV and the longterm windiness correction uncertainties are moderately correlated (R=0.49, $p=\mathrm{1.9}\times {\mathrm{10}}^{\mathrm{29}}$).

The linear regression and reference wind speed data uncertainties are weakly correlated (R=0.35, $p=\mathrm{2.5}\times {\mathrm{10}}^{\mathrm{15}}$).

The wind resource IAV and the linear regression uncertainties appear weakly negatively correlated ($R=\mathrm{0.21}$, $p=\mathrm{2.6}\times {\mathrm{10}}^{\mathrm{6}}$).
The first correlation noted earlier (wind resource IAV and longterm windiness correction) is explained simply by the fact that both uncertainty components are driven by wind resource variability. At a site with large wind variability, IAV will be large by definition and so will the uncertainty introduced by different lengths of time series used for the longterm AEP calculation.
The correlation between linear regression and reference wind speed data uncertainties can be justified given the dependence of both these uncertainty components on the number of data points used in the regression between energy production data and concurrent wind speed data (Fig. 7).
Both the slope and intercept errors (Eqs. 5 and 6), on which the linear regression uncertainty depends (as described in Sect. 2.3), are inversely proportional to the number of data points so that when a regression is performed on only a few data points, its uncertainty increases. This dependence is exemplified in Fig. 4, in which we have compared the sampling sets of regression lines for two stations in the EIA data set; for these two cases, the standard errors of regression slope and intercept for the station with 8 data points (on the right) are 30–50 times larger than what is found for the station with 90 data points (on the left).
The number of data points used for the regression also has an impact on the reference wind speed data uncertainty. In fact, a short period of record of a wind plant's operation can lead to different interpretations from the reference wind resource data sets used as to whether that short period of record was above, equal to, or below the longterm average resource. Over a longer period of record, these potential discrepancies between different wind resource data sets (in our case, reanalysis products) tend to average out, leading, therefore, to a reduced uncertainty. We illustrate this phenomenon by exploring the longterm trend of the reanalysis products for the wind plant with one of the highest reported reference wind speed data uncertainties (EIA ID 60502 reported a 3.7 % reference wind speed data uncertainty). Figure 8 shows the result. The period of record for wind plant operation (shown by a shaded blue area in Fig. 8) was only 12 months. As shown in the figure, the various reanalysis products have very different interpretations of the wind resource in the short period of record relative to the long term (ERAI: 4 % above average; MERRA2: 1 % below average; NCEP2: 1 % above average). Consequently, the use of each reanalysis product will lead to different magnitudes (both positive and negative) in the longterm windiness corrections, leading to high uncertainty in the resulting operational AEP calculation. By increasing the period of record (i.e., increasing the number of data points used in the regression), such discrepancies tend to average out. This is illustrated in Fig. 9, where we show how the period of record to longterm wind speed ratio varies as we extend the period of record by increasing the number of months while keeping December 2017 as the fixed ending time. For short periods of record, there is considerable deviation of this ratio among the different reanalysis products (i.e., the reference wind speed data uncertainty is high). As the length of the period of record increases, this ratio tends to converge to 1.0, and the spread between the three reanalysis products decreases (i.e., the reference wind speed data uncertainty is low).
Finally, the (weak) negative correlation between linear regression and wind resource IAV uncertainties is linked to the fact that they respond differently to the R^{2} coefficient between the reanalysis wind speed and the energy production data (Fig. 10). Predictably, the linear regression uncertainty is inversely proportional to the coefficient of determination because a stronger correlation between wind and energy production will lead to a reduced uncertainty of the regression between the two variables. On the other hand, wind resource IAV uncertainty shows a positive correlation with the regression R^{2} coefficient. This dependence can be explained because both quantities are positively correlated with the total variance of wind speed or, equivalently, produced energy. Figure 11 shows the relationship between IAV uncertainty and the total sum of squares, SS_{tot,WS}, of reanalysis wind speed (here, using MERRA2 monthly data), which is proportional to the variance of the data:
A positive correlation between IAV uncertainty and SS_{tot,WS} emerges. At the same time, the linear regression R^{2} coefficient also depends on the variance of the produced energy (and, equivalently, of wind speed) as it is defined as follows:
where SS_{res} is the total sum of the residuals from the linear regression. Equation (9) shows that when the total sum of squares SS_{tot} increases, so does R^{2}, thus confirming the positive correlation between R^{2} and the variance in the data.
3.3 Comparison between total operationalbased AEP uncertainty under different assumptions
After having revealed the correlations existing between different AEP uncertainty components and having explained their sources, we can compare the total operational AEP uncertainty calculated when allowing for these correlations (Eq. 7) with the total uncertainty calculated with the uncorrelated assumption using the conventional sum of squares approach (Eq. 1). Figure 12 shows the results of this comparison for the 472 wind plants considered as a scatterplot and also as a histogram of the difference ${\mathit{\sigma}}_{\mathrm{tot},\mathrm{corr}}{\mathit{\sigma}}_{\mathrm{tot},\mathrm{uncorr}}$. A weak bias can be observed with a mean value of +0.1 % in uncertainty difference (and differences up to 0.5 % for specific wind plants). In other words, if correlations between the different uncertainty components are ignored in the calculation method, the whole operational AEP uncertainty is then, on average, slightly underestimated.
This difference can be explained by comparing the contributions ${R}_{ij}\stackrel{\mathrm{\u203e}}{{\mathit{\sigma}}_{i}}\stackrel{\mathrm{\u203e}}{{\mathit{\sigma}}_{j}}$ from the various uncertainty pairs in Eq. (7) averaged over the 472 considered wind plants. Figure 13a shows the mean magnitude (across all wind plants) of these contributions for all of the considered uncertainty pairs. The negative correlation between IAV and linear regression has the largest single impact because this correlation involves the two largest uncertainty components (Fig. 5). However, the sum of the contributions from all of the positive correlations exceeds the sum of the contribution from the negatively correlated components (Fig. 13b), thus resulting in the overall average increase in total operational AEP uncertainty when the correlations are taken into account in the calculation.
Financial operations related to wind plants require accurate calculations of the annual energy production (AEP) and its uncertainty prior to the construction of the plant and, often, in the context of its operational analysis. As wind energy penetration increases globally, the need for techniques to accurately assess AEP uncertainty is a priority for the wind energy industry. Typically, current industry practice assumes that uncertainty components in AEP estimates are uncorrelated. However, we have shown that this assumption is not valid for the five components that comprise an operationalbased uncertainty. We used a Monte Carlo approach to assess AEP; this provides quantitative insights into aspects of the AEP calculation that drive its uncertainty. We have applied this approach using operational data from 472 wind plants, mostly in simple terrain, across the United States in the EIA923 database in order to study potential correlations between uncertainty components. Three pairs of uncertainty components revealed a statistically significant correlation: wind resource interannual variability (IAV) and longterm windiness correction (positive correlation); wind resource IAV and linear regression (negative); and reference wind speed data and linear regression (positive). Wind resource IAV and longterm windiness correction uncertainties are correlated because they both depend on wind resource variability. Wind resource IAV uncertainty is correlated with linear regression uncertainty because they are both inversely proportional to the number of data points in the period of record. Finally, reference wind speed data uncertainty and linear regression uncertainty show a negative correlation because they respond oppositely to the R^{2} coefficient between the (reanalysis) wind speed and energy production data.
Our results show that ignoring these correlations between uncertainty components causes an underestimation of the total operational AEP uncertainty of, on average, about 0.1 % with peak differences of 0.5 % for specific sites. These differences, though not large, would still have a significant impact on increasing wind plant financing rates. Moreover, we expect differences would become even larger for sites characterized by a more complex wind flow. Therefore, our results suggest that correlations between uncertainty components should be taken into account when assessing the total operational AEP uncertainty.
Additional components of uncertainty in an operational AEP were not considered in our study because of limited reporting in the EIA923 database. These components include reported availability, curtailment uncertainty, and various uncertainties introduced through analyst decisionmaking (e.g., filtering highloss months from analysis and regression outlier detection). Future studies could include the impact of these additional sources of uncertainty on the operational AEP assessment. Moreover, our analysis excluded sites, mostly in complex terrain, with a weak correlation between reanalysis wind resource data and wind power production. Future work could explore the magnitude of operational AEP uncertainty and the correlation between its components for such complex flow regimes. Finally, this study focused on correlations between operational AEP uncertainty components. Future work could explore correlations between the numerous preconstruction AEP uncertainty components (e.g., wake loss, wind speed extrapolation, wind flow model).
EIA data used in this study are accessible from https://www.eia.gov/electricity/data/eia923/ (US Energy Information Administration, 2020a). Geographical data of the EIA wind plants are available at https://www.eia.gov/maps/layer_infom.php (US Energy Information Administration, 2020b). Software used to assess operational AEP is available from https://github.com/NREL/OpenOA (last access: 1 October 2020, https://doi.org/10.11578/dc.20181023.1, PerrSauer et al., 2018).
NB and MO are equal contributors to this work. MO performed the AEP estimates on the wind plants considered in the study. NB and MO analyzed the processed data. NB wrote the paper with significant contributions from MO.
The authors declare that they have no conflicts of interest.
This research was performed using computational resources sponsored by the Department of Energy's Office of Energy Efficiency and Renewable Energy and located at the National Renewable Energy Laboratory.
This paper was edited by Carlo L. Bottasso and reviewed by Mark C. Kelly and Curran Crawford.
ANSI C12.12014: Electric Meters – Code For Electricity Metering, Standard, National Electrical Manufacturers Association, Virginia, available at: https://webstore.ansi.org/previewpages/NEMA/preview_ANSI+C12.12014.pdf (last access: 1 October 2020), 2014. a
Brower, M.: Wind resource assessment: a practical guide to developing a wind project, John Wiley & Sons, Hoboken, New Jersey, https://doi.org/10.1002/9781118249864, 2012. a, b
Cameron, J.: Postconstruction Yield Analysis, European Wind Energy Association Technical Workshop, available at: http://www.ewea.org/events/workshops/wpcontent/uploads/proceedings/Analysis_of_Operating_Wind_farms/EWEA Workshop Lyon  23 Jessica Cameron Natural Power.pdf (last access: Last access: 1 October 2020), 2012. a
Clifton, A., Smith, A., and Field, M.: Wind Plant Preconstruction Energy Estimates: Current Practice and Opportunities, Tech. rep., available at: https://www.nrel.gov/docs/fy16osti/64735.pdf (last access: 1 October 2020), 2016. a
Dee, D. P., Uppala, S. M., Simmons, A. J., Berrisford, P., Poli, P., Kobayashi, S., Andrae, U., Balmaseda, M. A., Balsamo, G., Bauer, P., Bechtold, P., Beljaars, A. C. M., van de Berg, L., Bidlot, J., Bormann, N., Delsol, C., Dragani, R., Fuentes, M., Geer, A. J., Haimberger, L., Healy, S. B., Hersbach, H., Hólm, E. V., Isaksen, L., Kållberg, P., Köhler, M., Matricardi, M., McNally, A. P., MongeSanz, B. M., Morcrette, J.J., Park, B.K., Peubey, C., de Rosnay, P., Tavolato, C., Thépaut, J.N., and Vitart, F.: The ERAInterim reanalysis: Configuration and performance of the data assimilation system, Q. J. Roy Meteorol. Soc., 137, 553–597, 2011. a
Dimitrov, N., Kelly, M. C., Vignaroli, A., and Berg, J.: From wind to loads: wind turbine sitespecific load estimation with surrogate models trained on highfidelity load databases, Wind Energ. Sci., 3, 767–790, https://doi.org/10.5194/wes37672018, 2018. a
EIA: A Guide to EIA Electric Power Data, Standard, Energy Information Administration, available at: https://www.eia.gov/electricity/data/guide/pdf/guide.pdf (last access: 1 October 2020), 2018. a
Energy Information Administration: Monthly Energy Review – March 2020, Tech. rep., US Department of Energy, 2020. a
Gelaro, R., McCarty, W., Suárez, M. J., Todling, R., Molod, A., Takacs, L., Randles, C. A., Darmenov, A., Bosilovich, M. G., Reichle, R., Wargan, K., Coy, L., Cullather, R., Draper, C., Akella, S., Buchard, V., Conaty, A., da Silva, A. M., Gu, W., Kim, G., Koster, R., Lucchesi, R., Merkova, R., Nielsen, J. E., Partyka, G., Pawson, S., Putman, W., Rienecker, M., Schubert, S. D., Sienkiewicz, M., and Zhao, B.: The modernera retrospective analysis for research and applications, version 2 (MERRA2), J. Climate, 30, 5419–5454, 2017. a
Global Wind Energy Council: Global Wind Report – Annual Market Update 2017, Tech. rep., Global Wind Energy Council, 2018. a
IEC 60688:2012: Electrical measuring transducers for converting A.C. and D.C. electrical quantities to analogue or digital signals, Standard, International Electrotechnical Commission, 2012. a
IEC 61400121:2017: Wind energy generation systems – Part 121: Power performance measurements of electricity producing wind turbines, Standard, International Electrotechnical Commission, 2017. a
IEC 6140015:draft: Assessment of site specific wind conditions for wind power stations, Standard, International Electrotechnical Commission, in review, 2020. a, b
IEC 61400263:2016: Wind energy generation systems – Part 263: Availability for wind power stations, Standard, International Electrotechnical Commission, 2016. a
Ishihara, T. and Yamaguchi, A.: Prediction of the extreme wind speed in the mixed climate region by using Monte Carlo simulation and measurecorrelatepredict method, Wind Energy, 18, 171–186, 2015. a
ISO 2533:1975: Standard Atmosphere, International Organization for Standardization, 11–12, 1975. a
JCGM 100:2008: Evaluation of measurement data – Guide to the expression of uncertainty in measurement, Joint Committee for Guides in Metrology, available at: https://www.bipm.org/utils/common/documents/jcgm/JCGM_100_2008_E.pdf (last access: 1 October 2020). 2008. a, b, c, d
Kalkan, A.: Uncertainty in Wind Energy Assessment, available at: http://www.windsim.com/documentation/UM2015/1506_WindSim_UM_Inores_Akgun_Kalkan.pdf (last access: 1 October 2020), 2015. a
Khatab, A. M.: Performance Analysis of Operating Wind Farms, Master's thesis, Uppsala University, Department of Earth Sciences, Campus Gotland, 2017. a
Lackner, M. A., Rogers, A. L., and Manwell, J. F.: Uncertainty analysis in MCPbased wind resource assessment and energy production estimation, J. Sol. Energy Eng., 130, 031006, https://doi.org/10.1115/1.2931499, 2008. a
Lindvall, J., Hansson, J., Undheim, O., and Vindteknikk, J.: Postconstruction production assessment of wind farms, Tech. Rep. 2016:297, Energyforsk, 2016. a
Lunacek, M., Fields, M. J., Craig, A., Lee, J. C. Y., Meissner, J., Philips, C., Sheng, S., and King, R.: Understanding Biases in PreConstruction Estimates, J. Phys.: Conference Series, 1037, 062009, 2018. a
Manwell, J. F., McGowan, J. G., and Rogers, A. L.: Wind energy explained: theory, design and application, John Wiley & Sons, Hoboken, NJ, 2010. a
Müller, K. and Cheng, P. W.: Application of a Monte Carlo procedure for probabilistic fatigue design of floating offshore wind turbines, Wind Energ. Sci., 3, 149–162, https://doi.org/10.5194/wes31492018, 2018. a
PerrSauer, J., Fields, M., Craig, A., Optis, M., Kemper, T., Sheng, S., Meissner, J., and Phillips, C.: Open OA, FKA: Wind Plant Performance Project (WP3) Benchmarking, https://doi.org/10.11578/dc.20181023.1, 2018. a
Saha, S., Moorthi, S., Wu, X., Wang, J., Nadiga, S., Tripp, P., Behringer, D., Hou, Y.T., Chuang, H.Y., Iredell, M., Ek, M., Meng, J., Yang, R., Mendez, M. P., van den Dool, H., Zhang, Q., Wang, W. H., Chen, M., and Becker, E.: The NCEP climate forecast system version 2, J. Climate, 27, 2185–2208, 2014. a
Shravan Kumar, M. and Anandan, V.: Comparision of the NCEP/NCAR Reanalysis II winds with those observed over a complex terrain in lower atmospheric boundary layer, Geophys. Res. Lett., 36, L01805, https://doi.org/10.1029/2008GL036246, 2009. a
Taylor, J. W., McSharry, P. E., and Buizza, R.: Wind power density forecasting using ensemble predictions and time series models, IEEE Transactions on Energy Conversion, 24, 775–782, 2009. a
US Energy Information Administration: Form EIA923, available at: https://www.eia.gov/electricity/data/eia923/, last access: 1 October 2020a. a
US Energy Information Administration: EIA Maps, available at: https://www.eia.gov/maps/layer_infom.php, last access: 1 October 2020b. a
Vaisala: Reducing Uncertainty in Wind Project Energy Estimates, Tech. rep., available at: https://www.vaisala.com/sites/default/files/documents/TritonDNVWhitePaper.pdf (last access: 1 October 2020), 2014. a
Westfall, P. H. and Young, S. S.: Resamplingbased multiple testing: Examples and methods for pvalue adjustment, vol. 279, John Wiley & Sons, Hoboken, NJ, 1993. a
Williams, S. K., Acker, T., Goldberg, M., and Greve, M.: Estimating the economic benefits of wind energy projects using Monte Carlo simulation with economic input/output analysis, Wind Energy: An International Journal for Progress and Applications in Wind Power Conversion Technology, 11, 397–414, 2008. a
Zhang, J., Draxl, C., Hopson, T., Delle Monache, L., Vanvyve, E., and Hodge, B.M.: Comparison of numerical weather prediction based deterministic and probabilistic wind resource assessment methods, Appl. Energ., 156, 528–541, 2015. a
Please note that this product is provided in MERRA2 directly and no further interpolation was performed.
Results are accepted by banks, investors, and so on for use in financing, buying/selling, and acquiring wind plants.
https://github.com/NREL/OpenOA, last access: 1 October 2020