the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Minimum Open Data Subset for Wind Power Prediction
Abstract. Accurate wind power prediction is required for grid integration of renewables, minimizing curtailment of renewable energy, and performing resource assessments. Prior research has explored the use of numerical weather prediction, reanalysis datasets, and observational data in power prediction and resource assessment applications. Observational data is spatially limited and often proprietary. Reanalysis datasets are available globally, but have a large spatial resolution and therefore do not capture the effects of complex geography well. Numerical weather prediction simulations allow for high spatial resolution flow models, but require significant processing resources and computational time. This work combines historical wind power production data, observational data, MERRA-2 reanalysis, and WRF model data at three wind farms in Ontario, Canada to determine the optimal data source, combination of data sources, and variables for prediction of wind power using a random forests model. Results show that a model combining select data from all three data sources, including a combination of wind speed, time, and other weather variables, improves predictive performance by up to 57 % over the benchmark power curve model. Analysis of feature importance shows that aggregating wind speed allows the model to make better use of additional weather features. The minimum subset of input data for the best performing model, which achieves a mean absolute error (MAE) of 0.071 across all sites, consists of averaged wind speed, temperature, wind direction, pressure, air density, and time variables (hour, day and month).
- Preprint
(9611 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
-
RC1: 'Comment on wes-2025-29', Anonymous Referee #1, 12 Apr 2025
Dear authors and editor, while this manuscript is of fair technical value I do not consider it worth a publication in WES in its current state for the following reasons:
- objectives and context: the objectives of a Wind Resource and Energy Yield Assessments (WRA and EYA) are primarily to estimate the long term net production of a wind farm. The paper is focusing on metrics (MAE and RMSE) which are not commonly used for WRA and EYA. A revised version of the paper should use the commonly accepted framework, so as to bring value to practitioners in the field.
- methods: the methods used by practitioners (measurement data analysis, long term correction, spatial extrapolation, gross and net (including wakes) energy yield, operational losses) are not discussed in this paper. A revised value of the paper should at least discuss these methods and how the proposed alternatives add value compare to these traditional methods. See for reference the WES article https://wes.copernicus.org/articles/6/311/2021/ and the reports from the Wind Plant Performance Prediction (WP3) NREL project. Also, a revised version of the manuscript should dwelve much more into details regarding the wind climate (macro-meso-micro) at every wind farm, through a description of the sites in terms of large-scale forcing, orography, roughness, atmospheric stability, wind turbine layout and characteristics. This information (derived from model and measurements) should be used to discuss the differences in model results (the model works differently for the third wind farm). See example such as https://doi.org/10.1127/metz/2021/1068. Lastly, a revised value of the paper should compare several reanalysis (at least ERA5 should be added).
- readibility: the manuscript needs to be shortened, in particular the part on the literature review. I understand the work derives from a Master Thesis, for which the style used (technical report) is fine. But here for a WES paper, the manuscript should be more concise and clearly insist on the novelty/added value of the proposed approach compared to existing frameworks.All the best,
Rémi Gandoin
Citation: https://doi.org/10.5194/wes-2025-29-RC1 -
RC2: 'Comment on wes-2025-29', Anonymous Referee #2, 23 Apr 2025
"general comments"
This paper investigates different ways of estimating the historical wind power available in a region. It uses the recorded actual wind power generation from three test locations in Canada to evaluate the skill of different approaches. Three different sources of atmospheric data are used to generate wind power: meteorological stations, WRF forecast, and MERRA-2 reanalysis. The skill of the sources are considered separately, as well as in combination. The importance of different atmospheric variables is also estimated. The model with the best skill scores used variables from all sources, and used 10 variables, emphasizing the importance of using atmospheric variables beyond simply wind speed. This is a well-written paper, with a logical and clear methodology, and presents interesting results and discussion.
"specific comments"
- As the paper uses inputs from reanalysis and met stations, it should be made clear that this method can not be used for operational forecasting.
- There are well-published biases within the MERRA-2 reanalysis dataset. As the authors mention in their conclusions, it would probably have been better to have used the ERA5 dataset instead, or, ideally, to have compared the skill of the two for the particular locations used in this study.
- 10 metre met station wind speed was extrapolated (not interpolated) to hub height using a logarithmic wind speed profile. There are many issues with this, including the importance of atmospheric stability, this could be highlighted as a source of uncertainty.
- Is MERRA-2 also extrapolated from 10m wind? Note: ERA5 records wind speed at 10m and 100m, which is an advantage here.
- Was WRF wind output at turbine hub height?
- The paper is based on only four months of data, which is quite a short time period. Results would be more rigorous if at least one year of simulations were generated.
- Table 4: min and max don’t mean much for wind direction. Perhaps something else, like standard deviation, would be more relevant?
"technical corrections"
- Table 1 does not seem to be referred to in the text.
- Line 168: typo “acquired from the and”
- Table 4: typo: column heading “WRF1” should, I think, be “WF1”
- Line 194: missing reference “acquired from the and”provided power curves (?)”
- Line 222: type: “Figure 3a” should be “Figure 4a”
- Line 265: perhaps change “Scenario 5” to “the last column”?
Conor Sweeney, UCD, Dublin, Ireland.
Citation: https://doi.org/10.5194/wes-2025-29-RC2
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
111 | 32 | 7 | 150 | 7 | 7 |
- HTML: 111
- PDF: 32
- XML: 7
- Total: 150
- BibTeX: 7
- EndNote: 7
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1