Preprints
https://doi.org/10.5194/wes-2025-29
https://doi.org/10.5194/wes-2025-29
20 Mar 2025
 | 20 Mar 2025
Status: this preprint is currently under review for the journal WES.

Minimum Open Data Subset for Wind Power Prediction

Elizabeth von Zuben and Kristen R. Schell

Abstract. Accurate wind power prediction is required for grid integration of renewables, minimizing curtailment of renewable energy, and performing resource assessments. Prior research has explored the use of numerical weather prediction, reanalysis datasets, and observational data in power prediction and resource assessment applications. Observational data is spatially limited and often proprietary. Reanalysis datasets are available globally, but have a large spatial resolution and therefore do not capture the effects of complex geography well. Numerical weather prediction simulations allow for high spatial resolution flow models, but require significant processing resources and computational time. This work combines historical wind power production data, observational data, MERRA-2 reanalysis, and WRF model data at three wind farms in Ontario, Canada to determine the optimal data source, combination of data sources, and variables for prediction of wind power using a random forests model. Results show that a model combining select data from all three data sources, including a combination of wind speed, time, and other weather variables, improves predictive performance by up to 57 % over the benchmark power curve model. Analysis of feature importance shows that aggregating wind speed allows the model to make better use of additional weather features. The minimum subset of input data for the best performing model, which achieves a mean absolute error (MAE) of 0.071 across all sites, consists of averaged wind speed, temperature, wind direction, pressure, air density, and time variables (hour, day and month).

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Share
Elizabeth von Zuben and Kristen R. Schell

Status: open (until 17 Apr 2025)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Elizabeth von Zuben and Kristen R. Schell
Elizabeth von Zuben and Kristen R. Schell

Viewed

Total article views: 41 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
34 6 1 41 0 0
  • HTML: 34
  • PDF: 6
  • XML: 1
  • Total: 41
  • BibTeX: 0
  • EndNote: 0
Views and downloads (calculated since 20 Mar 2025)
Cumulative views and downloads (calculated since 20 Mar 2025)

Viewed (geographical distribution)

Total article views: 41 (including HTML, PDF, and XML) Thereof 41 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 23 Mar 2025
Download
Short summary
Wind energy production is directly related to several meteorological variables, but such information is not publicly available at the wind farm location. This research seeks to understand what combination of publicly available meteorological data can accurately predict wind energy production. We find that a combination of publicly available data on wind speeds, temperature, wind direction, air pressure and density, and date and time can improve wind power prediction by over 57 %.
Share
Altmetrics