Preprints
https://doi.org/10.5194/wes-2025-62
https://doi.org/10.5194/wes-2025-62
13 Jun 2025
 | 13 Jun 2025
Status: a revised version of this preprint is currently under review for the journal WES.

Simulating run-to-failure SCADA time series to enhance wind turbine fault detection and prognosis

Ali Eftekhari Milani, Donatella Zappalá, Francesco Castellani, and Simon Watson

Abstract. Wind turbine Supervisory Control and Data Acquisition (SCADA) datasets available for research usually contain a limited number of failure events. This limitation hinders the successful application of Deep Learning (DL) methods for fault detection and prognosis, as they require large datasets for robust training and generalisation. This work proposes a method using Conditional Generative Adversarial Networks (cGANs) to generate synthetic SCADA time series that replicate wind turbine behaviour under controllable operational, environmental, and degradation conditions. Given a set of SCADA time series representing these conditions, the cGAN generates temperature and pressure time series simulating gearbox operation. Results show that augmenting the training set of an Artificial Neural Network (ANN) fault detection model with synthetic time series reduces false positives in the detected gearbox faults by 84 % on average, enabling the model to blindly detect a fault in a test wind turbine without prior knowledge of the event. Furthermore, training a Convolutional Autoencoder-based unsupervised health indicator (HI) model with both real and synthetic SCADA time series leads to an HI that more accurately captures the expected degradation trend. Using this HI, the gearbox's remaining useful life (RUL) can be predicted within the defined error bounds from around 4.5 months before the detection of the fault, while the HI obtained without the synthetic data fails to produce reliable RUL estimates.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Ali Eftekhari Milani, Donatella Zappalá, Francesco Castellani, and Simon Watson

Status: final response (author comments only)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Ali Eftekhari Milani, Donatella Zappalá, Francesco Castellani, and Simon Watson
Ali Eftekhari Milani, Donatella Zappalá, Francesco Castellani, and Simon Watson

Viewed

Total article views: 262 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
185 43 34 262 14 24
  • HTML: 185
  • PDF: 43
  • XML: 34
  • Total: 262
  • BibTeX: 14
  • EndNote: 24
Views and downloads (calculated since 13 Jun 2025)
Cumulative views and downloads (calculated since 13 Jun 2025)

Viewed (geographical distribution)

Total article views: 262 (including HTML, PDF, and XML) Thereof 262 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 15 Aug 2025
Download
Short summary
This paper proposes a data-driven approach to simulate wind turbine sensor time series, such as temperature and pressure signals, describing the behaviour of a wind turbine component as it degrades through time up to the failure point. It allows for the simulation of new failure events or the replication of a given failure under different conditions. The results show that the synthetic signals generated using this approach improve the performance of fault detection and prognosis methods.
Share
Altmetrics