Simulating run-to-failure SCADA time series to enhance wind turbine fault detection and prognosis

Eftekhari Milani, Ali; Zappalá, Donatella; Castellani, Francesco; Watson, Simon

doi:10.5194/wes-10-2563-2025

Articles | Volume 10, issue 11

https://doi.org/10.5194/wes-10-2563-2025

Articles | Volume 10, issue 11

Research article

12 Nov 2025

Research article |

| 12 Nov 2025

Simulating run-to-failure SCADA time series to enhance wind turbine fault detection and prognosis

Ali Eftekhari Milani, Donatella Zappalá, Francesco Castellani, and Simon Watson

Abstract

Wind turbine supervisory control and data acquisition (SCADA) datasets available for research usually contain a limited number of failure events. This limitation hinders the successful application of deep learning (DL) methods for fault detection and prognosis, as they require large datasets for robust training and generalisation. This work proposes a method using conditional generative adversarial networks (cGANs) to generate synthetic SCADA time series that replicate wind turbine behaviour under controllable operational, environmental, and degradation conditions. Given a set of SCADA time series representing these conditions, the cGAN generates temperature and pressure time series simulating gearbox operation. Results show that augmenting the training set of an artificial neural network (ANN) fault detection model with synthetic time series reduces false positives in the detected gearbox faults by 84 % on average, enabling the model to blindly detect a fault in a test wind turbine without prior knowledge of the event. Furthermore, training a convolutional autoencoder-based unsupervised health indicator (HI) model with both real and synthetic SCADA time series leads to an HI that more accurately captures the expected degradation trend. Using this HI, the gearbox's remaining useful life (RUL) can be predicted within the defined error bounds from around 4.5 months before the detection of the fault, while the HI obtained without the synthetic data fails to produce reliable RUL estimations.

Download & links

Article (PDF, 5052 KB)

Download & links

How to cite.

Received: 04 Apr 2025 – Discussion started: 13 Jun 2025 – Revised: 04 Aug 2025 – Accepted: 30 Sep 2025 – Published: 12 Nov 2025

1 Introduction

State-of-the-art deep learning (DL) methods for wind turbine fault detection and prognosis rely on large datasets for robust training and generalisation. However, component failures in wind turbines are rare events (Spinato et al., 2009), and wind farm operators are often reluctant to disclose detailed information about them due to privacy concerns (Chatterjee and Dethlefs, 2021). Therefore, supervisory control and data acquisition (SCADA) datasets available for research usually include very few failure events, limiting the successful implementation of DL methods. A viable solution to this challenge is to simulate new failure events within SCADA datasets. This involves generating time series data that mimic sensor signals reflecting turbine component behaviour as degradation progresses over a specific time window leading to failure. Rather than merely replicating existing failure event signals, these synthetically generated run-to-failure time series should instead capture diverse degradation scenarios under varying operational and environmental conditions. This diversity is crucial for enhancing the training data of DL fault detection and prognosis models, improving their robustness and practicality.

Existing approaches for generating synthetic signals have mostly used physics-based and hybrid physics-data-driven models of wind turbines. They often simulate damage by methods such as inserting additional mass or reducing local structural stiffness. For example, synthetic vibrational signals generated using the OpenFAST software (Jonkman et al., 2022) have been used to validate condition monitoring methods in Tatsis et al. (2017), Tatsis et al. (2021), and Song et al. (2024). However, they are unsuitable for training DL methods to be applied to real wind turbines with different configurations. In Pujana et al. (2023), a hybrid digital twin of a wind turbine drivetrain is developed to generate synthetic stator winding temperature signals, with the temperature increase due to a generator failure modelled as a heat exchanger. These synthetic signals are used to train a fault detection model. While useful, these methods oversimplify the actual component behaviour and cannot model gradual degradation. Therefore, they fail to generate realistic run-to-failure sequences across multiple SCADA signals, which are critical for prognostic applications.

Data-driven approaches for generating synthetic SCADA time series are rare in the literature. An artificial neural network (ANN)-based framework for generating synthetic SCADA signals, given operational, environmental, and degradation conditions, is proposed in Eftekhari Milani et al. (2024 a). While the generated synthetic signals are in good agreement with the corresponding field data, this approach assumes that sensor signals at each timestamp are deterministic functions of the current conditions. This assumption overlooks the inertia and temporal dependencies present in SCADA signals. These signals, especially temperature data, often exhibit significant inertia, with measurements strongly dependent on their previous values (Mello et al., 2021). Furthermore, this approach does not address the inherent stochasticity of SCADA signals and does not demonstrate whether it can simulate new failure events.

Generative adversarial networks (GANs) (Goodfellow et al., 2020) have been proven successful in generating diverse and realistic synthetic data across many domains, including images (Shorten and Khoshgoftaar, 2019), text (Li et al., 2018), audio (Liu et al., 2022), and video (Chu et al., 2020). These models consist of two neural networks trained in a competitive framework: a generator generating realistic-looking synthetic samples and a discriminator evaluating whether the data are real or synthetic. In the wind turbine SCADA data domain, the application of GANs has been mostly limited to addressing class imbalances in fault detection tasks by augmenting faulty data instances. For example, faulty data instances are generated in Liu et al. (2019) to enhance fault detection performance. In Wang et al. (2022), a variant of GAN called the least squares GAN is used to generate synthetic data instances and improve the performance of an autoencoder-based condition monitoring framework. Similarly, in Liu et al. (2023), a GAN is used to overcome the limitation of scarce faulty data by generating synthetic faulty instances, which are then used to enhance the performance of an autoencoder-based anomaly detection method. These methods outperform more traditional oversampling approaches (Antoniou et al., 2017), such as those based on the synthetic minority oversampling technique (SMOTE) (Chawla et al., 2002), which have been extensively used to address the problem of class imbalance in SCADA datasets (Peng et al., 2020; Yang et al., 2021; Li et al., 2023; Tao et al., 2024). However, they are limited to generating individual signal instances rather than run-to-failure time series. Another limitation of these methods is that they cannot simulate entirely new failure events and are limited to oversampling faulty data instances corresponding to existing failure events in SCADA datasets (Chesterman et al., 2023). Extending these approaches to generate entire time series is challenging, as a generative model must learn not only the feature distributions but also their temporal dynamics. Furthermore, it must generate a diverse set of time series under predefined operational, environmental, and degradation conditions to be useful for wind turbine fault detection and prognosis.

This work addresses these limitations by developing a method based on a conditional GAN (cGAN) (Mirza and Osindero, 2014). Unlike the standard vanilla GAN, the proposed model allows conditioning each generated signal instance to a vector of predefined conditions, including component degradation levels and SCADA measurements related to environmental conditions and the operational states of the wind turbine. To capture temporal dynamics of the signal instances, gated recurrent unit (GRU)-based recurrent neural networks (Cho et al., 2014) are used for both the generator and the discriminator networks, enabling the model to retain a memory of condition vectors from previous timestamps. Furthermore, as suggested by Yoon et al. (2019), a supervised loss term is added to the generator's training loss function to enhance the cGAN's ability to generate realistic SCADA time series. The effectiveness of this approach for fault detection and remaining useful life (RUL) prediction is demonstrated using field SCADA data. The results show that augmenting field data with synthetic time series generated by the cGAN significantly reduces false positives caused by the scarcity of failure events in training data. This enables the model to blindly detect a fault in one of the test wind turbines without prior knowledge of the event. Furthermore, including synthetic time series enhances the performance of a health indicator (HI) construction model. The resulting HIs better capture the degradation trend than those generated without the synthetic data, leading to more accurate RUL predictions. The rest of this paper is organised as follows. Section 2 describes the method developed for synthetic SCADA time series generation. Section 3 introduces the SCADA dataset used, data preprocessing, selected signals, and the healthy and faulty wind turbines. Sections 4 and 5 present the application of the proposed method in fault detection and RUL prediction, respectively. Finally, conclusions are drawn, and future work is discussed in Sect. 6.

2 Generation of synthetic SCADA signals

A set of SCADA signals representing the operation of a wind turbine component, such as gearbox temperature and pressure signals, is a discrete multivariate time series $S = {s_{i, t}}$ with distribution p_s, where $t = 1, \dots, T$ , $i = 1, \dots, N_{s}$ , N_s is the number of component signals, t is the time instance, and T is the length of the dataset. Similarly, $O = {o_{i, t}}$ , where $i = 1, \dots, N_{o}$ , and $E = {e_{i, t}}$ , where $i = 1, \dots, N_{e}$ , are sets of SCADA signals representing operational and environmental conditions over the same time interval $t = 1, \dots, T$ , such as rotor speed and ambient temperature, with N_o and N_e being the number of signals representing these conditions, respectively. D is an HI representing the degradation of the component at each point in time, which can be extracted from the SCADA signals. The set of ${O, E, D}$ is called the condition time series $C = {c_{i, t}}$ , where $i = 1, \dots, N_{c}$ , and $N_{c} = N_{o} + N_{e} + 1$ is the number of condition signals.

It is assumed that S is a function of C (Eftekhari Milani et al., 2024 a), with some stochasticity inherent in the SCADA signals. At each instance t, s_t is a function of not only c_t but also all the previous instances due to the inertia inherent in the SCADA signals. However, in practice, this dependence becomes negligible beyond a certain time window. This relationship can be expressed through a stochastic generative function ℱ:

\begin{matrix} (1) & s_{t} = F (c_{1 : t}, z_{t}), \end{matrix}

where z_t is a vector of random noise with distribution p_z. The objective of this work is to model ℱ through a GAN-based framework and use it to generate a set of synthetic SCADA signals S^s given any C.

2.1 HI construction

As mentioned in the previous section, D is an HI extracted from the SCADA dataset and, together with O and E, forms the condition time series C. In this work, D is obtained using the unsupervised method developed in Eftekhari Milani et al. (2024 b), where it is demonstrated that it can construct HIs that track the true component degradation trend more accurately than other methods proposed in the literature. This approach adopts a convolutional autoencoder (CAE), which is trained using a hybrid of particle swarm optimisation (PSO) and backpropagation to simultaneously maximise the HI monotonicity built in the middle layer and minimise the reconstruction error. Due to the generally irreversible nature of component degradation, an HI is expected to demonstrate a monotonic trend, and monotonicity has been widely used as one of the main criteria to build HIs (She and Jia, 2019; Yang et al., 2022). Therefore, maximising monotonicity leads to an HI that better represents the component degradation, leading to a more accurate RUL prediction. The training fitness function to be maximised is

\begin{matrix} (2) & f = f_{M} - f_{R} - f_{0} - f_{1}, \end{matrix}

where f_M is the monotonicity of the HI built in the middle layer of the CAE, measured using the Mann–Kendall (MK) metric (Pohlert, 2015); f_R is the CAE reconstruction loss, i.e., the mean squared error (MSE) of the difference between the CAE input and output; $f_{0} = | HI (0) |$ is the absolute HI value at its initial timestamp; and $f_{1} = | 1 - HI (end) |$ , where HI(end) is the HI value at its final timestamp. Maximising f corresponds to minimising both f₀ and f₁ and training the CAE to associate the healthy state at the initial timestamp with an HI value of 0 and the failed state at the final timestamp with an HI value of 1. In this work, f₀ is removed from f because the component is not necessarily in a pristine state at the initial timestamp of a run-to-failure SCADA time series.

SCADA measurements are characterised by high noise levels and varying operational and environmental conditions. These factors make it more challenging for the CAE to effectively reconstruct and denoise the signals compared to more controlled vibration signals obtained from bearing test beds used in Eftekhari Milani et al. (2024 b). Therefore, in this work, an additional term f_SD is considered, which measures the average weekly rolling standard deviation of the HI, and minimising this term leads to a less noisy HI. An equal weight of 1 for the four terms f_M, f_R, f₁, and f_SD results in slow convergence during the training process due to the slow minimisation of the reconstruction error, and the obtained training HI tends to be noisy. For this reason, the weights of the f_R and f_SD terms are set to 3 using trial and error to balance the four terms and resolve these issues.

The CAE training fitness function thus used in this work is

\begin{matrix} (3) & f = f_{M} - 3 f_{R} - f_{1} - 3 f_{SD} . \end{matrix}

The CAE architecture and the training algorithm hyperparameters are set according to those proposed in Eftekhari Milani et al. (2024 b).

Since wind turbines operate under highly variable operational and environmental conditions, the constructed HIs usually exhibit local variations, which can reduce the RUL prediction accuracy. To mitigate these variations, a post-processing algorithm is developed, leveraging the usual irreversible nature of component degradation. As shown in Algorithm 1, a curve is fitted to the HI using the non-parametric locally weighted scatterplot smoothing (LOWESS) regression approach (Cleveland and Devlin, 1988). This curve is then subtracted from the HI, and subsequently, the cumulative maximum of the curve is re-added to the HI.

Algorithm 1HI post-processing algorithm.

Fit a curve to the HI using LOESS.
Compute residue: $r e s i d u e = H I - c u r v e$ .
Compute curve_cm: $c u r v e_{c m} (t) = m a x [c u r v e (1 : t)]$ .
Compute the post-processed HI: $H I_{p p} = r e s i d u e + c u r v e_{c m}$ .

Figure 1 provides a visual explanation of this algorithm and its impact on RUL prediction. Figure 1a shows a hypothetical HI and a curve fitted using LOWESS. In Fig. 1b and c, which show the HI at t=200 s and t=300 s, respectively, it is evident that the slope of the regression line oscillates between negative and positive values. This undesirable behaviour is resolved after post-processing the HI with Algorithm 1, as shown in Fig. 1d–f.

https://wes.copernicus.org/articles/10/2563/2025/wes-10-2563-2025-f01

Figure 1Example of HI post-processing on a hypothetical HI: (a–c) raw HI, its first 200 s, and its first 300 s and (d–f) post-processed HI, its first 200 s, and its first 300 s.

Simulating run-to-failure SCADA time series to enhance wind turbine fault detection and prognosis

2.1 HI construction

2.2 Vanilla GAN

2.3 Proposed method for generating synthetic SCADA signals

3.1 Data preprocessing