the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Scalable SCADA-driven Failure Prediction for Offshore Wind Turbines Using Autoencoder-Based NBM and Fleet-Median Filtering
Abstract. Offshore wind turbines are crucial for sustainable energy production but face significant challenges in operational reliability and maintenance costs. In particular, the scalability and practicality of failure detection systems are a key challenge in large-scale wind farms. This paper presents a scalable, comprehensive approach to failure prediction based on the Normal Behavior Modeling (NBM) framework that integrates three components: a cloud-based pipeline, an undercomplete autoencoder for temperature-based anomaly detection, and a physics-informed, time-aware anomaly filtering method. The pipeline enables dynamic scaling and streamlined deployment across multiple wind farms. The autoencoder was trained exclusively on healthy 10-minute SCADA data and produces detailed anomaly scores that serve as the input for our filtering technique. It was trained on four years of data from a large offshore wind farm in the Dutch-Belgian zone and achieved UHH-ratios (UnHealthy-Healthy) of up to 1.69 and 1.21 for the generator and gearbox models, respectively. The filtering method refines the raw anomaly scores by comparing turbine signals to a windowed fleet median. By aggregating scores via sliding windows and employing robust distance metrics, the method reduces the volume of anomaly scores by up to 65 % without sacrificing predictive accuracy. This selective filtering effectively minimizes noise and non-relevant anomalies, enhancing the efficiency of maintenance analysis.
- Preprint
(2323 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
-
RC1: 'Comment on wes-2025-49', Anonymous Referee #1, 09 Jun 2025
This manuscript presents a well-structured and methodologically rigorous approach to scalable failure prediction in offshore wind turbines using SCADA data, autoencoder-based normal behavior modeling, and fleet median filtering. The authors have developed and validated a cloud-based, modular pipeline and propose a post-processing technique to reduce false positives in anomaly detection. While the work is timely and technically sound, several aspects could benefit from further clarification.
1. The filtering method is described as novel, however similar fleet based anomaly filtering strategies have been discussed in prior work (Hendrickx et al. 2020, Li et al. 2020). A clearer articulation of what distinguishes this work is needed.
2. The fleet median filtering method assumes most turbines operate under the same conditions at any given time. This assumption may break down, when turbines are shut down for maintenance. Furthermore, in region I downstream turbines produce less power due to wake losses, hence their generator and gearbox temperatures are lower than those of upstream turbines. The authors should discuss how such conditions might affect the effectiveness of the filtering method.
3. The scalability of the pipeline is asserted and architecturally supported, but not empirically demonstrated in the manuscript. If this is claimed as a major contribution, the authors should have included for example:
- Report runtime performance under different fleet sizes
- Demonstrate linear or sublinear scaling
- Show cost, memory or latency metrics as functions of loadCitation: https://doi.org/10.5194/wes-2025-49-RC1 -
RC2: 'Comment on wes-2025-49', Anonymous Referee #2, 16 Jun 2025
This paper presents an autoencoder-based anomaly detection approach for failure prediction in offshore wind turbines that analyzes temperature signals from SCADA data. The proposed approach also uses a fleet-level median filtering technique to reduce non-relevant anomalies. While the work addresses important challenges in wind turbine condition monitoring, several aspects require clarification and additional validation.
- The term "physics-informed" used to describe the filtering method could benefit from further clarification. The description of the filtering method in section 3.3 (distance to fleet median, windowing, multidimensional distances) appears to be primarily statistical and temporal, rather than directly incorporating physical models or principles. It would enhance clarity if the authors could explicitly detail how "physics-informed" aspects are integrated into the filtering logic.
- The paper describes its cloud-based pipeline, highlighting its modularity and scalability for managing anomaly detection across wind farms. However, the contribution of this solution remains unclear as the results section focuses solely on the autoencoder and filtering methods. There are no empirical data or quantitative metrics presented to validate the pipeline's actual performance, scalability, or efficiency.
- There is not enough detail about the specific failure types examined in this work. The authors mention gearbox and generator failures, but more information is needed about the failure sub-types and their locations for enhanced clarity.
- While the paper acknowledges that data can differ greatly across the fleet and emphasizes the importance of having a large enough fleet for reliable median calculation, I think more discussion is needed about specific sources of variability that could affect the fleet median approach. The paper assumes that a large fleet size will normalize variations, but factors like seasonal variation, turbine location within the wind farm (wake effects, wind exposure differences), and individual operational patterns might create systematic rather than random variations. It would be helpful to have more analysis of how these location-based and operational differences are distinguished from actual anomalies, especially since some turbines might consistently operate differently due to their position rather than equipment issues.
- Figures 5-7 show negative reconstruction errors and Figures 12-14 show negative anomaly scores. Since reconstruction errors are typically positive differences between predicted and actual values, it's unclear how to interpret negative values in this anomaly detection context.
- I would recommend comparing the autoencoder model with other normal behavior modeling approaches, especially since a cloud-based solution has been provided in this work and deployment of autoencoder models could be expensively high. Alternative models like isolation forest, one-class SVM, or statistical approaches might offer better cost-effectiveness and computational efficiency for cloud deployment while achieving similar anomaly detection performance.
Citation: https://doi.org/10.5194/wes-2025-49-RC2
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
100 | 25 | 8 | 133 | 7 | 6 |
- HTML: 100
- PDF: 25
- XML: 8
- Total: 133
- BibTeX: 7
- EndNote: 6
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1