the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Leveraging Signal Processing and Machine Learning for Automated Fault Detection in Wind Turbine Drivetrains
Abstract. Wind energy is considered a sustainable renewable energy source; however, it faces the challenge of significant operating and maintenance costs. The research proposes a hybrid fault detection method to combine the physical domain knowledge with the machine learning models to provide an overview of the health of wind turbine drivetrain components. Signal processing indicators are computed from raw vibration signals measured from strategically placed accelerometers over drivetrain components. It produces an immense number of indicators as each indicator is sensitive towards certain types of faults, and manual monitoring becomes an unfeasible task. The machine learning models are trained using signal processing indicators and SCADA data. The normal behaviour modelling technique is employed to learn the healthy operation of the machine from data collected during healthy machine operation. The trained normal behaviour machine learning models label each indicator in a healthy or faulty state over time. The labelled state-of-the-art signal processing indicators are fused to provide a high-level health status overview of wind turbine drivetrain components. It helps to derive the required details from many condition indicators, which is valuable when managing multiple components in a single wind turbine across an entire wind farm. The proposed hybrid fault detection method is validated on an offshore wind farm with multiple years of condition monitoring data. It provides a high-level health overview that is readily understandable for non-expert wind farm operators, and for more detailed fault analysis, experts can conduct a comprehensive inspection.
- Preprint
(2671 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
-
RC1: 'Comment on wes-2024-114', Anonymous Referee #1, 07 Dec 2024
The paper presents a robust hybrid methodology for fault detection. However, several areas require further improvement to enhance its clarity, rigor, and general applicability.
-
Clarity of Presentation: Figures, especially 6–9, are vital for understanding the results but are difficult to interpret due to anonymized axes and scales. While confidentiality is necessary, providing normalized or generalized labels (e.g., "Normalized Time," "Fault Indicator Count") would make the visualizations more accessible without compromising sensitive data. Additionally, the Bayesian Ridge Regression model is only briefly introduced; a more detailed justification for its use compared to other regression techniques would improve the methodological section.
-
Technical Gaps: While SCADA data is mentioned as an input, the integration process with vibration data is underexplored. Elaborating on preprocessing steps or synchronization challenges would provide a more comprehensive picture. Moreover, the paper does not report performance metrics such as precision, recall, or false alarm rates for the NBMs or hybrid system. Quantitative validation is crucial to assessing the method’s practical value.
-
Scalability Concerns: The approach requires training individual NBMs for each condition indicator across different operating regimes, which is computationally intensive and may be impractical for large-scale deployment. Although the paper acknowledges this issue, it does not propose concrete steps to address it. Exploring techniques like transfer learning, ensemble models, or feature reduction could mitigate this limitation.
-
Limited Discussion of Related Work: The paper could better contextualize its contributions by comparing the proposed method to other state-of-the-art approaches, such as fully data-driven deep learning systems. Highlighting the relative strengths and weaknesses of the hybrid approach would provide a clearer perspective on its novelty and utility.
-
Broader Applicability: The reliance on both vibration and SCADA data limits the applicability of the method to turbines equipped with these systems. The paper would benefit from discussing adaptations for turbines with only partial data availability or exploring how the method might generalize to other industrial systems.
-
Results Presentation: While the paper discusses case studies qualitatively, it lacks numerical summaries or statistical analysis of the detection performance. Providing such data would strengthen the evidence for the method’s effectiveness. Additionally, the scalability of the method across a fleet of wind turbines needs to be demonstrated more convincingly.
In summary, the paper provides valuable contributions to wind turbine condition monitoring but requires refinements in clarity, technical depth, and validation to maximize its impact. These revisions would ensure a more comprehensive and convincing presentation of the hybrid methodology.
Citation: https://doi.org/10.5194/wes-2024-114-RC1 -
-
RC2: 'Comment on wes-2024-114', Anonymous Referee #2, 21 Dec 2024
Would be interesting to see one of these studies beyond simulation.
Citation: https://doi.org/10.5194/wes-2024-114-RC2 -
RC3: 'Comment on wes-2024-114', Anonymous Referee #3, 23 Dec 2024
The health monitoring or fault detection is one of the most significant research directions to improve the management of offshore wind turbines. This manuscript aims to develop a hybrid fault detection method that combines physical domain knowledge with machine learning models to provide an overview of the health of wind turbine drivetrain components. It provides a solid methodology of health evaluation for wind turbines and the performance of the model is validated by collected real data from wind farm.
Overall, I consider this manuscript well written with clear overview of the problem addressed and contribution provided. The methodology used in the manuscript is well designed and presented. I have one suggestion below that I would like to get discussed and addressed by the authors before publication.
In the model validation session, the authors only showcase the prediction result in figure 7-9 without further statistics on the results. It is not straightforward to get a sense of the model performance. Hence, could the authors provide a more quantified measurement for the performance of the trained machine learning model? For example, what is the percentage of correctly predicted faults? What is the ratio of false positive and false negative?
Citation: https://doi.org/10.5194/wes-2024-114-RC3
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
112 | 40 | 6 | 158 | 1 | 1 |
- HTML: 112
- PDF: 40
- XML: 6
- Total: 158
- BibTeX: 1
- EndNote: 1
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1