IEA Wind Task 46 Aerodynamic Benchmark: Computational Aerodynamics Approaches for Assessing Blade Airfoil Performance Reduction due to Leading Edge Degradation

Campobasso, Michele Sergio; Castorrini, Alessio; Bretos, David; Mendez, Beatriz; Maniaci, David C.; Theron, Johannes N.; Meyer Forsting, Alexander; Sørensen, Niels Nørmark; Vimalakanthan, Kisorthman; Caboni, Marco; Gutierrez, Ruben; Gorbachova, Yana; Aihara, Aya; Grasso, Francesco

doi:10.5194/wes-2026-89

Preprints

https://doi.org/10.5194/wes-2026-89

Preprints

03 Jun 2026

| 03 Jun 2026

Status: this preprint is currently under review for the journal WES.

IEA Wind Task 46 Aerodynamic Benchmark: Computational Aerodynamics Approaches for Assessing Blade Airfoil Performance Reduction due to Leading Edge Degradation

Michele Sergio Campobasso, Alessio Castorrini, David Bretos, Beatriz Mendez, David C. Maniaci, Johannes N. Theron, Alexander Meyer Forsting, Niels Nørmark Sørensen, Kisorthman Vimalakanthan, Marco Caboni, Ruben Gutierrez, Yana Gorbachova, Aya Aihara, and Francesco Grasso

Abstract. Leading edge (LE) surface degradation of wind turbine (WT) blades caused by insect accumulation, erosion and other environmental agents reduces the aerodynamic performance of the blades, causing WT power and energy yield losses. Estimating these losses is paramount for cost-informed maintenance planning. Computational Fluid Dynamics (CFD) can predict aerodynamic performance losses. However, sensitivity of these predictions to physical model choice and detailed model settings can be large. To assess this sensitivity, the International Energy Agency Task 46 – Erosion of Wind Turbine Blades, developed the First Aerodynamic Benchmark, presented herein. The performance degradation of the NACA 63₃-418 airfoil due to moderate and severe LE degradation, assessed experimentally in two wind tunnel measurement campaigns, is studied. The clean and degraded airfoil performance predicted by seven CFD codes and two low-fidelity methods are cross-compared and benchmarked against measurements. A utility-scale WT featuring the NACA 63₃-418 airfoil on the outboard blade is used to determine the resulting power and energy losses onshore and offshore. Most codes succeed in predicting the measured airfoil performance reduction due to moderate LE degradation before stall. Consequently, all energy loss estimates are close. Conversely, the variability of the predicted aerodynamic performance reduction due to severe LE degradation is larger, and the variability of the resulting energy losses is also larger than at moderate LE degradation. These results underline both the significant sensitivity to the specific analysis set-up and the need for further research into methods for predicting the impact of advanced LE degradation, such as geometry perturbation-resolving simulations.

How to cite. Campobasso, M. S., Castorrini, A., Bretos, D., Mendez, B., Maniaci, D. C., Theron, J. N., Meyer Forsting, A., Sørensen, N. N., Vimalakanthan, K., Caboni, M., Gutierrez, R., Gorbachova, Y., Aihara, A., and Grasso, F.: IEA Wind Task 46 Aerodynamic Benchmark: Computational Aerodynamics Approaches for Assessing Blade Airfoil Performance Reduction due to Leading Edge Degradation, Wind Energ. Sci. Discuss. [preprint], https://doi.org/10.5194/wes-2026-89, in review, 2026.

Received: 17 May 2026 – Discussion started: 03 Jun 2026

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Status: final response (author comments only)

Subscribe to comment alert

CC1: 'Comment on wes-2026-89', J. Gordon Leishman, 21 Jun 2026

The manuscript addresses a relevant benchmark problem, namely, predicting the aerodynamic performance loss of wind-turbine airfoils caused by leading-edge degradation. However, the paper is unnecessarily long and poorly focused. Its actual contribution is the comparison of several CFD and low-fidelity methods against wind-tunnel data for the NACA 63_3-418 airfoil, followed by propagation of the resulting airfoil polars into turbine power and AEP estimates. That is a useful benchmark exercise, but the manuscript buries it under excessive background material, generic CFD exposition, and lengthy descriptions of standard models.
A major weakness is the inclusion of textbook-level turbulence, transition, and rough-wall-function equations that are neither original nor necessary to interpret the benchmark. Reproducing the $\gamma-Re_\theta$ SST transport equations, SST closure equations, wall-function formulae, and routine solver details does not strengthen the paper. It creates the appearance of technical depth while distracting from the actual scientific issue: whether current modeling practices can reliably predict performance degradation due to leading-edge roughness and erosion. Most of this material belongs either in cited references, a compact methods table, or an appendix.
The paper should be reorganized around the benchmark outcomes rather than generic CFD documentation. The main text should identify the modeling choices that differ among participants, explain which of those choices materially affect the results, and quantify the consequences for lift, drag, stall behavior, power loss, and AEP loss. The authors should also be more critical about the limitations revealed by the benchmark. If the predictions are highly sensitive to roughness modeling, transition treatment, wall functions, turbulence model constants, grid choices, or user-specific setup decisions, then the main conclusion should not be framed as a successful validation of current methods. It should be noted that the current RANS-based roughness modeling remains fragile and only partly predictive, especially for severe leading-edge degradation.
In summary, in its present form, the manuscript reads more like a compilation of participant CFD setup notes than a sharply written benchmark paper. The useful findings are diluted by redundant methodological padding and insufficient critical interpretation. I recommend a major revision. The manuscript should be substantially shortened, with standard equations removed from the main text, solver descriptions compressed, and the discussion refocused on what the benchmark actually demonstrates about predictive capability, uncertainty, and the limitations of current aerodynamic degradation models.

Disclaimer: this community comment is written by an individual and does not necessarily reflect the opinion of their employer.

Citation: https://doi.org/10.5194/wes-2026-89-CC1
RC1: 'Comment on wes-2026-89', Anonymous Referee #1, 23 Jun 2026

Overall Assessment

This manuscript presents a collaborative benchmark study investigating the effects of airfoil-surface roughness and leading-edge degradation on airfoil performance and wind-turbine annual energy production (AEP) using CFD and lower-fidelity methods. The manuscript is particularly valuable as it compares a diverse range of turbulence, transition, and roughness-modeling approaches used by different participants. It evaluates airfoil performance for two roughness heights and subsequently assesses their effects on wind-turbine AEP under both onshore and offshore operating conditions.
However, the manuscript has several limitations that should be addressed before publication. Therefore, I recommend major revision. Please refer to the comments below.
Major Comments

1. Presentation and Comparison of the Turbulence and Transition Models

The manuscript should provide a clearer and more concise comparison of the turbulence and transition models used by the different participants. In the current version, it is difficult to identify the differences among the participating groups without repeatedly consulting several paragraphs and the appendices. No consolidated summary of the turbulence-modeling approaches is provided in the main body of the manuscript. I recommend moving Tables A1, A2, and A3, or condensed versions of these tables, into the main body of the manuscript. These tables contain essential information needed to interpret the results. Presenting this information earlier would allow readers to understand the differences among the modeling approaches before examining the comparative results.
2. Precise Identification and Citation of the Turbulence and Transition Models

The manuscript should pay more attention to the precise identification and citation of the turbulence and transition models. For example, the SST model is cited using both Menter (1994) and Menter et al. (2003). Although the differences between these SST formulations are relatively limited, the specific model version used by each participant should nevertheless be stated clearly.
This issue is more important for the transition model. The manuscript cites Menter et al. (2006), Langtry et al. (2006), and Langtry and Menter (2009) when discussing the transition-model formulation. However, the model formulations and empirical correlations differ considerably among these publications. For example, the earlier formulation included an acceleration parameter, (K), and did not publish the complete set of empirical correlations, whereas the later formulation removed this parameter and provided the complete correlations.
As demonstrated in studies such as the NASA Turbulence Modeling Resource validation cases and the AIAA Transition Modeling Workshops, relatively small differences in model formulations and implementations can produce substantially different predictions. Therefore, it is not sufficient to refer generally to the “(\gamma-Re_{\theta})” transition model. The authors should identify the exact model version and implementation used by each participant.
Considering the already considerable length of the manuscript and the diversity of the numerical approaches, the authors could consider removing or reducing the general explanatory subsections on the SST and transition models (Section 3.1) particularly because not all participants used these models. Instead, the manuscript could provide accurate references and concise descriptions of the exact model versions used by the individual participants.
3. Readability and Presentation of the Figures

The overall readability of many figures is poor, including Figures 2, 4, 5, 7, 8, 11, 12, 14, 15, 17, and 18. It is difficult to distinguish the results of the different participants because of the large number of curves and the limited differentiation among line styles, symbols, and colors. In addition, some of the y-axis ranges are too broad or otherwise unsuitable for identifying meaningful differences among the predictions. The authors should revise these figures by using more distinguishable line styles and symbols and by increasing the font and marker sizes. The axis ranges should also be adjusted where necessary to make differences among the predictions more visible.
For Figures 14, 15, 17, and 18, please also provide a caption or legend identifying the dashed line.
4. Differences in Equivalent Sand-Grain Roughness: Tables A2 and A3

The equivalent sand-grain roughness heights differ among the participants, and several participants appear to have used values that were not derived directly from the experiments. Please explain why different equivalent sand-grain roughness heights were selected. The use of different roughness values raises concerns regarding the fairness and interpretation of the benchmark comparison. If the objective is to compare different approaches for roughness modelling, readers would generally expect all participants to use the same equivalent sand-grain roughness height.

The authors should discuss this issue explicitly and clarify why the experimental equivalent sand-grain roughness heights were not used by all participants.
Specific and Minor Comments

Please verify the citation format throughout the manuscript, including Lines 35 and 40.

- Line 35

Current text: rain Keegan et al. (2013)

Suggested revision: rain (Keegan et al., 2013)

- Line 40

Current text: LE structure at severe damage stages Han et al. (2018); Im and Kim. (2019); Cappugi et al. (2021); Campobasso et al. (2023).

Suggested revision: LE structure at severe damage stages (Han et al., 2018; Im and Kim, 2019; Cappugi et al., 2021; Campobasso et al., 2023).
Lines 129 and 160

The manuscript states:

“The roughness element height is (K=200\ \mu\text{m}), corresponding to a value of (K=246\ \mu\text{m}) for a unitary chord.” and

“The equivalent sand-grain roughness of the P40 sandpaper is (K_s \approx 425\ \mu\text{m}), corresponding to a value of about (708\ \mu\text{m}) for a unitary chord.”

Please also provide the equivalent sand-grain roughness for the lower-roughness case. The discussion currently presented in Lines 469–470 could be moved to immediately after Line 129. The equivalent sand-grain roughness could then be briefly restated in the later discussion for the reader’s convenience.
Line 385

The roughness-induced transition model of Langel et al. (2017) was originally developed in conjunction with the Langtry–Menter two-equation transition model. Please briefly explain how this roughness model was implemented in RFOIL and how it was coupled with the transition-prediction method used in the present study.
Table A1

Please define “Ambient turb.” more clearly. Does this term indicate the use of the sustaining terms by Spalart and Rumsey (2007)?

The table currently reports the physical first-cell height. Please provide the corresponding nondimensional wall spacing, (y^+), because this quantity is more informative. It is unclear whether the mesh-topology column provides useful differentiation because all participants used structured (all-quadrilateral) meshes. Other mesh parameters may be more informative, including:

• Wall-normal growth ratio

• Leading edge / trailing edge spacings.

Citation: https://doi.org/10.5194/wes-2026-89-RC1
RC2: 'Comment on wes-2026-89', Anonymous Referee #2, 15 Jul 2026

The paper presents results from the IEA Wind Task 46 Aerodynamic Benchmark, which concerns the modelling of leading edge erosion effect on airfoil and turbine performance. A number of different institutions/OEMs participate employing a number of tools of varying fidelity. Even when the same CFD solvers are used implementations differ, offering a broader range of approaches. The topic is very relevant and timely. The paper is generally well structured and easy to follow. However, some spects can be considerably improved. The two main points are listed below and more detailed comments are given in the attached file.

The presentation and critical discussion of the experimental validation datasets should be improved. Similarly the discussion on the differences between predictions and experiments can be improved. A statement on LE roughness being beneficial at Re=3M needs to be justified, as it contradicts present experience.
The conclusions are purely descriptive. Based on all the nice work presented here, the paper should include some form of best practice guidelines, otherwise its usefulness remains limited.

Citation: https://doi.org/10.5194/wes-2026-89-RC2

Viewed

Total article views: 149 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
118	26	5	149	7	5

HTML: 118
PDF: 26
XML: 5
Total: 149
BibTeX: 7
EndNote: 5

Views and downloads (calculated since 03 Jun 2026)

Month	HTML	PDF	XML	Total
Jun 2026	81	19	4	104
Jul 2026	37	7	1	45

Cumulative views and downloads (calculated since 03 Jun 2026)

Month	HTML	PDF	XML	Total
Jun 2026	81	19	4	104
Jul 2026	37	7	1	45

Viewed (geographical distribution)

Total article views: 98 (including HTML, PDF, and XML) Thereof 98 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 16 Jul 2026

Short summary

We assess against wind tunnel data the predictions of aerodynamics codes used in industry and research of wind turbine airfoil performance loss due to leading edge degradation. Predictions are close to measurements for moderate surface degradation. At severe degradation, differences among predictions increase due to user model and subjectivity in detailed set-up. Also quantified is the impact of airfoil performance variability on wind turbine energy losses, providing detailed sensitivity data.


Total:	0
HTML:	0
PDF:	0
XML:	0