How wind speed shear and directional veer affect the power production of a megawatt-scale operational wind turbine

. Most megawatt-scale wind turbines align themselves into the wind as deﬁned by the wind speed at or near the center of the rotor (hub height). However, both wind speed and wind direction can change with height across the area swept by the turbine blades. A turbine aligned to hub-height winds might experience suboptimal or superoptimal power production, depending on the changes in the vertical proﬁle of wind, also known as shear. Using observed winds and power production over 6 months at a site in the high plains of North America, we quantify the sensitivity of a wind turbine’s power production to wind speed shear and directional veer as well as atmospheric stability. We measure shear using metrics such as α (the log-law wind shear exponent), β bulk (a measure of bulk rotor-disk-layer veer), β total (a measure of total rotor-disk-layer veer), and rotor-equivalent wind speed (REWS; a measure of actual momentum encountered by the turbine by accounting for shear). We also consider the REWS with the inclusion of directional veer, REWS θ , although statistically signiﬁcant differences in power production do not occur between REWS and REWS θ at our site. When REWS differs from the hub-height wind speed (as measured by either the lidar or a transfer function-corrected nacelle anemometer), the turbine power generation also differs from the mean power curve in a statistically signiﬁcant way. This change in power can be more than 70 kW or up to 5 % of the rated power for a single 1.5 MW utility-scale turbine. Over a theoretical 100-turbine wind farm, these changes could lead to instantaneous power prediction gains or losses equivalent to the addition or loss of multiple utility-scale turbines. At this site, REWS is the most useful metric for segregating the turbine’s power curve into high and low cases of power production when compared to the other shear or stability metrics. Therefore, REWS enables improved forecasts of power production.


Introduction
Wind energy is already the second-largest source of renewable energy in the United States and is the fastest-growing source of renewable energy, providing 6.3 % of the total energy in the United States (EIA, 2017). As wind energy continues to grow, so will the challenge of predicting power output and integrating that power with the rest of the electric grid (Marquis et al., 2011;Woodford, 2011;Xie et al., 2011;Vittal and Ayyanar, 2013;Heier, 2014;Heydarian-Forushani et al., 2014;Sarrias-Mena et al., 2014).
Currently, wind farm operators and control engineers rely on wind turbine power curves to predict the power production of a given model of turbine for various inflow wind speeds (Brower, 2012). The inflow wind speeds are typically measured by instrumentation on top of the nacelle at or near hub height, where the blades of a turbine connect to its hub. Wind turbines are designed to optimize these inflow wind speeds by orienting themselves into the inflow.
Typical turbines use a wind vane located on top of the hub to determine the wind direction at that altitude. The turbine then rotates (yaws) into that inflow so that the hub is aligned with and parallel to the wind vane (Fleming et al., 2014;Wan et al., 2015). This yaw correction happens periodically, and the exact frequency depends on the specific turbine and many other factors. However, hub-height wind speeds and directions do not necessarily represent the inflow across the turbine rotor disk. Wind speed and direction can change with height across the rotor disk, a phenomenon known as shear. "Wind shear" simply considers the change in wind speed with height, whereas a change in wind direction is considered "wind veer" (Holton, 1992). In atmospheric science, the direction of the change in wind direction can also be useful; in the Northern Hemisphere, clockwise rotation with height is considered "veering", while counterclockwise rotation is considered "backing." Several common atmospheric phenomena cause vertical wind shear or veering or backing over the depth of a turbine's rotor disk. Wind speeds tend to increase with height in the atmosphere as the effects of surface friction decrease. In the planetary boundary layer this increase is, on average, logarithmic (Tennekes, 1973). Flows over land exhibit more shear because friction is larger over the land than the ocean. At night, the lack of mixing from convective eddies allows winds in the boundary layer to decouple from the surface such that both wind speed and direction can change with height (Blackadar, 1957;Walter et al., 2009). Nocturnal lowlevel jets, characterized by a maximum in wind speed in the stable boundary layer, often form over the Great Plains because of the decoupling phenomenon and inertial oscillations as well as the nocturnal change in the thermal wind (Blackadar, 1957;Whiteman et al., 1997;Banta et al., 2002;Vanderwende et al., 2015). Shear or veer associated with inertial oscillations also occurs because of frontal passages (Lundquist, 2003). Low-level jets can form offshore, leading to wind speed shear (Kraus et al., 1985;Hsu, 1988;Smedman et al., 1993;Ranjha et al., 2013;Pichugina et al., 2017) or wind directional veer (Bodini et al., 2019b) across the altitudes of a turbine rotor disk. Turbines located near the mouth of a canyon might experience shear effects of nocturnal valley exit jets (Banta et al., 1996;Jiménez et al., 2019). Warm and cold air advection can lead to directional veer (Holton, 1992). Outflow from thunderstorms can introduce density currents that affect both speed shear and directional veer (Goff, 1976;Lynch and Cassano, 2006). Finally, landbased topographic effects allow for the formation of localized circulations and microclimatic effects that could interact with the mean airflow across a rotor disk and create shear (Mahrt et al., 2014;Fernando et al., 2019).
Over the past 3 decades, shear and turbine power production have been related by various observational studies. In 1990, shear affected power curves, as seen in observations of three 2.5 MW turbines (Elliott and Cadogan, 1990). Shear decreases the power coefficient, compared to nonshear cases, for multimegawatt turbines (Albers et al., 2007). Diurnal variations in power production have been found resulting from diurnal variations in shear in a region of complex terrain at a site in the interior of the continental United States . Increases in power of a theoretical wind farm using observational shear values (rather than no-shear values) could be of up to 0.5 %, while decreases in power could approach 3 % as found by Walter et al. (2009). Model power curves (or power surfaces where the power production of a turbine is a function of both wind speed and air density) made from equivalent wind speeds from actual 2.5 MW turbine power data are more accurate than a standard power curve (Vahidzadeh and Markfort, 2019).
In addition, other simulation-based studies quantify the magnitude of the effects found observationally (Pedersen, 2004;Wagner et al., 2010). The power productions found in both Pedersen (2004) and Wagner et al. (2010) are dependent on the magnitude of the shear and whether the shear is based on direction or velocity. Wagner et al. (2010) additionally find that directional veer was less influential on the power production than speed shear. Sanchez Gomez and Lundquist (2020) suggest a combination of directional veer and shear should be considered.
Actual observations of wind shear and veer exhibit a significant variety of shapes (Pé et al., 2018), as shown in Fig. 1, with four wind speed profiles from vertically profiling Doppler lidar and relevant idealized linear and logarithmic profiles. All profiles show differences between the idealized profiles and the actual profiles and differences between the 80 m wind speed (effectively the height of the nacelle anemometer and vane) and the speeds at other heights. Though the first three of the four real profiles  appear similar to the idealized profiles, differences occur between the winds at all non-80 m heights and the idealized profiles . The winds at 80 m (effectively the height of the nacelle anemometer and vane) clearly differ from the winds at other heights as well. The fourth profile (Fig. 1d) shows the most nonlinear and nonlogarithmic wind speed profile and also shows the greatest difference between the 80 m wind speeds and wind speeds at other heights (Fig. 1h). Because the differences exist between the height levels for all profiles, the 80 m wind speed and thus the nacelle wind speed are not truly representative of the average wind speed across the rotor for any of the wind speed profiles.
This poor representation has consequences for turbine power production. The power produced by a turbine varies with the cube of the inflow wind speed in region II of a power curve (where turbines spend most of their time operating and where each of the profiles were taken from) as seen by where P (t) is the power at a given time t, ρ represents the air density, A represents the area swept out by the rotor disk, C p represents the coefficient of power which has a maximum of 0.59, and U (t) represents the inflow wind speed across the rotor disk at time t (Brower, 2012). Directional veer can mitigate or worsen the effects of speed shear. A rotor-equivalent wind speed (REWS) metric can describe the actual momentum encountered by a turbine rotor disk by accounting for the vertical shear. The simplest REWS, proposed by Wagner et al. (2009), accounts for only the wind speed shear and does so by dividing a turbine's rotor disk into discrete vertical layers or bins: where REWS Wagner is the equivalent wind speed, A represents the area swept out by the rotor disk, A i represents the area of a discretized section of the rotor disk, and u i represents the wind speed measured for the given section. Using a blade element momentum model to simulate a 3.6 MW turbine, Wagner et al. (2008) show that power production correlates better with the REWS than with the hub-height wind speed. Later work specified a REWS θ , which considers both speed shear and directional veer (Wagner et al., 2010;Choukulkar et al., 2015;Clack et al., 2016). Though similar to REWS Wagner , this method considers only the orthogonal component of the inflow wind speed to the plane of the turbine's rotor disk at each height bin. Although the combined effects of wind speed shear and wind directional veer on a turbine's power production are often stronger than either speed shear or directional veer alone, speed shear exerts more influence than directional veer in most circumstances. Turbulence can also affect the momentum accessible to a wind turbine rotor and is accounted for in the method of Choukulkar et al. (2015). Although former studies used REWS and similar metrics to explore the impact of shear and atmospheric stability on the prediction of power production from megawattscale turbines (Elliott and Cadogan, 1990;Rohatgi and Barbezier, 1999;Pedersen, 2004;Sumner and Masson, 2006;Albers et al., 2007;Van den Berg, 2008;Antoniou et al., 2009;Walter et al., 2009;Belu and Koracin, 2012;Wharton and Lundquist, 2012b;Vanderwende and Lundquist, 2012;Sanchez Gomez and Lundquist, 2020;Vahidzadeh and Markfort, 2019), a more recent study (Sark et al., 2019) concludes that turbines in regions with flat terrain do not benefit from using REWS rather than a hub-height wind speed. Here, we explore how different regimes of speed and directional veer across the turbine rotor disk affect power production of a megawatt-scale onshore turbine in a wind farm in the high plains of North America. Defining several wind speed and direction-based shear metrics, we compare power production in different regimes. We distinguish the importance of wind shear and veer and suggest the influence of topography. Finally, we address how the regimes differ from a mean power curve.
In Sect. 2, we describe the observational data set and data processing steps. In Sect. 3, we define REWS metrics and other shear metrics to characterize speed shear and directional veer. In Sect. 4, we describe distributions of the metrics for this site, demonstrate the superiority of REWS over hubheight wind speed for power prediction, and explore how other shear metrics relate to power production. We summarize results in Sect. 5 and pose suggestions for future work.

Observational data set
The data discussed in this paper were collected as part of a wake-steering campaign conducted by the National Renewable Energy Laboratory on five turbines at a commercial wind farm in the high plains of North America ( Fig. 2; more details in Fleming et al., 2019). Data for this study were collected from 04:00 UTC on 2 May 2018 through 23:59 UTC on 31 October 2018. This paper focuses on the turbine shown in red in Fig. 2. Although this turbine is not waked under typical wind directions at the site, waked data are removed as described in Sect. 2.3. Wind profile observations are collected by the lidar 350 m east-northeast of the chosen turbine. Exact locations and elevations are not given at the request of the wind farm owner and operator. The westernmost red circle represents the turbine studied in the paper. The triangle represents the vertically profiling Doppler lidar and meteorological tower (co-located). The four black circles to the east-southeast represent other turbines that could potentially wake the lidar and studied turbine.

Turbine data set
The turbine and lidar are located at the same elevation on a flat plateau. To the east and southeast, four other turbines are located within 1 km (Fig. 2). Methods for filtering waked data are described in Sect. 2.3. The plateau's escarpment, which descends around 100 m km −1 , lies south of the focus area. Southerly winds are not filtered out of the data set because such terrain can lead to the formation of speed shear and directional veer. The northerly fetch is relatively complex as well, though to a much lesser extent than the southerly fetch. To the northeast, the terrain descends to a depth of about half that of the escarpment to the south and does so over a much gentler slope. To the northwest, the terrain descends to a depth of about one-ninth that of the south. To the north the terrain descends to a depth of around onefourth that of the south.
The turbine of interest is a 1.5 MW General Electric superlong extended cold-weather extreme model with a cut-in wind speed of 3.5 m s −1 , a rated wind speed of 14 m s −1 , and a cut-out speed of 25 m s −1 . Both the turbine rotor diameter D and the hub height are nominally 80 m. Power production, nacelle wind speed and direction, fault codes (such as "turbine ok", "weather conditions", "grid loss"), and blade pitch angle from the turbine were recorded at 1 Hz by the turbine's supervisory control and data acquisition (SCADA) systems. Data processing methods applied to the data set regarding the turbine data and potential curtailments and periods of inactivity are addressed in Sect. 2.3.
For comparison to the power production, we consider the power curve of a generic 1.5 MW turbine (Schmitz, 2015).

Lidar data set
Wind speed and direction profiles are collected by a Leosphere WINDCUBE v2 located ∼ 4 D east-northeast of the turbine, identical to the model used in Bodini et al. (2019a) and Bodini et al. (2019b). The lidar takes three-dimensional wind speed and direction measurements at approximately 1 Hz every 20 m a.g.l. from 40 to 180 m. The lidar samples sequential line-of-sight velocity measurements along the four cardinal directions at 28 • (θ ) from the vertical followed by an additional beam oriented vertically. It completes a cycle of measurements nearly every 5 s. The lidar synthesizes the beams' line-of-sight measurements into a 1 Hz sample of horizontal and vertical wind speed component measurements. The manufacturer reports horizontal wind speed accuracies of 0.1 m s −1 and wind direction accuracies of 2 • . Time lags between the lidar and the turbine were not considered because of challenges in considering the advection of the wind. The horizontal wind speed components, u (westeast) and v (south-north), are found by where V los denotes the line-of-sight velocities at the cardinal directions north (N), east (E), south (S), and west (W). A meteorological tower with a Campbell CSAT3 sonic anemometer at 10 m, a Vaisala PTB110 pressure sensor at 1.5 m, a relative humidity measurement at 2 m, and an RTD temperature measurement at 2 m is co-located with the lidar. To quantify atmospheric stability, the Obukhov length L is calculated using 20 Hz 10 m sonic anemometer data, 1 Hz 1.5 m pressure data, 1 Hz 2 m temperature measurements, and 1 Hz 2 m relative humidity measurements: where k = 0.4 is the von Kármán constant, g is the acceleration of gravity 9.81 m s −2 , u * is the friction velocity calculated by u * = [u w 2 + v w 2 ] 1/4 , θ v in the numerator is the virtual potential temperature in Kelvin calculated from the 1 Hz 2 m temperature T d in Celsius with modifications from the 1 Hz 2 m relative humidity RH and 1.5 m pressure p to convert the temperature to virtual temperature by e s = 6.11 × x10 T Further modifications from the 1.5 m pressure p by θ v = (T v + 273.15) · p 0 p R/c p with p 0 = 1000 mbar and R/c p ≈ 0.286 convert the virtual temperature to a virtual potential temperature; θ v in the denominator is the virtual potential temperature in Kelvin calculated from the 20 Hz virtual temperature from the speed of sound and the same potential pressure calculation as the numerator, and w θ v is the kinematic sensible heat flux. The covariances for the heat flux and friction velocity are calculated from a Reynolds decomposition over a 30 min averaging time.
To quantify atmospheric stability we use two regimes, convective and stable, based on the nondimensional stability parameter (otherwise known as the surface-layer scaling parameter). ζ = z/L is used, where z is the height above ground level (10 m) of the flux measurements for L. Note that these categories are similar but not identical to the stable and nonstable categories of Fleming et al. (2019). Convective conditions occur during 0 < ζ < ∞, while stable conditions occur when −∞ < ζ < 0. Values further from 0 are stronger stabilities. Values that could be considered neutral (−0.01 ≤ ζ ≤ 0.01 as in Wharton and Lundquist, 2012a) only occur in 3.9 % of the postfiltered data and so are classified as stable or convective based on their sign. Figure 3 shows the dominant winds as measured by the lidar at 80 m a.g.l. (hub height) during the campaign through three wind roses using (a) all data, (b) convective stability data, and (c) stable stability data. This figure is made with prefiltered data.

Data filtering
Data collection extended from 04:00 UTC on 2 May 2018 until 23:59 UTC on 31 October 2018, nearly 15.8 × 10 6 s (nearly 6 months of data). Several data filters are applied.
Because of our focus on power production, we first removed time periods with turbine fault codes given in the SCADA data. Data are considered acceptable for four SCADA codes, "turbine ok", "turbine with grid connection", "run up/idling", and "weather conditions". The codes that are filtered out are related to maintenance, repair, grid loss, stops, wind direction curtailments, and further codes that are determined by the utility company to be bad but are not specified further. This filter removed 14.2 % of the data.
A further 11.5 % of the data were removed because of the turbine not producing power (power greater than 0 kW). Another 8.4 % of the data were removed because of the lidar not functioning on at least one of its five measurement heights within the turbine rotor disk.
Blade pitch angles greater than 6 • were filtered out as well to remove data that could be affected by curtailments. Blade pitch angles were used to filter data in other studies (St. Martin et al., 2016;Sanchez Gomez and Lundquist, 2020). We discarded data with blade pitch angles exceeding 6 • for this 1 Hz data set. This threshold was chosen experimentally to retain as many data as possible while still removing outliers. This approach removed a further 8.1 % of the data.
Times when ζ could not be calculated because of issues with any of the instrumentation used in creating ζ were removed. This filter removed around 0.67 % of the data.
Because of our focus on power production in region II of the turbine, we only considered data with REWS less than or equal to the turbine's rated wind speed. Once the REWS is at rated speed, the turbine can be assumed to be operating at rated power, regardless of whether the REWS is greater or less than the nacelle wind speed. This filter removed 0.48 % of the data.
Once the data had been filtered, we considered turbine yaw error. The lidar 80 m wind direction may differ from the turbine nacelle wind vane (Fig. 4a). Differences in direction greater than 25 • in either direction were filtered out because of the large effects of yaw misalignment, as shown in Fig. 4b, which shows the theoretical effect of the cosine, cosine 2 , and cosine 3 relationships between the yaw misalignment and power production by a yaw-misaligned turbine (Pedersen, 2004;Choukulkar et al., 2015;Mittelmeier and Kühn, 2018). The curve that a yaw-misaligned turbine follows depends on the aeroelastic properties of a given turbine itself (Fleming et al., 2014). Note that although these theoretical power impacts are symmetric, some work (Wagner et al., 2010;Sanchez Gomez and Lundquist, 2020) suggests that veering and backing have nonsymmetric effects. This filter removed 4.1 % of the data.
Finally, wind directions were removed during which either the lidar or the turbine could be waked (gray areas in Fig. 3 resulting from the turbine locations shown in Fig. 1). To specify these directions, the difference in wind speed between the lidar at 80 m (hub height) and the nacelle is calculated for 1 • direction bins (direction measured by the lidar). A 99 % twotailed confidence interval is calculated for each bin: where µ metric is the true population mean of the wind speed difference in a bin, x is the sample mean of the wind speed difference in a bin, t 0.005 is the critical value of t at 99 % confidence, σ is the sample SD of the wind speed difference in a bin, and N is the number of values in the bin ( Fig. 5; Wilks, 1962). Based on Fig. 5, we removed directions where the 99 % confidence interval on the mean difference between the two wind speeds over a 15 • group of direction bins changed smoothly to be 1 m s −1 different from the mean without in-  clusion of those directions (70-160 and 235-275 • ). This removal was done iteratively by hand by changing the removed directions (and thus changing the mean without those directions). The southeasterly flow does not completely conform to the quantitative process because of the physically based inflection point, where the turbine is waked more strongly closer to the east and the lidar is waked more strongly closer to the south because of the layout of the equipment (Fig. 2). However, those directions were removed as well. Discarding these wind directions removed an additional 22.2 % of the data. We repeated the same process based on the nacelle wind direction, which resulted in smaller ranges of wind directions (not shown). The wider direction bins (from the lidar direction) were filtered. All of these filtering processes left a total of nearly 4.8 × 10 6 s for analysis, or the equivalent of almost 2 months of 1 Hz data (30.4 % of the total). Subsequent analyses were applied to this subset of the data. All subsequent data percentage plots are based on the filtered data set.

Methods
Calculations of shear metrics are described in Sect. 3.1. Methods for creating power curves are described in Sect. 3.2.

Shear calculation methods
REWS represents the effect of wind speed shear across the rotor disk using discretized wind speed profiles. REWS is calculated by where z represents a height from the list of discrete heights that the lidar measures across the rotor diameter (40, 60, 80, 100, and 120 m) and {i|(0, 1, 2, 3, 4)} indexes through those heights, A z(i) to z(i+1) represents the area of the rotor disk between two discrete heights z(i) and z(i + 1), U z(i) represents the wind speed at the height z(i) and U z(i+1) represents the wind speed at the height z(i + 1), and A turbine represents Figure 6. Schematic for calculation of the REWS. The turbine rotor disk (circle) is divided into four discrete areas. A i denotes the area of the colored section from z(i) to z(i + 1). The terms denote the averaged horizontal wind speed used for a given colored area. Lidar measurement heights are shown at right. the overall area of the turbine rotor disk (approximated to be a perfect circle of radius 40 m for our purposes). This calculation follows the method of Wagner et al. (2008) but with slight modifications because of the lidar data collection at discrete heights, including the rotor disk bottom and top, rather than heights found in the middle of each discrete interval ( Fig. 6). This averaging assumes that the winds vary linearly across each 20 m span of the turbine and that their average represents the true inflow across that area.
We use the REWS to calculate a difference from the nacelle wind speed as REWS N-NTF : where REWS is as calculated in Eq. (11), U nacelle is the wind speed measured by the nacelle-mounted anemometer, and NTF = (U lidar − U nacelle ) is a simple nacelle transfer function (NTF). This simple NTF is a bias calculation between the lidar wind speed and nacelle wind speed of 0.686 m s −1 based on all wind directions over the entire filtered data set. Although the NTF varies slightly with direction ( Fig. 5), those variations are less than 10 % of the NTF itself. A true NTF is not applied in part because the lidar does what a true transferfunction-corrected nacelle measurement is supposed to do: measure the wind speed most accurately, disregarding rotor wake effects. The application of the NTF shifts the peak of the histogram of REWS N-NTF to 0 as well (Fig. 8a). A similar metric comparing the lidar hub-height wind speed with the REWS, REWS L , is calculated by where U lidar is the hub-height lidar wind speed measurement. REWS N-NTF and REWS L quantify whether using the nacelle wind speed underestimates ( REWS N-NTF or REWS L is negative) or overestimates ( REWS N-NTF or REWS L is positive) the rotor-disk-integrated winds encountered by the turbine.
The REWS with direction, REWS θ , represents the effect of both wind speed shear and wind directional veer across the rotor disk using discretized wind speed and direction profiles (Choukulkar et al., 2015). Similar to how Eq. (11) integrates wind speed across the rotor disk, REWS θ integrates the normal component of the flow across the rotor disk and therefore considers the directional veering and backing: where , and A turbine are as described for Eq. (11) and θ z(i) = θ lidar,z(i) − θ nacelle is the difference between the lidar wind direction at height z(i) and nacelle wind direction (and is always between −180 and 180 • ). θ z(i) < 0 specifies that the lidar-measured wind direction is "to the left" of the turbine as seen facing upwind, while θ z(i) > 0 specifies the lidar wind direction is "to the right" of the turbine as seen facing upwind.
To quantify difference, REWS θ,N −NTF is calculated by where REWS θ is as calculated in Eq. (13), U nacelle is the wind speed measured by the nacelle-mounted anemometer, and NTF is the simple nacelle transfer function discussed previously. The application of the NTF also shifts the peak of the histogram of REWS θ,N −NTF to 0. Similarly, REWS θ,L is calculated by where U lidar is the hub-height wind speed measurement. REWS θ,N −NTF and REWS θ,L quantitatively show whether using the nacelle wind speed underestimates ( REWS θ,N −NTF or REWS θ,L is negative) or overestimates ( REWS θ,N−NTF or REWS θ,L is positive) the rotordisk-integrated winds encountered by the turbine, considering veering and backing.
Wind shear is also quantified with the wind shear exponent, α (Peterson and Hennessey, 1978;Emeis, 2013), calculated in a bulk fashion by considering only wind speed at the top and bottom of a vertical layer of atmosphere, presuming a logarithmically increasing profile: where U top and U bottom are the lidar-measured horizontal wind speeds at the top (120 m) and bottom (40 m) of the rotor disk and z top and z bottom are the heights of 120 and 40 m, respectively. While α may be simple to calculate and is thus widely used (Peterson and Hennessey, 1978;Wharton and Lundquist, 2012b;Vanderwende and Lundquist, 2012;Emeis, 2013), wind profiles may differ from a logarithmic profile across the rotor diameter of a turbine (Wagner et al., 2008). Additionally, α does not consider veering or backing or even the magnitude of the wind speed. We consider directional veer with two further metrics. The simplest metric, β bulk , considers only differences in wind direction at the top and bottom of the rotor disk: where θ top and θ bottom are the lidar-measured horizontal wind directions at the top (120 m) and bottom (40 m) of the rotor disk (values constrained to lie between −180 and 180), and z top and z bottom are the heights of 120 and 40 m, respectively. β bulk resembles depictions of layer-wise directional veer in hodographs (MacKay, 1971), where the shear is only considered as a bulk quantity. A negative β bulk implies backing of the wind across the turbine rotor disk (the wind rotates counterclockwise as it increases in height), while a positive β bulk implies veer (the wind rotates clockwise as it increases in height). In a simulation, Wagner et al. (2010) found that a clockwise veer increases turbine power production while counterclockwise backing decreases the power produced because of differences in angle of attack for the turbine blades. However, Sanchez Gomez and Lundquist (2020) found different results during an observational study such that veer leads to a larger decrease on turbine power production than backing. The β bulk calculation does not consider any general yaw misalignment from the 80 m hub-height wind speed as measured by the lidar that might occur at the same time as directional shear. Thus, it is impossible to know whether power changes in β bulk conditions are a result of yaw misalignments or directional shear. Like α, β bulk does not consider the hubheight wind speed. A more discrete veer metric, β total , considers shear at each level: where z and i are as described for Eq. (11), θ top and θ bottom are the lidar-measured horizontal wind directions at the top (120 m) and bottom (40 m) of the rotor disk, and θ z(i+1) −θ z(i) is the difference between the lidar wind direction at height z(i+1) and the lidar wind direction at height z(i), constrained to be between −180 and 180 • . This measurement assumes that both veer and backing will decrease the power output of a turbine and will do so symmetrically. β total should be considered for cases where the directional veer is nonmonotonic across the rotor. Like β bulk , β total does not consider yaw misalignment or the hub-height wind speed. These metrics were visualized using an example lidarmeasured wind profile (Fig. 7) during a time period with a ζ of 0.45 (convective). The turbine was producing power at this time, though the exact power is not given at request of the utility company. The nacelle wind speed was 4.50 m s −1 ; the turbine was oriented to winds from 285 • ; the lidar wind speed at hub height was 3.7 m s −1 , and the lidar wind direction at hub height was 286.8 • . The shear metrics vary: the REWS was 5.36 m s −1 , so the REWS N-NTF was 0.15 m s −1 and the REWS L was 1.66 m s −1 ; the REWS θ was 5.28 m s −1 with a REWS θ,N −NTF of 0.08 m s −1 and REWS θ,L of 1.58; α was 1.83 (very large, according to Walter et al., 2009); β bulk was −0.76 • m −1 , suggesting backing; and β total was 0.76 • m −1 . This case underscores challenges with any NTF. Because the nacelle speed was actually larger than the lidar speed for this case and the NTF was created under the mean case assumption that the lidar speed is greater than the nacelle speed, the addition of our NTF caused REWS N-NTF and REWS θ,N −NTF to be lower than they should be.
Depending on which wind speed is used, the turbine power production for this case varies significantly, as calculated from Eq. (1) and the variable wind-speed-dependent C p values of Schmitz (2015), interpolated to 0.01 m s −1 bins. The air density is disregarded so as not to reveal the elevation of the test site. Instead, power is expressed as a percentage of rated. These powers are meant only as example values as a simple power curve created from basic principles and do not surmise the real, more complicated, power curve.
The lidar wind speed suggests a power 4.7 % of rated; the nacelle wind speed suggests a power 8.4 % of rated; the REWS suggests a power 14 % of rated, and the REWS θ suggests a power 13.4 % of rated (Table 2). For this case, the discrepancies of power are ∼ 10 % of rated power simply because of the different wind speed assessments. Although exact turbine power production cannot be given for this time, REWS θ and REWS are the most accurate metrics to the actual power production but still vary from it somewhat.

Power curve calculation
For each shear metric, we calculated three power curves by segregating the actual 1 Hz power production recorded by the turbine's SCADA system (rather than using an idealized curve) into 0.5 m s −1 wind speed bins. The three power curves are designated as such: a mean power curve (all the power data in the bin), a high-case power curve (all the powers such that the shear metric at the time index of the power is greater than a certain critical value of the shear metric), and a low-case power curve (all the powers such that the shear metric at the time index of the power is less than a certain critical value of the shear metric). The critical values are determined in Sect. 4.1.
Around the shear metric-based power curves, 99 % confidence intervals were calculated using a two-tailed t test at each bin following the confidence interval given in Eq. (9). The mean power curve (regardless of shear conditions) is considered to be the overall population mean for power production, µ, so a confidence interval is not placed around the data.
Two different independent variables (wind speeds) can apply to our data set, the lidar wind speed at 80 m (L) and the nacelle wind speed offset by the NTF (N-NTF). For the REWS L case, the lidar wind speed (L) is used as the x axis. For the other plots, the N-NTF is used for the x axis. If the wrong wind speeds are used for the REWS case power curves, the case means tend to collapse onto the mean power curve.
Additionally, differences between the overall mean power curve and the shear metric-based power curves were plotted. The confidence intervals on these plots come from the subtraction of the mean power curve from the bounds of the confidence intervals.

Results
Section 4.1-4.3 describe distributions of shear metrics, determinations of critical values of the metrics, and correlations between the metrics. Section 4.4-4.10 describe how the shear metric cases affect power production.

Histogram distributions of shear metrics and determination of critical values
Histograms and cumulative distribution functions of the shear metrics suggest a range of stability and shear conditions during the test period (Fig. 8). In Fig. 8, the histograms and the cumulative distribution functions are normalized separately so that the maximum value of each respective plot is 1.
The differences between REWS N-NTF and REWS L , Fig. 8a and b, emphasize the difference between the lidar and nacelle measurements of hub-height wind speed as well as the role of integrating the winds across the rotor disk. Although REWS N-NTF (Fig. 8a) exhibits a wide distribution, REWS L (Fig. 8b) is centered more tightly around zero, likely because the REWS is calculated from lidar values and some variation in the wind occurs between the lidar and the nacelle. The critical value used for REWS N-NTF is 0, which segregates data with REWS greater than the offset nacelle wind speed (0 < REWS N-NTF ) and those with REWS less than the offset nacelle wind speed ( REWS N-NTF < 0). Likewise, the critical value used for REWS L is 0. Low-REWS N-NTF cases make up 51.6 % of the data, while high cases make up 48.4 %. For REWS L , low cases make up 49.8 % of the data and high cases make up 50.2 %. Neither of the REWS θ cases (N-NTF and L) appears because the respective N-NTF and L histograms are nearly identical to their REWS counterparts.
The distribution of α (Fig. 8c) shows that winds tend to increase with height but that some cases of winds decreasing from 40 to 120 m do occur, similar to Walter et al. (2009). To segregate between high and low values of α, we use a threshold for high of 0.2 (as in Lundquist, 2012, andLundquist, 2012b) and a low threshold of 0.1 (same as Vanderwende and Lundquist, 2012, and slightly greater than Wharton and Lundquist, 2012b, who use 0.09). High cases of α make up 37.4 % of the data, and low cases comprise 40.7 % of the data.
Just as with REWS N-NTF and REWS L , a nearly 50-50 split of the ζ segregation occurs (Fig. 8d). The critical value is chosen to be 0, to split stable and unstable cases from each other, as explained in Sect. 2.2. Stable cases make up 52.8 % of the data, while convective cases make up 47.2 % of the data. Only ζ values between −100 and 100 are shown in Fig. 8d to resolve most of the data.
The β bulk distribution (Fig. 8e), divided between veering (β bulk > 0) and backing (β bulk ), shows a surprising prevalence of backing conditions, in contrast to other observations (Walter et al., 2009;Bodini et al., 2019b;Sanchez Gomez and Lundquist, 2020). Veer occurs 34.7 % of the time, while backing occurs 64.9 % of the time. We suspect that the complex nature of the local terrain and/or the prevalence of cold front passages during this summertime period supports more backing than veering.
The β total distribution (Fig. 8f) is effectively an absolutevalued β bulk with an increased number of low values because of occurrences of nonmonotonic shear. For β total , the choice of 0.15 as a critical value was chosen experimentally by splitting the histogram of β total by varying the parameter of the critical value. Using 0.15 splits the data almost in half. The low-β total case accounts for more than 54.1 % of the filtered data, and the high-β total case accounts for more than 45.6 % of the filtered data. Values other than 0.15 • m −1 were explored, such as 0.1 and 0.2 • m −1 . Similar results were found

Wind shear metric Equation
Eq. no.  with 0.1 • m −1 but with wider confidence intervals on the high-β total case that lead to less significance. The 0.2 • m −1 case was also similar to the 0.15 • m −1 case, with worse symmetric divisions between high and low.

Polar distributions of shear metrics
To explore variations of the metrics with wind direction, we created polar plots for each shear metric ( Fig. 9) by binning data into 5 • bins using the lidar wind direction and plotting the median of the data in the bins. Medians were chosen rather than means to account for the long tails on measurements, such as ζ and β bulk . For REWS N-NTF and REWS L cases (and those including direction, not shown), a strong variation with wind direction occurs ( Fig. 9a and b). Nearly all northerly wind direction bins are low-REWS cases, and nearly all southerly wind direction bins are high-REWS cases. This variation with wind direction seems to arise from the terrain, with extremely complex terrain to the south because of an escarpment and relatively flat terrain to the north (compared to the escarpment).
Similarly, α varies strongly with wind direction (Fig. 9c), though the variation is not as distinct as that of the REWS cases. All the southerly wind directions are stable except for the south to south-southeasterly neutral cases. Northerly flow is typically neutral, with one convective point on the data boundary to the west-northwest and a cluster of convective data ranging from northerly to north-northeasterly. The north-northeasterly directions are the ones with the lowest terrain elevation change in any direction, while the topography just upwind of the equipment to the west-northwest and east-northeast actually descends before the turbine. Stability, as defined by surface-layer scaling parameter ζ (Fig. 9d), resembles that defined by α (Fig. 9c). All southerly cases are stable except one (on the boundary of southsoutheasterly flow), and some northerly directions are stable as well. However, a majority of the data with northerly flow are convective. North-northeasterly winds are convective (as with convective α) though some westerly convective points occur, which are not seen with α. However, stable points still exist to the north, generally with westerly components. This distribution could be a result of the plateau's (mainly southerly) escarpment wraps around the turbines to the west somewhat. Because ζ involves friction velocity, this terrain could be enough to shear the flow and cause ζ to be stable to those directions. However, this might not be the case because the terrain is not enough to cause westerly REWS metrics to increase.
β bulk does not show a strong directional dependence: nearly all directions have median low-β bulk values, which implies a uniform dominance of backing winds (Fig. 9e). However, the west to west-northwest values are high and therefore generally positive, which implies a dominance of veering winds from those directions. Given how few winds come from the west-northwest, proposing a mechanism for this veering is difficult.
The directional distribution of β total is somewhat similar to that of α and ζ , where lower values of β total occur under directions of convection (as denoted by α and ζ ) and greater values of β total happen under directions of stability (Fig. 9f). However, not all stable directions correspond to high β total and not all convective directions correspond to low β total . These results are somewhat expected and physically reasonable because the lack of convection during the night allows the atmosphere to decouple with height, increasing veering or backing. However, these results are not as directionally consistent as for β bulk .

Temporal distributions of shear metrics
To find variations of the metrics with time, each shear metric is binned by local time hour and the median of the data in each hour bin is plotted (Fig. 10). Medians were again chosen rather than means to account for the long tails on certain measurements such as ζ and β total .
Temporally, neither REWS N-NTF nor REWS L exhibits a clear diurnal cycle. Both high-and low-REWS periods occur during both daytime and nighttime hours ( Fig. 10a  and b). Additionally, the two cases do not covary with each other by hour, as the L case changes sign between high and low eight times, while the NTF-shifted nacelle wind speed case only changes sign four times. The times at which the sign changes between the two cases are not always the same. However, when the two cases do change signs at the same times (04:00-05:00, 15:00-16:00 LT), the sign changes at those times are always the same.
A clear diurnal cycle manifests for α (Fig. 10c), with stable values at night decreasing to neutral values during the morning transition and convective values during the day. During the evening transition, neutral values reoccur with stable cases reemerging later at night. The morning transition takes longer than the evening transition because solar heating requires a few hours to heat the ground enough to begin convection (Lapworth, 2005;Lapworth, 2009). Once the sun begins to set, most of the remaining heat from the ground is lost quickly because of convection, leaving the ground to cool radiatively (on a clear night), meaning the evening transition should be relatively rapid (Lee and Lundquist, 2017). Like α, ζ shows a strong temporal cycle (Fig. 10d). During daytime hours ζ becomes negative (convective), and during nighttime hours ζ becomes positive (stable).
Previous investigations of stability metrics for wind energy studies have relied on α as a stability metric (Wharton and Lundquist, 2012b;Vanderwende and Lundquist, 2012). We break up our α data based on those stability delineations and see that α does have a strong daily cycle, which would be expected for a stability metric in such a location, and refer to high-and low-α cases as stable and convective, respectively, to match with prior research. However, directionally, there appears to be a strong influence of terrain on stability. Thus, untangling the interaction between complex terrain and stability is challenging in this location.
ζ is treated in a similar manner to α. A strong diurnal cycle emerges, which is to be expected; however, the directional variation is dominated by stable cases from directions that could likely be influenced by topography. Because the Obukhov length calculation incorporates friction velocity, it (and thus ζ ) is clearly influenced by the terrain at this location.
The diurnal cycle also emerges in β bulk (Fig. 10e). All hours have median low-β bulk values which implies a dominance of backing winds at all times of the day at this complex terrain site. No hours exhibit a median veer. However, the backing is weaker (less negative) during the convective hours (as also suggested by α and ζ ). This behavior is physically reasonable because convective eddies mix momentum through the boundary layer, coupling winds throughout the boundary layer, such that the wind direction should vary little with height during convection.
As explained earlier, β total is effectively the absolute value of β bulk (but with a nonzero critical value of 0.15), and so the temporal distribution of β total (Fig. 10f) somewhat resembles that of β bulk (Fig. 10e). Stronger veer dominates from midnight until 08:00 LT, likely because of nocturnal decoupling. The overall temporal distribution of veer appears in sync with the temporal distribution of α; however, the choice of the critical value of 0.15 (the choice of which is explained in Sect. 4.1) affects the visualization of this distribution.

Further comparisons between selected metrics
While the median diurnal cycle suggests a relationship between ζ and α, we would like more robust evidence of this correlation. To find such a correlation, we computed linear correlation coefficients between ζ and α across 5 • lidar direction bins treating each 5 • wind direction bin separately because of the influence of terrain on the location. However, after calculating correlations of metrics within these 5 • wind direction bins, we found little evidence of agreement between these metrics. The strongest linear correlation values between ζ and α are only 0.4; these values occur in the southerly bins. The maximum linear correlation between ζ and α for northerly bins is less than 0.2, indicating very poor correlation. We applied the same directional binning linear correlation method to both types of REWS and ζ and both types of REWS and α. No combinations had greater correlations than 0.18 for any direction bin (figures not shown). This lack of any directional correlation further suggests that the metrics do not map directly to atmospheric stability metrics in this region with complex terrain.
Additionally, because the histograms of the directional and nondirectional REWS metrics are so similar (Sect. 4.1), nondirectional and directional REWS power curves strongly resemble each other. Power curves based on REWS θ,N-NTF and REWS θ,L are not statistically signif- icantly different from that of REWS N-NTF and REWS L , respectively, so only results for REWS N-NTF and REWS L are shown (Sects. 4.5 and 4.6, respectively). The greatest differences between the directional and nondirectional REWS metrics occur at high yaw misalignments, suggesting that a general yaw misalignment is more impactful than any further veer across the rotor disk under the specific conditions our location faced.

∆REWS N-NTF impacts on power production
REWS N-NTF shows statistically significant differences in actual turbine power production during cases of high REWS N-NTF (generally high shear) and low REWS N-NTF (generally low shear or negative shear; Fig. 11). The difference between the metrics is greatest around 7.5 and 12.5 m s −1 , as measured by the NTF-shifted nacelle wind speed.
Further, power production during high-REWS N-NTF conditions significantly exceeds the mean power production for conditions with NTF-shifted nacelle wind speeds between 3.19 and 13.70 m s −1 (Fig. 11). Generally, increases range from around 20 to 40 kW but can exceed 60 kW (2.7 % to 4 % of rated; Fig. 11b). The maximum average increase in power from the mean in the significant range is between 45.73 and 60.44 kW (3 % to 4 % of rated) at 12.70 m s −1 .
Power production during low-REWS N-NTF conditions is significantly less than the mean power production for NTFshifted nacelle wind speeds between 2.20 and 13.70 m s −1 (Fig. 11b). The maximum average decrease in power from the mean in that range is between 28.20 and 29.27 kW (1.9 % to 2 % of rated), which occurs at the NTF-shifted nacelle wind speed of 7.70 m s −1 (Fig. 11b).
Although the impact on power is somewhat symmetric, the high-REWS N-NTF case leads to greater increases than the decreases in the low-REWS N-NTF case at high NTF-shifted nacelle wind speeds above 8 m s −1 or so.

∆REWS L impacts on power production
Actual turbine power production during high-and low-REWS L conditions varies significantly, showing statistically significant differences between high and low cases (Fig. 12). The difference between the metrics is greatest around 11 m s −1 . Further, power production during high-REWS L (typically high-shear) conditions is significantly greater than the mean power production for conditions with hub-height lidar wind speeds between 4.07 and 12.57 m s −1 (Fig. 12b). However, just as with REWS L , that difference varies depending on the hub-height lidar wind speed. Increases in power, compared to the mean power curve, generally range from around 5 to 40 kW (0.3 % to 2.7 % of rated) but can exceed 70 kW (4.7 % of rated; Fig. 12b). The maximum average increase in power from the mean in the significant range is between 31.08 and 74.86 kW (2.1 % to 5 % of rated) at 11.07 m s −1 .
In contrast, power production during low-REWS L (typically low-shear or negative-shear) conditions is significantly less than the mean power production with hub-height lidar wind speeds between 3.07 and 12.57 m s −1 (Fig. 12). The maximum average decrease in power from the mean in that range is between 22.56 and 25.10 kW (1.5 % to 1.7 % of rated), which occurs at 9.57 m s −1 (Fig. 12b). At high li- Figure 11. (a) Power curves generated for both REWS N-NTF cases with 99 % confidence intervals. The mean power curve is shown by the solid black line. (b) Difference between two REWS N-NTF cases and the mean power curve where an overlap with 0 shows insignificance. The dashed red line corresponds to the nacelle rated wind speed of 14 m s −1 but is shifted up because of the NTF-shifted nacelle wind speed being offset from the nacelle wind speed. The high uncertainty above rated nacelle wind speeds is an artifact of low data availability and curtailment at rated speeds that we were unable to filter out. dar wind speeds above 8 m s −1 or so, the high-REWS L case leads to greater increases than the decreases in the low-REWS L case.

α impacts on power production
Turbine power production does not vary clearly as a function of α (Fig. 13), suggesting that α is not a powerful metric for assessing power production at this site. The low-α case shows significantly greater power production than the high-α case for nearly all NTF-shifted nacelle wind speeds between around 8 and 12.5 m s −1 . The high-α case generates significantly less power than the mean by 5 to 20 kW (0.3 % to 1.3 % of rated) for wind speeds from around 8 to 12.5 m s −1 (Fig. 13). The maximum average decrease in power from the mean in that range is between 10.15 and 19.15 kW (0.7 % to 1.3 % of rated), which occurs at the NTF-shifted nacelle wind speed of 11.20 m s −1 (Fig. 13b). The low-α case generates significantly greater power than the mean by around 1 to 20 kW (0.1 % to 1.3 % of rated) from around 8 to 13 m s −1 . The maximum average increase in power from the mean in that range is between 17.29 and 20.58 kW (1.2 % to 1.4 % of rated), which occurs at the NTF-shifted nacelle wind speed of 12.20 m s −1 (Fig. 13b). However, at lesser wind speeds (below 8 m s −1 ), both the high-and low-α cases demonstrate inconsistent oscillatory variability and even switch sign with each other at NTF-shifted nacelle wind speeds just past the cut-in wind speed. Some significant wind speed cases exist below 8 m s −1 ; however, the differences in power from the mean are very small. This inconsistent and unsatisfying picture of the utility of α in predicting power production led us to experiment with changing the threshold critical α values. Setting a smaller low bound (reducing the number of convective cases) only increases significance in Fig. 13b until an α of 0.07, but that α threshold fails to match the diurnal cycle. As such, the original critical low bound of 0.10 is used. Setting a lower low threshold than 0.10 or a higher high threshold than 0.20 does not enhance differences between the metrics and the means. Rather, the confidence intervals widen, because of fewer low or high data points, while the mean values do not change, leading to insignificance. Furthermore, because of the preponderance of neutral α values, only 78.2 % of the filtered data set is used to create the high-and low-α curves. Neutral values are included in the mean power curve. However, changing our critical values (and thus placing neutral data into the high and low cases) leads to greater insignificance. The data for such insignificant results are not shown.
These results of α impacts on power production are somewhat counterintuitive to physically based expectations but are similar to the results of Vanderwende and Lundquist (2012), based on 2 months of data at this site several years previously. High α is a measurement of high shear, and high shear implies that the top of the turbine rotor disk is associated with a greater wind speed than the hub height, which should be associated with a greater wind speed than the bottom of the turbine rotor disk. However, the greater wind Difference between two REWS L cases and the mean power curve where an overlap with 0 shows insignificance. The dashed red line corresponds to the nacelle rated wind speed of 14 m s −1 but is shifted up because of the NTF-shifted nacelle wind speed being offset from the lidar wind speed. The high uncertainty above rated nacelle wind speeds is an artifact of low data availability and curtailment at rated speeds that we were unable to filter out. speeds near the top of the rotor disk may not be able to compensate enough for the lesser wind speeds near the bottom of the rotor disk because of complicated wind profiles that result from the locally complex terrain. The greater wind speeds near the top of the rotor disk also may not be orthogonal to the rotor disk, because of veering or backing, and therefore cannot be harvested efficiently by the turbine blades.

ζ impacts on power production
The impact of stability as quantified by ζ (Fig. 14) is more easily interpretable than that of α ( Fig. 13) but is not as clear as that of the REWS metrics (Figs. 11 and 12), suggesting that ζ has some skill in assessing power production even though ζ is based on data collected near the surface.
The low-ζ case, associated with daytime conditions, shows significantly greater power production than the highζ case, associated with nighttime conditions, for nearly all NTF-shifted nacelle wind speeds between 4 and 13 m s −1 . The high-ζ case generates significantly less power than the mean by around 1 to 20 kW (0.1 % to 1.3 % of rated) for wind speeds from around 4 to 12.5 m s −1 (Fig. 14). The maximum average decrease in power from the mean in that range is between 2.49 and 18.18 kW (0.2 % to 1.2 % of rated), which occurs at the NTF-shifted nacelle wind speed of 12.20 m s −1 (Fig. 14b). The low-α case generates significantly greater power than the mean as well as significantly greater power than the high case by 1 to 16 kW (0.1 % to 1.1 % of rated) from 8 to 13 m s −1 . The maximum average increase in power from the mean in that range is between 13.26 and 15.82 kW (0.9 % to 1.1 % of rated), which occurs at the NTF-shifted nacelle wind speed of 12.20 m s −1 (Fig. 14b).

β bulk impacts on power production
The influence of β bulk on turbine power production depends very closely on wind speed. Below 10 m s −1 , β bulk has almost wholly insignificant results (Fig. 15). However, above that speed, small but significant oscillatory gains and losses in power occur. High β bulk (veering) leads to power gains, while low β bulk (backing) leads to power deficits. At wind speeds below rated, confidence bounds on the high-β bulk case do not exceed 20 kW (1.3 % of rated) of power increase and confidence bounds on the low-β bulk case do not exceed 10 kW (0.7 % of rated).
The difference in power production seen between veer and backing at wind speeds above 10 m s −1 resemble the results of Wagner et al. (2010). However, turbine yaw misalignment is not explicitly controlled for in our paper and only mean veer and backing are examined, when different values could have different effects on power production. Additionally, values of β bulk tend to approach 0 for both high and low cases (Fig. 16). As such, the significant portions of the power curve above 10 m s −1 are not a result of higher or lower values of β bulk occurring but rather of lower values of β bulk occurring with faster wind speeds. That greater wind speeds see less shear and veer is also physically reasonable because greater Figure 13. (a) Power curves generated for both α cases with 99 % confidence intervals. The mean power curve is shown by the solid black line. (b) Difference between two α cases and the mean power curve where an overlap with 0 shows insignificance. The dashed red line corresponds to the nacelle rated wind speed of 14 m s −1 but is shifted up because of the NTF-shifted nacelle wind speed being offset from the nacelle wind speed. The high uncertainty above rated nacelle wind speeds is an artifact of low data availability and curtailment at rated speeds that we were unable to filter out. wind speeds tend to mechanically mix momentum through winds at all heights.
Finally, overall, nearly twice as many low-β bulk data (veering) exist than high-β bulk data (backing), remarkably different from other field campaigns in flat terrain (Walter et al., 2009;Sanchez Gomez and Lundquist, 2020) or offshore (Bodini et al., 2019b). This disparity suggests that the confidence intervals around the high (veer) case would be tightened with more data.

β total impacts on power production
Power gains and losses for β total exhibit differences between high directional veering or backing and low directional veering or backing from 4.5 to 12.5 m s −1 (Fig. 17). Veering or backing undermines power production. Low values of β total imply a lack of directional shear across the turbine rotor disk, meaning that the winds across the rotor point orthogonally at the rotor plane and thus will not decrease power. Veering or backing reduces the magnitude of the winds orthogonal to the rotor disk, undermining power production. Low β bulk happens more often than high β bulk by a factor of nearly 2. This lack of symmetry leads to a decrease in power production be-cause low β bulk leads to a decrease in power production and high β bulk does not occur frequently enough to make up for it.
Additionally, high values of directional shear exert a greater impact on power production (just over 10 kW or 0.7 % of rated) than low values of directional shear (which never exceed 10 kW or 0.7 % of rated). At greater wind speeds, the high-β total case appears to lose even more power. This disparity is physically reasonable because the more the direction veers, the less power the turbine can extract from the atmosphere compared to a nonveered flow.

Discussion and conclusions
In this article, we explore how wind shear, wind veer, and atmospheric stability impact actual power production of an operational megawatt-scale wind turbine at a commercial wind farm in the high plains of North America. SCADA systems measured the turbine's power productions at 1 Hz over a period of nearly 6 months. Additional measurements from a vertically profiling Doppler lidar and a meteorological mast allow us to derive wind shear and stability metrics REWS L , REWS N-NTF , α, ζ , β bulk , and β total . After intercomparing these stability metrics, we use them to evaluate the power production in different regimes of shear by creating power curves for the different shear regimes. We evaluate power curves in terms of absolute changes in the power production of the turbine for the given regimes of shear. Percent changes (in rated power) are recorded as well. REWS and its difference from hub-height wind speed from either the upstream lidar ( REWS L ) or the nacelle anemometer ( REWS N-NTF ; Figs. 11 and 12) demonstrate the clearest impact of the wind profile on power production. These REWS-based metrics also rely on the most clear-cut bounds that could straightforwardly be applied to other turbines and wind farms. Significant differences between power curves created with REWS with and without direction (between REWS θ,L and REWS L and between REWS θ,N-NTF and REWS N-NTF , respectively) do not occur at this site. However, small differences between REWS with and without direction do exist.
Both high-REWS cases (0 < REWS L and 0 < REWS N-NTF ) lead to significantly greater power production than the mean power production (by up to 74.86 and 60.44 kW or 5 % and 4 % of rated, respectively) from lidar speeds of 4.07 to 12.57 m s −1 and NTF-shifted nacelle wind speeds of 3.19 to 13.70 m s −1 , respectively. Both low-REWS cases ( REWS L < 0 and REWS N-NTF < 0) lead to significantly less power production than the mean power production (by up to 25.10 and 29.27 kW or 1.7 and 2 % of rated, respectively) from lidar speeds of 4.07 to 12.57 m s −1 and NTF-shifted nacelle wind speeds of 3.19 to 13.70 m s −1 , respectively. The wind speed ranges where REWS is effective are the widest wind speed ranges of any of the metrics.
Although REWS is the most illuminating metric at this site, neither high-lidar nor low-lidar nor nacelle-based REWS cases occur with a consistent temporal pattern through the data set ( Fig. 10a and b). Terrain influences may dominate REWS at this site. High REWS, quantified from both lidar-based and nacelle-based REWS, occurs more often during southerly flow ( Fig. 9a and b), with inflow coming from low elevations up and over an escarpment, than for northerly flow, generally descending from higher terrain. Although this terrain influence is site specific, the REWS approach is likely more general and can be applied to other sites.
These results confirm the Sark et al. (2019) conclusion that measurement of REWS for power production purposes is necessary for complex terrain sites. Cost-benefit analyses are advised on the cost of implementation of installation and upkeep of inflow-sensing equipment (like a Doppler lidar) to provide REWS measurements and the benefit of REWS for power production prediction. Of course, such equipment may be necessary for other purposes, such as adaptive alignment of turbines for wake control (Fleming et al., 2019).
Although previous results for the power law coefficient α's effect on power production (Wharton and Lundquist, 2012b;Vanderwende and Lundquist, 2012), suggest useful relationships, we find that, at this site, α results are too sensitive to chosen critical values and are not as clearly interpretable as the REWS results. For low-α cases, significantly more power is produced than the mean around the middle of region II (from 8 to 12.5 m s −1 or so; Fig. 13). High-α cases at nacelle speeds in that same portion of region II lead to significantly less power production than the mean (Fig. 13). However, at slower wind speeds (below 8 m s −1 ), these same results only apply to a lesser change in power production, and the two cases are often not significantly different from each other or the mean case. Part of the explanation of the muddled results is that α is only a measure of the shear and not of the actual wind speeds that comprise the inflow profile. Although the power curves are plotted as a function of the nacelle wind speed, this value may differ from the true wind speed at nacelle height and that speed may vary more over the rest of the rotor disk as well.
The power law coefficient α exhibits other weaknesses. Interestingly, wind speed shear α and wind direction veer in the form of β bulk and β total fail to show a clear relationship with each other at this location. Likewise, α does not correlate with REWS metrics or ζ . Finally, α has the issue of data loss. Neutral conditioned data are not considered, meaning that around 22 % of the filtered data were not used. In contrast, because of the clear demarcations for the REWS metrics, 100 % of the REWS data could be used.
Additionally, these α results contrast somewhat with previous findings by Wharton and Lundquist (2012b). In a different site with channeled flow that could not exhibit veer, they found that high α increased power during wind speeds from 8 to 10 m s −1 . Although α does exhibit a strong daily cycle (convective in local daytime hours and stable at night), it also varies strongly with direction (stable when coming over very complex terrain, neutral otherwise, and convective when the fetch covers the flattest terrain). As such, the α in our case functions greatly as a descriptive indicator of inflow characteristics. This disparity in topography could account for the difference in findings.
However, our results agree well with those found by Vanderwende and Lundquist (2012), whose study used many more turbines over a shorter time period several years ago at this site. They assessed power curves with α bounds as well. They found that low α increases power at wind speeds in the higher-wind-speed portion of region II of the power curve, which generally follow our results between 8 and 12.5 m s −1 or so. Our findings for α require that winds with low α must take on a REWS profile that lowers the turbine's equivalent wind speed below the hub-height wind speed (and vice versa for the high-α case).
The surface-layer scaling parameter ζ efficiently segregates this turbine's power production into high and low cases. However, the ζ impacts on power are small, constrained to less than 20 kW (1.3 % of rated) difference from the mean in either the high or the low case (Fig. 14). Like α, ζ varies strongly with both time of day (convective in local daytime hours and stable at night) and direction (stable when coming over complex terrain but convective otherwise), but α and ζ do not correlate linearly with each other by direction, further obfuscating attempts to draw stability conclusions from these metrics at this location.
The direct assessment of wind veering and backing, β bulk , only shows small significant changes in power at wind speeds above 10 m s −1 (Fig. 15). At those speeds, low β bulk (backing) leads to less power production than the mean case (under 10 kW or 0.7 % of rated) while high-β bulk cases (veering) lead to greater power production than the mean case (up to 20 kW or 1.3 % of rated). These results agree with simulations (Wagner et al., 2010). However, at another (flat) site, Sanchez Gomez and Lundquist (2020) found that both veer and backing decrease power compared to cases with no veering or backing; that study distinguished high veer from low veer, whereas we only contrast veering and backing. Like α, β bulk lacks information about the inflow wind speeds. However, simply using REWS would mitigate this problem. β bulk shows a consistent daily cycle -all hours are dominated by backing at our site, but backing weakens during the day (when α and ζ are convective; Fig. 10e). β bulk does not show a strong directional cycle, except to say that westerly flow tends to be the only flow that introduces veer rather than backing and westerly flow is uncommon at this location (Fig. 9e). As with α, care should be taken to consider the root cause of the directional sheer veer if it should be used by itself in future work. β bulk also suggests that β total is only a useful measurement at wind speeds of less than 10 m s −1 , where the changes in power for veer and backing do not significantly differ from the mean (Fig. 17).
Overall, we find that REWS has the most predictive power for power production from an operational megawatt-scale wind turbine. REWS has the most significant results that occur over the largest portion of the power curve. In addition, because REWS simply functions as a description of the wind at a given instant, rather than a prescription (such as stability that might be affected by factors such as topography), REWS is the simplest metric to understand and apply. Thus, findings for both high and low REWS L and REWS N-NTF likely hold at other locations and for other seasons and conditions, although the relationship between the frequency of occurrence of high and low cases would likely change at other locations.
Such results show that improvements in power production prediction in region II of a power curve are certainly greater on average than 15 kW (1 % of rated power) for both high and low cases of REWS N-NTF or REWS L compared to the mean. The maximum increases in power production prediction can also exceed 4 % of rated power or even more when compared to the average power at a given wind speed. REWS is straightforward to implement and does not rely on assumptions or presumptions about the wind or stability.
The next step of this work would be to implement REWS into controls schemes for individual turbines or for entire wind farms. However, to do so, accurate measurements must be made of inflow across the rotor diameter from towers or remote sensing instruments. Likewise, for implementation into a wind farm's controls, these measurements would have to be spatially co-located somewhat with the turbine(s) they would affect, as inflow directions can change across the dimensions of a wind farm. Hub-mounted lidars are a promising method of such inflow characterization (Harris et al., 2007;Mikkelsen et al., 2013). Applying these methods to that inflow could help align the turbines further to maximize the potential of the inflow (Wagner et al., 2010;Fleming et al., 2014). Although this study found no meaningful difference between REWS and REWS θ , other locations with greater directional veer, influenced by meteorological phenomena such as cold pools (Wilczak et al., 2019;Redfern et al., 2019) or offshore decoupling (Bodini et al., 2019b), could find a more significant impact of the wind direction on the REWS.
Code and data availability. Currently, the data are not publicly available at the request of the wind farm owner and operator. The meteorological data may become available in the future at the DOE A2e data portal at https://a2e.energy.gov/about/dap (U.S. Department of Energy, 2020).
Author contributions. JKL brought attention to this issue to PM during his senior undergraduate year at the University of Colorado Boulder for use as an independent study project. JKL and PM coordinated with PF to conduct analysis on the issue on a data set that PF, PM, and JKL were already using for other research. PM wrote the initial draft and created all figures; this draft was then reviewed, edited, and revised by PF and JKL. The final draft was made by PM based on suggested changes.
Competing interests. The authors declare that they have no conflict of interest.
Disclaimer. The views expressed in the article do not necessarily represent the views of the DOE or the US Government. The US Government retains and the publisher, by accepting the article for publication, acknowledges that the US Government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this work, or allow others to do so, for US Government purposes.
Financial support. This research has been supported by the National Science Foundation CAREER Award (grant no. AGS-1554055) and the US Department of Energy Office of Energy Efficiency and Renewable Energy, Wind Energy Technologies Office (grant no. DE-AC36-08GO28308). Funding provided by the US Department of Energy Office of Energy Efficiency and Renewable Energy, Wind Energy Technologies Office. Review statement. This paper was edited by Joachim Peinke and reviewed by Melinda Marquis and one anonymous referee.