<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">WES</journal-id><journal-title-group>
    <journal-title>Wind Energy Science</journal-title>
    <abbrev-journal-title abbrev-type="publisher">WES</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Wind Energ. Sci.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">2366-7451</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/wes-4-343-2019</article-id><title-group><article-title>The super-turbine wind power conversion paradox:<?xmltex \hack{\break}?> using machine learning to
reduce errors<?xmltex \hack{\break}?> caused by Jensen's inequality</article-title><alt-title>Super-turbine wind power conversion paradox</alt-title>
      </title-group><?xmltex \runningtitle{Super-turbine wind power conversion paradox}?><?xmltex \runningauthor{T. C. McCandless and S. E. Haupt}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes">
          <name><surname>McCandless</surname><given-names>Tyler C.</given-names></name>
          <email>mccandle@ucar.edu</email>
        </contrib>
        <contrib contrib-type="author" corresp="no">
          <name><surname>Haupt</surname><given-names>Sue Ellen</given-names></name>
          
        <ext-link>https://orcid.org/0000-0003-1142-7184</ext-link></contrib>
        <aff id="aff1"><institution>National Center for Atmospheric Research, 3450 Mitchell Lane, Boulder,
CO 80301, USA</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Tyler C. McCandless (mccandle@ucar.edu)</corresp></author-notes><pub-date><day>4</day><month>June</month><year>2019</year></pub-date>
      
      <volume>4</volume>
      <issue>2</issue>
      <fpage>343</fpage><lpage>353</lpage>
      <history>
        <date date-type="received"><day>19</day><month>December</month><year>2018</year></date>
           <date date-type="rev-request"><day>20</day><month>February</month><year>2019</year></date>
           <date date-type="rev-recd"><day>3</day><month>May</month><year>2019</year></date>
           <date date-type="accepted"><day>16</day><month>May</month><year>2019</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2019 Tyler C. McCandless</copyright-statement>
        <copyright-year>2019</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://wes.copernicus.org/articles/4/343/2019/wes-4-343-2019.html">This article is available from https://wes.copernicus.org/articles/4/343/2019/wes-4-343-2019.html</self-uri><self-uri xlink:href="https://wes.copernicus.org/articles/4/343/2019/wes-4-343-2019.pdf">The full text article is available as a PDF file from https://wes.copernicus.org/articles/4/343/2019/wes-4-343-2019.pdf</self-uri>
      <abstract><title>Abstract</title>
    <p id="d1e90">Wind power is a variable generation resource and therefore
requires accurate forecasts to enable integration into the electric grid.
Generally, the wind speed is forecast for a wind plant and the forecasted
wind speed is converted to power to provide an estimate of the expected
generating capacity of the plant. The average wind speed forecast for the
plant is a function of the underlying meteorological phenomena being
predicted; however, the wind speed for each turbine at the farm is also a
function of the local terrain and the array orientation. Conversion
algorithms that assume an average wind speed for the plant, i.e., the
super-turbine power conversion, assume that the effects of the local terrain
and array orientation are insignificant in producing variability in the wind
speeds across the turbines at the farm. Here, we quantify the differences in
converting wind speed to power at the turbine level compared with a
super-turbine power conversion for a hypothetical wind farm of 100 2 MW
turbines as well as from empirical data. The simulations with simulated
turbines show a maximum difference of approximately 3 % at
11 m s<inline-formula><mml:math id="M1" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> with a 1 m s<inline-formula><mml:math id="M2" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> standard deviation of wind speeds and
8 % at 11 m s<inline-formula><mml:math id="M3" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> with a 2 m s<inline-formula><mml:math id="M4" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> standard deviation of wind
speeds as a consequence of Jensen's inequality. The empirical analysis shows
similar results with mean differences between converted wind speed to power
and measured power of approximately 68 kW per 2 MW turbine. However, using
a random forest machine learning method to convert to power reduces the error
in the wind speed to power conversion when given the predictors that quantify
the differences due to Jensen's inequality. These significant differences can
lead to wind power forecasters overestimating the wind generation when
utilizing a super-turbine power conversion for high wind speeds, and
indicate that power conversion is more accurately done at the turbine level
if no other compensatory mechanism is used to account for Jensen's
inequality.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d1e150">As the capacity of renewable energy resources increases, accurate forecasts
of power production are becoming increasingly instrumental for efficient and
effective management of the energy grid. In 2017, the worldwide wind power
capacity grew by 10.8 % to a total capacity of 539 GW (World Wind Energy
Association, 2018). This capacity covers only about 5 % of the total
global energy demand, so continued growth of wind power generation capacity
is expected. Large wind power plants that have tens to hundreds of turbines
pose many challenges for forecasting, as the meteorological conditions, the
topography, the array orientation, and the resulting wake effects may affect wind and power variability across the turbines at the farm. Ultimately, the
variability in the wind power needs to be accounted for in farm-level,
day-ahead wind power forecasts that are used in unit commitment and
electricity market bidding strategies, as well as in intra-day wind power
forecasts that are used for reliability, regulation, or sales on the spot
market (Ahlstrom et al., 2013; Orwig et al., 2014).</p>
      <p id="d1e153">There are two main sources of error in wind power forecasting: the error in
the underlying weather forecast of wind speed and to a lesser degree air
density (Pandit et al., 2018;<?pagebreak page344?> Bulaevskaya et al., 2015), and the error in
converting the wind speed to power. Past research has indicated an advantage
in using machine learning methods for wind power conversion (Parks et al.,
2011), and we further investigate this in the context of the super-turbine
approach. In the super-turbine conversion methodology, the wind speed is
forecast as a farm-average value, and that wind speed is converted to
farm-level power. Bartlett (2018) pointed out that the super-turbine approach
can result in substantial errors in power conversion, especially when
variability exists across a wind farm. He further analyzed wind farm data to
explore alternate methods to convert wind speed to power. The underlying
assumption of the super-turbine approach ignores the variability in wind
speed across the turbines and the nonlinearity of the power curve. Some
methods, however, have added wind variability as an explanatory variable in
wind power conversion (Pieralli et al., 2015). The issue of wind farm variability is
illustrated in Fig. 1, where the blue line indicates the power conversion for
the super turbine (i.e., farm-level mean wind speed), whereas the red
distribution illustrates that each turbine may have different wind speeds
centered around the mean value, which results in a distribution of power with
a lower mean value than the super turbine for a 10 m s<inline-formula><mml:math id="M5" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> wind speed.
For this notional example, the super-turbine approach would predict an
average power of approximately 1600 kW whereas the turbine-level approach
would predict approximately 1500 kW.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1"><?xmltex \currentcnt{1}?><label>Figure 1</label><caption><p id="d1e170">Illustration of an instance of converting wind speed to power for a
number of turbines and for the average wind speed across the turbines, which
is termed the super-turbine power conversion methodology. This shows that the
mean power converted from the distribution of wind speeds at the turbines is
less than the super-turbine power conversion for a mean wind speed of
10 m s<inline-formula><mml:math id="M6" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>.</p></caption>
        <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="https://wes.copernicus.org/articles/4/343/2019/wes-4-343-2019-f01.png"/>

      </fig>

      <p id="d1e192">The explanation for this phenomenon is in Jensen's inequality, which states
that the convex transformation of a mean is less than or equal to the mean
applied after convex transformation, and vice versa for a concave
transformation (Jensen, 1906). More generally, Jensen's inequality states
that if you have a nonlinear function, the average of the function is not
equivalent to the function of the average, and the magnitude of this
inequality depends on the nonlinearity of the function and the variability
(Pickett et al., 2015). Jensen published this mathematical proof over
100 years ago, and while there are some fields such as ecological physiology and evolutionary biology that have studied the impact of Jensen's inequality
(Denny, 2017), the impact on wind power forecasting and methods that overcome
the impact have not been studied. Denny (2017) provides details regarding the
basic concepts of Jensen's inequality with specific examples in biology.</p>
      <p id="d1e195">The impact of Jensen's inequality in wind power forecasting is best
illustrated in the steep portion of the curve for converting wind speed to
wind power, which is generally taken as a cubic function following the power
density function. Thus, at low wind speed values the transformation is convex
and at high wind speed values the transformation is concave, which is
illustrated by the orange line in Fig. 1. Therefore, at low wind speeds we
expect the super-turbine power conversion (i.e., the mean applied before) to
be less than the turbine-level power conversion, but at high wind speeds we
expect the super-turbine power conversion to be greater than the turbine
level power conversion. The application of Jensen's inequality to wind power
conversion is herein described as the super-turbine wind power conversion
paradox.</p>
      <p id="d1e198">Our goal is to quantify the error due to Jensen's inequality for a
super-turbine power conversion using both simulated hypothetical wind farm
data and empirical data. We show the expected difference using simulations
for a hypothetical wind farm that has 100 turbines and empirical data from
the Shagaya Renewable Energy Park in western Kuwait. In Sect. 2, we describe
the methodology for the simulation of the hypothetical wind farm and present
results. In Sect. 3, we present and discuss the impact of Jensen's inequality
in an empirical analysis for the 10 MW Shagaya wind farm. In Sect. 4, we
discuss the results of the hypothetical and empirical analysis. In Sect. 5,
we propose a machine learning technique for predicting the total wind farm
power and discuss the impact of those results in overcoming Jensen's
inequality. Section 6 presents conclusions and suggestions for next steps.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Hypothetical wind farm</title>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Simulation methodology for a hypothetical wind farm</title>
      <p id="d1e216">We statistically simulate wind speeds for a hypothetical wind farm to
quantify the expected differences for turbine-level and farm-level power
conversions for a variety of theoretical<?pagebreak page345?> meteorological conditions. Our
hypothetical wind farm has 100 2 MW wind turbines, and we simulated these
100 turbine wind speeds 1000 times for each mean wind speed considered. To do
this, we sampled from a Gaussian distribution with multiple different mean
wind speeds and two different wind speed standard deviations. We tested mean
wind speeds of 6, 7, 8, 9, 10, 11, 12, and 13 m s<inline-formula><mml:math id="M7" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> with a standard
deviation of 1 m s<inline-formula><mml:math id="M8" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, and we tested the same mean wind speeds with a
standard deviation of 2 m s<inline-formula><mml:math id="M9" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. The standard deviation represents the
variability of wind speeds across the wind turbines at the farm that would be
affected by the meteorological conditions, the topography, and the array
orientation. Although one would expect a general wind speed distribution to
be best fit with a Weibull distribution, here we use a Gaussian distribution because we
were sampling from specific points designed as the wind farm's mean wind speed
in the overall distribution of wind speeds. We also opted for a Gaussian
distribution because the variability across the turbines at a wind farm are
caused by multiple factors including turbulence, wake effects, local terrain,
turbine mechanics, and other microscale weather, which is a different
underlying driver of variability than using a Weibull distribution to
characterize the long-term climatology of wind speeds at a farm.</p>
      <p id="d1e255">Next, we convert the wind speed to power for the turbine-level and farm-level
wind speed for each simulation using a typical power curve. For our power
conversion methodology, we use a 10th-order polynomial fit to 2 MW Vestas
turbine data (Vestas, 2017). The 10th-order polynomial fit adequately
captures the convex shape below 9 m s<inline-formula><mml:math id="M10" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> and the concave nature of the
cubic wind speed to power conversion for wind speeds above 9 m s<inline-formula><mml:math id="M11" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, as
illustrated in Fig. 2. Although wind turbine manufacturers typically provide
power curves under ideal conditions (while turbines operate in a wide variety
of meteorological conditions that are seldom ideal), this conversion is
standard and serves our purpose of having a consistent conversion of wind
speed to wind power for individual wind turbines and for the super turbine
for quantifying differences.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2"><?xmltex \currentcnt{2}?><label>Figure 2</label><caption><p id="d1e284">Tenth-order polynomial fit (orange line) to 2 MW Vestas turbine data
from Vestas (2017).</p></caption>
          <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="https://wes.copernicus.org/articles/4/343/2019/wes-4-343-2019-f02.png"/>

        </fig>

</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>Simulation of a hypothetical wind farm with low wind variability</title>
      <p id="d1e301">The simulation results for the hypothetical wind farm with lower wind
variability (standard deviation of 1 m s<inline-formula><mml:math id="M12" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>) match our hypothesis that
the super-turbine power conversion value is less than the
turbine-level power conversion value at low wind speeds, whereas at high wind speeds the
super-turbine power conversion value is greater than the turbine-level power
conversion value. For each of the 1000 instances we simulated, we used the
polynomial equation to convert the wind speed to wind power for each wind
turbine as well as the average of the wind speed for the 100 turbines for
each instance. Then, we took the average of the wind power calculated over
the 1000 simulated instances for each wind speed mean and standard deviation.
The turbine-level mean values are plotted as red asterisks in Fig. 3 for mean
wind speeds from 6 to 13 m s<inline-formula><mml:math id="M13" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> drawn from a Gaussian distribution with
wind speed standard deviations of 1 m s<inline-formula><mml:math id="M14" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. These are compared to the
super-turbine mean averaged across the 1000 instances, which are indicated by
blue dots in Fig. 3. The plot shows that the mean for the turbine-level power
conversion is less than the mean for the super-turbine power conversion for
wind speeds of 9 m s<inline-formula><mml:math id="M15" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> and greater. For wind speeds less than
9 m s<inline-formula><mml:math id="M16" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, the opposite is true, as is expected from Jensen's
inequality.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3"><?xmltex \currentcnt{3}?><label>Figure 3</label><caption><p id="d1e366">Comparison of the turbine-level power conversion to the
super-turbine power conversion for the hypothetical wind farm with a
1 m s<inline-formula><mml:math id="M17" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> standard deviation across the turbines.</p></caption>
          <?xmltex \igopts{width=227.622047pt}?><graphic xlink:href="https://wes.copernicus.org/articles/4/343/2019/wes-4-343-2019-f03.png"/>

        </fig>

      <p id="d1e387">The difference between the turbine-level power conversion and the
super-turbine power conversion, shown in Table 1, indicates that the
turbine-level power conversion has a greater average value than the
super-turbine power conversion up to 8 m s<inline-formula><mml:math id="M18" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, but at 9 m s<inline-formula><mml:math id="M19" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>
and greater the reverse is true with the maximum difference of <inline-formula><mml:math id="M20" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>61.28 kW
per turbine. For a wind farm with 100 turbines with a 2 MW capacity each, the
super-turbine wind conversion would result in an overestimate of power by
over 6 MW, an error of approximately 3 %. For a two-sample related
<inline-formula><mml:math id="M21" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> test, all wind speeds are significantly different at the 95 % level
as shown in the third column of Table 1.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T1"><?xmltex \currentcnt{1}?><label>Table 1</label><caption><p id="d1e432">Difference of the super-turbine power conversion subtracted from the
turbine-level power conversion for the hypothetical wind farm of 100 turbines
with 1 m s<inline-formula><mml:math id="M22" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> variability simulated 1000 times. The <inline-formula><mml:math id="M23" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> value indicates
statistical significance at the 95 % level for a two-sample related
<inline-formula><mml:math id="M24" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> test.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="3">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1">Wind speed</oasis:entry>
         <oasis:entry colname="col2">Difference</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M25" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> value</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">(m s<inline-formula><mml:math id="M26" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col2">(kW/turbine)</oasis:entry>
         <oasis:entry colname="col3"/>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">5</oasis:entry>
         <oasis:entry colname="col2">24.16</oasis:entry>
         <oasis:entry colname="col3">0.00</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">6</oasis:entry>
         <oasis:entry colname="col2">30.49</oasis:entry>
         <oasis:entry colname="col3">0.00</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">7</oasis:entry>
         <oasis:entry colname="col2">35.77</oasis:entry>
         <oasis:entry colname="col3">0.00</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">8</oasis:entry>
         <oasis:entry colname="col2">33.75</oasis:entry>
         <oasis:entry colname="col3">0.00</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">9</oasis:entry>
         <oasis:entry colname="col2">2.07</oasis:entry>
         <oasis:entry colname="col3">0.00</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">10</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M27" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">34.43</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">0.00</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">11</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M28" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">61.38</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">0.00</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">12</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M29" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">41.33</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">0.00</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">13</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M30" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">14.51</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">0.00</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">14</oasis:entry>
         <oasis:entry colname="col2">2.18</oasis:entry>
         <oasis:entry colname="col3">0.00</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Simulation of a hypothetical wind farm with high wind
variability</title>
      <p id="d1e680">The simulation results with higher wind variability (standard deviation of
2 m s<inline-formula><mml:math id="M31" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>) similarly match our hypothesis that at low wind speeds the
super-turbine power conversion value is less than the turbine-level power
conversion value, whereas at high wind speeds the super-turbine<?pagebreak page346?> power
conversion value is greater than the turbine-level power conversion value;
however, the magnitude of the differences is greater than with lower wind
speed variability.</p>
      <p id="d1e695">The results for the simulations drawing from a Gaussian distribution with
mean wind speeds of 6–13 m s<inline-formula><mml:math id="M32" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> and a 2 m s<inline-formula><mml:math id="M33" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> wind speed
standard deviation appear in Fig. 4. For larger variability in the wind
across the wind turbines, the deviation is more pronounced between the
turbine-level wind power conversion and the super-turbine power conversion,
especially at wind speeds of 10, 11, and 12 m s<inline-formula><mml:math id="M34" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. The differences
between conversion methodologies are shown in Table 2, with the maximum
difference of <inline-formula><mml:math id="M35" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">165.66</mml:mn></mml:mrow></mml:math></inline-formula> kW per turbine at a mean wind speed of
11 m s<inline-formula><mml:math id="M36" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. For a wind farm with 100 turbines with a 2 MW capacity, the
super-turbine wind conversion would overestimate the wind speed
by over 16 MW, which represents an error of more than 8 %. For a two-sample related
<inline-formula><mml:math id="M37" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> test, all wind speeds are significantly different at the 95 % level
as shown in the third column of Table 2.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4"><?xmltex \currentcnt{4}?><label>Figure 4</label><caption><p id="d1e766">Comparison of the turbine-level power conversion to the
super-turbine power conversion for the hypothetical wind farm with
2 m s<inline-formula><mml:math id="M38" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> standard deviation across the turbines.</p></caption>
          <?xmltex \igopts{width=227.622047pt}?><graphic xlink:href="https://wes.copernicus.org/articles/4/343/2019/wes-4-343-2019-f04.png"/>

        </fig>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T2"><?xmltex \currentcnt{2}?><label>Table 2</label><caption><p id="d1e791">Difference of the super-turbine power conversion subtracted from the
turbine-level power conversion for the hypothetical wind farm of 100 turbines
with 2 m s<inline-formula><mml:math id="M39" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> variability simulated 1000 times. The <inline-formula><mml:math id="M40" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> value indicates
statistical significance at the 95 % level for a two-sample related
<inline-formula><mml:math id="M41" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> test.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="3">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1">Wind speed</oasis:entry>
         <oasis:entry colname="col2">Difference</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M42" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> value</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">(m s<inline-formula><mml:math id="M43" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col2">(kW/turbine)</oasis:entry>
         <oasis:entry colname="col3"/>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">6</oasis:entry>
         <oasis:entry colname="col2">30.92</oasis:entry>
         <oasis:entry colname="col3">0.01</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">7</oasis:entry>
         <oasis:entry colname="col2">83.76</oasis:entry>
         <oasis:entry colname="col3">0.00</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">8</oasis:entry>
         <oasis:entry colname="col2">82.14</oasis:entry>
         <oasis:entry colname="col3">0.00</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">9</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M44" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">6.47</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">0.00</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">10</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M45" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">120.23</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">0.00</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">11</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M46" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">165.66</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">0.00</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">12</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M47" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">137.44</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">0.00</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">13</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M48" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">40.97</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">0.00</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <?xmltex \floatpos{t}?><fig id="Ch1.F5" specific-use="star"><?xmltex \currentcnt{5}?><label>Figure 5</label><caption><p id="d1e1017">Map of the Shagaya wind farm location in western Kuwait
characterized by flat, arid desert.</p></caption>
          <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://wes.copernicus.org/articles/4/343/2019/wes-4-343-2019-f05.jpg"/>

        </fig>

</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Shagaya wind farm</title>
      <p id="d1e1035">In addition to the simulated hypothetical data, we examined data from a
10 MW wind farm located at the Shagaya<?pagebreak page347?> Renewable Energy Park in Kuwait. The
location of the wind farm is labeled “Shagaya” in Fig. 5 and the turbines are
located at an elevation of 240 m, a latitude of approximately
29.22<inline-formula><mml:math id="M49" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> N, and a longitude of approximately 47.05<inline-formula><mml:math id="M50" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> E. The local topography
is flat and the climate is characterized by persistent arid conditions with
large temperature differences between summer and winter. There are five 2 MW
turbines currently located at Shagaya with data available at a 10 min
frequency from 1 September 2017 until 31 May 2018, which comprised 34 393
instances in the initial dataset. The SCADA dataset includes the mean power
produced by each turbine over a 10 min period as well as the standard
deviation, minimum, and maximum of the 1 min raw data over this 10 min
period.</p>
      <p id="d1e1056">The data were preprocessed for quality control beginning with removing
instances with missing data that occurred in approximately 22.5 % of the
original dataset. Negative power observations in the dataset were set equal
to 0 MW power as there were small negative values recorded when the wind
turbine was not generating power, likely a result of the turbine consuming a
small amount of power. Finally, if the wind speed at the turbine was measured
at greater than 3 m s<inline-formula><mml:math id="M51" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> but no power was reported, those instances
were removed from the dataset as they reflected times of possible maintenance
or other forced shutdown of the turbine, which occurred in 2.09 % of the
original dataset. We converted the measured wind speed to a converted wind
power using the 10th-order polynomial and all wind power values converted
from wind speeds that were above 2020 kW were replaced with 2020 kW, as
that was approximately the maximum observed. The total dataset size after
quality control comprised 23 679 instances that included measured power at all
five turbines. Over this period of time, the average power at each turbine
ranged between 787 and 813 kW and the average wind speed ranged between 7.00
and 7.13 m s<inline-formula><mml:math id="M52" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> with a standard deviation across the turbines of
0.26 m s<inline-formula><mml:math id="M53" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. To quantify the correlation between the wind speed and the
power measured at each turbine, we computed the <inline-formula><mml:math id="M54" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> between wind speed
and power for each turbine independently. Using all data, the wind speed to
power <inline-formula><mml:math id="M55" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> was in the range of 0.76–0.86 for each of the turbines;
when limiting the data to the range of 3–12 m s<inline-formula><mml:math id="M56" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, we found that the
<inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> was in the range of 0.79–0.90 for each of the turbines.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T3" specific-use="star"><?xmltex \currentcnt{3}?><label>Table 3</label><caption><p id="d1e1144">Mean wind speed and mean power differences between one turbine and
all other turbines at the Shagaya wind farm.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1">Difference</oasis:entry>
         <oasis:entry colname="col2">Mean wind</oasis:entry>
         <oasis:entry colname="col3">Mean</oasis:entry>
         <oasis:entry colname="col4">Mean absolute</oasis:entry>
         <oasis:entry colname="col5">Mean absolute</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">between</oasis:entry>
         <oasis:entry colname="col2">speed</oasis:entry>
         <oasis:entry colname="col3">power</oasis:entry>
         <oasis:entry colname="col4">wind speed</oasis:entry>
         <oasis:entry colname="col5">power</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">turbine 1</oasis:entry>
         <oasis:entry colname="col2">difference</oasis:entry>
         <oasis:entry colname="col3">difference</oasis:entry>
         <oasis:entry colname="col4">difference</oasis:entry>
         <oasis:entry colname="col5">difference</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">and ...</oasis:entry>
         <oasis:entry colname="col2">(m s<inline-formula><mml:math id="M58" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col3">(kW)</oasis:entry>
         <oasis:entry colname="col4">(m s<inline-formula><mml:math id="M59" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col5">(kW)</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">2</oasis:entry>
         <oasis:entry colname="col2">0.13</oasis:entry>
         <oasis:entry colname="col3">19.27</oasis:entry>
         <oasis:entry colname="col4">0.28</oasis:entry>
         <oasis:entry colname="col5">61.02</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">3</oasis:entry>
         <oasis:entry colname="col2">0.09</oasis:entry>
         <oasis:entry colname="col3">22.38</oasis:entry>
         <oasis:entry colname="col4">0.31</oasis:entry>
         <oasis:entry colname="col5">70.16</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">4</oasis:entry>
         <oasis:entry colname="col2">0.03</oasis:entry>
         <oasis:entry colname="col3">23.24</oasis:entry>
         <oasis:entry colname="col4">0.35</oasis:entry>
         <oasis:entry colname="col5">73.80</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">5</oasis:entry>
         <oasis:entry colname="col2">0.08</oasis:entry>
         <oasis:entry colname="col3">27.19</oasis:entry>
         <oasis:entry colname="col4">0.41</oasis:entry>
         <oasis:entry colname="col5">100.11</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d1e1338">Next, we quantified the difference in nacelle wind speed, measured wind
power, and converted wind power among the turbines at the Shagaya wind farm.
Table 3 shows the mean wind speed (second column) and measured mean power
(third column) differences between turbine 1 and all of the other turbines. The mean difference in wind speed varied from 0.03 to 0.13 m s<inline-formula><mml:math id="M60" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> and the mean difference in power ranged between 19.27 and 27.19 kW. We then computed the farm-level power with the super-turbine approach using the mean wind speed across the
turbines and the polynomial fit to convert to power. We also computed
the turbine-level total wind farm power by converting the wind speed at<?pagebreak page348?> each
turbine to power and taking the sum across all turbines. Comparing both power
conversion techniques to the actual power produced we found a mean absolute
difference of 2.63 kW per 2 MW turbine, or a total wind farm power
difference of 13.15 kW. We then computed a mean absolute error of 68.83 kW
per 2 MW turbine between the super-turbine power conversion and the measured
power and a mean absolute error of 68.52 kW per 2 MW turbine between the
turbine-level power conversion and the measured power.</p>
      <p id="d1e1353">The differences between the power conversion using a polynomial fit to the
wind speed data and the measured power are not only due to the effect of
Jensen's inequality, but are also due to the Shagaya wind speeds measurements by nacelle anemometers. These measurements occur behind the blades of a turbine and the wind speeds are consequently impacted by wake effects. St. Martin et
al. (2017) showed that there is a substantial difference at wind speeds of
greater than 9 m s<inline-formula><mml:math id="M61" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> and accounted for the wake effects of using
nacelle wind speeds for power conversion by applying a fifth-order polynomial
fit between an upwind met tower and the nacelle wind speed data. We avoid the
use of a transfer function to map between a met tower and the nacelle wind
speeds because we want to isolate the impact from Jensen's inequality;
however, an operational power conversion methodology should attempt to take
the impact of using nacelle wind speeds to convert to power into account
and should therefore either include the met tower observations as a predictor
in the power conversion machine learning or should apply a transfer function
to the nacelle wind speed data.</p>
      <p id="d1e1368">Finally, we compared the super-turbine power conversion to the turbine-level
power conversion from the mean wind speed and nacelle wind speeds at the
individual turbines. The data are plotted in Fig. 6 with the super-turbine
mean power per 2 MW turbine on the <inline-formula><mml:math id="M62" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> axis and the turbine-level mean power
per 2 MW turbine on the <inline-formula><mml:math id="M63" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> axis. The scatter along the 1 : 1 line aligns
with the hypothetical data analysis: for wind speeds less than
8 m s<inline-formula><mml:math id="M64" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> the super-turbine mean underestimated the power, and for wind
speeds greater than 8 m s<inline-formula><mml:math id="M65" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> the super-turbine mean overestimated the
power.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F6"><?xmltex \currentcnt{6}?><label>Figure 6</label><caption><p id="d1e1411">Scatterplot of the super-turbine power conversion to the
turbine-level power conversion from the mean wind speed and wind speeds at
the individual turbines at the Shagaya wind farm.</p></caption>
        <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="https://wes.copernicus.org/articles/4/343/2019/wes-4-343-2019-f06.png"/>

      </fig>

</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Discussion</title>
      <p id="d1e1428">The greater difference for the power conversion in the 2 m s<inline-formula><mml:math id="M66" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>
standard deviation hypothetical wind speed variability scenario of the
simulated data is a result of a higher frequency of turbine-level power
conversions further from the mean of the wind speed. This is illustrated in
Fig. 7 where the blue distribution on the <inline-formula><mml:math id="M67" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> axis indicates the mean wind
speeds used for the super-turbine power conversion, which is shown as the
blue distribution on the <inline-formula><mml:math id="M68" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> axis. The red distribution on the <inline-formula><mml:math id="M69" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> axis
indicates the turbine-level wind speeds used in the conversion to power,
which is shown as the red distribution on the <inline-formula><mml:math id="M70" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> axis. This analysis is for
the 1000 simulated instances in the 2 m s<inline-formula><mml:math id="M71" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> variability scenario. The
turbine-level power conversion draws from a wider distribution whereas the mean value in the super-turbine power conversion draws from a narrower
distribution. This result occurs because taking the mean of the individual
simulations narrows the distribution via the law of large numbers where<?pagebreak page349?> the
mean of a large number of simulations or observations should approach the
expected value as the number of simulations or observations increases (Wilks,
2011). The wind speed and wind power
are asymmetric around the mean of the distribution of the wind speed at
10 m s<inline-formula><mml:math id="M72" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, as illustrated in Fig. 1 where the orange line is the
polynomial fit to the data for the cubic power transformation. At wind speeds
of 12 m s<inline-formula><mml:math id="M73" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> and greater, the power stays approximately constant at the
maximum value of 2000 kW. However, below wind speeds of 10 m s<inline-formula><mml:math id="M74" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> on
the left-hand side of the wind speed distribution, the power decreases
according to the orange line and does not hit a minimum value in the same way
the power achieves a maximum value on the right-hand side of the wind speed
distribution. This simulated dataset illustrates the impact of Jensen's
inequality on the wind speed to power conversion and how larger variability
will introduce greater differences due to more samples drawn from the
distribution further from the mean.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F7"><?xmltex \currentcnt{7}?><label>Figure 7</label><caption><p id="d1e1522">Illustration of the distribution of simulated wind speeds for each
turbine and the super-turbine mean. Red indicates the super-turbine power
conversion distribution for the 1000 simulated instances in the
2 m s<inline-formula><mml:math id="M75" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> variability scenario, and blue indicates the turbine-level
power conversion in the same variability scenario.</p></caption>
        <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="https://wes.copernicus.org/articles/4/343/2019/wes-4-343-2019-f07.png"/>

      </fig>

      <p id="d1e1543">The empirical data from the Shagaya wind farm in Kuwait highlights the same
structural differences between the super-turbine wind power conversion and
the turbine-level wind power conversion. Although the magnitude of the
differences is less than the magnitude of the simulated hypothetical data,
the wind farm in Kuwait is characterized by less variability among the
turbines than would be expected from a wind farm that covers a larger spatial
area, that is located in more diverse geography, or that experiences more variable
weather which could produce greater wind speed variability among turbines.
Wind farms may not measure wind speed and wind power at each turbine
individually; therefore, a technique to predict the total wind power at the
connection node given information about the mean wind speed and the
variability across the wind farm would be valuable, and machine learning may
provide an alternative solution.</p>
</sec>
<sec id="Ch1.S5">
  <label>5</label><title>Machine learning</title>
      <p id="d1e1555">Machine learning has been used to convert wind speed to power for wind farms
where data are available (Mahoney et al., 2012; Parks et al., 2011). Machine
learning is best utilized when there is a nonlinear relationship among the
predictors and the predictand and the true relationships can be found in the
dataset, which is a characteristic of this wind power conversion problem. The
machine learning model used here is the random forest supervised learning
method (Breiman, 2001). The random forest represents an ensemble of
regression trees where the final prediction is an average of the prediction
from each of the trees. Figure 8 illustrates the structure of the random
forest: the final prediction is an average of the predictions from each
tree in the forest where each tree is given a subset of the available
predictors and training data. Regression trees utilize the predictive power
of dividing a dataset into smaller subsets based on the predictive
relationships between the predictor and the predictand until the subsets
minimize the cost function (Witten and Frank, 2005). Regression trees do not
search for the most important predictor in order to split a node, but rather
exclusively search for the best predictor among a random subset of the predictors. This technique results in a final model that reduces overfitting the training
data and ultimately generalizes better (Witten and Frank,
2005). The random forest used here is the python package
“scikit-learn” random forest regressor (Pedregosa et al., 2011). Note
that we opted to use the random forest method because it is a machine
learning method that captures nonlinear relationships between predictors and
the predictand, and has the added benefit of avoiding overfitting as it is
an ensemble approach. Other machine learning methods such as the artificial
neural network or gradient boosted regression trees may work similarly well,
however, our goal is not to find the most optimal machine learning approach
but rather to highlight that machine learning can be used to learn the impact of Jensen's inequality in this application.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F8"><?xmltex \currentcnt{8}?><label>Figure 8</label><caption><p id="d1e1560">Diagram of the random forest machine learning method, which is an
ensemble of regression trees.</p></caption>
        <?xmltex \igopts{width=213.395669pt}?><graphic xlink:href="https://wes.copernicus.org/articles/4/343/2019/wes-4-343-2019-f08.png"/>

      </fig>

<sec id="Ch1.S5.SS1">
  <label>5.1</label><title>Hypothetical wind farm</title>
      <p id="d1e1576">Our goal in applying the random forest to the hypothetical data is to show
that this machine learning method is able to learn the effect of Jensen's
inequality on the super-turbine wind power conversion by using the mean
wind speed rather than the individual wind speeds at each turbine. We use
the turbine-level power conversion as the “observed” data that we are
trying to predict as this is the power aggregated from each turbine to
the total farm level. Random forest models<?pagebreak page350?> were trained on each simulated
wind speed dataset independently, which means that there were 10 random
forests for each wind speed from 5 to 14 m s<inline-formula><mml:math id="M76" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. The predictors provided
to the random forest were the super-turbine power conversion, the mean wind
speed, and the standard deviation of the wind speed. The optimal random
forest configuration was found to have a maximum number of 200 trees. The
maximum number of predictors the random forest uses in an individual tree
was found to be two, and the minimum number of leaves that are required to
split an internal node was determined to be one. We randomly split the
dataset into 80 % training and 20 % testing and all results are shown on
the test dataset.</p>
      <p id="d1e1591">The random forest is able to substantially reduce the error from the
super-turbine power conversion for all wind speeds, except for
9 m s<inline-formula><mml:math id="M77" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, which is right at the inflection point in the polynomial
power conversion and has minimal effect from Jensen's inequality, as shown by
the difference of 2.07 kW per 2 MW turbine prior to applying machine
learning. The results are shown in Table 4 where the average difference
between the super-turbine power conversion and the turbine-level power
conversion appears in the middle column and the mean absolute error (MAE) of
the random forest appears in the right column. For wind speeds less than or equal to
9 m s<inline-formula><mml:math id="M78" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, the super-turbine power conversion overestimates the power
whereas the opposite is true for wind speeds greater than or equal
9 m s<inline-formula><mml:math id="M79" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. However, the random forest is able to reduce the MAE to
between 0.51 and 2.42 kW per 2 MW turbine for all wind speeds. This minimal
remaining amount of error could be due to randomness in the simulations of
the 100 turbines because we provided the random forest the standard deviation
of the wind speeds for the 100 turbines.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T4"><?xmltex \currentcnt{4}?><label>Table 4</label><caption><p id="d1e1633">Difference of the super-turbine power conversion subtracted from the
turbine-level power conversion for the hypothetical wind farm of 100 turbines
with 1 m s<inline-formula><mml:math id="M80" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> variability simulated 1000 times in the middle column.
The MAE of the random forest for each wind speed is shown in the right-hand
column.</p></caption><oasis:table frame="topbot"><?xmltex \begin{scaleboxenv}{.95}[.95]?><oasis:tgroup cols="3">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1">Wind speed</oasis:entry>
         <oasis:entry colname="col2">Super-turbine power</oasis:entry>
         <oasis:entry colname="col3">Random forecast power</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">(m s<inline-formula><mml:math id="M81" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col2">conversion difference</oasis:entry>
         <oasis:entry colname="col3">conversion error</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(kW/turbine)</oasis:entry>
         <oasis:entry colname="col3">(kW/turbine)</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">5</oasis:entry>
         <oasis:entry colname="col2">24.16</oasis:entry>
         <oasis:entry colname="col3">2.42</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">6</oasis:entry>
         <oasis:entry colname="col2">30.49</oasis:entry>
         <oasis:entry colname="col3">0.51</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">7</oasis:entry>
         <oasis:entry colname="col2">35.77</oasis:entry>
         <oasis:entry colname="col3">0.56</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">8</oasis:entry>
         <oasis:entry colname="col2">33.75</oasis:entry>
         <oasis:entry colname="col3">1.44</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">9</oasis:entry>
         <oasis:entry colname="col2">2.07</oasis:entry>
         <oasis:entry colname="col3">2.34</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">10</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M82" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">34.43</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">2.01</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">11</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M83" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">61.38</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">1.03</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">12</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M84" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">41.33</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">1.54</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">13</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M85" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">14.51</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">1.75</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">14</oasis:entry>
         <oasis:entry colname="col2">2.18</oasis:entry>
         <oasis:entry colname="col3">1.13</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup><?xmltex \end{scaleboxenv}?></oasis:table></table-wrap>

</sec>
<sec id="Ch1.S5.SS2">
  <label>5.2</label><title>Shagaya wind farm</title>
      <p id="d1e1874">We next applied a random forest to the Shagaya empirical data to determine
whether we could improve upon the mean difference of 68.83 kW per 2 MW
turbine between the super-turbine power conversion and measured power. The
configuration of the random forest that reduced the error and maintained a
close balance between training and testing dataset error had 200 trees, which resulted in the
minimum number of samples to split an internal node of 20 and the minimum number of samples to be a
leaf node of 20. Once again, we randomly split the dataset into 80 %
training and 20 % testing and all results are shown on the test dataset.</p>
      <p id="d1e1877">We systematically tested multiple variations of predictors available in
order to minimize the error in converting the wind speeds at each farm to
the measured power. First, we tested giving the random forest the predictors
of the turbine-level converted mean power and the standard deviation across
turbines and calculated an MAE of 51.15 kW per 2 MW turbine (47.46 kW error
on training data). We then computed the mean wind speed and the standard
deviation of the wind speed and used those as predictors along with the
super-turbine power, and the error was nearly the same at 51.21 kW per 2 MW
turbine (47.55 kW error on training data). Next, we tested using the
super-turbine mean power and each turbine's individual wind speed as
predictors and found that the MAE was reduced to 50.41 kW per 2 MW turbine
(45.67 kW error on training data). These different predictor sets show that
providing the machine learning model with each of the individual turbine wind
speeds allows the model to better train to the variability across the
turbines. Note that as there were only five wind turbines in this
dataset, the standard deviation may not adequately represent the variability
across the turbines compared with a wind farm with a hundred turbines and
likely a more normal distribution with variability better represented by the
standard deviation. Finally, we tested adding in the five-turbine mean
temporal standard deviation of the 1 min wind speeds over the
10 min interval as a predictor. This predictor set that included<?pagebreak page351?> only
the individual turbine wind speeds and the temporal standard deviation of
the wind speeds produced the lowest error with the MAE decreasing to 44.27 kW
per 2 MW turbine (40.19 kW error on training data). This is a 35.7 %
reduction in error from the original super-turbine power conversion using
the mean wind speed and machine learning compared with the super-turbine
power conversion using the mean wind speed.</p>
      <p id="d1e1880">Finally, we compared the predictions to the measured power to evaluate the
distribution of differences across the range of measured power and found that
the differences increased as the power increased, although the majority of the instances fell along the 1 : 1 line. The predictions from the random forest
are shown on the <inline-formula><mml:math id="M86" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> axis with the measured power on the <inline-formula><mml:math id="M87" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> axis of Fig. 9.
The <inline-formula><mml:math id="M88" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> value of 0.9898 for a linear fit to the observed and predicted
power highlights the predictive power of the random forest in power
conversion. The greater variability in the differences between the measured
power and the predicted power at higher power values could be a function
of mechanical reasons that cause the turbines to produce lower power than
expected at a given wind speed. Ultimately, however, one would not expect a
machine learning model to capture decreases in power produced when the
turbines may not be functioning at rated capacity unless the dataset included
information about curtailment or other mechanical causes.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F9"><?xmltex \currentcnt{9}?><label>Figure 9</label><caption><p id="d1e1911">Scatterplot of the measured mean power at the Shagaya wind farm
compared to the random forest predicted mean, with the black line indicating
a linear fit with a <inline-formula><mml:math id="M89" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> of 0.99.</p></caption>
          <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="https://wes.copernicus.org/articles/4/343/2019/wes-4-343-2019-f09.png"/>

        </fig>

<?xmltex \hack{\newpage}?>
</sec>
</sec>
<sec id="Ch1.S6" sec-type="conclusions">
  <label>6</label><title>Conclusions</title>
      <p id="d1e1942">The wind power forecasted at the farm level is of the utmost importance for a
utility or system operator; however, the variability at the farm level is a
function of the variability across the turbines at the farm. In this study we
use both hypothetical simulated data and empirical data to analyze the effect
of Jensen's inequality on the application of a super-turbine power conversion
where a mean wind speed is used to convert to power. We showed that there are
systematic nonlinear differences between a turbine-level power conversion
and a super-turbine power conversion at a range of wind speeds from 5 to
14 m s<inline-formula><mml:math id="M90" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. The effect of Jensen's inequality was found to be most
pronounced at approximately 7 and 11 m s<inline-formula><mml:math id="M91" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> in the simulated
hypothetical wind farm, where the curvature of the power curve is the
greatest. Understanding the impact of Jensen's inequality on the total power
at a wind farm, or the super-turbine wind conversion paradox, allows a
utility to choose a power conversion methodology that incorporates this
effect for a more accurate power conversion estimate.</p>
      <p id="d1e1969">In the empirical data analysis, we were similarly able to show differences
between the turbine-level power conversion and a super-turbine power
conversion even for a relatively small 10 MW wind farm consisting of five
individual turbines in flat desert terrain. One would expect that a wind
farm with more turbines in a larger area would exhibit more variability,
especially if there were local terrain or wake effects. In the hypothetical
data we showed there is a larger effect of Jensen's inequality as the wind
variability increases, as would be expected for a larger wind farm.</p>
      <p id="d1e1972">Finally, we showed that the random forest machine learning method is able to
reduce the error in the wind speed to power conversion when provided with
predictors that quantify the differences due to Jensen's inequality. This was
first done using the hypothetical simulated data where the error was reduced
to under 2.5 kW per 2 MW turbine for all wind speeds from 5 to
14 m s<inline-formula><mml:math id="M92" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. In the empirical data analysis, we were able to reduce the
error from an average difference of over 68 kW per 2 MW turbine to a little
over 44 kW per 2 MW turbine, which represents an error reduction of greater than 35 %.</p>
      <p id="d1e1987">In this study, we focused on utilizing machine learning to isolate and
remediate the differences caused by Jensen's inequality on wind speed to
power conversion. We did not try to find the lowest error methodology for
converting wind speed to power, as a machine learning model would likely have
lower error when including other meteorological variables such as wind
direction, temperature, and humidity. The error for the machine learning
method would also be impacted by measurement error including the error
caused by using nacelle wind speeds without a transfer function, so we would
not expect any method to produce an error of 0 kW per 2 MW turbine in this
study. The super turbine approach will typically use power measured at a
meter for the entire farm<?pagebreak page352?> whereas the turbine approach will use the power
measured at each turbine; however, there can be discrepancy between the sum
of the turbine power values and the power measured at the farm's meter due to
losses in transmission. Ultimately, utilities are interested in the power
measured at the farm's meter or at a meter on the transmission line away
from the farm, and the machine learning method should produce accurate
predictions of power at that meter considering the effect of Jensen's
inequality.</p>
      <p id="d1e1991">Jensen's inequality can produce significant differences in wind power
conversion between a super-turbine approach and a turbine-level approach to power
conversion, which we have named the super-turbine wind power conversion
paradox. This analysis suggests that forecasters responsible for predicting
power for a utility should perform power conversion at the turbine level or
use machine learning to reduce the effects of Jensen's inequality in power
conversion. Additionally, if the temporal standard deviation of wind speed
is known, machine learning can incorporate both the effects from Jensen's
inequality on the spatial and temporal variability of wind speeds across
wind turbines at a wind farm.</p>
</sec>

      
      </body>
    <back><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d1e1998">The data and code from this study are subject to a nondisclosure agreement and cannot be released.</p>
  </notes><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d1e2004">SEH brought attention to the power conversion discrepancies after discussions with utility stakeholders at the 2018 Energy Systems Integration Group Forecasting Workshop. SEH and TCM jointly identified Jensen's inequality as a potential underlying cause of the power conversion methodologies and proposed using machine learning to overcome it.  TCM performed the simulations for the hypothetical wind turbine analysis, analyzed the data from the Shagaya wind farm, and applied the machine learning models to overcome the super-turbine power conversion paradox.  TCM produced all figures and an initial draft of the article, which was revised by SEH and jointly finalized.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d1e2010">The authors declare that they have no conflict of
interest.</p>
  </notes><ack><title>Acknowledgements</title><p id="d1e2016">The authors thank Drake Bartlett of Xcel Energy for bringing attention to
this issue. We also wish to acknowledge the Kuwait Institute for Scientific
Research (KISR) and Majed Al-Rasheedi for use of the Shagaya wind farm data.
We thank Jared Lee, Gerry Wiener, and Branko Kosovic for helpful comments.
The National Center for Atmospheric Research is sponsored by the National
Science Foundation.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d1e2021">This research has been supported by the Kuwait
Institute for Scientific Research (grant no. P-KISR-12). This material is also based upon work supported by the National Center for Atmospheric Research, which is a major facility sponsored by the National Science Foundation under cooperative agreement no. 1852977.</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d1e2027">This paper was edited by Athanasios Kolios and reviewed by
two anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bib1"><label>1</label><mixed-citation>
Ahlstrom, M., Bartlett, D., Collier, C., Duchesne, J., Edelson, D., Gesino,
A., Keyser, M., Maggio, D., Milligan, M., Mohrlen, C., O'Sullivan, J., Sharp,
J., Storck, P., and Rodriguez, M.: Knowledge is power: Efficiently
integrating wind energy and wind forecasts, IEEE Power Energy M., 11, 45–52,
2013.</mixed-citation></ref>
      <ref id="bib1.bib2"><label>2</label><mixed-citation>
Bartlett, D.: Power Conversion: Plant-level vs. Turbine-Level, Temperature,
Static vs. Self-learning, Energy System Integration Group Forecasting
Workshop, St. Paul, MN, 21 June 2018.</mixed-citation></ref>
      <ref id="bib1.bib3"><label>3</label><mixed-citation>
Breiman, L.: Random Forest, Mach. Learn., 45, 5–32, 2001.</mixed-citation></ref>
      <ref id="bib1.bib4"><label>4</label><mixed-citation>Bulaevskaya, V., Wharton, S., Clifton, A., Qualley, G., and Miller, W. O.:
Wind power curve modeling in complex terrain using statistical models, J.
Renew. Sustain. Energ., 7, 013103, <ext-link xlink:href="https://doi.org/10.1063/1.4904430" ext-link-type="DOI">10.1063/1.4904430</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib5"><label>5</label><mixed-citation>Denny, M.: The fallacy of the average: on the ubiquity, utility and
continuing novelty of Jensen's inequality, J. Exper. Biol., 220, 139–146,
<ext-link xlink:href="https://doi.org/10.1242/jeb.140368" ext-link-type="DOI">10.1242/jeb.140368</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib6"><label>6</label><mixed-citation>Jensen, J. L. W. V.: Sur les fonctions convexes et les inégalités
entre les valeurs moyennes, Acta Mathematica, 30, 175–193,
<ext-link xlink:href="https://doi.org/10.1007/BF02418571" ext-link-type="DOI">10.1007/BF02418571</ext-link>, 1906.</mixed-citation></ref>
      <ref id="bib1.bib7"><label>7</label><mixed-citation>
Mahoney, W. P., Parks, K., Wiener, G., Liu, Y., Myers, B., Sun, J., Delle
Monache, L., Johnson, D., Hopson, T., and Haupt, S. E.: A Wind Power
Forecasting System to Optimize Grid Integration, IEEE T. Sustain. Energ., 3,
670–682, 2012.</mixed-citation></ref>
      <ref id="bib1.bib8"><label>8</label><mixed-citation>Orwig, K. D., Ahlstrom, M., Banunarayanan, V., Sharp, J., Wilczak, J. M.,
Freedman, J., Haupt, S. E., Cline, J., Bartholomie, O., Hamman, H., Hodge,
B.-M., Finley, C., Nakafuji, D., Peterson, J., Maggio, D., and Marquis, M.:
Recent Trends in Variable Generation Forecasting and Its Value to the Power
System, IEEE T. Renew. Energ., 6, 924–933, <ext-link xlink:href="https://doi.org/10.1109/TSTE.2014.2366118" ext-link-type="DOI">10.1109/TSTE.2014.2366118</ext-link>,
2014.</mixed-citation></ref>
      <ref id="bib1.bib9"><label>9</label><mixed-citation>Pandit, R. K., Infield, D., and Carroll, J.: Incorporating air density into a
Gaussian process wind turbine power curve model for improving fitting
accuracy, Wind Energy, 22, 302–315, <ext-link xlink:href="https://doi.org/10.1002/we.2285" ext-link-type="DOI">10.1002/we.2285</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib10"><label>10</label><mixed-citation>
Parks, K., Wan, Y.-H., Wiener, G., and Liu, Y.: Wind Energy Forecasting – A
Collaboration of the National Center for Atmospheric Research (NCAR) and Xcel
Energy, National Renewable Energy Laboratory, Golden, CO, 2011.</mixed-citation></ref>
      <ref id="bib1.bib11"><label>11</label><mixed-citation>
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel,
O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J.,
Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, E.:
Scikit-learn: Machine Learning in Python, J. Mach. Learn. Res., 12,
2825–2830, 2011.</mixed-citation></ref>
      <ref id="bib1.bib12"><label>12</label><mixed-citation>Pickett, E. J., Thomson, D. L., Li, T. A., and Xing, S.: Jensen's Inequality
and the Impact of Short-Term Environmental Variability on Long-Term
Population Growth Rates, PLoS ONE, 10, e0136072,
<ext-link xlink:href="https://doi.org/10.1371/journal.pone.0136072" ext-link-type="DOI">10.1371/journal.pone.0136072</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib13"><label>13</label><mixed-citation>
Pieralli, S., Ritter, M., and Odening, M.: Efficiency of Wind Power
Production and its Determinants, Agricultural &amp; Applied<?pagebreak page353?> Economics
Association and Western Agricultural Economics Association Annual Meeting,
San Francisco, CA, 26–28 July 2015.</mixed-citation></ref>
      <ref id="bib1.bib14"><label>14</label><mixed-citation>St. Martin, C. M., Lundquist, J. K., Clifton, A., Poulos, G. S., and Schreck,
S. J.: Atmospheric turbulence affects wind turbine nacelle transfer
functions, Wind Energ. Sci., 2, 295–306,
<ext-link xlink:href="https://doi.org/10.5194/wes-2-295-2017" ext-link-type="DOI">10.5194/wes-2-295-2017</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib15"><label>15</label><mixed-citation>Vestas V90 2 MW:
<uri>https://en.wind-turbine-models.com/turbines/16-vestas-v90#powercurve</uri>
(last access: 10 September 2018), 14 July 2017.</mixed-citation></ref>
      <ref id="bib1.bib16"><label>16</label><mixed-citation>Wilks, D. S.: Statistical methods in the atmospheric sciences, Elsevier
Academic Press, Amsterdam, Boston, 2011.
 </mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bib17"><label>17</label><mixed-citation>
Witten, I. H. and Frank, E., Data Mining: Practical Machine Learning Tools
and Techniques, 2nd edn., Morgan Kaufmann Publishers Inc., San Francisco, CA,
2005.</mixed-citation></ref>
      <ref id="bib1.bib18"><label>18</label><mixed-citation>World Wind Energy Association: Wind Power Capacity Reaches 539 GW, 52.6 GW
Added in 2017, Press Release, available at:
<uri>https://wwindea.org/blog/2018/02/12/2017-statistics/6/</uri>, last access: 30 November 2018.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>The super-turbine wind power conversion paradox: using machine learning to reduce errors caused by Jensen's inequality</article-title-html>
<abstract-html><p>Wind power is a variable generation resource and therefore
requires accurate forecasts to enable integration into the electric grid.
Generally, the wind speed is forecast for a wind plant and the forecasted
wind speed is converted to power to provide an estimate of the expected
generating capacity of the plant. The average wind speed forecast for the
plant is a function of the underlying meteorological phenomena being
predicted; however, the wind speed for each turbine at the farm is also a
function of the local terrain and the array orientation. Conversion
algorithms that assume an average wind speed for the plant, i.e., the
super-turbine power conversion, assume that the effects of the local terrain
and array orientation are insignificant in producing variability in the wind
speeds across the turbines at the farm. Here, we quantify the differences in
converting wind speed to power at the turbine level compared with a
super-turbine power conversion for a hypothetical wind farm of 100 2&thinsp;MW
turbines as well as from empirical data. The simulations with simulated
turbines show a maximum difference of approximately 3&thinsp;% at
11&thinsp;m&thinsp;s<sup>−1</sup> with a 1&thinsp;m&thinsp;s<sup>−1</sup> standard deviation of wind speeds and
8&thinsp;% at 11&thinsp;m&thinsp;s<sup>−1</sup> with a 2&thinsp;m&thinsp;s<sup>−1</sup> standard deviation of wind
speeds as a consequence of Jensen's inequality. The empirical analysis shows
similar results with mean differences between converted wind speed to power
and measured power of approximately 68&thinsp;kW per 2&thinsp;MW turbine. However, using
a random forest machine learning method to convert to power reduces the error
in the wind speed to power conversion when given the predictors that quantify
the differences due to Jensen's inequality. These significant differences can
lead to wind power forecasters overestimating the wind generation when
utilizing a super-turbine power conversion for high wind speeds, and
indicate that power conversion is more accurately done at the turbine level
if no other compensatory mechanism is used to account for Jensen's
inequality.</p></abstract-html>
<ref-html id="bib1.bib1"><label>1</label><mixed-citation>
Ahlstrom, M., Bartlett, D., Collier, C., Duchesne, J., Edelson, D., Gesino,
A., Keyser, M., Maggio, D., Milligan, M., Mohrlen, C., O'Sullivan, J., Sharp,
J., Storck, P., and Rodriguez, M.: Knowledge is power: Efficiently
integrating wind energy and wind forecasts, IEEE Power Energy M., 11, 45–52,
2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>2</label><mixed-citation>
Bartlett, D.: Power Conversion: Plant-level vs. Turbine-Level, Temperature,
Static vs. Self-learning, Energy System Integration Group Forecasting
Workshop, St. Paul, MN, 21 June 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>3</label><mixed-citation>
Breiman, L.: Random Forest, Mach. Learn., 45, 5–32, 2001.
</mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>4</label><mixed-citation>
Bulaevskaya, V., Wharton, S., Clifton, A., Qualley, G., and Miller, W. O.:
Wind power curve modeling in complex terrain using statistical models, J.
Renew. Sustain. Energ., 7, 013103, <a href="https://doi.org/10.1063/1.4904430" target="_blank">https://doi.org/10.1063/1.4904430</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>5</label><mixed-citation>
Denny, M.: The fallacy of the average: on the ubiquity, utility and
continuing novelty of Jensen's inequality, J. Exper. Biol., 220, 139–146,
<a href="https://doi.org/10.1242/jeb.140368" target="_blank">https://doi.org/10.1242/jeb.140368</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>6</label><mixed-citation>
Jensen, J. L. W. V.: Sur les fonctions convexes et les inégalités
entre les valeurs moyennes, Acta Mathematica, 30, 175–193,
<a href="https://doi.org/10.1007/BF02418571" target="_blank">https://doi.org/10.1007/BF02418571</a>, 1906.
</mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>7</label><mixed-citation>
Mahoney, W. P., Parks, K., Wiener, G., Liu, Y., Myers, B., Sun, J., Delle
Monache, L., Johnson, D., Hopson, T., and Haupt, S. E.: A Wind Power
Forecasting System to Optimize Grid Integration, IEEE T. Sustain. Energ., 3,
670–682, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>8</label><mixed-citation>
Orwig, K. D., Ahlstrom, M., Banunarayanan, V., Sharp, J., Wilczak, J. M.,
Freedman, J., Haupt, S. E., Cline, J., Bartholomie, O., Hamman, H., Hodge,
B.-M., Finley, C., Nakafuji, D., Peterson, J., Maggio, D., and Marquis, M.:
Recent Trends in Variable Generation Forecasting and Its Value to the Power
System, IEEE T. Renew. Energ., 6, 924–933, <a href="https://doi.org/10.1109/TSTE.2014.2366118" target="_blank">https://doi.org/10.1109/TSTE.2014.2366118</a>,
2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>9</label><mixed-citation>
Pandit, R. K., Infield, D., and Carroll, J.: Incorporating air density into a
Gaussian process wind turbine power curve model for improving fitting
accuracy, Wind Energy, 22, 302–315, <a href="https://doi.org/10.1002/we.2285" target="_blank">https://doi.org/10.1002/we.2285</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>10</label><mixed-citation>
Parks, K., Wan, Y.-H., Wiener, G., and Liu, Y.: Wind Energy Forecasting – A
Collaboration of the National Center for Atmospheric Research (NCAR) and Xcel
Energy, National Renewable Energy Laboratory, Golden, CO, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>11</label><mixed-citation>
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel,
O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J.,
Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, E.:
Scikit-learn: Machine Learning in Python, J. Mach. Learn. Res., 12,
2825–2830, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>12</label><mixed-citation>
Pickett, E. J., Thomson, D. L., Li, T. A., and Xing, S.: Jensen's Inequality
and the Impact of Short-Term Environmental Variability on Long-Term
Population Growth Rates, PLoS ONE, 10, e0136072,
<a href="https://doi.org/10.1371/journal.pone.0136072" target="_blank">https://doi.org/10.1371/journal.pone.0136072</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>13</label><mixed-citation>
Pieralli, S., Ritter, M., and Odening, M.: Efficiency of Wind Power
Production and its Determinants, Agricultural &amp; Applied Economics
Association and Western Agricultural Economics Association Annual Meeting,
San Francisco, CA, 26–28 July 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>14</label><mixed-citation>
St. Martin, C. M., Lundquist, J. K., Clifton, A., Poulos, G. S., and Schreck,
S. J.: Atmospheric turbulence affects wind turbine nacelle transfer
functions, Wind Energ. Sci., 2, 295–306,
<a href="https://doi.org/10.5194/wes-2-295-2017" target="_blank">https://doi.org/10.5194/wes-2-295-2017</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>15</label><mixed-citation>
Vestas V90 2&thinsp;MW:
<a href="https://en.wind-turbine-models.com/turbines/16-vestas-v90#powercurve" target="_blank">https://en.wind-turbine-models.com/turbines/16-vestas-v90#powercurve</a>
(last access: 10 September 2018), 14 July 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>16</label><mixed-citation>
Wilks, D. S.: Statistical methods in the atmospheric sciences, Elsevier
Academic Press, Amsterdam, Boston, 2011.

</mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>17</label><mixed-citation>
Witten, I. H. and Frank, E., Data Mining: Practical Machine Learning Tools
and Techniques, 2nd edn., Morgan Kaufmann Publishers Inc., San Francisco, CA,
2005.
</mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>18</label><mixed-citation>
World Wind Energy Association: Wind Power Capacity Reaches 539&thinsp;GW, 52.6&thinsp;GW
Added in 2017, Press Release, available at:
<a href="https://wwindea.org/blog/2018/02/12/2017-statistics/6/" target="_blank">https://wwindea.org/blog/2018/02/12/2017-statistics/6/</a>, last access: 30 November 2018.
</mixed-citation></ref-html>--></article>
