Articles | Volume 10, issue 3
https://doi.org/10.5194/wes-10-559-2025
https://doi.org/10.5194/wes-10-559-2025
Research article
 | 
19 Mar 2025
Research article |  | 19 Mar 2025

Enabling efficient sizing of hybrid power plants: a surrogate-based approach to energy management system modeling

Charbel Assaad, Juan Pablo Murcia Leon, Julian Quick, Tuhfe Göçmen, Sami Ghazouani, and Kaushik Das
Abstract

Optimal sizing of hybrid power plants (HPPs), which include wind power plants and battery energy systems, is essential to prevent financial losses from under- or over-sizing relative to grid connection capacities. Accurate sizing requires high-fidelity energy management systems (EMSs) to model bidding strategies and operations in electricity markets, resulting in precise operational revenues and costs. However, due to the computational burden of such models, sizing methodologies often resort to low-fidelity EMS models, leading to faulty sizing evaluations. To address the need for a computationally efficient and accurate model that enables quantitative assessment of HPPs, we evaluate the potential of surrogate models to replace a high-fidelity EMS participating in the day-ahead electricity market in Denmark with perfect forecasts. Given the limited literature on surrogates of EMS models for utility-scale, grid-connected HPPs with batteries, we develop and compare four different surrogate models to approximate a state-of-the-art EMS model. The best-performing surrogate employs singular value decomposition for dimensionality reduction and a feed-forward neural network for regression. This surrogate achieves a normalized root mean square error of 0.81 % in approximating yearly revenues. This method proves effective in accurately evaluating the operation of HPPs across various geographical locations and hence in multiple sizing problems. Furthermore, we utilize the surrogate to evaluate the profitability of several HPP sizes, achieving a root mean square error of 0.010 on the profitability index, with values ranging between −0.13 and 0.18. This demonstrates that the developed surrogate model is suitable for HPP sizing under the given cost and financial assumptions.

Share
1 Introduction

As the renewable energy and storage industry matures, governmental incentives that sustain this rapid development have started to shift. Initially sustained by government-supported feed-in tariffs, wind power plants are transitioning to feed-in premiums or contracts for differences. As noted by Kitzing et al. (2024), contracts for differences have become increasingly significant in the European market, particularly following an agreement reached in May 2024 between the European Commission, the European Parliament, and the European Council (Council of the European Union2024). This agreement mandates that the support for renewable technologies is provided through two-sided contracts for differences or equivalent schemes, applying to both new investments and repowering. These support levels are now determined through competitive bidding procedures (Busch et al.2023). Additionally, the power plants are expected to maximize their value from energy markets such as the electricity spot market or balancing and/or reserve markets. This change exposes power plant developers to the dynamics of the wholesale electricity market. In this evolving landscape, hybrid power plants (HPPs) that include storage technology are becoming valuable for establishing robust business cases. Despite the absence of a universal definition for HPPs – highlighted by varying interpretations in the literature (Dykes et al.2020; Long et al.2022; Paska et al.2009) – for the purposes of this research, we define renewable-based HPPs as power plants that combine several generation technologies, including wind turbines and possibly energy storage, to produce electricity and other energy vectors. They operate from a single geographical location, with all generated power being transmitted to the electrical grid via a unified point of grid connection.

In the presence of a battery energy storage system, HPPs can use strategies such as market arbitrage, which involves buying and storing electricity when market prices are low and selling it when prices are high. Additionally, the integration of energy storage is crucial for smoothing power supply fluctuations, mitigating power curtailment, enabling HPPs to offer system services, and reducing grid congestion (Das et al.2019).

As HPPs transition to market-driven revenue models, optimizing their operations becomes essential to capitalize on market opportunities and address operational challenges. An energy management system (EMS) fulfills this role by devising optimal operational strategies throughout the power plant's lifetime. For an HPP with a battery energy storage system, the EMS optimizes the charge and discharge power of the battery, given the available resources (i.e., wind speed, solar power), the battery’s capacity, available grid connection capacity, and other constraints. In electricity trading, various markets allow power plant operators to sell their energy. The spot market (SM) is currently the most lucrative market where power is traded for immediate delivery. Within the SM, operators can participate in the day-ahead and hour-ahead markets. Day-ahead bidding establishes hourly prices for the following day, while hour-ahead bidding allows for adjustments based on updated generation forecasts and cleared SM prices. Since electricity markets require power producers to bid in advance on the amount of electricity they will generate, these bids rely on forecasts of renewable energy generation and market prices. If actual generation deviates from scheduled bids, financial penalties are incurred. Additionally, the balancing market (BM) offers HPPs another revenue source, operating alongside the SM to handle discrepancies between forecasted and actual demand and supply. The BM allows transmission system operators to adjust for differences arising from SM bids that are based on earlier forecasts and the real-time conditions closer to delivery. This market acts in near real time, addressing deviations from scheduled generation and imposing penalties as necessary. Consequently, an EMS aims to maximize profits by strategically storing and selling electricity in both the SM and the BM while also minimizing imbalance costs due to deviations from scheduled energy bids.

In the field of hybrid renewable energy systems, particularly in microgrids, the dispatch problem has been extensively studied, as evidenced by numerous review articles on the topic (Barbosa et al.2024; Shivarama Krishna and Sathish Kumar2015; Fathima and Palanisamy2015). However, the primary purpose of microgrids is to manage or follow load profiles within a network, whereas HPPs operate as distinct generation facilities with a connection to the power grid. This unique connection emphasizes the role of HPPs in active power generation, rather than load management alone, which calls for a distinct EMS model such as the ones developed by Toubeau et al. (2021) and Ding et al. (2016).

EMS models vary in complexity and computational demand, and for this article, we categorize them into high-fidelity (HF) and low-fidelity (LF) models. High-fidelity EMS models provide detailed and accurate representations of HPPs, capturing intricate system dynamics, precise component behaviors, and sophisticated market interactions. These models incorporate forecasting and real-time data, include comprehensive operational constraints, and optimize bidding strategies to maximize profits in electricity markets (Taha et al.2018; Zhu et al.2024; Ochoa et al.2022; Han and Hug2020; Li and Qiu2016; Abdeltawab and Mohamed2015; Yang et al.2018; Huang et al.2021; Das et al.2020). While HF EMS models offer high accuracy in estimating operational performance and financial outcomes, they require significant computational resources and time. For instance, Huang et al. (2021) develop a stochastic optimization-based HF EMS where solving the dispatch for 1 week of operation takes between 329 and 2991 s, depending on the chosen optimization algorithm among five compared. Similarly, Li and Qiu (2016) present a deterministic HF EMS model that requires using a monthly time step to reduce simulation costs.

In contrast, LF EMS models simplify the representation of HPP operations by using aggregated system models, basic forecasting methods (or none), and simplified market participation strategies (An et al.2020; Luo et al.2015; Cai et al.2016; Zhang et al.2018, 2017). They reduce computational demand by neglecting detailed component behaviors and operational constraints. Although they enable rapid simulations and are easier to integrate into optimization frameworks, their oversimplifications can lead to inaccuracies in revenue and cost estimations, potentially resulting in sub-optimal or erroneous sizing decisions. Table 1 provides an overview of the distinctions between HF and LF EMS models. Notably, there is no universally accepted standard for defining these classifications, so the table serves as a guideline based on key characteristics relevant to this study.

Table 1Comparison of HF and LF EMS models.

Download Print Version | Download XLSX

The trade-off between computational efficiency and model accuracy presents a significant challenge for the optimal sizing of HPPs. A sizing optimization of an HPP involves maximizing a financial metric by varying the wind power plant rating, battery energy, and power ratings. Calculating that financial metric requires solving an EMS model for each potential HPP configuration. Consequently, HF EMS models offer precise assessments; however, relying solely on them is impractical due to their substantial computational demands. Conversely, using LF EMS models reduces computational time but risks compromising the financial viability of the project due to inaccurate assessments. To illustrate the computational burden of an HF EMS model, we evaluate the state-of-the-art EMS developed by Zhu et al. (2023). This model requires 1250 min to solve for 25 years of operation (the assumed lifetime) of a given HPP using a single-node high-performance computing (HPC) cluster, Sophia (DTU HPC Cluster2019), which has 32 physical cores ( 16-core AMD EPYC 7351) and 128 GB of RAM (4 GB per core, DDR4 2666 MHz). Therefore, even if we need to evaluate only a few sizings for the optimizer to converge, we require a substantial amount of time to reach a solution. For example, evaluating 10 sizings takes 12 500 min, or approximately 208 h. Additionally, in previous work familiar to the authors (Leon et al.2024), a sizing optimization can take up to several hundred iterations to approach optimality. In that study, the authors use a low-fidelity EMS model to evaluate the operation of an HPP over its lifetime in a matter of 15 s. The comparison of the optimization time is based on the same computational resources. Given these computational benefits, HPP sizing optimization often relies on LF EMS models. For example, Leon et al. (2024) propose a methodology for sizing HPPs as a nested optimization problem, using two LF EMS models: a short-term EMS formulated as linear programming and a long-term rule-based EMS. The short-term EMS provides a baseline for daily optimal operations, while the long-term EMS modifies these operations to account for degradation effects and forecast inaccuracies over the plant's lifetime. Similarly, in a study aimed at optimizing the design and layout of a hybrid wind-solar–storage plant, Stanley and King (2022) employ a simple battery dispatch model, where the battery is only discharged to meet minimum power requirements. While using LF EMS models may result in reduced accuracy in revenue estimation, they are widely adopted in HPP sizing due to computational efficiency. Indeed, several review studies underscore the prevalence of LF EMS models in sizing methodologies (Roy et al.2022; Lian et al.2019; Thirunavukkarasu et al.2023).

It is challenging to quantify the accuracy loss when using LF EMS instead of HF EMS models. Research studies often test EMS models on varied configurations of HPPs, and only a few conduct direct comparative analyses within the same setup, primarily focusing on high-fidelity EMS models. For instance, Ochoa et al. (2022) compare deep reinforcement learning with both stochastic optimization and robust optimization for photovoltaic-battery HPPs using US market data, finding that deep reinforcement learning offers superior economic performance and significantly reduces computational time compared to the other two studied methods. Similarly, Han and Hug (2020) report that, in a 1-year simulation using Nord Pool data, the distributionally robust optimization model achieves higher revenues than deterministic forecasting approaches. Zhu et al. (2024) further explore this by comparing EMS models that utilize distributionally robust optimization with those based on deterministic optimization and stochastic optimization for wind–battery hybrid plants in Nordic day-ahead markets, taking imbalance settlements into account. By adjusting the parameters of the distributionally robust optimization model, they demonstrate that the economic performance ranks highest for this approach, followed by risk-neutral stochastic optimization and, finally, deterministic optimization. This approach enables more resilient offering strategies, especially in markets with high penalties for energy imbalances. Additionally, Zhu et al. (2024) examine the accuracy of total profits across three HF EMS models and show that even within these models, certain simplifications commonly found in low-fidelity models – such as the use of deterministic forecasts – can lead to revenue discrepancies of up to 7.6 % compared to the best-performing model (refer to Table 3 of the referenced paper). Given these considerations, this paper primarily focuses on reducing the computational demand of high-fidelity EMS models.

To address the computational challenges associated with implementing a realistic EMS for HPP sizing while maintaining high accuracy, a promising approach involves using data-driven surrogate-based modeling. This technique demonstrates potential in tackling computationally intensive problems across various domains (Zhang et al.2021; Lin et al.2023; Pang et al.2023). These reduced-order models (ROMs) aim to replace high-dimensional, resource-intensive problems with models that are significantly faster to simulate while accurately representing the original solution behavior. In particular, Hesthaven et al. (2022) review the development of surrogates for time-dependent problems, including those with nonlinear dynamics, which are of interest in our work. In this context, data-driven surrogate models stand out as promising solutions, thanks to major advancements in machine learning methods. These models often follow an offline–online paradigm. During the offline phase, a reduced basis is extracted from a collection of high-fidelity solutions; this reduced basis is then used to train the surrogate model by optimizing weights or coefficients that capture the system dynamics. Although this step is computationally intensive, it only needs to be performed once. In the online phase, the surrogate model uses the precomputed weights to compute new outputs almost instantly, with minimal computational cost. This paradigm enables the surrogate model not only to learn the mapping from inputs to outputs but also to understand underlying patterns within the input data, leading to faster and more accurate simulations. Numerous successful implementations of data-driven surrogate models exist in the literature. For instance, Hesthaven and Ubbiali (2018) develop an ROM using proper orthogonal decomposition (POD) to extract a reduced basis from high-fidelity solutions and employ multi-layer perceptron neural networks to approximate the coefficients of the reduced model, although time dependency is not considered. Similarly, Guo and Hesthaven (2019) use a POD projection and map the time and parameter values onto the reduced basis using tensor products of two Gaussian processes – one for time and one for parameters. Hess et al. (2023) utilize a data-driven ROM approach to efficiently compute the Rayleigh–Bénard cavity problem, integrating POD, dynamic mode decomposition, and manifold interpolation for a robust and computationally efficient model. Departing from POD, Bhatt et al. (2023) employ deep auto-encoder networks to compress high-fidelity snapshots before using these in forecasting models – specifically, long short-term memory and temporal convolutional networks for time series forecasts and convolutional neural networks for spatial feature extraction – significantly reducing computational costs in both the offline and the online stages. Most ROMs have been applied to problems described by partial differential equations with sharp gradients. In contrast, our aim is to apply similar techniques to high-fidelity EMS models for HPPs. Although surrogate modeling has advanced across multiple fields, a gap remains in developing models tailored to EMSs for utility-scale HPPs, particularly those that incorporate detailed operational strategies for market participation. This gap exists not only due to the scarcity of existing applications of surrogate models for EMSs in HPPs but also because of the complexity involved in designing an accurate surrogate model based on a multitude of input and output time series. Additionally, integrating the surrogate model within a sizing evaluation framework adds another layer of complexity.

This article seeks to evaluate the potential of data-driven surrogate models in reducing the computational burden of high-fidelity EMS models while preserving the high accuracy needed for a reliable assessment of HPPs. To achieve this, we develop four surrogate models to approximate the outputs of the HF EMS. We begin by employing two models based on multivariate linear regression, which serve as baselines due to their simplicity, interpretability, and low computational cost. Recognizing that the EMS exhibits complex and nonlinear behaviors that linear models may not capture adequately, we also develop two models based on neural networks (NNs). Neural networks, particularly feed-forward neural networks (FNNs), are capable of modeling intricate nonlinear relationships through their layered architectures and nonlinear activation functions. This makes them well suited for approximating the inherent nonlinear dynamics of the HF EMS. By leveraging NNs, we can capture complex patterns and interactions within the data that linear models might overlook, potentially achieving higher accuracy in estimating outputs. The choice of these surrogate models allows us to explore the trade-offs between model complexity, computational efficiency, and accuracy. By comparing the performance of multivariate linear regression models and neural networks, we assess the extent to which incorporating nonlinearity improves the surrogate's ability to replicate the HF EMS outputs. Additionally, NNs have demonstrated success in surrogate modeling across various fields due to their flexibility and scalability, making them a promising candidate for this application.

Building upon the EMS model developed by Zhu et al. (2023), which is detailed in the following section, this paper seeks to answer the following question: how can we enable the sizing evaluation of utility-scale HPPs based on an accurate and computationally efficient EMS model?

The major contributions of this article are as follows:

  • development of a fast and accurate surrogate model for a utility-scale HPP EMS participating in the spot market

  • demonstration of the surrogate's generalizability in different geographical locations within the same electricity market region

  • integration of the developed surrogate within a sizing evaluation framework to accurately assess the profitability of various HPP configurations.

The remainder of this paper is organized as follows: Sect. 2 describes the HF EMS model that the surrogates are based on and the methodology devised for the surrogate modeling of the EMS. Section 3 details the sizing evaluation framework for analyzing the profitability of an HPP using the HF EMS and the surrogate models. Section 4 provides details on the case study and the data used to train and validate the surrogate models, while Sect. 5 offers an in-depth analysis of the best-performing surrogate and its application. The performance of all surrogates is detailed in Appendix A. These results are put into perspective in Sect. 6 and summarized in Sect. 7.

2 Methodology

In this section, we begin by presenting the high-fidelity model that serves as the foundation for the surrogate models in Sect. 2.1, followed by the methodology for developing the four surrogate models in Sect. 2.2.

2.1 HPP operation model (EMS)

The EMS model, on which the surrogate is built, is presented in this section. The EMS model is based on a study by Zhu et al. (2023) that focuses on a co-located wind–battery HPP. This novel EMS model is formulated to optimize market participation within two sequential electricity markets: the SM and the BM, which encompass the regulatory periods of the Danish market structure. This state-of-the-art EMS has the advantage of considering

  • long-term operation of the HPP with comprehensive revenue stream calculations;

  • grid capacity as a practical constraint for the HPP;

  • the possibility of considering overplanting, which has been shown to increase the value of HPPs.

However, this paper primarily focuses on the EMS's role in day-ahead SM participation. Additionally, our study considers the Danish market structure with a dispatch interval of 15 min. As in real power plants, the SM bidding process (also referred to as SM optimization) communicates with a real-time (RT) dispatch optimization. In this framework, the SM optimization provides energy set points based on weather and market forecast data to the RT dispatch, which, in turn, uses real-time measurement data to derive real-time power values. Real-time measurements allow for the calculation of deviations and the application of penalties. The inputs to the SM optimization are time series forecasts of wind power and market prices, while the RT dispatch uses the same input time series updated with real-time measurements for each dispatch interval, as well as the bidding schedule generated from the SM optimization. For clarity, the inputs and outputs of the SM optimization and RT dispatch are listed in Table 2. The combined models, SM optimization and RT dispatch, are referred to as a high-fidelity EMS model in this paper.

Table 2Inputs and outputs of SM optimization and RT dispatch from Zhu et al. (2023).

Download Print Version | Download XLSX

While the SM optimization's input and output time series are based on hourly time steps, the RT dispatch's outputs and real-time input time series have a time step equal to the dispatch interval, i.e., 15 min for the Danish market structure. Additionally, both models assume a given HPP configuration, also referred to as sizing parameters in this article. These are defined as the wind power plant rated power (PW), the rated battery power (BP), battery energy capacity (BE), and grid connection power capacity (PG). Other battery parameters, such as charge or discharge efficiency, are assumed from Zhu et al. (2023).

Figure 1 illustrates the considered EMS workflow. The EMS operates through a structured daily cycle, beginning with the forecasting stage. On the previous day (d−1), a forecast of wind generation and spot market prices for the following day (d) is obtained. Using these data, the SM optimization is conducted at noon on day d−1, aligning with the day-ahead market closure, to determine the optimal hourly power bidding for the HPP. This optimization is formulated as a mixed-integer linear programming (MILP) problem, aiming to maximize the plant's revenue across the day by strategically bidding power on the SM. On day d, the RT dispatch optimization is executed at 15 min intervals throughout the day. This phase focuses on minimizing discrepancies between the power that was bid on the spot market and the actual real-time available power. The RT dispatch is modeled as a mixed-integer quadratic programming (MIQP) problem, which dynamically adjusts the HPP's output to meet market commitments as closely as possible, responding to variations in generation. Finally, the day concludes with a settlement process on day d+1, where the outcomes of the day's operations are reconciled. The details of all models can be found in the referenced work.

https://wes.copernicus.org/articles/10/559/2025/wes-10-559-2025-f01

Figure 1Workflow of the HF EMS of this article, developed by Zhu et al. (2023).

The optimization models were solved using the solver of IBM's Decision Optimization Studio CPLEX through the docplex Python library (IBM2023), operating on DTU’s high-performance computing cluster, Sophia (DTU HPC Cluster2019). It was observed that for a given HPP configuration, 47 min was required to compute the outputs for 1 year of operation of the HF EMS model. The underlying reason for this is due to the iterative and sequential nature of the framework. For each day, the MILP optimization is solved first, followed by the MIQP for each dispatch interval (e.g., 96 times per day). While each iteration of the MILP and MIQP problems requires a minimal amount of time (less than 0.15 s), the frequency of these optimizations is substantial. Moreover, since each time step depends on the previous one, it is necessary to perform the optimization sequentially. Table 3 shows the number of decision variables and constraints required to optimize for inputs spanning 1 year. This highlights the substantial computational time required to optimize the sizing of an HPP based on such an operational model.

Table 3Computational complexity of HF EMS models for 1 year of input data.

Download Print Version | Download XLSX

While the combination of both models allows for a realistic representation of the operation of an HPP, it has its own limitations. These limitations are also carried over to the surrogate that is built upon both models. No battery degradation model is considered in the optimization process. It is well known that lithium-ion batteries' energy capacity degrades over time in a nonlinear fashion (Xu et al.2018), which directly impacts revenues as opportunities for energy arbitrage are reduced. Additionally, as we focus on demonstrating the potential of surrogate modeling for EMSs of HPPs, perfect forecast data are used. Finally, the balancing market is not considered in this article and is left for future work.

2.2 Surrogate methodology

In this article, a surrogate model consists of several sub-components: data pre-processing, a regressor, and data post-processing. The pre-processing involves scaling, which ensures that all inputs contribute equally to the model's estimations and supports the surrogate's convergence algorithm. The post-processing is applied in accordance with the pre-processing to interpret the results in their original scale. The regressor is the model tasked with approximating the high-fidelity EMS. Section 2.2.1 details the inputs and outputs for training and evaluating the surrogate models. Section 2.2.2 describes four surrogate models, differing in their data processing and regressor models. Section 2.2.3 and 2.2.4 cover the training and validation of these models, respectively.

2.2.1 Surrogate model's inputs and outputs

Regardless of the surrogate model being evaluated, all models aim to approximate the same output time series given the same input data. Table 4 lists the various input and output time series used to train and validate the surrogate.

Table 4Input and output time series of surrogate models.

Download Print Version | Download XLSX

The input time series of the surrogate match those of the SM optimization, and its outputs align with the RT dispatch's outputs. In addition to the input time series, the regressor also considers three scalar parameters representing an HPP, which help differentiate between various HPP configurations. These parameters are represented by the ratios PW/PG, BP/PG, and BE/BP.

2.2.2 Surrogate models

Table 5 presents all surrogate models evaluated. The first surrogate, S1, which serves as our benchmark, normalizes the input and output time series for each HPP configuration and employs multivariate linear regression to compute the normalized outputs from normalized inputs. The second surrogate, S2, incorporates a dimensionality reduction method known as singular value decomposition (SVD), as developed by Gene H. Golub (1996). After normalizing inputs and outputs, we apply SVD, a common tool in numerical analysis, particularly for dimensionality reduction. The specific use of this method is detailed in this section. Surrogate models S3 and S4 are similar to S1 and S2, but they differ in their choice of regressors. Instead of employing a multivariate linear model, these models utilize a tuned FNN to capture the nonlinear relationships between inputs and outputs.

Table 5Surrogate models tested.

Download Print Version | Download XLSX

For all surrogate models, we apply data post-processing consistent with the pre-processing to ensure the output data are interpretable in their original scale. For all surrogate models, we normalize the wind power generation time series (Wt) using the turbine's rated power and the spot market price time series (SMt) by the maximum price, achieving a scaling between zero and one. Since the output time series magnitudes depend on the sizing inputs, we use these parameters as the basis for normalization following these equations:

(1)Pt,normHPP=PtHPP/PG,(2)Pt,normdis=Ptdis/BP,(3)Pt,normcha=Ptcha/BP,(4)Pt,normcurt=Ptcurt/(PW-PG).

We apply the SVD as described in Zhu et al. (2010) to derive the principal component matrices. This method is used independently for the matrices containing the input time series Min and the output time series Mout. The SVD is applied following the normalization described above and is used for surrogate models S2 and S4. Figure 2 illustrates the matrix Min, which includes all normalized input time series for a single HPP configuration: the normalized wind power time series for day d and for the HPP configuration n, Wt,norm,dHPPn, and the normalized SM prices, SMt,norm,dHPPn.

https://wes.copernicus.org/articles/10/559/2025/wes-10-559-2025-f02

Figure 2Matrix M containing input time series, denoted Min. Wt,norm,dHPPn is the normalized wind power time series for day d and for the HPP configuration n. SMt,norm,dHPPn is the normalized SM prices.

Download

As the high-fidelity EMS uses hourly time steps for forecasted wind power and SM prices, each of the input vectors, SMt,norm,dHPPn and Wt,norm,dHPPn, has 24 time steps. To expand this matrix for all HPP configurations, we concatenate horizontally (i.e., along the second dimension) each matrix Min corresponding to an HPP configuration. The output time series matrix, Mout, is constructed in a similar fashion. However, unlike Min, this matrix contains four time series, the ones defined in Eqs. (1) to (4). Note that these time series have a time step equal to the dispatch interval, e.g., 15 min.

After applying singular value decomposition (SVD) to both matrices, Min and Mout, we extract their principal component matrices and truncate them to the desired level. As a result, we obtain two sets of matrices with different truncation levels, denoted as rin and rout, respectively. The truncation level is chosen so that the explained variance is 99 %; for the definition, see Eq. (4) of Freire and Ulrych (1988). Table 6 presents an overview of the features and samples of each data-processing method for input and output data spanning a year.

Table 6Features and samples of data-processing methods.

Download Print Version | Download XLSX

From this table, we observe that the features of the SVD method are likely higher than those derived from the normalization method (this depends on the truncation level). However, the number of samples is substantially lower, which allows us to achieve a reduced representation of the data. Note that the surrogate models using only normalization for data processing (S1 and S3) have five input features: SMt, Wt, PW/PG, BP/PG, and BE/BP. Additionally, they output four features: PtHPP, Ptdis, Ptcha, and Ptcurt.

2.2.3 Surrogate training

To train a surrogate model, a training dataset is defined based on a number of HPP configurations with distinct sizing parameters. The details of this dataset can be found in Sect. 4. More specifically, this training dataset is used to train the SVD transformation and the two regressor models. Note that normalization does not require training. Two models are used to approximate the outputs of the high-fidelity EMS: a tuned FNN (models S3 and S4) and a multivariate linear regression (models S1 and S2). The latter is used as a baseline model to compare the accuracy of the neural networks.

Since the regressors used in S1 and S2 differ significantly from those in S3 and S4, their training processes also vary. Models S3 and S4 use an FNN; hence, the training is done in two steps and applied to both models individually. The first step involves a tuning process and is carried out using two hyperparameters, shown in Table 7. Within this tuning process, it is important to note that each layer can have a varying number of neurons within the provided range. Afterwards, the best-performing model from the tuning process, for both S3 and S4, is selected for more exhaustive training. This two-step approach is necessary to reduce the computational burden introduced by the tuning process, in which several hundred NN architectures are evaluated; however, the NNs are not trained until they converge. Instead, the best-performing model from the tuning process is selected for further training. To efficiently select the hyperparameters among the search space, Hyperband by Li et al. (2018) is used. Hyperband uses random sampling of hyperparameters to explore a wide range of settings. For both model S3 and model S4, a rectified linear unit (ReLU) activation function is used for all hidden layers, and for the output layer, a linear activation function is used. ReLU is an appropriate activation function for the data, particularly following the normalization process, as all input and output time series become non-negative. To train an NN, at least two settings need to be defined. First, a loss function, which measures the error between the training data and the model's estimations – the mean squared error is used. Second, an optimizer, which modifies the model's weights and biases during the training process to minimize the loss function. Each optimizer has its own set of hyperparameters. The Adam optimizer by Kingma and Ba (2014) is used with a learning rate of 10−4. The tuning results in the architectures are presented in Tables B1 and B2 in Appendix B.

Table 7FNN grid search hyperparameter space.

Download Print Version | Download XLSX

Models S1 and S2 use multivariate linear regression. They are trained using the same dataset with the objective of minimizing the mean squared error, using the same optimizer as for the FNN. Models S1 and S2 are not subject to any tuning, as there are no hidden layers or neurons. Similarly, the SVD transformations are trained on the training dataset. The transformations are trained separately for both input and output time series, resulting in two distinct transformations.

2.2.4 Surrogate validation

We aim to identify the surrogate model that offers the best compromise between training time, inference time, and accuracy. First, we assess each surrogate's accuracy on a validation dataset, which is separate from but defined similarly to the training dataset. We measure the accuracy of each surrogate using the root mean square error (RMSE) between the estimated (y^i,norm) and actual (yi,norm) values for the normalized hourly time series and for all data points in the validation dataset (N). The RMSE (ϵRMS) is computed as follows:

(5) ϵ RMS = 1 N i = 1 N ( y i , norm - y ^ i , norm ) 2 .

Since this RMSE is calculated across all output time series, it provides a broad assessment of the model's accuracy, without specific insights into each individual series. We use this metric to compare the performance of the surrogate models presented in Table 5. Additionally, we measure the training and inference time of each model.

For the best-performing surrogate model, among S1–S4, we further investigate the accuracy of each output time series using RMSE for a deeper understanding of the model. Our focus then shifts to one specific output time series, the normalized power output Pt,normHPP. This time series allows us to calculate the yearly revenues, which are required to compute the profitability index (PI), enabling us to evaluate the profitability of a given sizing, the key application of our surrogate in this article.

To explore the methodology's potential further, we assess the surrogate's generalizability across various locations within the western Danish price region, DK1. In this intra-generalizability analysis, we calculate the normalized root mean square error (NRMSE) of the yearly revenues to assess the accuracy of the surrogate's approximations. Specifically, we compare the approximated revenues (Π^k) with the actual revenues from the HF EMS (Πk) for each HPP configuration k across all configurations in the selected set K. To express the RMSE as a relative measure, we normalize it by dividing by the median of the yearly revenues for all selected HPP configurations (Median(Π)), as computed by the HF EMS. The NRMSE (ϵNRMS) is given as

(6) ϵ NRMS = 1 K k = 1 K ( Π k - Π ^ k ) 2 Median ( Π ) .

The revenue time series is extracted from observed data (from the high-fidelity model) or approximated (from the surrogate model) power time series, considering the dispatch interval Δt and the total time steps within a year, T.

(7) Π = i = t T P t HPP SM t Δ t
3 Application: PI evaluation

To assess the business case of an HPP, we can use financial metrics like the internal rate of return (IRR) and net present value (NPV). IRR calculates the HPP's annual investment return, while NPV assesses its profitability in today's value. However, when an HPP is not profitable, resulting in a negative NPV, the IRR becomes undetermined. A more meaningful measure is the profitability index (PI), calculated as NPV/CAPEX. The PI indicates how many dollars of present value benefit are generated per dollar of investment, offering a more intuitive understanding of the investment's profitability. This metric allows for a direct comparison of the relative profitability of each project, regardless of its absolute size. Additionally, when resources are limited, NPV/CAPEX can aid in prioritizing projects. Projects with higher PIs can be prioritized as they promise greater returns per unit of investment. A PI greater than 1 signifies that the NPV of future cash flows exceeds the initial investment. Note that traditionally, for power plants using one type of generation technology, the levelized cost of energy (LCoE) is used to evaluate profitability. However, to assess the various potential revenue streams, metrics such as NPV (Dykes et al.2020) or the NPV/CAPEX are more relevant. This is because storage inherently increases costs and thus the LCoE, even though it has the potential to substantially increase revenue or profit.

To compute the PI, we require the NPV, which in turn requires accurate yearly revenues and costs over the HPP's lifetime, aligning with the ideal framework shown in Fig. 3a. Yet, as discussed in the Introduction and Sect. 2.1, this method is computationally demanding. We instead use an alternative framework in Fig. 3b, utilizing the developed surrogate. This surrogate replaces the high-fidelity EMS, significantly reducing computational time and making the framework's execution feasible. The accuracy of this framework, employing the surrogate model to evaluate the PI, is presented in Sect. 5.4.

https://wes.copernicus.org/articles/10/559/2025/wes-10-559-2025-f03

Figure 3High-level sizing framework. Left: ideal sizing evaluation framework using the high-fidelity EMS. Right: developed sizing evaluation framework using a surrogate of the EMS.

Download

The PI varies between each sizing configuration (denoted as x), represented by PG,PW,BE, and BP. The PI is calculated as follows:

(8) PI = NPV ( x ) CAPEX ( x ) .

The financial model is based on the yearly cash flow (CFy) and the discount rate after tax (rAT), throughout the lifetime of the power plant Y

NPV=yYCFy(1+rAT)y

Here, Y is the lifetime of the power plant. The cash flow is calculated based on yearly profits (Py) and CAPEX.

CFy=Pyfor y>0-CAPEXfor y=0

It is important to highlight that the yearly profits are based on the revenues from the surrogate or the high-fidelity EMS (Πy), as well as the operational expenditure (OPEXy), the tax rate (τtax), and the rAT.

Py=(Πy-OPEXy)(1-τtax)

The cost model used to calculated the CAPEX and OPEX is described in Appendix C. It should be noted that the calculation of NPV/CAPEX requires only the HPP power output time series from either the high-fidelity model or the surrogate.

4 Case study

In Sect. 4.1, we introduce the training and validation dataset. Following this, Sect. 4.2 discusses the data related to the intra-generalizability analysis. In Appendix C, the cost model is presented, and the related data are provided in Table D2 of Appendix D.

4.1 Training and validation dataset

4.1.1 HPP configurations

As we rely on a surrogate to replace the high-fidelity EMS, we require a comprehensive dataset to train and validate this surrogate. Therefore, a wide range of HPP configurations needs to be covered. In addition, these configurations must be realistic and in line with industry practices. Table 8 summarizes the parameter ranges. For this article, the grid connection varies between 50 and 700 MW.

Table 8Sizing parameters and ranges.

Download Print Version | Download XLSX

To ensure an equal distribution of all variables across the entire parameter space, the Latin hypercube sampling method, by Jin et al. (2005), is used to randomly select 250 sizing configurations, of which 200 HPPs (80 %) are used to train the regressor and 50 HPPs (20 %) are used to evaluate the accuracy of the surrogate, as detailed in Sect. 2.2.2. Subsequently, the high-fidelity EMS is solved using these configurations with the input time series presented in the section below.

4.1.2 Input time series and wind turbine technology

The input time series required for the high-fidelity model, mentioned in Table 2, are generated using two tools. Wind power time series are simulated with the CorRES simulation tool (Murcia Leon et al.2021; Koivisto et al.2019). This tool is based on meteorological reanalysis data from the Weather Research and Forecasting model. CorRES' stochastic model (Koivisto et al.2020b) is integrated to add fluctuations, resulting in wind power time series with 15 min resolution. The simulation is based on meteorological data from the year 2012, with the assumption that the climate in 2030 remains unchanged from 2012. CorRES requires specific inputs, including the HPP's longitude and latitude, the hub height of the wind turbine, the power curve, and the simulation period. The considered turbine is the Gamesa G80 with a rated power of 2 MW and a hub height of 100 m.

SM price time series for the 2030 electricity markets are obtained from the Balancing Tool Chain (BTC) (Kanellas et al.2020). BTC is built upon Balmorel, an open-source energy system model (Wiese et al.2018) that simulates electricity market operations, ranging from day-ahead to real-time dynamics for the northern central European region. Additionally, an investment optimization is implemented to simulate a 2030 energy system scenario (Koivisto et al.2020a).

4.1.3 Output time series

For all 250 HPP configurations and the aforementioned input time series, the high-fidelity model is used to generate all output time series described in Table 4. Out of these HPP configurations, 200 are used for training the surrogate model, and the remaining 50 are used to validate the model.

4.2 Intra-generalizability analysis data

For the intra-generalizability analysis, we use the best surrogate model following the methodology described in Sect. 2.2. We then test the surrogate's accuracy across four randomly chosen locations within the same market region, labeled A to D. At each location, we randomly select 10 HPP configurations from the training dataset and another 10 from the validation dataset. The coordinates of each location are listed in Table D1. Figure 4 displays these locations, indicating the training location with an “X” and the evaluation sites for the HPPs. As all locations are in the same market region, the SM price time series is the same for all locations. The weather data for locations A–D are provided by CorRES, and the output time series per location and per HPP configuration are generated using the high-fidelity EMS model. These data are used to compare the performance of the surrogate trained on location X and evaluated on locations A–D.

The wind generation distribution across all locations is available in Fig. 4. This violin plot illustrates the distribution of normalized wind power generation across five different locations (X, A, B, C, and D). Each half violin represents the density of wind power measurements for a location, showing where values are most concentrated. The symmetrical nature of each violin plot, with mirrored halves for each location, is a standard feature of violin plots that allows for a clearer visualization of the data distribution, where each half represents the same distribution of wind power measurements. The width of each violin indicates the density: wider sections reflect more frequent occurrences of those power levels, while narrower sections suggest less common values. The plot uses a logarithmic scale on the y axis, making it possible to visualize variations in power generation across a broad range, from very low to high outputs. Inside each violin, the black bar marks the interquartile range, while the white dot represents the median of the wind power measurements for that location. This combination allows for a clear comparison of both the range and the central tendencies of wind power output across different sites. For example, a location with a narrower and higher median distribution might experience more consistent and higher wind power generation (i.e., location X), while one with a broader distribution and lower median could have more variability (i.e., location C). Locations A, B, and D share similar distributions, where the shape of these distributions suggests that low power output is more common, with occasional rises to higher values.

https://wes.copernicus.org/articles/10/559/2025/wes-10-559-2025-f04

Figure 4Intra-generalizability data. (a) Location of trained surrogate model, X (in red), and evaluated model (A–D). (b) Normalized wind power distribution across all locations. A log scale was used to highlight the difference between locations.

5 Surrogate results

This section details the accuracy of the best surrogate model and its main application in evaluating the PI of HPPs. The comparison of all surrogate models is provided in Appendix A. After introducing the results of the most accurate surrogate model in Sect. 5.1, we examine how the accuracy of that model varies with different training dataset sizes in Sect. 5.2. Next, in Sect. 5.3, we assess the surrogate's performance across various locations where it has not been trained. Finally, Sect. 5.4 compares the PI accuracy when evaluated using both the surrogate and the high-fidelity EMSs. For all the results shown in this section, the validation dataset was used to evaluate the accuracy of the surrogate model and its application.

5.1 Best-performing surrogate model

The best-performing surrogate is the one titled S4 in Table 5. From Fig. A1, we observe that all tuned NNs (S3 and S4) outperform their linear counterparts (S1 and S2) in terms of accuracy. This result is expected to a certain degree: while the linear model may not fully capture the nonlinear dynamics of the high-fidelity model, we selected it to assess the extent to which a simpler model can approximate the EMS, given that many of the HF EMS model's constraints are linear. Among the linear models, model S2, which uses SVD in addition to normalization, slightly outperforms model S1, which uses only normalization. The application of SVD effectively captures key trends within the broad distribution of HPP configurations, thereby improving the accuracy of the linear model S2. However, we do not observe similar results when looking at the tuned models, S3 and S4. The NN of model S3 can make better use of all the data in the absence of SVD rather than using a reduced representation of them, as in model S4, which explains the difference between both tuned FNNs. There is a substantial difference between surrogates using only normalization (S1 and S3) and those using SVD in addition to normalization (S2 and S4). This difference is even more pronounced when comparing the tuned neural network models: model S4 (with SVD) converges in 5 h, whereas model S3 (without SVD) takes 7 d to converge. This disparity is mainly due to the difference in training data dimensionality, as highlighted in Table 6. The use of SVD reduces the number of features, simplifying the model and accelerating the training process with very little compromise on accuracy.

Based on the results presented in both the figure and the table, we conclude that the tuned neural network using SVD – model S4 – provides the best compromise between accuracy, training time, and execution time.

To gain deeper insights into the performance of model S4, we investigated its capability to approximate each normalized hourly output time series individually across all validation data. Figure 5 provides an overview of these results. From this figure, we observe that the power output of the HPP and the curtailed power are well estimated by the surrogate model. However, the battery charge and discharge profiles are more challenging to compute accurately. To further understand these discrepancies, we examine the calculated output time series for a given day from the surrogate, as well as the corresponding output time series from the high-fidelity model.

https://wes.copernicus.org/articles/10/559/2025/wes-10-559-2025-f05

Figure 5RMSE for each surrogate's output time series across all validation HPP configurations. For the definition of each variable, refer to Eqs. (1) to (4).

Download

Figure 6 shows the difference between the surrogate's approximation and the HF EMS' outputs. The surrogate captures the daily trend well across all time series. While it accurately estimates the intra-day fluctuations for power bidding, it is less precise when estimating battery charge and discharge power. This is due to the abrupt power fluctuations in the high-fidelity model, as seen in Fig. 6b and c. Additionally, as shown in Fig. 6d, the surrogate occasionally struggles to approximate consistent zero values over an entire day – a challenge characteristic of FNNs. Nonetheless, these discrepancies are minor, with estimated curtailed power fluctuating within a ±1.5 MW range instead of the expected steady 0 MW. Such variances are negligible relative to the HPP's export capacity, which can reach up to 700 MW.

https://wes.copernicus.org/articles/10/559/2025/wes-10-559-2025-f06

Figure 6Output time series for a given day, from the high-fidelity model (blue) and the surrogate (red). All time series are in megawatt. (a) Power output of the HPP, (b) battery charge profile, (c) battery discharge profile, and (d) curtailed power.

Download

The application presented in Sect. 3 requires only the HPP power output out of all the calculated output time series. That is why we further examine this output time series. The left panel of Fig. 7 presents a hexbin plot that compares hourly estimated and normalized HPP power outputs across all HPP configurations in the validation dataset (PSurrogateHPP). The hexagonal bins group nearby points (denoted as count in Fig. 7a) and show the density of data points within each bin. The density value is shown on the color bar; the darker the color, the denser the hexagon. A log scale is used for clarity. A one-to-one line, representing a perfect model, is also depicted for comparison. The power bidding on the SM aligns closely with the observed values (PEMSHPP).

The probability distribution function (PDF) of errors for the same data is shown on the right-hand side of Fig. 7. The histogram (in blue) shows the frequency distribution of these errors, while the red line represents a Gaussian (normal) distribution fitted to the data. The parameters of the Gaussian fit – mean (μ) and standard deviation (σ) – are shown in the legend and are 0.00 and 0.07, respectively. The RMSE is also 0.07, indicating the typical magnitude of computation errors.

The mean (μ) being equal to zero suggests that the surrogate's calculations are unbiased on average. The Gaussian fit's close alignment with the histogram suggests that the errors are distributed in a manner consistent with a normal distribution, which often implies that the surrogate model's residuals are well behaved in a statistical sense.

https://wes.copernicus.org/articles/10/559/2025/wes-10-559-2025-f07

Figure 7Accuracy of model S4 in approximating hourly normalized power output (see Eq. 1) for all HPPs from the validation dataset. (a) The hexbin plot groups nearby points and shows the density of data points within each bin; refer to the color bar for the density of each bin. (b) PDF of error; the red line indicates a Gaussian fit, with parameters detailed in the legend.

Download

5.2 Surrogate convergence to training dataset

The previous study demonstrated the capacity of the NN to replicate the daily trends of the high-fidelity EMS. However, the chosen data were based on an arbitrarily large number of HPP configurations. Consequently, we sought to examine how the NN's accuracy varies with different dataset sizes. Several surrogates were trained based on model S4 with varying training dataset sizes, ranging from 4 to 200 HPP configurations. The validation dataset from the previous study is not modified to ensure a fair comparison. Results of these simulations are illustrated in Fig. 8. Interestingly, the RMSE seems to plateau when reaching a training dataset size of 110 HPP configurations. We also note that there is only a marginal gain in accuracy between 50 and 100+ HPP configurations. This is relevant because it suggests potential reductions in the data generated by the high-fidelity EMS and, therefore, shorter training durations for the surrogate. As a reminder, each HPP configuration, which spans 1 year of data, requires 47 min to generate outputs using the high-fidelity EMS. It is also interesting to compare the normalized root mean square error (NRMSE) of yearly revenues, computed as per Eq. (6). The model trained with 200 HPP configurations has an NRMSE of 0.81 %, while the model trained with 32 HPP configurations has an NRMSE of 1.0 % across the entire validation dataset. Again, the difference between both outcomes is marginal, suggesting further reductions in training time and during the data generation process.

https://wes.copernicus.org/articles/10/559/2025/wes-10-559-2025-f08

Figure 8Evolution of accuracy with increasing training dataset size. A fixed validation dataset of 50 configurations is used across all simulations.

Download

5.3 Intra-generalizability accuracy

In this section, we evaluate the surrogate accuracy of model S4 at four different locations (A to D). Surrogate S4 is the one whose results are detailed in Sect. 5.1. As a reminder, this surrogate is trained using weather data from location X and with a training dataset of 200 HPP configurations. We use the NRMSE of yearly revenues to measure the accuracy of the surrogate at each location. This was done on 10 randomly selected HPP configurations from the training dataset and 10 others from the validation dataset. All accuracy results are compared to the baseline, i.e., using location X. The accuracy of the surrogate is illustrated in Fig. 9.

The surrogate model's NRMSE for computing revenue shows a marginal difference between training and validation datasets. Specifically, the NRMSE for the training dataset (location X) is 0.79 % compared to 0.81 % for the validation dataset. When looking at locations A to D, the average NRMSE for the training dataset samples is 0.79 % (aligning with the train baseline), whereas it is 1.3 % for validation dataset samples. Notably, location D shows the greatest discrepancy in NRMSE between training and validation samples. This variation may be attributed to the combination of HPP configurations and distinct weather time series at location D, as detailed in Fig. 4. Overall, despite location X's distribution with two distinct peaks (around 0.08 and 0.1) that are not observed in other locations, the surrogate's performance remains consistent across all locations.

https://wes.copernicus.org/articles/10/559/2025/wes-10-559-2025-f09

Figure 9Performance of surrogate S4's generalizability for different locations and for HPP configurations from different datasets. The baseline refers to location X, where the surrogate has been trained. Train baseline and val baseline correspond to the NRMSE of revenues from the training dataset of location X (200 HPP configurations) and from the validation dataset (50 HPP configurations). Val refers to 10 HPP configurations randomly selected from the validation dataset, while train refers to 10 random HPP configurations from the training dataset. Both val and train are evaluated on locations A–D.

Download

5.4 PI evaluation accuracy

In this section, we evaluate the PI of several HPPs using the surrogate S4 (presented in Sect. 5.1) and the high-fidelity EMS model. Both frameworks are described in Fig. 3. To evaluate the accuracy of the PI computed with the surrogate, we use the same 50 HPP configurations from the validation dataset for both frameworks. Figure 10 shows the PI calculated using the high-fidelity EMS on the y axis and the PI inferred using the surrogate on the x axis for the corresponding HPP configuration.

https://wes.copernicus.org/articles/10/559/2025/wes-10-559-2025-f10

Figure 10PI comparison based on surrogate inference and high-fidelity EMS evaluation for the validation dataset.

Download

The RMSE of the PI across the validation dataset is 0.010, indicating the average magnitude of the errors between the surrogate's estimations and the high-fidelity EMS evaluations. The scatter plot shows that most points are close to the line of perfect calculations (red line), with some scatter around it. Most points are below the perfect line, indicating that the surrogate slightly overestimates the profitability of the HPP. However, this tendency is reversed for higher NPV/CAPEX, where the surrogate provides a conservative estimate of the PI. Overall, the tight clustering of points around the red line suggests that the surrogate model is reliable for computing the PI when compared to the high-fidelity EMS.

6 Discussion

This study aims to evaluate the potential of applying surrogate modeling to emulate the behavior of a complex EMS for an HPP with bidding on the spot market. Given the increasing integration of renewable energy systems – particularly wind power – this research provides a practical approach to optimize HPP operations, enabling more efficient system integration and financial assessment.

Our investigation highlights the importance of both pre- and post-processing of data with an appropriate choice of the surrogate model. Among the options explored, the tuned FNN that utilizes SVD (surrogate S4) emerged as the optimal balance. Indeed, Fig. A1 shows that the tuned NN using only normalization (surrogate S3) is the most accurate, while adding SVD (surrogate S4) results in similar performance. Yet, when we look at the computational time, as shown in Table A1, using SVD (in surrogates S2 and S4) significantly reduces the training duration. This difference is even more pronounced when the tuning time is considered: it requires 5 d to tune the NN using normalization (surrogate S3), whereas it takes 4.3 h when SVD (surrogate S4) is used. This discrepancy is attributed to the inherent capability of SVD to extract a reduced order of data that contain meaningful coefficients and daily temporal trends.

However, challenges persist, particularly in estimating battery charge and discharge profiles. As depicted in Figs. 5 and 6, this difficulty arises from the high-fidelity model's abrupt power output fluctuations and the intrinsic nonlinearity of these variables. In particular, Fig. 6c and d highlight the surrogate's limitations in accurately and consistently representing the battery's hourly operation. In several time steps, the model shows the battery charging and discharging simultaneously, and at times, the charge and discharge power take on negative values, which is physically unrealistic. These issues stem from the nature of the regressor; the FNN cannot inherently capture the physical constraints that would typically be enforced by multiple equations. Specifically, the FNN lacks explicit equations to govern its outputs. However, for the purposes of this study, focusing primarily on the power output of the HPP is sufficient, as this is the only variable required in revenue calculation and subsequent PI evaluation. Additionally, the power output of the HPP is well estimated, as shown in the PDF plot of Fig. 7: there is no bias, and the standard deviation is very small. While this result only holds for a surrogate trained with 200 HPP configurations, it is still reasonable to assume similar behavior for a surrogate trained with fewer data points. Indeed, Fig. 8 demonstrates a marginal difference in accuracy between a surrogate trained with 200 HPP configurations and one trained with 50 HPP configurations. Ultimately, it is a trade-off between training time and accuracy. In terms of intra-regional generalizability, the accuracy across different locations is more consistent than between dataset types. This uniformity in accuracy within each dataset type can be partly attributed to the region's relatively homogeneous wind profiles, facilitated by its largely flat terrain. A loss in accuracy is observed when unseen HPP configurations are used (e.g., validation dataset). Nonetheless, these results demonstrate the surrogate's ability to capture essential data trends (Fig. 9). However, it is important to note that this study's scope was confined to the DK1 market region, characterized by uniform wind profiles due to its flat terrain (Fig. 4). The fast and accurate surrogate allows us to evaluate an HPP's profitability throughout its lifetime with little computational burden. Indeed, the surrogate model is capable of evaluating the NPV/CAPEX for all 50 HPP configurations in Fig. 10 in 25 s. In contrast, computing the same evaluations using the high-fidelity model for each HPP configuration, with inputs spanning a year, would take approximately 39 h. However, it is important to understand the impact of the surrogate's accuracy on the PI. Figure 10 shows that the surrogate can be reliably used if slight deviations of around 0.010 in the PI are acceptable for the intended business evaluation. In other words, the error in the computed NPV is around 1 % of the CAPEX. It is also relevant to note that not all HPP configurations are profitable, resulting in negative PI, further supporting the use of NPV/CAPEX as an evaluation metric. Hence, this emphasizes the importance of optimization in the context of HPP sizing, which is enabled with the developed framework. However, a detailed exploration of the sizing optimizer is beyond the scope of this article and will be the subject of future investigations. It is important to emphasize that these findings are site-specific and heavily dependent on the cost model employed; hence, they should not be generalized across different HPP sites.

There are certain limitations and future works worth acknowledging. For one, the full capabilities of the high-fidelity model have not been leveraged. While the EMS can consider a realization of the forecast error in both weather and market data, our initial approach prioritized a methodology using perfect forecast data. Nonetheless, this is a natural next step, where a sizing framework can be developed based on a surrogate that can handle the inherent uncertainties in weather and market forecast errors. While our research mainly focused on the spot market, currently the major source of revenues for power plants, market dynamics might shift. As the share of intermittent power plants increases in the grid system, which is becoming more decentralized, the balancing market is forecasted to become a considerable revenue stream. Thus, there is a pressing need for a more comprehensive surrogate considering operational strategies in both spot and balancing markets. Moreover, an FNN has its own limitations when it comes to time series representation, as seen in the battery charge and discharge profiles in Fig. 6. This highlights the importance of further exploring the machine learning field. A promising avenue would be models that blend physical constraints, such as physics-informed neural networks. Additionally, to develop a more robust sizing methodology, it is necessary to account for various forecast scenarios for wind generation and market prices within the EMS. This can be achieved through methods from the field of surrogate-based optimization under uncertainty. These considerations will be addressed in future work.

7 Conclusion

In this paper, we have introduced a new methodology to accurately and efficiently approximate a state-of-the-art EMS for HPPs involved in spot market power bidding. This model leverages singular value decomposition to extract temporal trends in the input and utilizes an FNN to represent the nonlinear dynamics of the EMS. This method has demonstrated over twice the accuracy of traditional multivariate linear regression models. A key innovation of our study is the combined use of SVD and FNN, which represents a novel approach in this field. This approach successfully replicates the annual revenues of an HPP with an NRMSE of 0.81 % for the best model. To fully demonstrate the capabilities of our surrogate model, we have integrated it into a sizing evaluation framework designed to calculate the profitability index (NPV/CAPEX) based on the technology mix rating within the HPP. This framework not only enabled substantial computational savings – reducing processing time from 39 h to 25 s compared to a high-fidelity model – but also achieved an RMSE of 0.010. Although our methodology is straightforward, it is nonetheless powerful and opens up new possibilities for optimizing HPP sizing in the context of renewable energy integration. This study emphasizes the relevance of surrogate modeling to the wind energy field, where efficient and accurate tools are essential for navigating the increasing complexity of renewable energy markets and supporting the transition toward sustainable energy systems.

Appendix A: Surrogate model comparison

The accuracy of the four models, presented in Table 5, can be found in Fig. A1. The RMSE of all normalized hourly output time series is used to compare the accuracy of the models. Moreover, the training and inference times are reported in Table A1.

https://wes.copernicus.org/articles/10/559/2025/wes-10-559-2025-f11

Figure A1Validation RMSE by data surrogate model: “norm” is for models using a normalization only, and “SVD” is for models using a combined normalization and SVD.

Download

Table A1 compares the time needed to execute the methodology for each surrogate model. The pre-processing (pre-proc.) time considers both training and validation datasets. However, the training time (train time) accounts only for the training dataset, while the inference time (Inf. time) reflects the inference on a single HPP configuration spanning 1 year of data.

Table A1Time metrics of surrogate models.

Download Print Version | Download XLSX

Appendix B: FNN architecture

Tables B1 and B2 show tuned FNN architecture.

Table B1Architecture of model S4.

Download Print Version | Download XLSX

Table B2Architecture of model S3.

Download Print Version | Download XLSX

Appendix C: Cost model

The cost model is described in this section. The CAPEX depends on Cw, Cb, and Cel, which are the CAPEXs of the wind power plant, batteries, and the balance of system (BOS). Similarly, the OPEX is the sum of Ow,y, Ob,y, and Oel,y, which are the yearly OPEXs from the wind power plant, batteries, and BOS. Cw is proportional to the wind turbine's cost (WTcost) and the cost of civil works (WTcivil) in EUR MW−1. Meanwhile, Cel is proportional to a combination of the number of battery equivalents in today's value (Nbeq), the battery energy cost per megawatt hour (BcostE), the battery power cost (BcostP), civil costs (BcivilP), and control system costs (BcontrolP) per megawatt. Cel depends on the shared BOS cost (CBOS), the grid connection cost of the HPP (PcostG), and the cost of land in EUR km−2 (Lrent) . The OPEX of the wind power plant is calculated based on the fixed and variable operation and maintenance (O&M) costs of the wind turbine per year and per megawatt (WTfixed,yOM and WTvariable,yOM), as well as the mean annual energy production (AEP) of the wind power plant (mean(AEP)). Meanwhile, the battery's yearly OPEX is proportional to the yearly O&M cost of the battery per megawatt hour (ByE,OM). The equations are as follows:

CAPEX=Cw+Cb+CelOPEXy=Ow,y+Ob,y+Oel,yCw=(WTcost+WTcivil)PWCb=NbeqBcostEBE+(BcostP+BcivilP+BcontrolP)BPCel=(CBOS+PcostG)PG+LrentOw,y=WTfixed,yOMPW+mean(AEP)WTvariable,yOMOb,y=ByE,OMBEOel,y=0.

In this study, we set a fixed lifetime for the battery (ib) as battery degradation is not considered. Additionally, to address the decreasing costs of batteries over time, we employ the concept of the equivalent number of present batteries (Nbeq). This method incorporates the annual battery price reduction rate (fb) and the designated replacement year for each battery (yb(ib)).

Nbeq=ib=0Nb-1(1-fb)yb(ib)
Appendix D: Data supplement

Table D1 lists the coordinates of each location used in the intra-generalizability study.

Table D2 presents a summary of the cost assumptions used in this article. As the battery's lifetime is 7 years, each HPP will require three batteries during its lifetime. Given a battery price reduction of 10 % per year, we obtain an equivalent number of batteries (Nbeq) of 1.84.

Table D1Location coordinates for the generalizability study. Coordinates are shown in decimal degrees.

Download Print Version | Download XLSX

Table D2Cost assumptions.

Download Print Version | Download XLSX

Data availability

The weather and spot market price time series data are available on request from the corresponding author.

Author contributions

CA, JPML, and KD conceived the research; CA, JPML, and JQ developed the methodology; CA and JPML developed the code for the surrogate and gathered the required data; CA, JPML, JQ, and TG performed the formal analysis; JPML, KD, SG, and TG supervised the work; CA, JPML, and JQ wrote the original draft; and JPML, KD, SG, and TG reviewed and suggested modifications to the draft.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

We would like to thank Juan-Andrés Pérez-Rúa and Poul Ejnar Sørensen for their critical feedback on the methodology and the figures, Jenna Iori for her feedback after the first draft was written, and – finally – Rujie Zhu for sharing the high-fidelity EMS script.

Financial support

This work is funded by TotalEnergies under the project “Hybrid Power Plant Life-cycle Optimization”.

Review statement

This paper was edited by Jennifer King and reviewed by two anonymous referees.

References

Abdeltawab, H. H. and Mohamed, Y. A. R. I.: Market-oriented energy management of a hybrid wind-battery energy storage system via model predictive control with constraint optimizer, IEEE T. Ind. Electron., 62, 6658–6670, https://doi.org/10.1109/TIE.2015.2435694, 2015. a

An, Y., Zhao, Z., Wang, S., Huang, Q., and Xie, X.: Coordinative optimization of hydro-photovoltaic-wind-battery complementary power stations, CSEE Journal of Power and Energy Systems, 6, 410–418, https://doi.org/10.17775/CSEEJPES.2019.00330, 2020. a

Barbosa, A. M., Rotella Junior, P., Rocha, L. C. S., Barbosa, A. D. S., and Bolis, I.: Optimization methods of distributed hybrid power systems with battery storage system: A systematic review, Journal of Energy Storage, 97, 112909, https://doi.org/10.1016/j.est.2024.112909, 2024. a

Bhatt, P., Kumar, Y., and Soulaïmani, A.: Deep convolutional architectures for extrapolative forecasts in time-dependent flow problems, Advanced Modeling and Simulation in Engineering Sciences, 10, 1–35, https://doi.org/10.1186/s40323-023-00254-y, 2023. a

Busch, S., Kasdorp, R., Koolen, D., Mercier, A., and Spooner, M.: The Development of Renewable Energy in the Electricity Market, Tech. Rep. KC-BD-23-004-EN-N, European Commission, https://doi.org/10.2765/411281, 2023. a

Cai, Z., Bussar, C., Stöcker, P., Moraes, L., Magnor, D., and Sauer, D. U.: Optimal Dispatch Scheduling of a Wind-battery-System in German Power Market, Energ. Proced., 99, 137–146, https://doi.org/10.1016/J.EGYPRO.2016.10.105, 2016. a

Council of the European Union: Electricity Market Reform: Council Signs Off on Updated Rules, https://www.consilium.europa.eu/en/press/press-releases/2024/05/21/electricity-market-reform-council-signs-off-on-updated-rules/ (last access: 9 October 2024), 2024. a

Das, K., Hansen, A. D., Koivisto, M., and Sørensen, P. E.: Enhanced features of wind-based hybrid power plants, Proceedings of the 4th International Hybrid Power Systems Workshop, 22–23 May 2019, Crete, Greece, 2019. a

Das, K., Philippe Grapperon, A. L. T., Sørensen, P. E., and Hansen, A. D.: Optimal battery operation for revenue maximization of wind-storage hybrid power plant, Electr. Pow. Syst. Res., 189, 106631, https://doi.org/10.1016/J.EPSR.2020.106631, 2020. a

Ding, H., Pinson, P., Hu, Z., and Song, Y.: Integrated bidding and operating strategies for wind-storage systems, IEEE T. Sustain. Energ., 7, 163–172, https://doi.org/10.1109/TSTE.2015.2472576, 2016. a

DTU HPC Cluster: Sophia HPC Cluster, DTU Sophua, https://doi.org/10.57940/FAFC-6M81, 2019. a, b

Dykes, K., King, J., DiOrio, N., King, R., Gevorgian, V., Corbus, D., Blair, N., Anderson, K., Stark, G., Turchi, C., and Moriarty, P.: Opportunities for Research and Development of Hybrid Power Plants, Tech. rep., National Renewable Energy Lab. (NREL), Golden, CO650 (United States), https://doi.org/10.2172/1659803, 2020. a, b

Fathima, A. H. and Palanisamy, K.: Optimization in microgrids with hybrid energy systems – A review, Renewable and Sustainable Energy Reviews, 45, 431–446, https://doi.org/10.1016/j.rser.2015.01.059, 2015. a

Freire, S. L. M. and Ulrych, T. J.: Application of singular value decomposition to vertical seismic profiling, Geophysics, 53, 778–785, https://doi.org/10.1190/1.1442513, 1988. a

Gene H. Golub, C. V. L.: Matrix computations, 3rd edn., Johns Hopkins University Press, ISBN 0801854148, 1996. a

Guo, M. and Hesthaven, J. S.: Data-driven reduced order modeling for time-dependent problems, Comput. Method. Appl. M., 345, 75–99, https://doi.org/10.1016/j.cma.2018.10.029, 2019. a

Han, X. and Hug, G.: A distributionally robust bidding strategy for a wind-storage aggregator, Electr. Pow. Syst. Res., 189, 106745, https://doi.org/10.1016/J.EPSR.2020.106745, 2020. a, b

Hess, M. W., Quaini, A., and Rozza, G.: A data-driven surrogate modeling approach for time-dependent incompressible Navier-Stokes equations with dynamic mode decomposition and manifold interpolation, Adv. Comput. Math., 49, 1–27, https://doi.org/10.1007/s10444-023-10016-4, 2023. a

Hesthaven, J. S. and Ubbiali, S.: Non-intrusive reduced order modeling of nonlinear problems using neural networks, J. Comput. Phys., 363, 55–78, https://doi.org/10.1016/J.JCP.2018.02.037, 2018. a

Hesthaven, J. S., Pagliantini, C., and Rozza, G.: Reduced basis methods for time-dependent problems, Acta Numer., 31, 265–345, https://doi.org/10.1017/S0962492922000058, 2022. a

Huang, X., Wang, J., Huang, T., Peng, H., Song, X., and Cheng, S.: An optimal operation method of cascade hydro-PV-pumped storage generation system based on multi-objective stochastic numerical P systems, J. Renew. Sustain. Ener., 13, 16301, https://doi.org/10.1063/5.0032455, 2021. a, b

IBM: IBM® Decision Optimization Modeling for Python (DOcplex), https://github.com/IBMDecisionOptimization/docplex-doc (last access: 14 September 2023), 2023. a

Jin, R., Chen, W., and Sudjianto, A.: An efficient algorithm for constructing optimal design of computer experiments, J. Stat. Plann. Infer., 134, 268–287, https://doi.org/10.1016/j.jspi.2004.02.014, 2005. a

Kanellas, P., Das, K., Gea-Bermudez, J., and Sørensen, P.: Balancing Tool Chain: Balancing and automatic control in North Sea Countries in 2020, 2030 and 2050, DTU Wind Energy E, DTU Wind Energy, Denmark, 2020. a

Kingma, D. P. and Ba, J.: Adam: A Method for Stochastic Optimization, CoRR, abs/1412.6980, https://api.semanticscholar.org/CorpusID:6628106 (last access: 18 March 2025), 2014. a

Kitzing, L., Held, A., Gephart, M., Wagner, F., Anatolitis, V., and Klessmann, C.: Contracts-for-difference to support renewable energy technologies – Considerations for design and implementation, European University Institute, https://doi.org/10.2870/379508, 2024. a

Koivisto, M., Das, K., Guo, F., Sørensen, P., Nuño, E., Cutululis, N., and Maule, P.: Using time series simulation tools for assessing the effects of variable renewable energy generation on power and energy systems, WIREs Energy and Environment, 8, e329, https://doi.org/10.1002/wene.329, 2019. a

Koivisto, M., Gea-Bermúdez, J., and Sørensen, P.: North Sea offshore grid development: combined optimisation of grid and generation investments towards 2050, IET Renew. Power Gen., 14, 1259–1267, https://doi.org/10.1049/iet-rpg.2019.0693, 2020a. a

Koivisto, M., Jónsdóttir, G. M., Sørensen, P., Plakas, K., and Cutululis, N.: Combination of meteorological reanalysis data and stochastic simulation for modelling wind generation variability, Renew. Energ., 159, 991–999, https://doi.org/10.1016/j.renene.2020.06.033, 2020b. a

Murcia Leon, J. P., Habbou, H., Friis-Møller, M., Gupta, M., Zhu, R., and Das, K.: HyDesign: a tool for sizing optimization of grid-connected hybrid power plants including wind, solar photovoltaic, and lithium-ion batteries, Wind Energ. Sci., 9, 759–776, https://doi.org/10.5194/wes-9-759-2024, 2024. a, b

Li, F. F. and Qiu, J.: Multi-objective optimization for integrated hydro–photovoltaic power system, Appl. Energ., 167, 377–384, https://doi.org/10.1016/J.APENERGY.2015.09.018, 2016. a, b

Li, L., Jamieson, K., DeSalvo, G., Rostamizadeh, A., and Talwalkar, A.: Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization, J. Mach. Learn. Res., 18, 1–52, 2018. a

Lian, J., Zhang, Y., Ma, C., Yang, Y., and Chaima, E.: A review on recent sizing methodologies of hybrid renewable energy systems, Energ. Convers. Manage., 199, 112027, https://doi.org/10.1016/j.enconman.2019.112027, 2019. a

Lin, C., Liang, H., Pang, A., and Zhong, J.: Data-driven method of solving computationally expensive combined economic/emission dispatch problems in large-scale power systems: an improved kriging-assisted optimization approach, Front. Energ. Res., 11, 1273760, https://doi.org/10.3389/fenrg.2023.1273760, 2023. a

Long, Q., Das, K., Pombo, D. V., and Sørensen, P. E.: Hierarchical control architecture of co-located hybrid power plants, Int. J. Elec. Power, 143, 108407, https://doi.org/10.1016/j.ijepes.2022.108407, 2022. a

Luo, F., Meng, K., Dong, Z. Y., Zheng, Y., Chen, Y., and Wong, K. P.: Coordinated operational planning for wind farm with battery energy storage system, IEEE T. Sustain. Energ., 6, 253–262, https://doi.org/10.1109/TSTE.2014.2367550, 2015. a

Murcia Leon, J. P., Koivisto, M. J., Sørensen, P., and Magnant, P.: Power fluctuations in high-installation- density offshore wind fleets, Wind Energ. Sci., 6, 461–476, https://doi.org/10.5194/wes-6-461-2021, 2021. a

Ochoa, T., Gil, E., Angulo, A., and Valle, C.: Multi-agent deep reinforcement learning for efficient multi-timescale bidding of a hybrid power plant in day-ahead and real-time markets, Appl. Energ., 317, 119067, https://doi.org/10.1016/J.APENERGY.2022.119067, 2022. a, b

Pang, A., Liang, H., Lin, C., and Yao, L.: A Surrogate-Assisted Adaptive Bat Algorithm for Large-Scale Economic Dispatch, Energies, 16, 1011, https://doi.org/10.3390/en16021011, 2023. a

Paska, J., Biczel, P., and Kłos, M.: Hybrid power systems – An effective way of utilising primary energy sources, Renew. Energ., 34, 2414–2421, https://doi.org/10.1016/j.renene.2009.02.018, 2009. a

Roy, P., He, J., Zhao, T., and Singh, Y. V.: Recent Advances of Wind-Solar Hybrid Renewable Energy Systems for Power Generation: A Review, IEEE Open Journal of the Industrial Electronics Society, 3, 81–104, https://doi.org/10.1109/OJIES.2022.3144093, 2022. a

Shivarama Krishna, K. and Sathish Kumar, K.: A review on hybrid renewable energy systems, Renewable and Sustainable Energy Reviews, 52, 907–916, https://doi.org/10.1016/j.rser.2015.07.187, 2015. a

Stanley, A. P. and King, J.: Optimizing the physical design and layout of a resilient wind, solar, and storage hybrid power plant, Appl. Energ., 317, 119139, https://doi.org/10.1016/J.APENERGY.2022.119139, 2022. a

Taha, M. S., Abdeltawab, H. H., and Mohamed, Y. A. R. I.: An online energy management system for a grid-connected hybrid energy source, IEEE J. Em. Sel. Top. P., 6, 2015–2030, https://doi.org/10.1109/JESTPE.2018.2828803, 2018.  a

Thirunavukkarasu, M., Sawle, Y., and Lala, H.: A comprehensive review on optimization of hybrid renewable energy systems using various optimization techniques, Renewable and Sustainable Energy Reviews, 176, 113192, https://doi.org/10.1016/j.rser.2023.113192, 2023. a

Toubeau, J. F., Bottieau, J., De Greeve, Z., Vallee, F., and Bruninx, K.: Data-Driven Scheduling of Energy Storage in Day-Ahead Energy and Reserve Markets with Probabilistic Guarantees on Real-Time Delivery, IEEE T. Power Syst., 36, 2815–2828, https://doi.org/10.1109/TPWRS.2020.3046710, 2021. a

Wiese, F., Bramstoft, R., Koduvere, H., Pizarro Alonso, A., Balyk, O., Kirkerud, J. G., Åsa Grytli Tveten, Bolkesjø, T. F., Münster, M., and Ravn, H.: Balmorel open source energy system model, Energy Strateg. Rev., 20, 26–34, https://doi.org/10.1016/j.esr.2018.01.003, 2018. a

Xu, B., Oudalov, A., Ulbig, A., Andersson, G., and Kirschen, D. S.: Modeling of Lithium-Ion Battery Degradation for Cell Life Assessment, IEEE T. Smart Grid, 9, 1131–1140, https://doi.org/10.1109/TSG.2016.2578950, 2018. a

Yang, Y., Guo, S., Liu, D., Li, R., and Chu, Y.: Operation optimization strategy for wind-concentrated solar power hybrid power generation system, Energ. Convers. Manage., 160, 243–250, https://doi.org/10.1016/J.ENCONMAN.2018.01.040, 2018. a

Zhang, G., Hu, W., Cao, D., Liu, W., Huang, R., Huang, Q., Chen, Z., and Blaabjerg, F.: Data-driven optimal energy management for a wind-solar-diesel-battery-reverse osmosis hybrid energy system using a deep reinforcement learning approach, Energ. Convers. Manage., 227, 113608, https://doi.org/10.1016/j.enconman.2020.113608, 2021. a

Zhang, X., Yuan, Y., Hua, L., Cao, Y., and Qian, K.: On Generation Schedule Tracking of Wind Farms With Battery Energy Storage Systems, IEEE T. Sustain. Energ., 8, 341–353, https://doi.org/10.1109/TSTE.2016.2598823, 2017. a

Zhang, X., Gu, J., Yuan, Y., Hua, L., and Shen, Y.: Scheduling wind-battery energy storage hybrid systems in time-of-use pricing schemes, IET Generation, Transmission & Distribution, 12, 4435–4442, https://doi.org/10.1049/IET-GTD.2017.1878, 2018. a

Zhu, L.-P., Yin, X., and Zhu, L.-X.: Dimension Reduction via an Alternating Inverse Regression, J. Comput. Graph. Stat., 19, 887–899, 2010. a

Zhu, R., Das, K., Sørensen, P. E., and Hansen, A. D.: Optimal Participation of Co-Located Wind–Battery Plants in Sequential Electricity Markets, Energies, 16, 5597, https://doi.org/10.3390/en16155597, 2023. a, b, c, d, e, f

Zhu, R., Das, K., Sørensen, P. E., and Hansen, A. D.: Enhancing profits of hybrid wind-battery plants in spot and balancing markets using data-driven two-level optimization, International Journal of Electrical Power and Energy Systems, 159, 110029, https://doi.org/10.1016/j.ijepes.2024.110029, 2024. a, b, c

Download
Short summary
This research develops a new method for assessing hybrid power plant (HPP) profitability, combining wind and battery systems. It addresses the need for an efficient, accurate, and comprehensive operational model by approximating a state-of-the-art energy management system (EMS) for spot market power bidding using machine learning. The approach significantly reduces computational demands while maintaining high accuracy. It thus opens new possibilities in terms of optimizing the design of HPPs.
Share
Altmetrics
Final-revised paper
Preprint