Abstract
Wind farm wake modeling is of great significance for wind turbine layout optimization design and yaw control strategy. In this work, we combine deep neural network (DNN) with spectral proper orthogonal decomposition (SPOD) to discover dynamic characteristics of wake under different inflow conditions. Then an assessment of the proposed SPOD-DNN surrogate modeling method of parameterized fluid is performed by comparing the predicted results. Meanwhile, we demonstrate the robustness of the SPOD-DNN through a comparison with POD-DNN, where SPOD produces fewer modes than POD but can achieve the same cumulative contribution rate and wake prediction accuracy. In the end, the method is developed to predict the wake of single wind turbine in untrained inflow condition and Wake of six wind turbines with different yaw angles. The results reveals that the model has good generalization performance and can robustly reconstruct the wake of multiple wind turbines in different directions.
Keywords
Introduction
Wind energy, as one renewable energy source, has become an important source of power generation. Especially, to achieve the goals of Paris Climate Agreement and net zero carbon emissions, many countries have set targets for wind energy generation, accelerating the development of wind energy Tarafdar Hagh and Khalili (2019). It is estimated that by 2050, wind power alone will provide about 35% of total electricity demand International Renewable Energy Agency (2019). Due to the limitation of energy storage system and substation equipment, the establishment of large-scale wind power plants is conducive to cost control. However, the rotation of wind turbine causes a long and complex vortex wake, and in wind farm, the wake effect of upstream wind turbines will seriously affect the dynamics and power generation efficiency of downstream wind turbines.
Generally speaking, wake effect is an aerodynamic phenomenon characterized by the decrease of wind speed and the increase of turbulence, which leads to the downstream wind turbines to capture less energy and bear more aerodynamic loads. For example, the comparative study of two inline wind turbine with the same rotor diameter shows that the power loss of downstream turbine reached 46% compared with the power output of single turbine under the same operating conditions Adaramola and Krogstad (2011). To mitigate the influence of wake effect, the optimize layout for wind turbine is usually implemented in the process of wind farm design stage while various control techniques are performed to reduce the influence of upstream wind turbine wake Fleming et al. (2015). The layout of wind turbines in the wind field not only considers the influence of topography, but also mainly considers the interference between wind turbine wake and wind turbine rotor. Large-scale wind farms usually arrange more wind turbines, but in this way, wind turbines often run in the wake of upstream or adjacent wind turbines, resulting in less captured wind energy. The model proposed in this paper can quickly predict the diffusion and recovery of wake, which has certain reference significance for the arrangement of wind turbine spacing and is helpful to optimize the layout of wind farm. At the same time, the wake not only affects the power output of the wind turbine, but also affects the structural fatigue of the wind turbine. In the wind farm, the downstream wind turbines are often seriously affected by the wake of the upstream wind turbines. By quickly predicting the wake of a single wind turbine, it is helpful for the downstream wind turbines to yaw and “avoid” the influence of the wake of the upstream wind turbines. Accurate prediction of wind turbine wake contributes to optimized layout of wind turbines and turbine control, significantly improving the power generation efficiency of large-scale wind farm Gao and Hong (2021), Dou et al. (2019).
Great efforts have been made in the numerical computation and analysis of wind turbine wake. The wake interaction is studied based on various low-fidelity and high-fidelity models Boersma et al. (2017). Some analysis models such as Jensen Park model Katic et al. (1986), Jensen (1983), Frandsen et al. (2006) model that are suitable for low fidelity or small-scale data analysis are applied to wind turbine wake computation, providing an effective way for layout optimization design of wind farms. Because the approaches involve sparse data and depend heavily on empirical parameters, their performance is limited by modeling errors. These low fidelity models can only be used for simple analysis of wind turbine wake. Currently, most investigations of the real-time dynamic wake simulation are carried out by solving Navier-Stokes equations with high-fidelity models that use actuator disks model (ADM) Calaf et al. (2010), Nilsson et al. (2015) or actuator lines model (ALM) Lu and Porté-Agel (2011), Churchfield et al. (2012a) to build wind turbine. Moreover, the accuracy difference of wind farm simulation with ADM and ALM methods were also investigated Witha et al. (2014), Martínez-Tossas et al. (2015). And the research of high-fidelity solver based on numerical solution is still quite active Sprague et al. (2020). However, numerical simulation takes a lot of computational time and cannot be used as the immediate feedback of control strategy. For instance, each 1000s large eddy simulation of a
Neural network is usually used as an effective means to study and analyze fluid problems because of the great potential of machine learning in data mining and reducing computational costs Brunton et al. (2019). Several groups have applied machine learning for wake prediction, including establishing the corresponding relationship between inflow parameters and wake using neural network Zilong et al. (2020), realizing the real-time simulation of distributed fluid wake using POD-LSTM model Jincheng and Xiaowei (2020a), and establishing a wind farm wake modeling method using DC-CGAN model Jincheng and Xiaowei (2020). However, it is difficult to directly process massive data by machine learning. For this reason, the research method combining machine learning with reduced-order model (ROM), which takes into account the advantages of data compression and data mining, has attracted more and more attention in the field of fluid mechanics Swischuk et al. (2019), Wang et al. (2019); Lee and You (2019). Lui and Wolf has proved that SPOD has a good application in the problems of circular cylinder flow and airfoil stall Lui and Wolf (2019). This inspired us to establish the surrogate model for wake prediction of wind turbine by using SPOD and POD methods. In this method, the flow field parameters are taken as input, the reduction coefficients are taken as training output, and the flow field is reconstructed by the reduction coefficients.
This work focuses on developing a new wind farm wake model based on non-intrusive ROM and high-fidelity CFD simulation. In the proposed modeling method, the high-fidelity data is first reduced to reduction coefficients by spectral proper orthogonal decomposition technique. Then, a regression model is constructed by deep feedforward neural networks to approximate the mapping of input parameters to reduction coefficients. Lastly, the flow field of corresponding flow parameters is reconstructed by the predicted reduction coefficients. Considering that SPOD can filter the temporal pattern, and redistribute the system energy to higher reduced-order modes to retain more information of fluid field Sieber et al. (2016), we choose SPOD as the tool of order reduction in this paper. Indeed, facing the complex dynamic system with huge amount of data, SPOD can effectively extract the mode of high energy ratio and compress the data dimension, which is extremely suitable for dealing with wind turbine wake. DNN is chosen because of its reliable performance in dealing with regression tasks. In this study, a series of wind turbine wake simulations are carried out under different inflow conditions and yawing conditions, and a high-fidelity LES database is generated. The ability of classical POD and SPOD in reconstructing wind turbine wake data is compared, and the superiority of SPOD in this respect is confirmed. After that, two cases of single wind turbine wake prediction and wind farm wake prediction were carried out.
The main contributions of this article are as follows:
Based on the high-fidelity dataset of wind turbine we computed, a new data-driven wake model is established. SPOD method is used to deal with the complex wake of wind turbines, which can effectively extract high-energy modes. DNN method is used to learn the transient characteristics of wake, which can provide accurate and stable long-term prediction.
Compared with the POD-DNN model, SPOD-DNN model can predict more accurately and stably by redistributing the mode energy to reduce the prediction targets.
Wake on untrained single turbine and six turbines is successfully real-time predicted with the presented model The results are also validated by comparing with high-fidelity SOWFA simulation. This fully show that the established model can accurately and steadily realize the real-time prediction of wind farm wake.
SPOD-DNN based surrogate modeling method
Based on finite-volume method of CFD, this work employs LES method to compute wind turbine wake under a series of yaw conditions, yielding the dataset. Detailed computing environment and computing parameters including regional geometry and boundary conditions will be given in Section 3. This section introduces a modeling method for approximate prediction of flow field data. The whole model is divided into two parts: offline prediction and online prediction. Figure 1 shows the flowchart of SPOD-DNN method, including data generation, DNN regression, and flow field forecast.

The flowchart illustrating the proposed SPOD-DNN based surrogate modeling method.
Training dataset
The flow parameters
Data reduction
In this step, we need to reduce the collected high-fidelity flow field data to obtain the reduced coefficients and the corresponding forward and inverse transforms. The correlation matrix of POD is obtained by snapshot set Volkwein (2011)
where
Here,
After that, we can find the optimal basis vector by determining the eigenvalues and eigenvectors
Arrange the eigenvalue from the largest to the smallest
The original flow field can be approximately represented as
where the reduction coefficients
Regression via DNN
After obtaining the training dataset, deep neural network (DNN) is used as the regression model in this work. As shown in the Figure 2, the correspondence between input and output can be expressed as
where

A deep feedforward network with N inputs, several hidden layers, and one output layer with N outputs.
Model generation process
The parameter update of DNN can be divided into two parts. The first step is forward propagation, which uses several weight coefficient matrices
Deep feedforward network (DNN).
The training goal is to minimize the root mean square error (RMSE) between the predicted result and the simulated target. The neural network is trained using the Adam optimization algorithm in this paper Kingma and Ba (2014), which is based on PyTorch.
Activation function
In a neural network, the information computation among the layers in the network is linear, and the data probably contains linear and nonlinear relationships. Adding additional nonlinear activation functions can make the network capture more nonlinear information and strengthen the learning ability of the network. Commonly used activation functions are sigmoid, tanh, ReLU, etc. As an unsaturated activation function, ELU will not encounter the problem of gradient explosion or disappearance. In fact, compared with other activation functions (such as ReLU and variants, Sigmoid and hyperbolic tangent), the application of ELU as activation function has higher accuracy. The activation functions ELU is used in SPOD-DNN and the formulas are as follows.
Application in wind turbine wake prediction
This section develops a data-driven wind farm wake model by adopting the method introduced in Section 2. The generation of wind turbine wake data is introduced in Section 3.1. In section 3.2, the training and verification of SPOD-DNN model are described in detail, and the comparison with POD-DNN method is made. Finally, two cases are studied to verify the application of SPOD-DNN model in wake prediction of untrained wind turbines and wind farm in Section 3.3.
High-fidelity data generation
The high-fidelity flow field data are generated by SOWFA (Simulator for Wind Farm Applications), which is a numerical solver based on OpenFOAM developed by the National Renewable Energy Laboratory (NREL) of the United States Churchfield and Lee (2012). SOWFA has been sufficiently verified in previous researches, such as turbine dynamics studies Churchfield et al. (2012) and wind farm control Fleming et al. (2013).The NREL 5MW wind turbine is adopted in this paper, and the diameter of rotor is 126 m. Then a series of simulation with three wind turbines are carried out to capture the wake of downstream turbines. The sketch of flow domain is illustrated in Figure 3. The interval between two turbines is about five times of rotor diameter. For the grid, the whole field adopts the grid size of

Sketch of the computational domain.
For the generated wake data to cover a wider range, three inflow velocity, the average free-stream wind velocity of
Model training and validation
The generated LES data contains 270 flow scenarios, and is divided into 210 training scenarios and 60 test scenarios for training and validation of which 30 were verification examples and 30 were test examples, and cross-verification was carried out. The results of error verification are shown in Table 1. The training data is fed into the neural network by a batch processing program. Specifically, the test data set is only utilized to verify the results after training in order to ensure no data leakage.
The optimal hyper-parameters of the two surrogate modeling methods.
The performance of DNN described in algorithm 1 depends to a great extent on the choice of hyperparameter, such as network depth, the number of hidden units in each layer, regularization parameters, and learning rate
The coefficient error of the model is defined as the RMSE between the prediction coefficients and the true coefficients:
And the model error is defined as the average of the RMSE between the predicted flow field and the true flow field:
After training, the predicted values are compared with the test data that is supposed to be unavailable during the training to evaluate its performance. Four typical test cases are shown in Figure 4. The flow conditions for these test studies are: Case1: average velocity = 6m/s, yaw =

The comparisons between the predictions by the developed SPOD-DNN model and the corresponding true values for four randomly selected test cases.
As shown in Figure 4, the main characteristics of wind turbine wake are accurately captured, for example, the wake deflection with change of yaw angle, the wake gradual recovery in the flow direction, the wake expansion in the spanwise direction, and the blocking effect of upstream wake. The root mean square error for the flow and spanwise direction velocity predictions is
To further examine the performance of surrogate model, the wind speed profiles at different positions are extracted. The results are given in Figure 5, including the predicted and the true values. Obviously, the wake flowing through the wind turbine can be observed from a “flat” profile to a “double-peak” profile and then gradually recover. When flow passing through the downstream wind turbine, it can be clearly observed that the wake turbulence increases and the recovery speed slows down. Moreover, the spanwise velocity fluctuation caused by wind turbine yaw can be observed. The predicted profiles at all locations match very well with the corresponding high-fidelity simulation results.

The velocity profiles predicted by SPOD-DNN model for four randomly-selected test cases. The corresponding true values are also shown for comparisons: (a) Case1(u), (b) Case1(v), (c) Case2(u), (d) Case2(v), (e) Case3(u), (f) Case3(v), (g) Case4(u), and (h) Case4(v).
Two case study
Single turbine wake prediction
For proving the generalization performance of the model, we make a wake prediction of single wind turbine which is not in the training set. The average wind speed of the incoming flow is

The flow fields predicted by the developed SPOD-DNN model at a series of turbine yaw angles and wind velocity 12 m/s.
Wind farm wake prediction
The purpose of this model is to further predict the wake of wind farm. Here, we take six 5MW wind turbines as an example for wind farm wake prediction. The high-fidelity simulation is still generated by SOWFA to ensure the reliability of the experiment. Sketch of the wind farm domain is shown in Figure 7. The grid scale of this case is

Sketch of the wind farm domain.
Free wind condition with an average wind speed of

The prediction results by the developed SPOD-DNN model and SOWFA for the wind farm wake case.
In addition, we also conducted an example of six wind turbines with a longitudinal spacing of double diameter. The grid scale of this case and the total number of meshes are as the previous exampleis. Free wind condition with an average wind speed of

The prediction results by the developed SPOD-DNN model and SOWFA for the wind farm wake case under different longitudinal distances.
Moreover, we can observe that when the downstream wind turbine runs in full wake, the wind energy capture will be greatly reduced. If the downstream wind turbine takes timely avoidance measures in the wake change of the upstream wind turbine, the output and power generation efficiency of the wind farm can be effectively increased.
Conclusions
Based on SPOD and DNN, a new wind farm wake model is developed to predict the wake of wind turbines under different inflow conditions. Facing the complicated turbine wake dynamics, it is advantageous to use SPOD to create a stable and accurate ROM. Therefore, the correlation matrix (SPOD method) is applied with a filtering function to remove the high-frequency content existing in classic POD.
The established SPOD-DNN model can well predict the main wake characteristics, including the wake deflection under the change of yaw angle, the wake recovery and expansion. Compared with the POD-DNN model, the performance of SPOD-DNN model with prediction error of only
The case study of the untrained wind turbine with yaw angle and free-flowing wind speed shows that the SPOD-DNN model can well generalize the untrained flow scene, which proves the robustness of the current ROM constructed by DNN. Meanwhile, case study of a small wind farm show that the established model has good consistency with LES model, and can well capture the wake dynamics (including flow direction and spanwise velocity field). And the developed model still has room for improvement to capture the influence of lateral wake. Generally speaking, the established model successfully predicted the unsteady characteristics of turbine wake, which demonstrated the ability of the proposed method in modeling wind farm wake.
One direction for future work is to apply this new wake model to wake control, exploring some yaw control strategies to reduce the load on downstream wind turbines and maximize wind energy capture efficiency. The other direction is the prediction of the real-time power of the wind farm as a whole based on the wake prediction.
Footnotes
Appendix
In this part, we introduce the verification of the actuation line model. According to overview of the simulator for wind fram application, the computation results of the actuating line model will be greatly influenced by the scale of grid nodes
It can be seen from the table that the relative error of power is negative when
The main content of this part is to test the convergence of the model reduction method. The data set to be tested consists of M = [1500; 1700; 1900; 2100; 2300; 2500] snapshot. The convergence of the first 18 modes is tested, as shown in Figure A2. It can be observed that when the number of snapshots is 1500, the first five pairs of modes have converged, and other modes need more snapshots to approximate well. When the number of snapshots reaches 2300, the change of smaller energy modes is already very small.
Acknowledgements
The author would like to thank Fengfeng Zhao, Jing Cao, and Boyang Song for the fruitful discussions.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work has received funding from the National Natural Science Foundation of China No. 11502141. It was also supported by the Scientific Computing Research Technology Platform (SCRTP) at the Shanghai University of Electric Power.
