This article was submitted to Wind Energy, a section of the journal Frontiers in Energy Research
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
With the development of advanced digital infrastructure in new wind power plants in China, the individual wind-turbine level data are available to power operators and can potentially provide more accurate available wind power estimations. In this paper, considering the state of the wind turbine and the loss in the station, a four-layer spatio-temporal neural network is proposed to compute the available power of wind farms. Specifically, the long short-term memory (LSTM) network is built for each wind turbine to extract the time-series correlations in historical data. In addition, the graph convolution network (GCN) is employed to extract the spatial relationship between neighboring wind turbines based on the topology and patterns of historical data. The case studies are performed using actual data from a wind farm in northern China. The study results indicate that the computation error using the proposed model is lower than that using the conventional physics-based methods and is also lower than that using other artificial intelligence methods.
香京julia种子在线播放
In recent years, the installed capacity of wind power has gradually increased, and the proportion of new energy power generation has gradually increased. In July 2022, the Global Wind Energy Council released the “Global Wind Report 2022.” This report shows that the new installed capacity of global wind power is 93.6 GW. By the end of 2021, the cumulative installed capacity of global wind power reached 837 GW, a year-on-year increase of 12.4%. However, due to the random, fluctuating, and intermittent characteristics of wind power, it brings significant challenges to real-time dispatching of power grids (
At present, the theoretical power computation and prediction of wind farms can be divided into physics-based methods, statistics-based methods, artificial intelligence-based methods, and hybrid methods. The physics-based method mainly uses the numerical weather forecast model to calculate the future wind speed. Then, the predicted wind speed is brought into the relevant wind farm power curve (
The statistical method is based on the statistical analysis of the correlation between wind farm power generations, wind speed, and wind direction data and establishes the mapping relationship between wind speed and wind direction and the output power of the wind farm. In the literature (
At present, artificial intelligence-based methods are generally used for wind power forecasting and available power estimations. The artificial intelligence method uses historical power data, NWP data, etc., as input information to establish a non-linear mapping relationship between the output and multi-variables. Compared with statistical methods, the adaptability and self-learning ability of artificial intelligence methods have been significantly improved.
A combined model is a combination of two or more models, which eliminates the limitations of the individual models by combining their advantages in order to maximize the advantages of each method and improve the accuracy of wind farm power prediction. In a study by
Most of the existing wind farm power prediction methods used numerical weather forecast data or measurement data from weather towers in wind farms as model inputs and the theoretical power of the whole wind farm as the model output. Since the state of the wind turbines and the loss in the field are not taken into account, they can only be used for theoretical power generation calculation and theoretical prediction of wind farms. Few studies have been conducted to calculate the available power of wind farms.
The previous theoretical power estimation is aimed at using numerical weather forecasts to estimate the total power generated by the entire wind farm, which cannot be accurate to each wind turbine, and the error is large. The emergence of a stand-alone information system can provide the nacelle wind speed of each wind turbine so that the grid dispatching department can be based on stand-alone actual measurement data to more accurately estimate the wind turbine and wind farm available power generation. Second, in most areas of the country, the only data available to the grid dispatching department are the weather forecast data and the active power of the grid connection point, which does not include the status of waiting for wind, planned shutdown, unit failure shutdown, and other wind turbine operating conditions. However, the active power emitted by the wind turbine in the normal power generation operation state reflects the real power generation capacity of the wind turbine.
Regional control centers in China enhanced the existing SCADA system to collect each wind turbine’s operating status, wind speed, and wind direction from every wind farm, which provides an opportunity for accurate computation of available power of each wind farm. In this paper, we propose a deep spatio-temporal neural network for calculating the available power generation in wind farms based on the data to integrate wind farm’s SCADA into the control center’s SCADA. Long short-term memory (LSTM) is used to extract temporal information, and the graph convolutional network (GCN) is used to describe the topology between wind turbines in wind farms and then solve the problem of spatial correlation and station loss.
In this paper, in order to improve the prediction accuracy, based on the real-time meteorological information of each wind turbine provided by the stand-alone information system, an ultra-short-term available power calculation method is proposed. This method combines the GCN and LSTM and considers the spatio-temporal correlation of wind farms and station losses. The key contributions are as follows: 1) Stand-alone information systems are used in wind power forecasting. They can provide real-time meteorological information such as the wind speed of each wind turbine and the information on the operating status of the wind turbine. 2) A novel graph neural network-based hybrid approach is proposed for ultra-short-term power prediction. LSTM is used to excavate the temporal characteristics of the wind speed of the wind turbine. The spatial position relationship of the wind turbine constitutes graph data, which is used as an input to the GCN to capture spatial dependence. 3) Based on the construction of the electrical topology connection diagram, the station loss problem is solved in combination with the GCN.
The available power of a wind farm refers to the maximum power theoretically available from the wind energy subtracting the power output of a wind turbine under maintenance and line losses in the wind farm. Additionally, instead of the actual power output of the wind farm, it may also be subject to curtailment and dispatch signals. To estimate the maximum available power of a wind farm, the wind speed information is used as the input. In this work, we used the historical wind speed data measured at each turbine and the turbine output power data under the maximum power point tracking working condition. Therefore, the summation of power generations of all the turbines, subtracting the total losses of the wind farm, equals the maximum available power of the wind farm.
The aforementioned available power calculation does not take into account the in-station losses. Since the sources of loss in the station are diverse and difficult to estimate, therefore, it is proposed to construct an electrical connection diagram of wind farm wind turbines based on the GCN using the wind farm electrical main wiring diagram, and the characteristics of the loss in the station are included in the wind turbine electrical connection diagram.
There is a correlation between the available power generation of wind turbines and the historical nacelle wind speed. The LSTM neural network layer can be constructed to effectively mine the time-series correlation information of wind turbine wind speed data. The complex topography and wake effects in the wind farm space have an impact on the power generated by the wind turbines. There is a correlation between adjacent wind turbines. Due to the different spatial distribution of WTGs and their own operating conditions, even the available power generation of WTGs of the same model varies. Therefore, based on the GCN, the fan relationship connection diagram is considered to extract spatial features.
In terms of the structure of the proposed algorithm, in this work, a two-stage deep neural network-based algorithm is proposed, which mainly consists of four major layers. The first stage contains the first two layers, namely, the LSTM layer and the GCN layer. LSTM is a type of recurrent neural network that is proven to be very effective in terms of handling the input data with temporal relationships. The time-series wind speed data are used as the input of the LSTM layer. The GCN layer is implemented to extract the spatial topology of the wind dynamics in the wind farm to help the estimation of the available power. In the second stage, a third GCN layer is used to calculate the losses in the wind farm, where the topology of the wind farm is also used. The last layer is a fully connected layer for the final output. It should be noted that the layer here refers to a section of small network, which contains multiple sub-layers, like single GCN layers, activation function, and pooling layer. The first stage and the second stage can be pre-trained separately under the supervised learning scheme with SGD and then combined together for fine-tuning the parameters.
Model general frame diagram.
There is a correlation between the available power generation of wind turbines and the historical nacelle wind speed (
The long short-term memory network is a variant of the recurrent neural network (RNN), which solves the problem of RNN gradient explosion and gradient disappearance. Based on the RNN, LSTM changes the single neural network layer in the RNN to one with four neural networks.
By changing the structure of neurons, LSTM introduces the gate mechanism and removes and adds information in the neurons through the gate mechanism (
LSTM internal structure diagram.
The LSTM unit consists of four parts: information storage chain, forgetting gate, memory gate, and output gate. The information storage chain runs through all the LSTM units before and after the LSTM network and is responsible for the storage and transmission of wind speed and operation status information of wind turbines at historical moments, and the information in the information storage chain of each LSTM unit is updated.
The role of the forgetting gate is to selectively forget the wind speed and operating status information components of some historical moments and to avoid too much information from the historical moments to affect the neural network’s processing of the wind speed and operating status inputs of the wind turbine at the current moment. By the effect of the forgetting gate, the information in the historical moments of WTGs that are not strongly correlated with the estimated moments can be eliminated. The mathematical expression of the forgetting gate is shown in Eq.
The memory gate is the control unit used to control whether the wind speed and operating status data of the WTGs at time
The output gate integrates the output data processed by the forgetting gate and the memory gate as the output of the power generated by the wind turbine at moment t. The mathematical expression of the output gate is as follows:
The network unit status is used to store information about the current WTG power, wind speed, and operating status and pass it to the next moment, which affects the output of the WTG power at the next moment. The update equation of the network unit status is shown in Eq.
The LSTM neural network layer can be constructed to effectively mine the time-series correlation information of wind turbine wind speed data. The key historical moments of wind speed are first screened out, and then the wind speed and operation status of the key historical moments are passed into the LSTM network as input vectors, and the valid information in the key historical moments is selected through memory gates and forgetting gates to update the network unit states, thus making the LSTM network fully consider the temporal correlation when estimating the available power generation of wind turbines and thus improving the available power estimation accuracy.
The complex topography and wake effects in the wind farm space have an impact on the power generated by the wind turbines. The wind turbines are not distributed in isolation in the wind farm space, and the data of neighboring turbines have a large contribution to the estimation of the available power generation of wind turbines. From the perspective of wind speed in wind turbine nacelle, the nacelle wind speed of a WTG at moment
Traditional convolutional neural networks are limited to modeling Euclidean spatial data only, while graph convolutional networks can process non-Euclidean spatial data using graph representation, which makes them more suitable for modeling all wind turbines in a wind farm. In the wind farm available power estimation, the graph data are first constructed based on the WTG connection relationship graph and the nacelle wind speed and operation status data of WTGs, and then the extraction of spatial features among WTGs is completed based on the graph convolutional neural network.
Before extracting the spatial features of wind farms using the GCN, the wind turbine connection relationship graph should be constructed based on the correlation of all wind turbines in the wind farm. Each wind turbine in the wind farm is abstracted into nodes, and the correlation of nacelle wind speed between all wind turbines in the wind farm is analyzed by the Pearson correlation coefficient, and the units with the correlation coefficient greater than the set threshold can be judged as having a correlation, and the units with a correlation are connected to form edges. Taking a wind farm with six wind turbines as an example, the connection relationship diagram of wind turbines in the wind farm is constructed, as shown in
Schematic diagram of building a connection diagram of wind turbines.
The GCN is a neural network that performs feature extraction on graphs. A graph consists of a set of vertices and edges connecting the vertices [27]; the vertices are the objects studied, and the edges are specific relationships between two objects. In the wind power plant turbine connectivity graph, turbines are abstracted as nodes, and units with a strong correlation with each other constitute edges. The fan connection relationship graph notation can be expressed as
Adjacency matrix and degree matrix.
The graph convolutional neural network uses the structural information of the edge–vertex connections of the WTG connectivity graph
In Eq.
The spatial correlation problem between wind turbines can be effectively solved by using the GCN, and the schematic diagram of extracting spatial features using the GCN is shown in
Schematic diagram of extracting spatial features by the GCN.
Station losses also need to be taken into account when making available power estimates. Wind turbines in the wind farm through the transformer step-up, and the power lines are connected together, and they finally go through the wind farm main transformer step-up voltage to the grid. This process will inevitably produce line losses, box transformer losses, main transformer losses, etc., which are collectively referred to as station losses. Because the station loss is not part of the power sent through the grid connection point, the station loss does not belong to the wind farm available power generation. The calculation of the wind farm available power generation needs to subtract this part of the power loss. However, station losses come from a variety of sources and are difficult to estimate, making it difficult to eliminate the impact of direct calculation on their accuracy.
Using the graph convolutional neural network, the wind turbine electrical connection relationship diagram can be constructed according to the main wind farm electrical wiring diagram, and the characteristics of the station losses are implied in the wind turbine electrical connection relationship diagram so as to indirectly complete the calculation of the losses of the convergence line, the main transformer in the wind farm. The schematic diagram of the wind turbine electrical connection relationship diagram is shown in
Electrical connection topology of the wind turbines.
The wind turbines are abstracted as nodes in the second layer of the GCN graph, and the power transmission paths are abstracted as edges between the nodes. The propagation process of the nodes’ own attributes in the GCN well-simulates the losses occurring in the power transmission process of the WTGs’ power generation. The input of the second GCN layer is the output matrix of the first GCN layer
The actual operation data of a wind farm are used as an example to test the model of this paper. The installed capacity of the wind farm is 100 MW, including 50 turbines with a single capacity of 2 MW, one main transformer with a rated capacity of 120 MW, 50 box transformers with a rated capacity of 2,300 kW, and six convergence lines; the turbine models are consistent. The wind farm data include the actual power of wind turbines, wind speed, and the actual power of wind farm grid connection points, and the data sampling interval is 1 December 2020–31 December 2020, with a sampling frequency of 1 min/point and a total of 44,640 samples. To use the data more efficiently for training, testing, and validation, the 10-fold cross validation is used.
The proposed deep neural network is trained with the stochastic gradient descent (SGD) optimization algorithm and the cross-entropy loss functions. The cross-entropy loss increases as the predicted probability diverges from the actual label. A perfect model has a loss of 0. However, balancing the over and under fitting is important for the training process. Therefore, the loss will not be 0 in reality. The SGD is an iterative method for optimizing an objective function with smoothness properties. It can be considered a stochastic approximation of the conventional gradient decent optimization, where an estimated gradient is used as the replacement of the actual gradient.
In this paper, four models are compared with the proposed method. The proposed model is marked as model_1, which establishes a wind farm available output estimation model based on LSTM and GCN considering the spatial and temporal correlation within the wind farm and the loss within the wind farm; model 2 is the prototype machine method (
After the adjustment of the model structure and parameters, the parameters of the proposed model are as follows: (1) LSTM layer: Since the input variables of the model are only the nacelle wind speed and operation status, the dimension of the input layer is 2; the number of key historical moments is determined to be 4 based on correlation analysis, so the number of time steps of the input layer is 5; for the estimation of available power generation in wind farms, this paper only considers a single-layer LSTM estimation model, so the number of hidden layer parameters is 1; the dimension of the hidden layer is generally chosen to be four times the number of input variables Therefore, the number of hidden layer dimensions of the model is 8. (2) GCN layer: The model in this paper uses two layers of the GCN. The graph topology of the first layer network is shown in
Construction of the wind farm wind turbine connection diagram.
Construction of the electrical connection diagram of the wind turbine.
(3) Fully connected layer: The fully connected layer is selected as the output layer of the spatio-temporal neural network. The input layer dimension of this layer is 50, which represents the available power generation of 50 wind turbines; the output layer dimension is 1, which represents the estimated value of the available power generation of wind farms.
Based on the data of pre-processed samples completed the aforementioned four wind farm available power generation estimation models, the test set in each model estimation results is shown in
Plot of wind speed versus power output for six turbines.
Errors in the test set of the theoretical power generation estimation model for wind turbines.
Model | Root mean square (RMS) error (%) | Mean absolute error (MAE) (%) |
---|---|---|
Proposed method (model_1) | 2.005 | 1.679 |
Prototype machine method ( |
9.486 | 7.269 |
LSTM-based method ( |
3.265 | 2.169 |
Power curve method ( |
5.286 | 4.568 |
LSTM + GCN method (model_5) | 2.925 | 1.354 |
In
In
Estimated power output versus the real power outputs of the six turbines.
From
Comparison of estimated power values and real power values of a part of the test set.
From
To sum up, based on the long short-term memory network and the graphical convolutional neural network, this paper establishes a model (model_1) that considers the temporal order of the wind speed of wind turbines and the spatial correlation of wind turbines inside wind farms and the station loss and has a more stable performance than that of model 2, model 3, model 4, and model_5, and its estimation error is lower and closer to the real value; by comparing model_3 and model_5 with the models proposed in this paper, it can be seen that the two-layer GCN in the model can effectively extract the spatial information inside the wind farm and solve the problem of loss in the station.
We investigate the problem of estimating the available power generation of wind farms and propose a model for calculating the available power generation of wind farms based on long short-term memory networks and graph convolutional networks.
When comparing the proposed method with the four available power generation estimation methods, namely, the sample machine method, the LSTM model, the power curve method, and the available generation power estimation method that considers only the spatio-temporal correlation method in the wind farm, the root mean square error of the proposed method is 2.138%, which is 77.5%, 34.5%, 59.6%, and 26.9% higher than that of the sample machine method, the LSTM model, the power curve method, and the method that considers only the spatio-temporal correlation within the wind farm, respectively, proving the feasibility and superiority of the proposed method.
The improvement of the accuracy of the estimation of the available power generation of wind farms will facilitate the online dispatching and optimization of the strategy of the direct wind power AGC (automatic generation control) system by the dispatching department of the State Grid Corporation, promote the consumption of new energy, and help the country achieve the reduction of carbon emissions.
In terms of the findings, limitations, and recommendations, the authors would like to share the following: 1) Due to the aerodynamics, the output power of wind farms has strong spatial correlations. 2) With the help of the data from individual turbines, more accurate estimation results in terms of available power can be generated because the state of individual turbines can be considered, including the rotating maintenance of turbines within the wind farm. 3) The performance of the machine learning algorithms is limited by the quality of the input/training dataset. Therefore, it is very important to wash and clean the dataset and find the outliers, resulting in improved estimation accuracy.
The limitation of the proposed method is also the data availability. It is fortunate that we have the data of individual wind turbines in terms of wind speed, output power, and operation states. Therefore, the model can be developed in this work, and the available power can be estimated based on individual turbine outputs and losses. Otherwise, the conventional wind forecasting is the best way to predict the output power of a wind farm.
The original contributions presented in the study are included in the article/Supplementary Material; further inquiries can be directed to the corresponding author.
YL and PZ proposed the algorithm, KH and JL wrote the manuscript, and ZL provided valuable comments.
The work is supported by the State Grid of China, North China Branch, under Project No. SGNCOOOODKJS2000265.
Author YL was employed by North China Branch of the State Grid Corporation of China.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.