Abstract
In order to improve the prediction accuracy of short-term wind speed, a short-term wind speed prediction model based on artificial bee colony algorithm optimized error minimized extreme learning machine model is proposed. The extreme learning machine has the advantages of fast learning speed and strong generalization ability. But many useless neurons of incremental extreme learning machine have little influences on the final output and, at the same time, reduce the efficiency of the algorithm. The optimal parameters of the hidden layer nodes will make network output error of incremental extreme learning machine decrease with fast speed. Based on the error minimized extreme learning machine, artificial bee colony algorithm is introduced to optimize the parameters of the hidden layer nodes, decrease the number of useless neurons, reduce training and prediction error, achieve the goal of reducing the network complexity, and improve the efficiency of the algorithm. The error minimized extreme learning machine prediction model is constructed with the obtained optimal parameters. The stability and convergence property of artificial bee colony algorithm optimized error minimized extreme learning machine model are proved. The practical short-term wind speed time series is used as the research object and to verify the validity of the prediction model. Multi-step prediction simulation of short-term wind speed is carried out. Compared with other prediction models, simulation results show that the prediction model proposed in this article reduces the training time of the prediction model and decreases the number of hidden layer nodes. The prediction model has higher prediction accuracy and reliability performance, meanwhile improves the performance indicators.
Keywords
Introduction
Increasing natural energy resources, the reduction of emissions, climate change, and rising energy costs lead to a change from classic carbon or nuclear-based power supply to a concentration on renewable energy resources (Kramer et al., 2014). Wind energy is a typical renewable energy (Heinermann and Kramer, 2016). Accurate short-term wind speed prediction has important theoretical significance and practical application value for wind power industry (Tascikaraoglu and Uzunoglu, 2014; Troncoso et al., 2015). Short-term wind speed prediction means predict the future wind speed in advance (Ambach and Vetter, 2016). The persistence method is the simplest prediction method, that is, the measured wind speed value at the latest sampling time is taken as the prediction value of the next sampling time (Santamaria-Bonfil et al., 2016). Many prediction methods generally use the persistence method as a benchmark to evaluate the prediction accuracy. Some scholars have set up the mathematical model for wind speed and have achieved good prediction results (Bozkurt et al., 2014; Gottschall et al., 2006; Henriksen et al., 2013; Iversen et al., 2016; Luhur et al., 2015). These mathematical models need to consider a lot of input conditions and states, but in practical applications, these states or conditions are difficult to obtain, which greatly limits the actual implementation.
However, many prediction methods of short-term wind speed are mainly based on historical data and time series model. The author compares the use of wind speed time series as well as differences of subsequent measurements with random forests, support vector regression, and k-nearest neighbors. The results show that time series is suitable for wind speed prediction (Heinermann and Kramer, 2015). These prediction methods using historical data, through some linear models include multivariate adaptive regression model (Staid et al., 2015), autoregressive moving average model (Erdem and Shi, 2011), autoregressive integrated moving average model (Cadenas et al., 2016) or nonlinear model include support vector machine (SVM) (Ortiz-García et al., 2011; Shrivastava et al., 2016), least squares support vector machines (Mathaba et al., 2012), artificial neural network (Doucoure et al., 2016; Mishra and Dash, 2018; Ramasamy et al., 2015; Saleck and von Bremen, 2007; Sivachitra and Vijayachitra, 2014), and so on to predict short-term wind speed time series in the future. The research results show that the short-term wind speed has a strong nonlinear (Baloch et al., 2016; Soman et al., 2010), so the nonlinear model is more suitable for short-term wind speed prediction. But there is no uniform method to determine the parameters of SVM and least squares support vector machine (LSSVM) prediction model. The artificial neural network is difficult to determine the network structure and easy to fall into the local optimal value.
The traditional neural network learning algorithm uses the gradient descent method to train the network. The learning speed of algorithm is slow, and all the parameters of the network need to be adjusted iteratively. This problem seriously restricts the development of forward neural network. Huang et al. (2006b) proposed a new neural network—extreme learning machine (ELM) algorithm. The ELM algorithm uses the random mechanism to reduce the parameter setting and choice. It is one kind of simple feasible fast learning algorithm. Compared with other traditional neural network learning algorithms or SVM, ELM algorithm has the advantages of fast learning speed and strong generalization ability (Huang et al., 2012; Lan et al., 2010). The authors point out that computing time of ELM is usually several thousand times faster than back propagation (BP) neural network or SVM (Naji et al., 2016). Therefore, the ELM algorithm is also applied to the short-term wind speed prediction (Bouzgou, 2014; Salcedo-Sanz et al., 2014; Shamshirband et al., 2015). However, the optimal number of hidden nodes in the standard ELM algorithm is determined by many experiments, so it is not only time-consuming but also random.
In recent years, many scholars have studied the network structure of ELM deeply. From the current research results, the main research work is focused on the node incremental ELM (I-ELM). I-ELM algorithm starts from a relatively small-scale neural network, gradually adds the hidden node to the network. When the output error of the network meets the accuracy of the actual problem, the node increase process will stop. Huang proposed extreme learning incremental machine (I-ELM) algorithm (Huang et al., 2006a) and enhanced incremental extreme learning machine (EI-ELM) algorithm (Huang and Chen, 2007, 2008), Wang proposed enhanced convex incremental extreme learning machine (ECI-ELM) algorithm (Wang and Zhang, 2012), and Feng proposed error minimized extreme learning machine (EM-ELM) algorithm (Feng et al., 2009). All of the above algorithms added hidden layer units in the network. The number of hidden nodes of I-ELM, CI-ELM, EI-ELM, and ECI-ELM algorithms increased with the increase in the number of iterations. The hidden nodes of EM-ELM algorithm can be added one by one or quantity with increase in the number of iterations. Although the incremental ELM algorithm have not the optimal number of hidden layer node and over fitting problems, many neurons in the incremental ELM algorithm have a very small effect on the final output. The useless neurons increase the number of iterations, which reduces the efficiency of the algorithm.
Based on the above discussions and EM-ELM algorithm, this article uses the artificial bee colony (ABC) algorithm to optimize the parameters of the hidden layer nodes in EM-ELM. The proposed algorithm can reduce the number of useless neurons in EM-ELM, decrease the complexity of the network, and improve the efficiency of the algorithm. The simulation results show that the method has better prediction effect through the actual short-term wind speed data.
The preliminaries
EM-ELM algorithm
The structure of ELM neural network algorithm is similar with the single-hidden layer feed forward neural networks (SLFNs). The difference is that ELM can randomly select its training parameters and get a complete network training model only through the output weights obtained by the least squares method.
ELM algorithm contains the following three steps (Wang and Zhang, 2012). For a given training set
where
The initial network structure of EM-ELM neural network has zero hidden layer nodes. The network structure can also be determined by the user. Under the premise of the initial structure determination, before increasing the number of nodes in the hidden layer, the training data should be used to train the network. The number of hidden layer nodes is increased after the initial output weights and the training errors are obtained. There are two methods on how to increase the nodes number of the hidden layer, single or quantity increasing. In the case of a single increase, the algorithm randomly generated input weights and thresholds of node. The node is added to the existing network structure. In the case of quantity increasing, there are two methods to increase the number of hidden layer nodes. The first method is to increase the fixed number of hidden layer nodes each time. The other is to increase the number of hidden layer nodes, which has a linear relationship with the number of existing hidden layer nodes. After obtaining the number of nodes that need to be increased, it will be added to the network and the corresponding input weights and thresholds can be added to the existing input weights and thresholds matrix. After iteration, the network needs to update the output weights and training errors of the whole network. Repeat to increase the number of hidden nodes until achieving the maximum hidden layer nodes number or the desired training error.
But the network output value
ABC algorithm
The ABC algorithm is an intelligent optimization algorithm which comes from the behavior of honey bees (Karaboga and Basturk, 2007). Compared with the genetic algorithm, the differential evolution algorithm, and the particle swarm optimization algorithm, the ABC algorithm is very competitive (Karaboga et al., 2012). This article will use ABC algorithm to optimize optimal parameters of hidden layers of EM-ELM. The EM-ELM prediction model is constructed with the obtained optimal parameters.
The nectar source of ABC algorithm is abstracted as a point in the solution space. The quality of nectar source
At the start of the search phase, a new nectar source around nectar source is generated by employed foragers according to equation (4)
where
That means onlooker bees generate a random number belong to [0,1] and compare it with
The implementation steps of ABC algorithm are as follows:
The short-tem wind speed prediction model based on ABC-optimized EM-ELM
The prediction model
The hidden layer parameters
where
The implementation steps of hidden parameters of EM-ELM optimized by ABC algorithm can be described as following:
Convergence analysis of the algorithm
The following two lemmas are introduced.
Lemma 1
Given arbitrary
where
Lemma 2
Given any bounded continuous or piecewise continuous activation function, for any continuous objective function, if
then there is arbitrary output matrix
Theorem 1
Given discrete sample
Proof
1.If
Assuming the maximum number of iterations of the ABC algorithm is
According to the literature (Huang et al., 2006b), if
Therefore, there exists
2.If
Assuming the maximum number of neurons is
According to Lemma 1, constant
Therefore, there exists
Simulation
In order to verify the predictive performance of the ABC-optimized EM-ELM, the short-term wind speed data are used to validate the predictive ability. In order to illustrate the effectiveness of proposed prediction method, four groups’ short-term wind speed data are collected from a power plant located in Liaoning Province, China. The sampling period is 1 h, 30 min, 20 min, and 10 min, respectively. The length of the four datasets is 300. 300 groups of short-term wind speed data with a sampling period of 1 h were collected from 6 o’clock on 1 March 2013 to 18 o’clock on 13 March 2013. The first 276 groups of data are used to train the model (from 6 o’clock on 1 March 2013 to 18 o’clock on 12 March 2013), and the latter 24 groups of data (from 19 o’clock on 11 March 2013 to 18 o’clock on 12 March 2013) are used as the test set to verify the accuracy of the prediction model. 300 groups of short-term wind speed data with a sampling period of 30 min were collected from 6 o’clock on 15 March 2013 to 12 o’clock on 21 March 2013. The first 276 groups of data are used to train the model (from 6 o’clock on 15 March 2013 to 24 o’clock on 20 March 2013), and the latter 24 groups of data (from 0 o’clock on 21 March 2013 to 12 o’clock on 21 March 2013) are used as the test set. 300 groups of short-term wind speed data with a sampling period of 20 min were collected from 6 o’clock on 15 March 2013 to 10 o’clock on 19 March 2013. The first 276 groups of data are used to train the model (from 6 o’clock on 15 March 2013 to 2 o’clock on 19 March 2013), and the latter 24 groups of data (from 2 o’clock on 19 March 2013 to 10 o’clock on 19 March 2013) are used as the test set. 300 groups of short-term wind speed data with a sampling period of 10 min were collected from 6 o’clock on 15 March 2013 to 8 o’clock on 17 March 2013. The first 276 groups of data are used to train the model (from 6 o’clock on 15 March 2013 to 4 o’clock on 17 March 2013), and the latter 24 groups of data (from 4 o’clock on 17 March 2013 to 8 o’clock on 17 March 2013) are used as the test set. 300 groups’ short-term wind speed data with four sampling periods are shown in Figure 1.

Short-term wind speed time series.
The proposed ABC algorithm optimized EM-ELM prediction model is used to predict four groups’ short-term wind speed data. The prediction steps of wind speed data are 24. The parameters of ABC algorithm–optimized EM-ELM prediction model are
The operation results of several algorithms of wind speed data with a sampling period of 1 h.
ABC: artificial bee colony; EM-ELM: error minimized extreme learning machine; ELM: extreme learning machine; EI-ELM: enhanced incremental extreme learning machine.
The operation results of several algorithms of wind speed data with a sampling period of 30 min.
ABC: artificial bee colony; EM-ELM: error minimized extreme learning machine; ELM: extreme learning machine; EI-ELM: enhanced incremental extreme learning machine.
The operation results of several algorithms of wind speed data with a sampling period of 20 min.
ABC: artificial bee colony; EM-ELM: error minimized extreme learning machine; ELM: extreme learning machine; EI-ELM: enhanced incremental extreme learning machine.
The operation results of several algorithms of wind speed data with a sampling period of 10 min.
ABC: artificial bee colony; EM-ELM: error minimized extreme learning machine; ELM: extreme learning machine; EI-ELM: enhanced incremental extreme learning machine.
It can be seen from Tables 1 to 4, because the ABC algorithm reduces the number of hidden layer nodes in the iteration cycle, the time consuming and training time of the algorithm is better than the standard ELM, EM-ELM, and E-ELM. Also due to the reduction of the hidden layer node numbers, the complexity of the ELM network is reduced. The overall performance of the system is improved, which means that the method can obtain more simplified ELM network structure in the same or less training time.
Figure 2 shows comparison of the predicted values and the actual values of four groups’ short-term wind speed data in test set in one simulation by using four ELM models. From the graph, we can observe that the method in the article is superior to other three kinds of ELM in curve fitting. The prediction model in this article has a better regression prediction ability for short-term wind speed time series.

Comparison results of four groups’ short-term wind speed data by four ELM prediction models.
In order to further compare the predictive effects of this method, method is compared to combined prediction model based on wavelet transform (Tian et al., 2015b), prediction model based on chaotic theory and online least square support vector machine (Tian et al., 2015a), and prediction model based on combination kernel function least squares support vector machine (Tian et al., 2014). At the same time, the proposed method is compared with persistence method (Santamaria-Bonfil et al., 2016), least squares support vector machines (Mathaba et al., 2012), autoregressive integrated moving average model (Cadenas et al., 2016), and neural network (Saleck and von Bremen, 2007). The parameters of least squares support vector machines can be obtained by LSSVM Toolbox. The parameters of autoregressive integrated moving average model can be obtained by Akaike information criterion (AIC) criterion. The simulation results of four groups’ wind speed data are shown in Figure 3. Figure 4 shows absolute error distribution of four groups’ wind speed data of the prediction models mentioned in this article. Figure 5 shows relative error distribution of four groups’ wind speed data of the prediction models mentioned in this article. From the results of Figures 2 to 5, we can see that the proposed prediction model is better than other models in prediction accuracy and prediction error.

Comparison of the short-term wind speed prediction and actual value of four groups’ wind speed data.

Absolute error distribution of four groups’ wind speed data.

Relative error distribution of four groups’ wind speed data.
In this article, we introduce the following four kinds of performance indicators to measure the prediction accuracy of the prediction model:
RMSE (root mean square error)
MAE (mean absolute error)
MAPE (mean absolute percentage error)
Reliability
where
Table 5 shows the comparison of RMSE, MAE, and MAPE performance indicators of these prediction models. The results in Table 5 also show that the prediction model in this article is superior to other prediction models in the performance indicators.
Comparison of performance indicators of four groups’ wind speed data.
RMSE: root mean square error; MAE: mean absolute error; MAPE: mean absolute percentage error; EM-ELM: error minimized extreme learning machine; ELM: extreme learning machine; EI-ELM: enhanced incremental extreme learning machine; LSSVM: least squares support vector machine.
Figure 6 shows reliability and confidence distribution of the prediction models mentioned in this article. It can be seen from this graph that the prediction model in this article has higher reliability under the same confidence level. It can be known that the reliability of ABC-optimized EM-ELM prediction model is better than other prediction models.

Reliability and confidence distribution of four groups’ wind speed data.
In summary, from the point of view of the above modeling training results, curve fitting, error distribution, performance indicators, reliability, and so on, ABC algorithm–optimized EM-ELM of short-term wind speed prediction model has better prediction effect. The main reason for the improvement of prediction performance is that ABC algorithm optimizes the parameters of the hidden layer nodes of EM-ELM. It makes the network output error rapid decline, simplifies the structure of network, reduces the number of useless neurons, and improves the algorithm efficiency.
Conclusion
A prediction model based on ABC algorithm–optimized error minimized ELM is proposed in this article. This prediction model is applied to the prediction of short-term wind speed time series. The I-ELM is the main research direction to improve the performance of ELM, but there exist a large number of useless neurons in the I-ELM, which reduces the efficiency and performance of the algorithm. In order to simplify the network structure, improve the performance of the algorithm, this article uses the ABC algorithm with good performance to optimize the parameters of the hidden layer based on the EM-ELM. At the same time, the stability and convergence of the algorithm are proved. The simulation results are verified by the short-term wind speed time series. Compared with other prediction models, the simulation results show that the prediction model proposed in this article has better prediction effect and higher prediction accuracy.
In this article, the short-term wind speed time series is used as the simulation object. In fact, the proposed prediction model can also be used in the prediction of any time series such as sunspot number, river run-off, stock price, network traffic, and wind power. The future research of this article is to apply the prediction model to other time series and further improve the prediction accuracy of the prediction model.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was partially supported by the Science Research Project of Liaoning Education Department (No. LGD2016009) and the Natural Science Foundation of Liaoning Province of China (No. 20170540686).
