Abstract
In recent decades, the Nonlinear Grey Bernoulli Model “NGBM (1, 1)” has been applied in various fields and achieved positive results. However, its prediction results may be inaccurate in different scenarios. In order to expand the field of application and to improve the predictive quality of the NGBM (1, 1) model, this paper proposes an effective model (named Fourier-NGBM (1, 1)). This model includes two main stages; first, we get the error values based on the actual data and predicted value of NGBM (1, 1). Then, we use a Fourier series to filter out and to select the low-frequency error values. To test the superior ability of the proposed model, two numerical data sets were used. One is the historical data of annual water consumption in Wuhan from 2005 to 2012 in He et al. ’s paper, and the other is example data from Wang et al. ’s paper. The forecasted results prove that the performance of the Fourier-NGBM (1, 1) model is better than three other forecasting models, namely GM (1, 1), NGBM (1, 1) and the improved Grey Regression model. Furthermore, this study also applied the proposed model to forecast the electricity consumption in Vietnam up to the year 2020. The empirical results can offer valuable insights and provide basic information for model building to develop future policies regarding electrical industry management. In subsequent research, more methodologies can be used to reduce the residual error of the NGBM (1, 1) model, such as Markov chain or different kinds of Fourier functions. Additionally, the proposed model can be applied in different industries with fluctuating data and uncertain information.
Keywords
Introduction
The grey prediction model is the central part of grey system theory [1]. It is an efficient method for dealing with uncertain information and building time series forecasting models with limited data. In the early 1980s, the grey model GM (1, 1) was introduced by Deng based on control theory [2, 3]. It was soon more commonly used than other forecasting models in time series because this model is easy to simulate and needs only small sample sizes. It was constructed based on the first-order accumulation method and first-order differential equation. Fundamentally, it follows the exponential growth laws. Based on the parameter value of a, we can make a decision in short-term, mid-term as well as long-term forecasting. Previous studies have evidenced the advantages of the grey forecasting model in dealing with the problem of uncertain information and limited data [3–5]. Therefore, many scientists have used this model in different areas such as tourism demand forecasting [6, 7], cargo throughput or motor vehicle forecasting [8–10], gold or stock index price forecasting [11–13], the output value forecasting of the high technology industry [14–17], energy demand forecasting [18–20], etc.
Recently, many scholars have used different ways to expand the application areas and to improve the prediction quality of grey models. Specifically, Lin et al. [21] and Wang et al. [22] provided new ways to calculate the background values of the grey forecasting model. Hsu [17] and Wang et al. [23] proposed a systematic approach to optimize the development coefficient or the grey input coefficient or both of these parameters. Some researchers focused on the residual error modification of the GM (1, 1) model, such as Hsu [15] and Wang et al. [24]. Additionally, many scientists proposed hybrid models based on the GM (1, 1) model. These include the grey econometric model [25], grey Markov model [26, 27], and the grey fuzzy model [21], etc. All of the aforementioned research concentrated on the modification of background values or residual errors.
On the other hand, some scholars used different equations to modify the GM (1, 1) model to become new models. For example, Chen [28] proposed the nonlinear grey Bernoulli model NGBM (1, 1) to forecast the unemployment rate. Gou et al. [9] proposed a grey Verhulst model to forecast cargo throughput. Chen et al. [29] used the NGBM (1, 1) model to predict the annual unemployment rates of ten countries. Zhou et al. [30] also applied the proposed model to forecast the foreign exchange rates of Taiwan’s major trading partners. The above real case studies evidenced that the NGBM (1, 1) significantly improved the efficient prediction of the original GM (1, 1) model.
With the flexible changing of the exponential parameter n in the NGBM (1, 1) forecasting model, this feature has made the model more suitable where there are linear or nonlinear characteristics of real systems. Relying on this feature, Zhou et al. [30] applied the novel NGBM (1, 1) model to forecast the power load. Hsu [16] proposed the novel NGBM (1, 1) based genetic algorithm to forecast the output values of high tech industries in Taiwan. Chen et al. [31] proposed a Nash NGBM (1, 1) model based on the Nash equilibrium concept to forecast the monthly stock indices in Taiwan. Wang et al. [32] proposed an optimized NGBM (1, 1) model to forecast the qualified discharge rate in 31 provinces in China. Pao et al. [33] proposed a model named the NGBM-OP model to forecast three important indicators of the clean energy economy in China, which are the amount of CO2 emissions, energy consumption, and economic growth. The empirical results proved that the prediction ability of the proposed model was remarkably improved in comparison with GM (1, 1), ARIMA and the NGBM (1, 1) model.
In fact, because the initial condition is a part of the predictive function, this is a main factor that affects the performance of grey prediction, which is the reason why most researchers have concentrated on the modification of the background values. In this paper, the author provides a different approach to reduce the model error by modifying the error sequence obtained from the NGBM (1, 1) with a Fourier series. Two numerical example and the results of a real case show that the Fourier-NGBM (1, 1) model can significantly improve the prediction accuracy.
This paper is an extended version of our conference paper published in the ACIIDS 2019 proceedings [34]. We have significantly expanded the paper, adding the following elements: Firstly, we use highly fluctuating data and lots of noise to test the performance of the presented model and we propose to apply it to a real case related to forecasting electricity consumption in Vietnam. The remainder of this paper is organized as follows. Section 2 analyzes the related work. Section 3 briefly introduces the NGBM (1, 1) model and proposes a systematic approach for improving the prediction accuracy of the NGBM (1, 1) by a Fourier series. Section 4 proves the effectiveness of the proposed model with two numerical examples; one is water consumption forecasting in He and Tao [35] and the other is fluctuation data in Wang et al. Section 5 presents the application of the proposed model in a real case. Finally, the paper concludes with some comments and suggestions.
Related work
As mentioned in section 1, the NGBM (1, 1) model expresses a superior forecasting ability when compared with the GM (1, 1) and grey Verhulst models. This is because it does not require a specific number in the exponential value of n (n ≠ 1). So, it can be flexible in determining the shape of the model’s curve with fluctuating sequence data. This characteristic has attracted the interest of many scientists and prompted further research for improving the prediction accuracy of this model. To deal with the solution of the optimal background interpolation value p, Zhou et al. used a particle swarm optimization algorithm [30]. To show the superior forecasting ability of the proposed model, the long-term power load during 1996 to 2007 is used for the modelling. According to the MAPE in-sample and out-of-sample indexes, the outcome of the simulation clearly indicated that the proposed model produces better forecasted results than the other four mentioned models which are the GM (1, 1), the optimal p value of GM (1, 1), the NGBM (1, 1) and the proposed model. These results once again confirmed that the prediction accuracy of NGBM (1, 1) can be improved by using a PSO algorithm.
Hsu [16] applied a genetic algorithm to get the optimal values in the NGBM (1, 1) model in order to enhance the performance prediction with a small amount of data. Then, the author applied it to forecast the economic trends in the integrated circuit industries in Taiwan during the period 1990 to 2007. The empirical results indicated that the error values of the proposed model in this study were lower than the traditional GM (1, 1) model and grey Verhulst model in terms of the MAPE index, which for in-sample was 12.48% and out-of-sample 4.64%. Chen et al. [31], based on the Nash equilibrium concept, proposed a Nash NGBM (1, 1) model which is an optimization of the exponential value n and the background interpolation value p in the NGBM (1, 1). For testing the proposed model’s adaptability, the monthly stock indices in Taiwan were used. The forecasting results show that the proposed model actually improved the forecasting precision.
Wang et al. [32] proposed a novel NBGM (1, 1) model by optimizing the two parameters p (background interpolation value) and n (exponential value). To solve this problem, the authors used the LINGO software. In order to verify the efficiency of the novel NGBM (1, 1) model in an environment with fluctuating data, the authors used a numerical example from Wen’s work and the output values of the electronics industry in Taiwan. The simulated results demonstrated that the forecast performance of the novel NBGM (1, 1) model is significantly improved compared to the original NGBM (1, 1) model. In addition, the authors also applied the improved model to forecast the annual qualified discharge rate in 31 provinces in China during the period 2001 to 2011. Empirical results show that the proposed model is more effective and reliable in prediction, with an average RPE index (relative percentage error) of less than 4% for 27 provinces and more than 4% for 4 provinces.
In summary, all these previous studies focused on the modification of the background value and parameters by using different mathematical algorithms. Based on the idea of Wang and Phan [24], this study presents a systematic approach and tries to enhance the performance of the NGBM (1, 1) model by using the Fourier series.
Methodology
Nonlinear grey bernoulli model “NGBM (1, 1)”
According to Chen et al. [29, 31], the NGBM (1, 1) is a first-order one-variable grey Bernoulli differential equation. For predictions related to nonlinear time series with a small sample, the forecasted results of NGBM (1, 1) are better than the original grey forecasting models such as GM (1, 1) and grey Verhulst. The algorithm of NGBM (1, 1) is expressed as follows:
m is the total number of modelling data.
b is the grey input,
n is any real number and n ≠ 1.
Let Δt
k
= 1, then
To replace X(1) (t
k
) in Equation (5), we obtain
p is the production coefficient (p ∈ [0, 1]), and normally the p value is set to 0.5.
The Fourier series [6] was used to filter out and to select the low-frequency error values of the NGBM (1, 1) to improve the performance in the prediction. The overall procedure is as follows:
We use the OLS method to calculate the parameters a0, a1, b1, a2, b2 . . . . a
z
, b
z
by the equation below:
Replacing the parameters just estimated in Equation (21) into Equation (17), we can get the modified residual series
In this paper, the Mean Absolute Percentage Error (MAPE) index is used to estimate the accuracy and reliability of the prediction models [36]. It is expressed as follows:
In this section, two examples with highly fluctuating data or lots of noise are given to illustrate the effectiveness of the proposed model. The first example is the historical data of annual water consumption in He et al.’s paper [35] and the second example is the example data in Wang et al.’s paper [32]. The details of the two simulations are presented below.
Example data from He et al. [35]
To show the effectiveness of the Fourier-NGBM (1, 1), this study first takes the annual water consumption demand in Wuhan as a numerical example of data and then compares the performance among the Fourier-NGBM (1, 1), a coupled model of the grey system and multivariate linear regression, and the original NGBM (1, 1) [35].
According to the historical data of the annual water consumption demand from 2005 to 2012 in Wuhan (see Table 1), this paper uses Microsoft Excel to simulate the mathematical algorithm of grey forecasting models. This software was provided by Microsoft Corporation. Excel software not only provides some basic functions but also provides two useful functions, Mmult and Minverse, to calculate the parameters in the grey forecasting model. In particular, the function of Mmult (array 1, array 2) can be used for multiplying two relevant arrays matrices. The function of Minverse (array) is to calculate the inverse matrix. Moreover, Excel software includes the efficiency tool named Excel-solves for finding the optimal values of parameters in forecasting models. For the sake of convenience, this paper just shows the detailed results of NGBM (1, 1) and Fourier-NGBM (1, 1) in Table 1. The other predicted results are omitted here, and only the MAPE indexes of these models are listed in Table 2.
Forecasting results from the NGBM (1, 1) and Fourier-NGBM (1, 1) models
Forecasting results from the NGBM (1, 1) and Fourier-NGBM (1, 1) models
Summary of evaluation indexes of model accuracy
According to the average of MAPE in Table 1, this results reveals that the residual error of Fourier- NGBM (1, 1) with n = -0.11 and p = 0.5 is lower than the original one. The average MAPE index decreased from 0.8% to 0%. This result was also compared with the coupled model of the grey system and multivariate linear regression model in [35], as well as GM (1, 1). The overall MAPE indexes as well as the performance forecasting of these models is shown in Table 2.
The results from Table 2 show that the MAPE of the Fourier-NGBM (1, 1) model is nearly 0%. This feature confirms that the performance of the Fourier-NGBM (1, 1) model is excellent. From comparison with the three remaining models, the Fourier-NGBM (1, 1) shows the highest accuracy in forecasting the total annual water consumption demand in Wuhan.
In this example, the raw data sequence jumps randomly X(0) = (5, 6, 4, 7). In this case, Wang et al. [32] used the example to demonstrate the accuracy of the optimized NGBM (1, 1). In this section, we also used the same example to compare the forecasting performance of the Fourier-NGBM (1, 1) model with the optimized NGBM (1, 1) and the original NGBM (1, 1) in Wang’s paper [32]. The results of the forecasting models are shown in Table 3 and Fig. 1.
Forecasted results from the different grey models
Forecasted results from the different grey models

Original fluctuation sequence curves versus forecasts.

Curves of actual and forecasted values using Holt’s-TCES, GM (1, 1), original NGBM (1, 1) and Fourier-NGBM (1, 1).
The results from Table 3 show that the optimized NGBM (1, 1) with n = -19.58 and p = 0.569 has a higher accuracy than the original NGBM (1, 1) with n = -10 and p = 0.5. However, by using the Fourier-NGBM (1, 1) with parameters n = -10 and p = 0.5, the MAPE of the Fourier-NGBM (1, 1) decreased from 7.16% to 0%. This result clearly indicated that the Fourier-NGBM (1, 1) is the best fitting model among the three forecasting models on the same sequence. A clearer visualization is shown in Fig. 1.
Data collection
The data of electricity consumption from 1990–2015 in Vietnam were obtained from the US Energy Information Administration [39] and are shown in Table 4.
Electricity consumption in Vietnam
Electricity consumption in Vietnam
In this case, four forecasting models, which are Holt’s trend corrected exponential smoothing method, traditional GM (1, 1), the nonlinear grey Bernoulli model NGBM (1, 1) and the proposed model, named Fourier-NGBM (1, 1), were used to find the best model in forecasting the electricity consumption in Vietnam as well as to test the proposed forecasting solution. First, we ran the data with the four forecasting models to find the best model among them based on the MAPE index.
Holt’s trend corrected exponential smoothing method (Holt’s-TCES)
Holt’s trend corrected exponential smoothing method was extended with simple exponential smoothing to allow the forecasting of data with a trend by Holt in 1957 [40]. This method involves a forecast equation and two smoothing equations (one for the level and one for the trend). The specific algorithm of Holt’s trend corrected exponential smoothing method was illustrated in [40]. We ran Holt’s-TCES with the smoothing parameter for the level alpha = 0.4 and the smoothing parameter for the trend beta = 0.3. The forecasted results and residual errors are tabulated in Table 5.
Forecasted value and residual error by the GM (1, 1) and Holts’-TCES models
Forecasted value and residual error by the GM (1, 1) and Holts’-TCES models
As mentioned in the introduction, the traditional GM (1, 1) model is the core of the grey forecasting model. It is constructed from a first-order differential model with one input variable. The overall modelling algorithm of the traditional GM (1, 1) forecasting model was shown in [24]. Based on the mathematical algorithm, this study can calculate the parameters: a = -0.112 and b = 8597.37. The function of GM (1, 1) for the electricity consumption is as follows:
All the forecasted values and the residual error of the traditional GM (1, 1) model are recorded in Table 5. As can be seen in Table 5, the MAPE index of the traditional GM (1, 1) model and Holt’s-TCES model in forecasting the electricity consumption is 4.67% and 7.33%, respectively. This indicates that the accuracy of the traditional GM (1,1) is higher than Holt’s-TCES model.
The NGBM (1, 1) and fourier-NGBM (1, 1) models
According to the historical data (in Table 4), this study can calculate the parameters: a = -0.1078, b = 2203.49, n = 0.123 and p = 0.5 based on the algorithms of the original NGBM (1, 1) model in section 3.1. The modelling of NGBM (1, 1) for the electricity consumption in Vietnam was found as follows:
The residual error series attained from NGBM (1, 1) in Equation (16) is then modified with a Fourier series (in Equation (19)), which forecasted the results of the Fourier-NGBM (1, 1) as per the algorithms stated in section 3.2. The detailed modelling and computing procedures have been omitted due to limited space. The forecasted values of traditional NGBM (1, 1) and Fourier-NGBM (1, 1) are summarized in Table 6.
Forecasted value and residual error by the NGBM (1, 1) and Fourier-NGBM (1, 1) models
Forecasted value and residual error by the NGBM (1, 1) and Fourier-NGBM (1, 1) models
Tables 5 and 6 reveal that the original NGBM (1, 1) with n = 0.123 and p = 0.5 shows a higher accuracy than that of the traditional GM (1, 1) in terms of the MAPE of the NGBM (1, 1) decreased from 4.67% to 2.20 %.
As can be seen in Table 6, the Fourier-NGBM (1, 1) with n = 0.123 and p = 0.5 shows a higher accuracy than the original NGBM (1, 1) as the MAPE of the Fourier-NGBM (1, 1) is 0%. This result clearly indicates that the Fourier-NGBM (1, 1) is the best fitting model for forecasting the electricity consumption in Vietnam.
After comparison of the performance of the four forecasting models in forecasting the electricity consumption in Vietnam, the empirical results clearly showed that the Fourier-NGBM (1, 1) is the best model. Therefore, the study strongly suggests that the Fourier-NGBM (1, 1) is used for estimation of electricity consumption demand in the future. The forecasted value of electricity consumption in Vietnam up to 2020 is shown in Table 7. Because of the real values of electricity consumption demand in 2016 and 2017 are unaccessible. Therefore, this paper cannot compare the forecasted values of Fourier-NGBM (1, 1) model. Forecasted value of the electricity consumption up to the year 2020 are shown in Table 7.
Forecasted value of electricity consumption by Fourier-NGBM (1, 1)
Forecasted value of electricity consumption by Fourier-NGBM (1, 1)
In Table 7, the forecasted electricity consumption in 2019 and 2020 will be over 219,281 and 245,085 GWH, respectively. This figure indicates that electricity consumption in Vietnam will grow significantly in the future. This empirical result will provide a basic scenario for decision-makers to make a good decision in the future regarding electrical industry management and building the developing strategy.
NGBM (1, 1) is an important extension of the traditional grey prediction model, and can be used to predict the nonlinear small sample time sequences encountered in practical data. In this paper, after the original NGBM (1, 1) model is reviewed, a novel systematic approach for improving the forecasting accuracy of NGBM (1, 1) is introduced. The results from the simulation of a numerical example in He et al. ’s and Wang et al. ’s papers indicate that Fourier-NGBM (1, 1) gives a superior modelling performance. To demonstrate further the advantages of the Fourier-NGBM (1, 1) model, electricity consumption demand in Vietnam is forecasted. The simulation results show that Fourier-NGBM (1, 1) is more suitable for electricity consumption demand forecasting than the other models compared with it, and it can also be used to make highly accurate predictions for other nonlinear forecasting tasks.
Furthermore, this study found the model suitable to forecast the electricity consumption in Vietnam update to 2020. The contribution of the current study provides some insights and basic information for decision-makers in model building for developing future policies regarding electrical industry management.
Further work is recommended to apply different methods such as Markov chain or different kinds of Fourier function to modify the residual error of the NGBM (1, 1) model and to expand the application of the proposed model to forecasting performance in different industries.
