Abstract
Gross domestic product (GDP) is the most widely-used tool for measuring the overall situation of a country’s economic activity within a specified period of time. A more accurate forecasting of GDP based on standardized procedures with known samples available is conducive to guide decision making of government, enterprises and individuals. This study devotes to enhance the accuracy regarding GDP forecasting with given sample of historical data. To achieve this purpose, the study incorporates artificial neural network (ANN) into grey Markov chain model to modify the residual error, thus develops a novel hybrid model called grey Markov chain with ANN error correction (abbreviated as GMCM_ANN), which assembles the advantages of three components to fit nonlinear forecasting with limited sample sizes. The new model has been tested by adopting the historical data, which includes the original GDP data of the United States, Japan, China and India from 2000 to 2019, and also provides predications on four countries’ GDP up to 2022. Four models including autoregressive integrated moving average model, back-propagation neural network, the traditional GM(1,1) and grey Markov chain model are as benchmarks for comparison of the predicted accuracy and application scope. The obtained results are satisfactory and indicate superior forecasting performance of the proposed approach in terms of accuracy and universality.
Keywords
Introduction
The Gross Domestic Product (GDP), which measures aggregate monetary value of all finished goods and services produced by an economy during a given period, serves as one of the most widely acknowledged and extremely important indicator to reflect market economic activity of a nation or region, as well as getting insight into the state of an economy. Hence, GDP forecasting in a timely manner which is very meaningful in guiding government policymakers, enterprises administrators and individuals to capture the development of economy and make the correct economic decision-making, has spurred widespread concerns by many researchers and practitioners in the last few decades. During recent years, with facing various downside risks for the global economics and aggravation of international trade disputes, GDP forecasting of an economy has become increasingly complex and unprecedented urgent.
Study on GDP forecasting has experienced explosive growth in recent years. Up to the present, methodologies used in these studies can roughly be summarized into four types: the traditional statistic model, neural network-based model, grey forecasting model and the combination model. As a set of mature procedures, statistics forecast methods including autoregressive integrated moving average (ARIMA) model, regression, exponential smoothing method etc. Box and Jenkins (1970) [1] initiated ARIMA model which as a typical time series method, involves three main stages namely identification, estimation and diagnostic checking. Gujarati and Porter (2004) [2] emphasized that logarithm of the raw GDP data series is adjusted before it is analyzed by ARIMA method. Through some empirical researches, ARIMA model has been proven to be reliable and accurate model in predicting macroeconomic variable GDP [3–9]. Besides, the other common statistical prediction methods like multifactor regression [10], quadratic exponential smoothing [11], are also successfully applied into GDP forecasting. Due to the characteristics of nonlinear categorization, along with capacity of fast and accurate modeling, neural network, especially one of the most popular algorithm, i.e. back propagation (BP) neural network received much attention and achieved great results [12], including some concrete application of GDP forecasting [13–15]. However, it should be noted that both statistical methods and neural network not only require a large sample of historical data but also typical distributions [16], thereby limiting the scope of their practical use.
Grey system theory originated by Deng (1982) [17] is a powerful mathematical tool to settle grey system with partial information known and partial information unknown. A significant advantage of this method embodies it only requires limited discrete data to estimate the behavior of a system with incomplete information and also achieves a certain degree of fitting effect to evaluate the reliability [18, 19]. Cause this strength there emerges a series of studies that applied the grey model to predict GDP in various nations or regions [20–23]. Inadequacy lies in that the single grey model mainly focuses on the linear distributed data, and noise in the nonlinear data may give rise to significant errors in the estimate. Liu et al. (2020) [24] established a generalized grey forecasting technique named the optimized grey prediction model with time power term (abbreviated as OPTGM (1,1,α)) to predict China’s GDP, which shows higher forecasting accuracy compared with the traditional grey prediction model. But an inherent difficulties of this model lies in the determination of parameter α. Subsequently, great endeavor on merging grey model with other techniques has been devoted to extend the application scopes of nonlinear analysis together with increasing the predicting accuracy. An improved grey prediction model and its variants based on grey model and ANN are put forward in succession, arise in a variety of applications, such as transpacific air passenger market [25], flood and wind speed forecasting [26, 27], industrial control systems [28], air pollutant emission [29]. Some studies extended linear GM(1,1) to nonlinear grey Bernoulli model termed NGBM and applied it in predicting the foreign exchange rates [30], main economic indices of high technology enterprises [31], electricity power generation [32] and GDP forecasting [33, 34]. Another mathematical technique known as Markov chain (1997) [35], that due to its advantage of dealing with randomness, has been incorporated into grey model to enhance the predicted accuracy in recent studies. Wang and Meng (2008) [36] investigated forecasting electricity demand by using Grey-Markov model. Kumar and Jain (2010) [37] employed Grey-Markov model to predict energy consumption in India. Chen (2011)[38] put forward a Grey-Markov model to forecast financial crises for an enterprise. Other studies applied Grey-Markov model in prediction of global ICT development [39], fire accidents [40], aircraft accidental damage [41], equipment failure [42], urban medical services demand [43], developments of GDP, population and energy consumption [44], and demonstrated which can be as a viable alternative with a high prediction precision in various fields. The same inherent defect of Grey-Markov model with GM(1,1) lies in that which is hardly able to deal with nonlinearity cases that widely exist in reality.
To sum up, three major limitations existing in these approaches as mentioned above. First, the traditional regression procedures, neural network technique and their variants requires a large database to guarantee forecasting accuracy. Second, GDP is highly influenced by various external shocks and thus performs instability of growth, especially for some emerging economies. Single grey model is incapable for acquiring accurate predication in the situation of random vibration and high noise. Employing Markov chain model highly requires the calculation of GDP transition probability. GDP transaction features vary across categories and nations, and probability matrix of GDP transaction exhibits difference from one nation to another. Third, both traditional grey model, Markov chain and their integrations focus on (approximate) linear predictions, but them are not good at handling of nonlinearity.
This paper aims to address the limitations outlined above and enhance the precision accuracy in GDP forecasting by developing a new hybrid model which is based on GM (1,1) model, combines Markov chain model and artificial neural network (ANN), named Grey-Markov chain model with ANN error correction. In essence the combined model can be seen as a Grey-Markov model with residual modification by ANN sign estimation. It takes advantage of small sample forecasting of GM(1,1), modelling the randomness on the discretized states of Markov chain, and takes advantage of nonlinear treatment capability of ANN. In order to test prediction accuracy and better adaptability of the presented model, we utilize it to GDP forecasting which can be approximate quantified as a nonlinear forecasting under limited data with random vibration and noise. Four countries consisting of two developed economies, i.e. the United States and Japan, two emerging economies, i.e. China and India, are specialized selected which is due to the feature of GDP trend for each alternative in the period 2000-2019 differs significantly. The details are displayed in Fig. 1 and Table 1, so that these sample data possess typicality with sufficient proof to verify the effectiveness of our proposed model.

Comparison of the actual real GDP growth rate in the United States, Japan, China and India, 2000-2019. (Original data is collected from open database of the World Bank.)
Mean, variance and variance coefficient of GDP growth rate in 2000-2019 of the four countries
The remainder of the paper is organized as follows. Section 2 contains background material related to grey model and Markov chain, and describes the procedure of the improved grey Markov chain model with ANN error correction. Section 3 states applications to GDP forecasting, presents empirical results and compares with prediction performance of methods included. The final comments and conclusions are provided in Section 4 and 5.
The improved Grey Markov Chain model with ANN error correction (GMCM_ANN) is assembly in three methods, including GM(1,1) model, Markov chain and Artificial Neural Network (ANN). The purpose of GM (1,1) is to initially fit and predicts GDP in the linear manner by adopting historic data. The technique of ANN is applied to make residual modification in GM(1,1) for adapting nonlinear characteristics of reality. In order to further increase the prediction precision, Markov chain is incorporated in the GM(1,1) with ANN error correction model that revises the estimates by considering the transferring states of previous forecasting sequences.
The GM(1,1) Model
The GM(1,1) as a mature technique is generally developed as the following procedure.
Assume that the original data series is X(0) ={ x(0) (1) , x(0) (2) , …, x(0) (n) }, where x(0) (i) >0, i = 1, 2, …, n. A sequence operator which can weaken the randomness of the source data called one time accumulating generation operation (1-AGO) accumulates the data series and gains a new series, denoted by X(1) ={ x(1) (1) , x(1) (2) , …, x(1) (n) }, satisfying
Furthermore, the next mean generation sequence is defined as: Z(1) (n) ={ z(1) (2) , …, z(1) (n) }, where
Due to the fact that the above monotonically increasing sequence by using 1-AGO is similar to the solution curve of the first order linear differential equation. One can derive the whitenization differential equation in GM (1,1) model as follows:
Another equivalent expression in GM(1,1) is:
In the equations above, symbol ∧ represents the grey predicted value, α represents the developing coefficient, and β represents the grey action. Let u = [α, β]
T
be the parameter list, one can obtain its values by using least square method:
Consequently, the prediction value at step k (k = 1, 2, …, n - 1) is determined by the grey prediction equations:
Practical prediction of GDP involves its composition, which is often affected by some unknown and random factors. Such as unexpected supply and demand shocks, seasonal variations etc. Thus, the traditional GM(1,1) model sometimes is inadequate of predicting accurate GDP with original data series. In order to improve the prediction precision of GDP, a brief reviews regarding GMCM in which Markov chain model combined with GM(1,1) model is introduced in the current section. The integrated model will be more competent for tracking the randomness and volatility of macroeconomic total quantity, and performs a higher accurate prediction on GDP compared to the single model GM(1,1). The main procedures of GMCM are outlined in the following.
The partition of transferring
Based on the actual GDP denoted by x(0) (i) (i = 1, 2, …, n) drawing from the initial data series and the forecasting values
We use the symbol P ij (k) to denote the probability of ratio transferring from state i to j in step k, in which P ij (k) = M ij (k)/M i . Where M ij (k) represents the sample size of the ratio transferring from state i to j in step k, and M i is the sample size of the ratio state i.
Hence the Markov state transition probability matrix, denoted by P (k), with its entries P
ij
(k) can be expressed as:
Let π
j
(t) be probability of state in period t, a state transaction vector with regard to sequence of periods is derived, denote by (π1 (t) , π2 (t) , …, π
m
(t)). In addition, π
j
(0) indicates the initial probability vector of the state transaction, utilizing of the initial probability vector and the state transaction probability matrix, to ascertain the probability vector of forecast objects in any period at any state. A hybrid model of prediction with mixture of GM(1,1) and Markov chain is given by:
Where π
j
(t) = πj-1 (t) P (1), π
j
(t) ⊗ Q
j
= π
j
(t) · L
j
+ (1 - π
j
(t)) U
j
. In essence
The existing of error between raw data and forecasting is unavoidable. Hence, minimizing the error is a practicable way to increase forecasting precision. In this section we are going to propose a new method by using ANN corrects the error to develop the grey Markov chain prediction model. The original forecasting value is obtained by GM(1,1), followed by one calculates the estimates forecast error and updates predictive values the using ANN, and finally investigates information of states transfer by Markov chain to obtain the estimated forecast values. In order to implement the proposed model, the procedure of obtaining the forecasted values by GMCM_ANN is described in the following, showed as Fig. 2: Collect the original series of data and divide into two groups, i.e. the training subset and testing subset. The former is used to train the forecasting model, and the latter is employed to test performance of forecasting model. Use the grey forecasting model GM(1,1) to train the training samples, then detection effect is validated by testing data, and compute the predictive values. Obtain the residual series. Let {e(0) (1) , e(0) (2) , …, e(0) (j) , …, e(0) (n)} be the residual series, where Training the ANN model based on the training subset of the standardized error sequence. Let Γ be the prediction order, let ɛ(0) (j - 1) , …, ɛ(0) (j - Γ) (j = Γ, …, n) be defined as input samples, ɛ(0) (j) be defined as the expected value of forecast by ANN. In this study we set Γ = 3, set learning rate with 0.05 and maximum training times with 5000 for generating neural network. And run twenty experiments on the training data for ANN error correction and select the optimal fitting results as the outputs of ANN error correction. Compute the forecasting values with ANN error correction. It can be expressed as:
Construct a data sequence {s (Γ) , … , s (n) } with its entries Ascertain the state transition probability matrix, and train the forecasting model incorporated in Markov chain with respect to training subset of {s (Γ) , … , s (n) }. By applying Equation 8, one obtains the final predictive values of the improved GMCM_ANN.

Algorithm flowing chart for GMCM_ANN.
Note that GMCM_ANN which we put forward primarily embodies its advantages in three aspects. First, it makes prediction without the restriction of sample size and is qualified to solve the problems of insufficient number of training samples. Second, the traditional GM(1,1) model is as a linear grey forecasting model, and Markov chain mainly adapts to stationary objects, while GMCM_ANN extends the linear grey forecasting to nonlinear forecasting, as well the novel model can deal with non-stationary objects. Third, GMCM_ANN can modify the volatility and noise in prediction. The new method fully plays their respective advantages GM, MC and ANN, can extend the applicable scopes to nonlinear dynamic information prediction and better improve performance of predicated accuracy. To quantitative verify the effectiveness of GMCM_ANN, the following section applies it with empirical data, and compares the performance with other forecasting models.
Data
We use time series with annal data which is with regard to two developed countries United States (US) and Japan (JP), two developing countries China (CN) and India (IN), spanning the years 2000 through 2019, to verify the proposed model. Note that GDP is the market value of all of the final goods and services produced within the country’s borders in a one-year period. It is commonly regarded as the most important indicator to measure the national economic development, and has the features that include approximate linear growth rate, multiple causes, and influence of other random factors. The data used in this paper is collected from open database of the World Bank. Data from 2000 to 2015 is used to train the forecasting models, and the rest periods as testing set that is employed to test the performance of different forecasting models. Original data of GDP from 2000 to 2019 are listed in the Appendix.
Procedure
To assessing forecast performance of the proposed method GMCM_ANN, four forecasting methods, i.e. autoregressive integrated moving average model (ARIMA), Back-Propagation neural network (BP), the traditional GM(1,1) and grey Markov chain prediction Model (GMCM) are as benchmarks for comparison. By importing the original data into these models as mentioned above separately, we successively investigate GDP forecasting of four selected countries. Matlab software package (2010) is employed to implement these models and derives the corresponding estimated results. Plots of the actual data and the predicted values of GDP are contained in Figs 3–6. These blue line reflect the actual GDP situations of various countries, the brown line, red line, yellow line, purple line and green line represents the predictions which are calculated by ARIMA, BP, GM, GMCM and GMCM_ANN, respectively. It is marked in each figure below that one takes 2000-2015 as models training segment, 2016-2019 as models testing segment to examine prediction accuracy of each model.

Forecasting GDP of the United States by ARIMA, BP, GM(1,1), GMCM and GMCM_ANN.

Forecasting GDP of Japan by ARIMA, BP, GM(1,1), GMCM and GMCM_ANN.

Forecasting GDP of China by ARIMA, BP, GM(1,1), GMCM and GMCM_ANN.

Forecasting GDP of India by ARIMA, BP, GM(1,1), GMCM and GMCM_ANN.
The test focuses on three common evaluation criteria, including relative percentage error (RPE), mean absolute percentage error (MAPE) and root mean square error (RMSE), to compare the five models’ prediction performance in these criteria. These formulas are defined as follows:
For predicting GDP of the United States in Fig. 3, it is straightforward to conclude that BP has the worst fitting performance whether in training segment or testing segment due to the significant errors. Forecasting GDP of China in Fig. 5 and India in Fig. 6 also show the same features with accuracy. But forecasting GDP of Japan in Fig. 4 displays a slightly difference that among ARIMA, BP and GM have indistinguishable unsatisfactory performances. Compare prediction performances of the five models by taking into account the testing period, it can be seen that the result by GMCM_ANN has the slightest errors on the whole, subsequent to the GMCM, while the ARIMA, GM and BP have significant residuals. Due to the fact it indicates the hybrid models GMCM and GMCM_ANN have performed more accurate prediction than the rest three single models with remarkable improvements in precision. Especially GMCM_ANN further revising the forecasting results by GMCM to achieve the highest accuracy. The experiment results proves the validity of the proposed method. However, It can be seen roughly that the fitting degrees by GMCM_ANN in Figs. 3–4 are higher than the fitting degrees in Figs. 5–6. Other four models also encounter the common feature of lower accuracy in predicting GDP of China and India. This phenomenon shows that prediction accuracy of models depends on the samples, and more difficulties arise in predicting GDP of China and India rather than the United States and Japan.
It is customary that testing data is used to compare the forecasting accuracy of different models. According to the corresponding procedures above for various models, these estimated values and RPE for GDP of four selected countries during the testing period (2016-2019) are presented. The concrete results are listed in Table 2 as below.
The comparison of forecasting results obtained by five models.(Units: Billion US Dollars)
The comparison of forecasting results obtained by five models.(Units: Billion US Dollars)
Taking absolute RPE as a yardstick to measure prediction performance of different models, several stylized facts can be revealed from Table 2. First, in aggregate the mixture models are more accurate in contrast to the single component model. Second, the model fitting results indicates GMCM_ANN with error correction has lower prediction error than GMCM in most cases, such as in the GDP forecasting of Japan, China and India. Third, the estimated results by GMCM is casually closer to the real data than GMCM_ANN, like forecasting US GDP in 2016-2017. However, to forecast US GDP in 2018, the absolute RPE for GMCM suddenly rises from 0.4% to 1.6%, while the absolute RPE for GMCM_ANN significantly narrows from 1% to 0.1%, the latter performs more stability of prediction accuracy with less fluctuation of RPE.
For a better comparison, MAPE and RMSE as two crucial guidelines are on assessing forecasting accuracy of the four approaches to formally demonstrate the validity of GMCM_ANN model. The results of the comparison based on the testing period (2016-2019) are presented in Table 3 and Fig. 7.

The comparison of fitting results by five models.
As shown in Table 3 and Fig. 7, GMCM_ANN has the highest imitation with the original testing data. Comparing with the three single models ARIMA, BP and GM(1,1), GMCM_ANN has sharply reduced 61.17%, 69.96% and 64.56% in MAPE for forecasting US GDP. Comparing with the mixture model GMCM, GMCM_ANN also has significant declined 35.40% in MAPE for forecasting US GDP. Similar patterns also appear in the GDP forecasting of other three countries. Investigation of RMSE also supports the same conclusions. These results are able to prove the validity of the proposed model, which has a better superiority of prediction precision by comparison among four remaining methods. Consequently, it is reasonable to expect that the predicted result of each country’s GDP in 2020-2022 calculated by GMCM_ANN are good simulations of the actual values with lower forecast error. The concrete results are displayed in Table 4 as below.
The prediction precisions by five mentioned models
The prediction results for the four countries’ GDP by GMCM_ANN.(Units: Billion US Dollars)
The more accurate predicting of future GDP trend is obviously of significant importance to government, corporation and individual decision makers. In this study we employ the proposed model GMCM_ANN and the other four models ARIMA, BP, GM(1,1), GMCM as reference models in GDP forecasting of the United States, Japan, China and India. The empirical results show that GMCM_ANN has superior accuracies for GDP forecasting with different countries. Then we obtain the new predicted values of each country in 2020-2022 by using GMCM_ANN. From the foregoing, some suggestions are provided to be considered in further GDP forecasting.
As the world’s two biggest emerging economies, both China and India are growing at rates noticeably faster than the rest two mature economies the United States and Japan in recent years. And the two exhibit a very obvious nonlinear growth characteristics. The main driving forces could be attributed to these development phases also comparatively primary, rich human resources, huge domestic consumption market, and market-oriented economy system innovation etc. Partly because of that, predicting GDP of China and India are likely to be fraught with more difficulties, as well as a relatively low accuracy.
It should be noted that due to insufficient training model exists in the period 2000 - 2015 which is limited by sample size of database. This results in a larger fitting error in the fitting curve of GDP in the testing period by applying the single prediction models ARIMA, BP and GM(1,1). The prediction model based on BP is inappropriate for GDP forecast because data does not appear as S-type fluctuation characteristics. And the prediction ARIMA and GM(1,1) models also has poor performances because of limited sample size and non-linear economic growth for the four countries, especially for the two emerging economies.
In order to modify the grey forecasting, the GMCM model by considering the interrelationship between the states of time series through using Markov state transition probability matrix, which is tentatively fitted to the data and certainly improves the overall prediction accuracies in our experiment. But inadequate lies in that the model only captures (approximate) linear and stationary information.
The hybrid model GMCM_ANN overcomes the shortage of unsuitability for nonlinear sampled-data in the GM(1,1) model, and weakens the volatility and noisiness of data by using Markov chain, the artificial neural network (ANN) is incorporated in the GMCM error correction model to further achieve reliable and precise predicted results. Taking advantage of three involved components, the GMCM_ANN model has exhibited good adaptability for limited data and various volatilities. With using the same group of historical data of GDP, the proposed model has a better prediction than ARIMA, BP, GM(1,1) and GMCM models. In terms of the noisiness and volatility of databases in GDP forecasts, the GMCM_ANN model is more stable and reliable for predicting economic growth trends. This model enriches methodologies for forecasting that can be easily extended to settle down other prediction problems with limited and volatility data.
Conclusions
Grey predicting only requires limited sample size and has sufficient capacity to dealing with incomplete and uncertain information. Markov chain is accustomed used for extraction rule from data with randomness. ANN is an approximate simulation of biologic nerve system so that has great nonlinear processing capability to adapt to reality. Therefore, a combination prediction model consisted of GM(1,1), Markov chain and ANN is capable for GDP forecasting based on the raw data series with fluctuation and noise, as well as achieving high predicting accuracies. This paper presents a new hybrid prediction model abbreviated as GMCM_ANN with ANN error correction for GDP forecasting of the United States, Japan, China and India. The idea of correcting prediction error in advance makes the new technique flexible in enhancing predicting accuracy. The obtained results in empirical experiment validate the feasibility and effectiveness of the proposed method.
It is worth emphasizing that the development of GMCM_ANN as an assembly of three different models, has a relatively complex process so as to require a large amount of calculation and time consumption. And in reality economic aggregate of one country is affected by multiple influences, accordingly the data form change from single time series data into panel data, even mixed-frequency data. Hence it is often not enough to be convincing that GDP forecast only depends on historical GDP series. Further research includes developing the proposed approach to multi-factor prediction model with a variety of data types, and focusing on comparing predictive capability and precision in different economic scenarios, along with recommending some constructive policy suggestions for improving economic growth.
Footnotes
Acknowledgments
The work was supported by the Research Funds of Collaborative Innovation Center for Urban Industries Development in Chengdu-Chongqing Economic Zone, China (Grants No. 950218032). The authors are highly grateful to the editor and anonymous referees for their valuable comments and constructive suggestions which help to improve the present form of this paper.
Appendix
Original data from 2000 to 2019. (Units: Billion US Dollars)
| Year∖GDP | US | JP | CN | IN |
| 2000 | 12620 | 5349 | 2232 | 873 |
| 2001 | 12746 | 5371 | 2418 | 915 |
| 2002 | 12968 | 5377 | 2639 | 950 |
| 2003 | 13339 | 5459 | 2904 | 1025 |
| 2004 | 13846 | 5580 | 3198 | 1106 |
| 2005 | 14332 | 5672 | 3562 | 1194 |
| 2006 | 14742 | 5753 | 4015 | 1290 |
| 2007 | 15018 | 5848 | 4586 | 1389 |
| 2008 | 14998 | 5784 | 5029 | 1432 |
| 2009 | 14617 | 5471 | 5502 | 1544 |
| 2010 | 14992 | 5700 | 6087 | 1676 |
| 2011 | 15225 | 5694 | 6669 | 1763 |
| 2012 | 15567 | 5779 | 7193 | 1860 |
| 2013 | 15854 | 5894 | 7751 | 1978 |
| 2014 | 16243 | 5916 | 8317 | 2125 |
| 2015 | 16710 | 5989 | 8892 | 2295 |
| 2016 | 16972 | 6025 | 9491 | 2482 |
| 2017 | 17349 | 6141 | 10132 | 2660 |
| 2018 | 17856 | 6190 | 10797 | 2842 |
| 2019 | 18300 | 6211 | 11520 | 2940 |
Source: The World Bank Open Data.
